Skip to main content

Identifying co-targets to fight drug resistance based on a random walk model

Abstract

Background

Drug resistance has now posed more severe and emergent threats to human health and infectious disease treatment. However, wet-lab approaches alone to counter drug resistance have so far still achieved limited success due to less knowledge about the underlying mechanisms of drug resistance. Our approach apply a heuristic search algorithm in order to extract active network under drug treatment and use a random walk model to identify potential co-targets for effective antibacterial drugs.

Results

We use interactome network of Mycobacterium tuberculosis and gene expression data which are treated with two kinds of antibiotic, Isoniazid and Ethionamide as our test data. Our analysis shows that the active drug-treated networks are associated with the trigger of fatty acid metabolism and synthesis and nicotinamide adenine dinucleotide (NADH)-related processes and those results are consistent with the recent experimental findings. Efflux pumps processes appear to be the major mechanisms of resistance but SOS response is significantly up-regulation under Isoniazid treatment. We also successfully identify the potential co-targets with literature confirmed evidences which are related to the glycine-rich membrane, adenosine triphosphate energy and cell wall processes.

Conclusions

With gene expression and interactome data supported, our study points out possible pathways leading to the emergence of drug resistance under drug treatment. We develop a computational workflow for giving new insights to bacterial drug resistance which can be gained by a systematic and global analysis of the bacterial regulation network. Our study also discovers the potential co-targets with good properties in biological and graph theory aspects to overcome the problem of drug resistance.

Background

Drug resistance has been posing an emergent threat to human health and infectious disease treatment. Drug resistance is a natural survival mechanism for bacteria when the cell is exposed to drug exposure. Mycobacterium tuberculosis (Mtb) remained to be one of the leading and widely spread killer infectious diseases. In 2008, estimated 390000-510000 cases of multidrug resistant Mtb recorded in WHO 2010 and this problem is worsened significantly by the emergence of drug resistance under clinical drug used. Up to now, methods employed to tackle the problem of drug resistance are rather arbitrary. Several wet-lab experiments and clinical decisions like rotation of antibiotic combinations, identification of new targets and chemical entities that may be less mutable are being explored to counter this problem by inhibiting the resistance mechanism [1]. However, those strategies are still not effective enough and have so far achieved limited success due to limited knowledge about how the resistance mechanisms are triggered in bacteria upon antibiotic drug treatment [2].

The gene expression depended upon the mechanism of action of the drug in the cell as a consequence of the action through metabolic and regulatory adjustments or triggering drug resistance more explicitly [3]. The high-throughput of microarray technology has led to explosion of data concerning the expression levels of the genes but most of statistical methods such as fold change and t-test identify genes with significant changes. However, based solely on the patterns of variations in terms of the increase or decrease in the expression levels of individual genes, it is in general hard to know the related processes involved in the mechanisms of the drug response and resistance.

Due to the increasing availability of protein interaction networks, network-based analysis provides an opportunity to discover an active (significant) network. The network provides a systems-level view of how genes and their products interact within the cell and explain the biological actions under specific condition. However, one weakness of the protein-protein interaction data is that it contains no information about the conditions under which the interactions may take place which means it is not a real snapshot of the interactions in vivo, but a union of the interactions activated under various conditions. Except protein interaction network supported, the network will be much more biologically insightful if the expression data is incorporated with them. Recently some various network-based approaches [4–8] based on protein interaction networks have obtained much better performance than traditional statistical approaches only based on the gene expression values. They first applied a scoring scheme to evaluate an active level of the network based on the gene expression of each gene or its interactions. In the second step, a search procedure is implemented to find the node connected in a sub-network has a highest score and to form a maximum-scoring sub-network. Due to this kind of the problem is NP-hard, several heuristic or approximate methods such as simulated annealing, locally greedy search and mathematical programming methods were proposed. Ideker et al. first formulated the problem of the active pathway detection, where the scoring function is given by a summation function of all genes' differentially expressed p-value within the sub-networks [9]. Dittrich et al. used an additive function of p-values based on a mixture model [10]. Breitling et al. proposed a method to score active sub-networks in terms of genes' order of their differential expression significance [11]. Sohler et al. searched for active networks by spanning the networks with a given set of seed proteins [7]. Such approaches as the vertex-based methods usually do not further select the active interaction relationships among the identified proteins. However, taking all the interactions among those 'active' proteins detected by those methods is inadequate, because under a particular condition, only a part of the interactions may be active. Guo et al. proposed a novel edge-based scoring to extract an active sub-network related to some investigated gene expression profiles under specific condition [12]. Han et al. calculated the average Pearson correlation coefficients between the hubs and their neighbors in the protein interaction networks [13]. Zhao proposed an integer linear programming (ILP) model to find the signal pathways by utilizing both protein interactions and microarray data [14]. Some edge-based scoring methods use statistical measures such as Pearson correlation coefficient for analyzing the pair relationships which do not work well in the small set of the microarray data and could be also unsuitable to explore true relationships because they are overly sensitive to the expression values.

With the availability of high throughput microarray data and interactome network, it is feasible to address the issue of drug resistance from the system's perspective. Wu et al. integrated those two kinds of the information to present a novel network-based approach to identify effective combination of drugs by comparing the sub-network affected by individual drugs [15]. Typically, the target of a drug inhibits the pathogen or arrests its growth but the resistant machinery is established via certain pathways. A recent idea is to counter the drug resistance that called a "co-target" instead of being the ancillary or secondary targets that have a critical physiological function for the survival of a cell and it helps a primary drug to inhibit the resistance mechanism [2]. Thus, a co-target could be either essential or non-essential but it is necessary to have a strong influence in the resistance network. Raman and Chandra formulated the problem of identifying a co-target as a search for the shortest paths obtained from the bacteria protein interaction network and calculated the "betweenness" attribute of genes to identify the potential co-target based on the gene expression values [2]. Although the shortest path analysis may yield a higher coverage than observed directly neighbors locally from the protein interaction data, this approach considers only a single length which ignores the potential contribution of the other paths with longer length. Due to the small world property of biological network, the shortest path length in a biological network is typically very small and most of the time there will be additional "relatedness" between two gene nodes [16, 17]. However, the shortest paths are the only routes of drug resistance and there are some "back-up" ways to make the robustness of the mechanism in bacteria [18]. Without any essential pathways related to the mechanism of drug resistance, Ayati et al. applied balanced network bipartition method to discover the co-targets which separate interaction network into disconnected pieces to effectively disrupt the survival of a bacterium when it has multiple pathways to trigger the drug resistance [18]. However, all of them simply take all the interactions in the public database as the edges and did not take the active interactions under antibiotic drug treatment into consideration.

We apply a computational approach that uses both gene expression data and interactome network to identify the active networks under antibiotic drug treatment. Then, we apply a random walk model to discover co-targets that are highly likely to affect the genes related to the mechanisms of the drug resistance through the main and back-up paths.

Methods

The overall workflow of our methods consists six steps which is shown in Figure 1. We use interactome network from STRING database in step 1 and assign weight values to the interactions based on the confidence scores and gene expression values from antibiotic drug treatment and control samples in step 2. We present an A* heuristic search method to identify the active networks under antibiotic drug treatment in step 3 and then classify different functional drug resistance pathways using known annotated curated resistance proteins [19] in step 4. We apply random walk method to discover potential co-targets in step 5 and 6.

Figure 1
figure 1

The overall workflow of our method.

Network construction from microarray data and protein-protein interactions database

The microarray data implies gene expression either up-regulated or down-regulated can be revealed in terms of two colored channel in the microarray data representing the intensity of the antibiotic treatment and control samples. The gene expression ratios are calculated as the median value of the pixels minus background pixel median value for one color channel divided by those for the other channel. We extract the median value of the log base 2 of each gene in experimental dataset because the median value of the normalized ratio is much harder to be affected by noise than the mean value. We derive a genome-scale interactome network from STRING database http://string-db.org/ where the interactions extracted from published literature including experimentally studied interactions from genome analysis [20]. The latter is based on well established bioinformatics concepts and methods including structural and functional linkages, genes belonging to a single operon or common neighborhood, pairs of proteins which share metabolites between them and suggested associations based on co-expression, domain fusion or conserved in a number of species [20]. In STRING database, a continuous confidence score is assigned to each interaction which is derived by benchmarking the performance of the predictions against a common reference set of trusted and true associations [20]. A higher confidence score Cuv is assigned while an interaction between two proteins u and v is supported by more types of evidences.

We formulate an undirected interactome network defined as G(V, E) where the node set V represents protein which is the product of gene v (v∈V) and edge set E represents the interactions e (e∈E) in the network. Because the network contains some false positives, we use the absolute value of the expression profile of each gene and the larger value denotes more significant differential expressed genes under drug treatment. We apply the weight to each edge which is defined in Equation (1) as the product of the confidence score C uv and the sum of the absolute value of gene expression values between two corresponding genes u and v in the edge.

w ( e ) = w u , v = A u , v = C u v × E x p u + E x p v
(1)

Exp u and Exp v are the average of the gene expression values of node u and v in the microarray. The higher the expression of the Exp u or Exp v denotes the larger differential expression changes of the genes. Then, we use the adjacent matrices A = (a uv )n*n to represent interactome network among n nodes where a uv denotes the weights of interactions between nodes u and v.

A* algorithm as heuristic search

In order to study the active networks relevant to drug response and resistance, it is required to define the source nodes to understand the flow of drug actions. DrugBank database http://www.drugbank.ca/ provides drug-related information and also determines the drug targets of an antibiotic drug [21]. Previous studies showed that the metabolic adjustments often occur so as to minimize the effect of inhibition on the particular protein and denoted that multiple proteins in the drug-related functional mechanism may be also targeted [22, 23]. According to the effect of such adjustments, it is reasonable to consider the proteins involved in the whole pathway as the source nodes rather than an individual protein [2]. Therefore, we use the drug targets from DrugBank database and the genes associated with the drug-related function as the source nodes for searching active networks.

In the protein interaction network, we try to search the linear or tree type active pathways and then assemble those paths to form the sub-network. However, searching linear paths may expand a large collection of new nodes while traversing new level of tree structure. In order to determine the range of path lengths in the network we would detect, we apply the heap-based Dijkstra's algorithm for each node to get the maximum length of the shortest path of all pairs of nodes in the network [24]. This information shows if any pair of nodes in the network can link to others at most the length and we thus use the length of the longest shortest path as the maximum length in the path searching. Therefore, we assume that the active network extraction issue is a minimum score linear path searching problem with the fixed length [25]. First, we normalize the weight w(e) of the edge e calculated by Equation (1) to be the range [0-1]. Then, we transfer the larger weight of the edge to be a smaller score and the score of the edge e between two corresponding genes u and v is calculated as score(e) = score(u, v) = -log(w(u,v)). The negative logarithm makes larger weight become smaller score and so on. We define the score of a path as the sum of scores of edges in the path and the formula is defined in Equation (2):

s c o r e ( p ) = ∑ e ∈ p s c o r e ( e )
(2)

Where score(e) is the score of an edge e in the path p.

To speed up the procedure in search of the minimum score linear path, it needs to prune the unexplored new nodes heuristically. We use the idea of A* search to design a pruning strategy and the heuristic function is to determine the weight of a pathway that reflects significance to some extent. We first run the search procedure 5000 times to determine the scores of all paths in the experiments formed a normal distribution. And then, the error rate based on the standard deviation score std , minimum score as score min and an average score of edges as score avg in the distribution help us to find the optimal pathway in estimating a bound heuristic function h(x) for a node x. We employ an A* search method that can explore heuristically after searching a fix length d in the paths and calculate the weight of a path form root node to the current node x as function g(x). The overall heuristic function of f(x) is defined in Equation (3) for finding a pathway with an optimal (minimum) score.

f x = g x + h x = s c o r e ( P d ) + s c o r e min × l - d
(3)

where l means the length of a path, d means the length from the source node that we have already traversed in the network, score(P d ) means the sum of the score up to the current node x with a length parameter d, score min means the minimum edge score in the network.

Because the lower f(x) a node is estimated, the more likely is it to be searched first. We set a bound score for a path p with length l that is defined as Equation (4) to control the quality of the path we could find:

B o u n d ( p ) = s c o r e a v g - α × s c o r e s t d × l
(4)

α is a constant factor to control the bound, score avg and score std means the average score and the standard deviation.

While we move to the next node through the edge in each search process, we compute heuristic function f(x) and compare it with the initially-set bound score. If f(x) exceeds the initially-set bound score, we do not expand the node further. For the nodes that are allowed to expand, their children nodes are expanded and their heuristic functions are computed and compared with the bound score again until the search reaches the end node. As the example in Figure 2, we consider finding a pathway with length l = 7 from the initial node A to the end node H. First we explore a fix length d = 2 from initial node A that lead us to node C, we start to estimate the score of a path with an additional length of 5 that yields a total weight 11 from current node C. The estimated score of the path is smaller than the bound score 11.13, therefore, we continue to traverse its children. The function of f(x) of current node D is 13.2 and therefore we cannot search into its children. We apply heuristic method to prune the search space instead of exhaustively searching for all the edges in the network.

Figure 2
figure 2

An example for A* searching method.

The known drug resistance genes reported in the previous research further help in classification of the paths [19]. After searching the paths using our heuristic method, we identify the potential drug resistance pathways with at least one curated resistance protein within the paths and assemble them into the active resistance network with active drug resistance gene set GDR.

Random walk to discover co-target

Random walk (RW) is a ranking algorithm which simulates a random walker starts on a set of seed nodes and moves to its immediate neighbors randomly at each step [26]. Each node in the graph is ranked by its probability of the random walker reaching the other nodes and the procedure of the RW model provides the basic idea to estimate the influence from the drug target to the other genes in the weighted network.

Initial probability for primary drug treatment using RW

Based on the characteristic of RW, we apply it to discover potential co-targets that have the maximum probability to affect the genes related to the drug resistance mechanisms. First, for every node v (v∈V), we define adj(v) which describes the set of nodes u with direct interaction with node v in the network G, and ws(v) as the sum of the weight associated from node v to its neighbors u in adjacency matrix A, their formal definitions are in Equations (5) and (6), respectively. The transition matrix M for RW is computed using the adjacency matrix A and ws(v) and the transition probability M uv from node v to node u is defined in Equation (7) where w(v,u) is calculated by Equation (1)

a d j ( v ) = u | ( v , u ) ∈ E
(5)
w s ( v ) = ∑ u ∈ a d j ( v ) w ( v , u )
(6)
M v u = p r o b a b i l i t y ( v - u ) = w ( v , u ) / w s ( v )
(7)

Let P 0 be the initial probability vector constructed in such a way that equal probabilities were assigned to all the source nodes with sum of their probabilities equal to 1. Let P s be a vector in which a node in the network holds the probability of finding itself in the random walk process up to the step s, the probability of P s+1 can be derived by

P s + 1 = M T P s
(8)

We plunge the transition matrix M and initial probability vector P 0 into the iterative Equation (8). After certain steps, the probabilities will reach a steady state which is obtained by performing the iteration until the difference between P s and P s+1 measured by L1 norm falls below a very small number such as 10-8. We define the vector P reference (d) as the reference probability vector represents a steady state probability vector of nodes for the treatment merely by the drug target d.

Discovering potential co-target

A combination of primary drug target and co-target should disrupt network and reduce the emergence of drug resistance thus allowing the main drug to kill the bacteria. The potentially predict co-target by looking at the topological properties of nodes in a drug-treated network in which a co-targeted protein is deleted or inhibited corresponding to remove or hard to reach it [27, 28]. It is possible to analyze the consequences on network structure by looking to the variations of the probability of all the other nodes while inhibiting the co-target protein. Given the weighted network of the primary drug treated network, we modify the transition matrix in order to determinate the probabilities of the resulting interactions after deleting or inhibiting the candidate co-targets. It is reasonable to assume the following constraints to specify the new transition matrix M':

(1) To inhibit proteins that are co-targets, the probability of the co-target nodes in the transition matrix should be set to a small value ε. In order to represent the inhibition of node in the network, the transition probability from node to the co-target protein should smaller than previous values. Therefore, we set the range of the parameter ε from 0 to TP min which is the minimum probability of the edge in the original transition matrix M.

(2) The sum of the probability of the nodes should be equal to 1, so the probabilities of the rest of nodes must be re-adjusted accordingly if at least one of the edges is set to ε.

In order to satisfy the above constraints, let ct(v) be a set of proteins whose nodes belong to adj(v) which are also co-targets, we have the definition in Equation (9). If ct(v) is an empty set, it denotes that node v does not direct interact with any co-target that we set in the experiment.

c t ( v ) = u | a d j ( v ) ∧ u  is a co - target 
(9)

For every node v in the network, if a node u in adj(v) belongs to ct(v), we reset the probability of walking into a co-target node with a small value ε (since it is assumed to be inhibited), else, let the number of the nodes in ct(v) as |ct(v)| and the sum of the weights of the nodes in adj(v) which are not in ct(v) as ws'(v) defined in Equation (10), we adjust the weight to each node which is not in ct(v) based on the weight ratio of w(v, u) and ws'(v) and one minus the probabilities attributed to co-target nodes as in Equation (11).

w s ′ ( v ) = ∑ w ∈ a d j ( v ) - c t ( v ) w ( v , u )
(10)
M v u ′ = p r o b a i l i t y ( v → u ) = ε u ∈ c t ( v ) w ( v , u ) w s ′ ( v ) 1 - c t ( v ) ε u ∉ c t ( v )
(11)

The small undirected network is represented in Figure 3(A) where node A is a primary drug target and all the weights of the edges are equal to one. Figure 3(B) is the adjacent matrix A and entries of the original transition matrix M are calculated according to Equation (7). While we choose the node C to be a co-target, the modified transition matrix M' is calculated according to Equations (9)-(11). Take node B as an example, first we get adj(B) = {A, C, E} and ct(B)={C} from Equation (9) and then we set the probability of M'BC and M'DC to be ε based on Equation (11). The probability of M'BA is calculated by

Figure 3
figure 3

An example of the transition matrix in the co-target assignment.

M B A ′ = p r o b a i l i t y ( B → A ) = 1 3 1 3 + 1 3 1 - ( 1 ) ε = 1 2 1 - ε
(12)

The probabilities of M'BE, M'DE, and M'DF are set in a similar manner. The initial probability P 0 is formed such that equal probabilities are assigned to the nodes that are targeted by the drug and its co-target with the sum equal to 1. In Figure 3(C) example, the initial probabilities for the pair of the primary drug target and its co-target are both set as 0.5 respectively. After certain steps, the probability will reach a steady state to the probability P cotarget (d, t) under the treatment of the primary antibiotic target d and its co-target t. Finally, we obtain a function F(d, t) which is shown in Equation (13) for every primary drug target and co-target pair. The function F(d, t) denotes the relative visit frequency of the gene set GDR for the co-target P cotarget (d, t) against the reference probability P reference (d). The larger value of the F(d, t) denotes that this co-target has a stronger influence to the active resistance gene set GDR with respect to the reference probability where only the drug target is under treatment.

F d , t = ∑ g ∈ G D R P c o t a r g e t d , t g / P r e f e r e n c e d g
(13)

where P cotarget (d, t) g denotes the probability of the gth gene which has the drug resistance in the vector P cotarget (d, t). After we calculate each gene t in active network using Equation (11), we rank gene sorted by the value of F(d, t) and try to identify potential co-targets that tend to have larger impact to GDR.

Results and Discussion

Our interactome networks of Mtb H37Rv extracted from STRING database which contains 3764 proteins with 179920 undirected interactions among them. We extract microarray data from Gene Expression Omnibus (GEO) at NCBI with accession number GSE1642 [29] which contains the most frequently used drugs for the treatment of Mtb, Isoniazid (INH) and Ethionamide (ETA). INH is a first-line drug used worldwide to treat Mtb and ETA is a second line drug. In the experiments, H37Rv treated with 0.2 mg/mL and 0.4 mg/mL INH (in 1 μL/mL EtOH) for 6 h with MIC (0.02 μg/mL) and control cells treated with equivalent amount of EtOH for 6 h. H37Rv also treated with 12 μg/mL and 40 μg/mL ETA with MIC (0.5 μg/mL). It must be noted that it is possible that the high concentration may lead to abnormal expression but there may be also a higher probability to develop drug resistance [2]. INH is known to be an inhibitor of mycolic acid biosynthesis (MAP) and ETA is a structural analog of INH that is also thought to inhibit the same biosynthesis. The MAP model contains 219 reactions and 197 metabolites, mediated through 28 proteins based on the complete and accurate with annotations from the latest literature [30]. After mapping to the microarray data, we use 21 proteins as source nodes for A* search to extract active networks.

Gene expression analysis in treated Mtb: Isoniazid and Ethionamide

Previous studies [31, 32] discovered 71 genes relevant to resistance mechanisms were classified into four types (a) efflux pumps which transport drugs out of the cell, (b) cytochromes and other target-modifying enzymes that cause potential chemical modification of drug molecules, (c) SOS-response and related genes leading to mutations or its regulatory region, (d) proteins involved in horizontal gene transfer (HGT) to import a target modifying protein from its environment. The efflux pump proteins directly pump the drug molecules out of the site of the cell [33, 34]. The cytochromes in the resistome are known to modify the drug structure. The proteins classified under the SOS category are believed to be important in mediating mechanisms important for DNA repair and hence in the emergence of mutations that give rise to drug resistance [35]. The proteins in the last category give rise to capabilities of reducing the physiological burden of the drug, typically by degrading the drug in a suitable manner [36–38].

The variation of the gene expression in the microarray data have shown that the lists of genes in fact were either increased or decreased upon exposure to the drug [3]. Table 1 shows the number of the up and down regulated genes belongs to curated resistance proteins microarray data [19]. There are 1920 up-regulated genes, 1806 down-regulated genes and the expression values of the 38 genes are equal to zero in INH samples. On the other hand, 1946 up-regulated genes, 1777 down-regulated genes and the expression values of the 41 genes are equal to zero in ETA samples. In the antibiotic efflux pumps category, there are 7 and 8 up-regulated genes while 10 and 9 genes are in the down-regulated set in INH and ETA samples. In SOS processes category, 5 up-regulated and 4 down-regulated genes are in INH samples while only 1 up-regulated gene and 8-down regulated genes in ETA samples. We take the absolute value of expression value to capture inhibitory activity (negative correlation) as well as activation activity (positive correlation) and we find that 31% (22/71) and 32.4% (23/71) of the genes' absolute expression values are larger than the average of the absolute expression values of all the genes in INH and ETA samples. Two genes with expression values which are larger than two standard deviations are iniA and efpA in both INH and ETA data.

Table 1 The number of the up and down regulated genes that belong to the curated resistance proteins in INH and ETA

The drug response and resistance networks under antibiotic treatment

We extract the paths with the length from three to seven where seven is the maximum length of the all pair shortest paths. We suppose there be a probability density function of the score of the active networks and there be parameters that can maximize the likelihood function to fit the density function. A simple and rapid method to calculate an approximate confidence interval is based on the application of the central limit theorem. We use the mean and variance to calculate the top 5% area as lower limits of the 95th percentile confidence interval in the distribution. We extract 681 and 679 active genes involved in the active networks from INH and ETA samples, respectively. Then, we identify the potential drug resistance pathways under drug treatment where at least one of curated resistance proteins is in the paths and assemble them to the active networks. There are 53 and 235 genes in GDR in INH and ETA active resistance networks. The paths to different functional resistance mechanisms for different drugs suggested that a given target may have a higher propensity for eliciting a specific mechanism of resistance [18]. From the functional enrichment point of view, we apply the Database for Annotation, Visualization and Integrated Discovery (DAVID) Gene Functional Classification Tool http://david.abcc.ncifcrf.gov/ to find the annotation for active gene set [39]. DAVID can condense a list of genes or associated biological terms into organized classes of related genes or biological modules by using a novel agglomeration algorithm. This organization is accomplished by mining the complex biological co-occurrences found in multiple sources of functional annotation. We focus on the functional processes and pathway enrichment of the active networks and select 3 out of 14 annotation categories related to the functional pathways and processes including Biological Process and Molecular Function in Gene Ontology (GO) and KEGG pathways. We filter the results that have at least 3 genes in each functional category with P-value < 0.05 and FDR < 0.25 see Additional file 1. Both of the genes treated by INH and ETA would facilitate the survival is seen through fatty acid metabolic and synthase activity and microarray experiments also show up-regulation of the genes involved in these pathways to a great extent [40]. Biotin-requiring enzymes including acetyl-CoA carboxylase, methylcrotonyl-CoA carboxylase, propionyl-CoA carboxylase, and pyruvate carboxylase play essential roles in cell metabolism for the survival and pathogenesis of Mtb [41]. The co-factors facilitate the transfer of CO2 during carboxylation, decarboxylation, and transcarboxylation reactions in fatty acid and carbohydrate metabolism. Glycerophospholipid metabolism and ethylbenzene degradation decreasing their gene expression profiles indicate down-regulation in the mechanism of the ETA drug resistance.

We find that fatty acid metabolism activity and membrane metabolism are closely related to the cell wall biosynthesis in both INH and ETA active networks. Mtb has a very complex cell wall which is composed of the cross-linked peptidoglycans linked to arabinogalactans, and mycolic acids [42]. Lipoarabinomannan is presented in the outer layer of the cell envelope which is anchored in the cell membrane [43]. In addition, mycolic acids have a distinctive chemical nature and a lack of mycolic acid synthesis eventually results in loss of cellular integrity and the bacteria death [44]. Therefore, we focus on those processes and cell wall to explain the drug response and resistance mechanism in Figure 4. Nodes are labeled by their gene symbols and nodes in white color denote the overlapping genes in both INH and ETA active networks. Nodes in black color only exist in INH active network and nodes in gray color only in ETA active network. The dash arrow denotes the different gene expression values in ETA samples.

Figure 4
figure 4

Part of the active networks treated with INH and ETA.

The effect of INH is captured by modifying the interaction of its known target, NADH-dependent enoyl-[acyl-carrier-protein] reductase, inhA (Rv1484) [45]. Fatty acid degradation pathway is activated for the degradation of cell-membrane lipids due to the interruption of their biosynthesis by the inactivation of drug target inhA [45]. We reveal that the mechanism of INH is known to inhibit the biosynthesis of mycolic acid to break the cell wall by disrupting the fatty acid synthesis-II (FAS-II) pathways in Figure 4. The up-regulated FAS-II pathway is closely connected to a similar up-regulated fatty acid degradation pathway assembled by the enoyl-hydratases and acyl-CoA ligases as well as by the putative acyl-CoA dehydrogenase [45]. Gene inhA formed a complex between the kasA-acpM proteins was proposed to be the primary targets in both INH and ETA samples. The kasA genomic consists of five genes in an operon (fabD, acpM, kasA, kasB and accD6) and all of them encode for enzymes that are involved in the fatty acid biosynthesis pathway [46–49]. The NADH dehydrogenase (ndh) gene is bound to the active site of inhA and we also discover that emb proteins (embA, embB, embC) encoded a functional arabinosyltransferase are also involved in the biogenesis of the mycobacterial cell wall. They may play important roles in the cellular homeostasis of the cell and the expressions of these genes activate certain pathways which could link to the resistance. Although ETA and INH are similar to work against the drug treatment, we find that ETA as a thioamide drug which has been shown to be metabolized by a specific fad enzyme, Rv3854c (ethA) and different routes to the FAS-II pathway. The major DNA polymerase dnaE2 participates in DNA repair synthesis and SOS-related genes which is up-regulated in INH active network while it is down-regulated in ETA active network. On the other hand, we assume that Mtb treated with INH has stronger relationship with the SOS process to trigger drug resistance.

Except the global view of the processed in drug response and resistance mechanism, we list the paths with small scores which belong to different functional resistance mechanisms in Tables 2 and 3 where the average value Savg(P) denotes the overall score(p) divided by the number of genes involved in the path p. We observe that the multidrug efflux pumps appear to be one of the major resistance mechanisms in both INH and ETA resistance network. We show that the linear paths with small scores where fadE family are negatively controlled genes that binds to the transcription factors for regulating the expression of fatty acid biosynthesis [46]. Drug resistance related genes efpA, ccsA, cysG and dnaE2 are strongly associated with fadE family which can contribute directly to the emergence of drug resistance. We observe that efpA exists in both active networks and previous experimental observations from a time-kill kinetics study of Mtb showed that the efflux pumps are the predominant mechanism of drug resistance for INH and ETA [49, 50]. The expression of the drug transporting genes (efpA, iniA, iniB, cysG, and ccsA) cause pumping out of the active drug from the cell where cysG and ccsA catalyze the NADPH-dependent processes strongly significant in INH active network. Our findings suggest consistency with the recent experimental results and numbers of the proteins refer to the mechanism of the resistance are identified from literature see Additional file 2. We suggest that different drug targets cause different mechanism of the drug resistance and hence different drugs could trigger resistance through different routes.

Table 2 Top paths of INH resistance in active sub-networks
Table 3 Top paths of ETA resistance in active sub-networks

The variations of F(d, t) while setting different values of the parameter epsilon

In the experiments, TP min in INH and ETA samples are both 0.0025. We run our method using the values of the parameter ε between 0 and 0.0025 with step 0.0005 and get the set of the probabilities of the nodes calculated by Equation (13). Then, we show the absolute value of the variation of probability of each gene among different values of the parameter ε in INH and ETA samples in Figure 5. The results in both INH and ETA samples have small mean and standard deviation and also denote that the range of the parameter ε we set does not affect the probability of each gene.

Figure 5
figure 5

The absolute values of the variations of F(d, t) among different values of the parameter epsilon in INH and ETA samplesTables.

The potential co-targets discovered by random walk model

The unusual multimethyl-branched fatty acids in Mtb are important for the resistance processes of antibiotic resistance, pathogen survival, and virulence [51–53]. Therefore, inhibitors aimed at those processes have high potential to become new anti-tuberculosis therapeutics [54]. We recently suggest the concept of primary drug target and co-target pairs where the co-target could be a key protein in mediating drug resistance. We apply our method to find the potential co-targets with the larger values of the F(d, t) in Equation (13). Upon analysis of the potential co-targets only be the up-regulated genes, this indicates that the up-regulated genes seem to have higher influence and play a more important role for a specific purpose. We display 10 and 15 potential co-targets at the top 3% of the function F(d, t) in Table 4 and 5, respectively.

Table 4 Top 10 co-targets for countering drug resistance under INH treatment
Table 5 Top 15 co-targets for countering drug resistance under ETA treatment

Gene radA, Rv2165c, fadA, pgk, Rv3802c, and atpB are all annotated as potential targets in a comprehensive target identification pipeline including a network analysis of the protein-protein interactome, a flux balance analysis of the reactome, experimentally derived phenotype essentiality data, antibiotic resistance, sequence analyses and a structural assessment of targetability [55]. Large numbers of the enzymes responsible for fatty acid metabolism are also profiled in both cell wall and membrane fractions and our results observe several potential co-targets belong to FadEs, acyl-CoA and enoyl-CoA which are related processes on the fatty acid breakdown in Table 4[56, 57]. One of the fadEs, fadE14 has shown the distinct homology to proteins involved in lipid metabolism [58]. Previous studies have shown that acetyl coenzyme A (acetyl-CoA) carboxylase is a key bacterial component in ATPase enzymatic activity and essential protein-protein interaction which catalyzes the first step in fatty acid synthesis and cell growth [59]. One of the AccD5 (5th subunit of acyl-CoA carboxylases (ACCase), has been strongly implicated as one of the essential ACCases important for cell envelope lipid biosynthesis and also identified as inhibitor [60]. The other acetyl-CoA carboxylase-related proteins involved in the biosynthesis of unique cell wall lipids may be explored as possible targets for new drug targets. The conserved secretion (Sec) pathways perform the protein export which is essential participating in inserting integral membrane proteins into the cytoplasmic membrane for virulence with the help of the signal recognition particle (SRP) [61, 62]. We show that potential co-target Ffh is a SRP subunit which is associated with adenosine triphosphate (ATP) binding cassette transporter type of proteins are known to be involved in the efflux of drugs in bacterial systems [63]. DnaE2 polymerase response is up-regulated in INH active network and it is strongly associated with error-prone DNA repair such as radA. SOS response is significant up-regulated in INH samples therefore our method suggest that inhibition of the DNA repair genes may reduce the survival of the bacteria. Although Rv2137c is a hypothetical protein, it strongly interacts with the plasma membrane proteins lipoproteins, adrenodoxin oxidoreductase and cell wall processes which is annotated in STRING database [20]. The potential co-target cysG in INH active network is strongly related to the efflux pump processes and efflux inhibitor is also annotated in potential targets to combat antibiotic resistance [64].

In Table 5, we discover Rv1343c (lprD) which was previously shown to be cell wall-associated by proteomics and it could be a specific inhibitor to counter ETA resistance [65]. Lipoproteins such like lprD carry out important functions efficiently at the membrane aqueous interface and its biosynthetic pathway is also essential for bacterial viability [66]. Bacteria may be inherently resistant with a particular type of cell wall structure with an outer membrane that establishes a permeability barrier against the antibiotic. Previous works denoted that FadA5 appears more times in the shortest paths and also considered it as a hub in the resistance pathways [2]. ATP hydrolysis inhibitor such as atpB would inhibit the generation of cellular energy and it would play a role in the bacteria's defence against cell damage. TMC207 (formerly known as R207910) specifically inhibits Mtb by inhibition of ATP synthesis was recently reported to have a potent and selective anti-mycobacterial activity in Phase II clinical trials as a second-line TB drug [67]. Our identification of mycobacterial acid phosphatase may a new target preventing the establishment of intracellular in Mtb and this finding is also characterized in previous papers [68]. Boshoff et al. studies demonstrated that the genes in the NAD biosynthetic pathway offers an important view to understand of a resistance mechanism and also can be an attractive potential targets [69].

The essential genes are required to sustain cellular life and they make excellent to be drug targets. In Table 4 and 5, there are ~30% of genes in the co-targets set are essential based on the essentiality information retrieved from TB [70] and DEG v5.0 database [71]. It is noticeable that those essential genes in the co-target sets have fundamental roles in cell and inhibition of each of them is harmful for cell survival. On the other hand, the selected co-targets have good properties in biological and graph theory aspects. The biological validation for the predicated results from our method is difficult, but it turns out that some of our predicted results had been reported in the public literature for validation. According to our results, we focus on the strategies adapted by Mtb to counter the drug resistance by assessing mechanical barrier and ATP energy. Mechanical barrier is one of the protective strategies employed by Mtb against drug resistance by the alteration of the lipoglycans or fatty acid processes. ATP energy-based and cell wall-related inhibitors may make bacteria loss energy to survive against cell damage.

Conclusions

To tackle the problem from a drug resistance perspective, it is essential to understand the molecular mechanisms by which bacteria acquire drug resistance using a network-based approach. We develop a computational workflow for giving new insights to bacterial drug resistance which can be gained by a systems-level analysis of bacterial resistance networks. In our approach, we utilize information on STRING database and expression data to construct a weighted network and to decipher the active networks related to drug resistance using A* search method. We discover that genes in both INH and ETA active networks would facilitate survival related to trigger the processes in cell wall, fatty acid metabolism and synthesis, and NADH-related processes. Efflux pumps appear to be the major mechanisms of resistance under INH and ETA drug treatment in Mtb and SOS response is significant involved in INH active network. Several correlations for the predicted resistance paths corresponding to the experimental data are available in literature suggesting that information flows through the identified routes are probable and biologically significant.

We globally identify the potential co-targets which have higher probabilities to affect the genes related to the drug resistance mechanism through the main and back-up paths using our random walk model. Those co-targets related to lipo-rich membrane, ATP energy and cell wall-related processes for countering drug resistance. Knowledge of the active networks under drug treatment help us address more systematic and novel ways to discover the potential co-targets with good properties in biological and graph theory aspects for overcoming the problem of drug resistance. It is important to bear in mind that studies on a theoretically derived network may have some limitations due to missing out some important interactions have not been identified. In the future, the genome of the drug-resistant strain and non-drug-resistant strain can be compared to identify genes which also worth considering as significant feature for co-targets in the sequence level. Inhibition of the primary target and the co-target simultaneously seems to be a feasible and novel way to overcome the problem of drug resistance for other diseases.

References

  1. Tan YT, Tillett DJ, McKay IA: Molecular strategies for overcoming antibiotic resistance in bacteria. Molecular medicine today 2000,6(8):309-314. 10.1016/S1357-4310(00)01739-1

    Article  Google Scholar 

  2. Raman K, Chandra N: Mycobacterium tuberculosis interactome analysis unravels potential pathways to drug resistance. BMC Microbiology 2008,8(234):1471-2180.

    Google Scholar 

  3. Simon JW, Richard AS, Ken L, Laurent K, Robert CR, Gurdyal SB: The use of microarray analysis to determine the gene expression profiles of mycobacterium tuberculosis in response to anti-bacterial compounds. Tuberculosis (Edinburgh, Scotland) 2004,84(3):263-274. 10.1016/j.tube.2003.12.005

    Google Scholar 

  4. Nacu S, Rebecca CT, Lee P, Holmes S: Gene expression network analysis and applications to immunology. Bioinformatics 2007,23(7):850-858. 10.1093/bioinformatics/btm019

    Article  Google Scholar 

  5. Qiu YQ, Zhang S, Zhang XS: Uncovering differentially expressed pathways with protein interaction and gene expression data. The Second International Symposium on Optimization and Systems Biology 2008, 74-82.

    Google Scholar 

  6. Scott J, Ideker T, Karp RM, Sharan R: Efficient algorithms for detecting signaling pathways in protein interaction networks. Ninth Annual International Conference on Research in Computational Molecular Biology 2005, 3500: 1-13.

    Google Scholar 

  7. Sohler F, Hanisch D, Zimmer R: New methods for joint analysis of biological networks and expression data. Bioinformatics 2004,20(10):1517-1521. 10.1093/bioinformatics/bth112

    Article  Google Scholar 

  8. Zhao X, Wang R, Chen L, Aihara K: Automatic modeling of signal pathways from protein-protein interaction networks. J Bioinform Comput Biol 2009,7(2):309-322. 10.1142/S0219720009004138

    Article  Google Scholar 

  9. ldeker T, Ozier O, Schwikowski B, Siegel AF: Discovering regulatory and signaling circuits in molecular interaction networks. Bioinformatics 2002, 18: S233-S240. 10.1093/bioinformatics/18.suppl_1.S233

    Article  Google Scholar 

  10. Dittrich MT, Klau GW, Rosenwald A, Dandekar T, Müller T: Identifying functional modules in protein-protein interaction networks. Bioinformatics 2008,24(13):i223-i231. 10.1093/bioinformatics/btn161

    Article  Google Scholar 

  11. Breitling R, Amtmann A, Herzyk P: Graph-based iterative Group Analysis enhances microarray interpretation. BMC Bioinformatics 2004, 5: 100. 10.1186/1471-2105-5-100

    Article  Google Scholar 

  12. Guo Z, Li Y, Gong X, Yao C, Ma W, Wang D, Li Y, Zhu J, Zhang M, Yang D, Wang J: Edge-based scoring and searching method for identifying condition-responsive protein-protein interaction sub-network. Bioinformatics 2007,23(16):2121-2128. 10.1093/bioinformatics/btm294

    Article  Google Scholar 

  13. Han J, Bertin N, Hao T, Goldberg DS, Berriz GF, Zhang LV, Dupuy D, Walhout AJM, Cusick ME, Roth FP, Vidal M: Evidence for dynamically organized modularity in the yeast protein-protein interaction network. Nature 2004, 430: 88-93. 10.1038/nature02555

    Article  Google Scholar 

  14. Zhao XM, Wang RS, Chen L, Aihara K: Uncovering signal transduction networks from high-throughput data by integer linear programming. Nucl Acids Res 2008,36(9):e48. 10.1093/nar/gkn145

    Article  Google Scholar 

  15. Wu Z, Zhao XM, Chen L: A systems biology approach to identify effective cocktail drugs. BMC Systems Biology 2010,4(Suppl 2):S7. 10.1186/1752-0509-4-S2-S7

    Article  Google Scholar 

  16. Maslov S, Sneppen K: Specificity and Stability in Topology of Protein Networks. Science 2002,296(5569):910-913. 10.1126/science.1065103

    Article  Google Scholar 

  17. Yook S, Oltvai Z, Barabasi A: Functional and topological characterization of protein interaction networks. Proteomics 2004, 4: 928-942. 10.1002/pmic.200300636

    Article  Google Scholar 

  18. Ayati M, Taheri G, Arab S, Wong L, Eslahchi C: Overcoming Drug Resistance by Co-Targeting. IEEE International Conference on Bioinformatics & Biomedicine 2010, 198-202.

    Google Scholar 

  19. Smith PA, Romesberg FE: Combating bacteria and drug resistance by inhibiting mechanisms of persistence and adaptation. Nature chemical biology 2007,3(9):549-556. 10.1038/nchembio.2007.27

    Article  Google Scholar 

  20. Von-mering C, Huynen M, Jaeggi D, Schmidt S, Bork P, Snel B: STRING: a database of predicted functional associations between proteins. Nucleic Acids Research 2003,31(1):258-261. 10.1093/nar/gkg034

    Article  Google Scholar 

  21. Wishart DS, Knox C, Guo AC, Cheng D, Shrivastava S, Tzur D, Gautam B, Hassanali M: DrugBank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Research 2008, 36: D901-D906.

    Article  Google Scholar 

  22. Nguyen L, Thompson CJ: Foundations of antibiotic resistance in bacterial physiology: the mycobacterial paradigm. TRENDS in Microbiology 2006,14(7):304-312. 10.1016/j.tim.2006.05.005

    Article  Google Scholar 

  23. Mdluli K, Slayden RA, Zhu Y, Ramaswamy S, Pan X, Mead D, Crane DD, Musser JM, Barry CE III: Inhibition of a Mycobacterium tuberculosis β-ketoacyl ACP Synthase by Isoniazid. Science 1998, 280: 1607-1610. 10.1126/science.280.5369.1607

    Article  Google Scholar 

  24. Dijkstra EW: A Note on Two Problems in Connexion with Graphs. Numerische Mathematik 1959, 1: 269-271. 10.1007/BF01386390

    Article  Google Scholar 

  25. Yeh CY, Yeh HY, Arias CR, Soo VW: Pathway detection from protein interaction networks and gene expression data using color-coding methods and A* search algorithms. The Scientific World Journal 2011.

    Google Scholar 

  26. Köhler S, Bauer S, Horn D, Robinson PN: Walking the Interactome for Prioritization of Candidate Disease Genes. The American Journal of Human Genetics 2008,82(4):949-958. 10.1016/j.ajhg.2008.02.013

    Article  Google Scholar 

  27. Bhalla US, Iyengar R: Emergent properties of networks of biological signaling pathways. Science 1999,283(5400):381-387. 10.1126/science.283.5400.381

    Article  Google Scholar 

  28. Scardon G, Laudanna C: Network centralities Interference and Robustness. Int J Complex Systems in Science 2011,1(2):164-168.

    Google Scholar 

  29. Boshoff HIM, Myers TG, Copp BR, McNeil MR, Wilson MA, Barry CE: The transcriptional responses of Mycobacterium tuberculosis to inhibitors of metabolism: novel insights into drug mechanisms of action. The Journal of Biological Chemistry 2004,279(38):40174-40184. 10.1074/jbc.M406796200

    Article  Google Scholar 

  30. Raman K, Rajagopalan P, Chandra N: Flux Balance Analysis of Mycolic Acid Pathway: Targets for Anti-Tubercular Drugs. PLoS Comput Biol 2005,1(5):e46. 10.1371/journal.pcbi.0010046

    Article  Google Scholar 

  31. Nguyen L, Thompson CJ: Foundations of antibiotic resistance in bacterial physiology: the mycobacterial paradigm. Trends Microbiol 2006,14(7):304-312. 10.1016/j.tim.2006.05.005

    Article  Google Scholar 

  32. Wright GD: The antibiotic resistome: the nexus of chemical and genetic diversity. Nat Rev Microbiol 2007,5(3):175-186. 10.1038/nrmicro1614

    Article  Google Scholar 

  33. Gupta AK, Katoch VM, Chauhan DS, Sharma R, Singh M, Venkatesan K, Sharma VD: Microarray analysis of efflux pump genes in multidrug-resistant mycobacterium tuberculosis during stress induced by common anti-tuberculous drugs. Microb Drug Resist 2009,16(1):21-28.

    Article  Google Scholar 

  34. McKeegan KS, Borges-Walmsley MI, Walmsley AR: Microbial and viral drug resistance mechanisms. Trends Microbiol 2002,10(10 Suppl):S8-S14.

    Article  Google Scholar 

  35. O'Sullivan DM, Hinds J, Butcher PD, Gillespie SH, McHugh TD: Mycobacterium tuberculosis DNA repair in response to subinhibitory concentrations of ciprofloxacin. J Antimicrob Chemother 2008,62(6):1199-1202. 10.1093/jac/dkn387

    Article  Google Scholar 

  36. Kurland CG, Canback B, Berg OG: Horizontal gene transfer: a critical view. Proc Natl Acad Sci USA 2003,100(17):9658-9662. 10.1073/pnas.1632870100

    Article  Google Scholar 

  37. Coros A, DeConno E, Derbyshire KM: Is6110, a mycobacterium tuberculosis complex-specific insertion sequence, is also present in the genome of mycobacterium smegmatis, suggestive of lateral gene transfer among mycobacterial species. J Bacteriol 2008,190(9):3408-3410. 10.1128/JB.00009-08

    Article  Google Scholar 

  38. Alekshun MN, Levy SB: Molecular mechanisms of antibacterial multidrug resistance. Cell 2007,128(6):1037-1050. 10.1016/j.cell.2007.03.004

    Article  Google Scholar 

  39. Huang D, Sherman B, Lempicki R: Systematic and integrative analysis of large gene lists using DAVID Bioinformatics Resources. Nat Protoc 2009,4(1):44-57.

    Article  Google Scholar 

  40. Bhat AG, Vashisht R, Chandra N: Modeling metabolic adjustment in Mycobacterium tuberculosis upon treatment with isoniazid. Systems and Synthetic Biology 2011,4(4):299-309.

    Article  Google Scholar 

  41. Savvi S, Warner DF, Kana BD, McKinney JD, Mizrahi V, Dawes SS: Functional Characterization of a Vitamin B12-Dependent Methylmalonyl Pathway in Mycobacterium tuberculosis: Implications for Propionate Metabolism during Growth on Fatty Acids. Journal of Bacteriology 2008,190(11):3886-3895. 10.1128/JB.01767-07

    Article  Google Scholar 

  42. Morita YS, Velasquez R, Taig E, Waller RF, Patterson JH, Tull D, Williams SJ, Billman-Jacobe H, McConville MJ: Compartmentalization of lipid biosynthesis in mycobacteria. J Biol Chem 2005, 280: 21645-21652. 10.1074/jbc.M414181200

    Article  Google Scholar 

  43. Besra GS, Brennan PJ: The mycobacterial cell wall: biosynthesis of arabinogalactan and lipoarabinomannan. Biochem Soc Trans 1997, 25: 845-850.

    Article  Google Scholar 

  44. Barry CE, Lee RE, Mdluli K, Sampson AE, Schroeder BG, Slayden RA, Yuan Y: Mycolic acids: structure, biosynthesis and physiological functions. Prog Lipid Res 1998, 37: 143-179. 10.1016/S0163-7827(98)00008-3

    Article  Google Scholar 

  45. Ehrt S, Schnappinger D: Mycobacterial survival strategies in the phagosome: defence against host stresses. Cell Microbiol 2009, 11: 1170-1178. 10.1111/j.1462-5822.2009.01335.x

    Article  Google Scholar 

  46. Banerjee A, Dubnau E, Quemard A, Balasubramanian V, Urn KS, Wilson T, Collins D, de Lisle G, Jacobs WR: inhA, a gene encoding a target for isoniazid and ethionamide in Mycobacterium tuberculosis. Science 1994,263(5144):227-230. 10.1126/science.8284673

    Article  Google Scholar 

  47. Wilson M, DeRisi J, Kristensen HH, Imboden P, Rane S, Brown PO, Schoolnik GK: Exploring drug-induced alterations in gene expression in Mycobacterium tuberculosis by microarray hybridization. PANS 1999,96(22):12833-12838. 10.1073/pnas.96.22.12833

    Article  Google Scholar 

  48. Cole ST, Brosch R, Parkhill J, Garnier T, Churcher C, Harris D, Gordon SV, Eiglmeier K, Gas S, Barry CE, Tekaia F, Badcock K, Basham D, Brown D, Chillingworth T, Connor R, Davies R, Devlin K, Feltwell T, Gentles S, Hamlin N, Holroyd S, Hornsby T, Jagels K, Krogh A, McLean J, Moule S, Murphy L, Oliver K, Osborne J, Quail MA, Rajandream MA, Rogers J, Rutter S, Seeger K, Skelton J, Squares R, Squares S, Sulston JE, Taylor K, Whitehead S, Barrell BG: Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature 1998,393(6685):537-544. 10.1038/31159

    Article  Google Scholar 

  49. de Steenwinkel JE, de Knegt GJ, ten Kate MT, van Belkum A, Verbrugh HA, Kremer K, van Soolingen D, Bakker-Woudenberg IA: Time-kill kinetics of anti-tuberculosis drugs, and emergence of resistance, in relation to metabolic activity of mycobacterium tuberculosis. J Antimicrob Chemother 2010,65(12):2582-2589. 10.1093/jac/dkq374

    Article  Google Scholar 

  50. Ouellet H, Podust LM, de Montellano PR: Mycobacterium tuberculosis cyp130: crystal structure, biophysical characterization, and interactions with antifungal azole drugs. J Biol Chem 2008,283(8):5069-5080.

    Article  Google Scholar 

  51. Minnikin DE, Kremer L, Dover LG, Besra GS: The methyl-branched fortifications of. Mycobacterium tuberculosis. Chem Biol 2002, 9: 545-553. 10.1016/S1074-5521(02)00142-4

    Article  Google Scholar 

  52. Trivedi OA, Arora P, Vats A, Ansari MZ, Tickoo R, Sridharan V, Mohanty D, Gokhale RS: Dissecting the mechanism and assembly of a complex virulence mycobacterial lipid. Mol Cell 2005,17(5):631-643. 10.1016/j.molcel.2005.02.009

    Article  Google Scholar 

  53. Rainwater DL, Kolattukudy PE: Synthesis of mycocerosic acids from methylmalonyl coenzyme A by cell-free extracts of Mycobacterium tuberculosis var. bovis BCG. Biological Chemistry 1983, 258: 2979-2985.

    Google Scholar 

  54. Gerdes SY, Scholle MD, D'Souza M, Bernal A, Baev MV, Farrell M, Kurnasov OV, Daugherty MD, Mseeh F, Polanuyer BM, Campbell JW, Anantha S, Shatalin KY, Chowdhury SA, Fonstein MY, Osterman AL: From Genetic Footprinting to Antimicrobial Drug Targets: Examples in Cofactor Biosynthetic Pathways. J Bacteriol 2002,184(16):4555-4572. 10.1128/JB.184.16.4555-4572.2002

    Article  Google Scholar 

  55. Raman K, Yeturu K, Chandra N: targetTB: A target identification pipeline for Mycobacterium tuberculosis through an interactome, reactome and genome-scale structural analysis. BMC Syst Biol 2008, 2: 109.

    Article  Google Scholar 

  56. Mann S, Ploux O: 7,8-Diaminoperlargonic acid aminotransferase from Mycobacterium tuberculosis, a potential therapeutic target. Characterization and inhibition studies 2006,273(20):4778-4789.

    Google Scholar 

  57. Ramaswamy SV, Reich R, Dou SJ, Jasperse L, Pan X, Wanger A, Quitugua T, Graviss EA: Single Nucleotide Polymorphisms in Genes Associated with Isoniazid Resistance in Mycobacterium tuberculosis. Antimicrob Agents Chemother 2003,47(4):1241-1250. 10.1128/AAC.47.4.1241-1250.2003

    Article  Google Scholar 

  58. Krithika R, Marathe U, Saxena P, Ansari MZ, Mohanty D, Gokhale RS: A genetic locus required for iron acquisition in Mycobacterium tuberculosis. Proc Natl Acad Sci USA 2006, 103: 2069-2074. 10.1073/pnas.0507924103

    Article  Google Scholar 

  59. Amer FA, El-Behedy EM, Mohtady HA: New Targets for Antibacterial Agents. Biotechnology and Molecular Biology Reviews 2008,3(3):46-57.

    Google Scholar 

  60. Lin TW, Melgar MM, Kurth D, Swamidass SJ, Purdon J, Tseng T, Gago G, Baldi P, Gramajo H, Tsai SC: Structure-based inhibitor design of AccD5, an essential acyl-CoA carboxylase carboxyltransferase domain of Mycobacterium tuberculosis. Proc Natl Acad Sci USA 2006, 103: 3072-3077. 10.1073/pnas.0510580103

    Article  Google Scholar 

  61. Xie K, Dalbey RE: Inserting proteins into the bacterial cytoplasmic membrane using the Sec and YidC translocases. Nature Reviews Microbiology 2008, 6: 234-244.

    Google Scholar 

  62. Feltcher ME, Sullivan JT, Braunstein M: Protein export systems of Mycobacterium tuberculosis: novel targets for drug development? Future Microbiol 2010,5(10):1581-1597. 10.2217/fmb.10.112

    Article  Google Scholar 

  63. Mir MA, Rajeswari HS, Veeraraghavan U, Ajitkumar P: Molecular characterisation of ABC transporter type FtsE and FtsX proteins of Mycobacterium tuberculosis. Arch Microbiol 2006, 185: 147-158. 10.1007/s00203-005-0079-z

    Article  Google Scholar 

  64. Hughes D: Exploiting genomics, genetics and chemistry to combat antibiotic resistance. Nat Rev Genet 2003,4(6):432-441.

    Article  Google Scholar 

  65. McDonough JA, McCann JR, Tekippe EM, Silverman JS, Rigel NW, Braunstein M: Identification of functional Tat signal sequences in Mycobacterium tuberculosis proteins. Journal of Bacteriology 2008,190(19):6428-6438. 10.1128/JB.00749-08

    Article  Google Scholar 

  66. Morita YS, Velasquez R, Taig E, Waller RF, Patterson JH, Tull D, Williams SJ, Billman-Jacobe H, McConville MJ: Compartmentalization of lipid biosynthesis in mycobacteria. J Biol Chem 2005, 280: 21645-21652. 10.1074/jbc.M414181200

    Article  Google Scholar 

  67. Andries K, Verhasselt P, Guillemont J, Gohlmann HW, Neefs JM, Winkler H, Van Gestel J, Timmerman P, Zhu M, Lee E, Williams P, de Chaffoy D, Huitric E, Hoffner S, Cambau E, Truffot-Pernot C, Lounis N, Jarlier V: A diarylquinoline drug active on the ATP synthase of Mycobacterium tuberculosis. Science 2005,307(5707):223-227. 10.1126/science.1106753

    Article  Google Scholar 

  68. Vergne I, Chua J, Lee HH, Lucas M, Belisle J, Deretic V: Mechanism of phagolysosome biogenesis block by viable Mycobacterium tuberculosis. Proc Natl Acad Sci USA 2005,102(11):4033-4038. 10.1073/pnas.0409716102

    Article  Google Scholar 

  69. Boshoff HI, Xu X, Tahlan K, Dowd CS, Pethe K, Camacho LR, Park TH, Yun CS, Schnappinger D, Ehrt S, Williams KJ, Barry CE III: Biosynthesis and recycling of nicotinamide cofactors in mycobacterium tuberculosis. An essential role for NAD in nonreplicating bacilli. J Biol Chem 2008,283(28):19329-19341. 10.1074/jbc.M800694200

    Article  Google Scholar 

  70. Reddy TB, Riley R, Wymore F, Montgomery P, DeCaprio D, Engels R, Gellesch M, Hubble J, Jen D, Jin H, Koehrsen M, Larson L, Mao M, Nitzberg M, Sisk P, Stolte C, Weiner B, White J, Zachariah ZK, Sherlock G, Galagan JE, Ball CA, Schoolnik GK: TB database: an integrated platform for tuberculosis research. Nucleic Acids Res 2009, (37 Database):D499-508.

  71. Zhang R, Lin Y: DEG 5.0, a database of essential genes in both prokaryotes and eukaryotes. Nucleic Acids Res 2009, (37 Database):D455-D458.

Download references

Acknowledgements

This research is partially supported by the Bioresources Collection and Research Center of Linko Chang Gung Memorial Hospital and National Tsing Hua University of Taiwan R. O. C. under the grant number 98N2424E1.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Hsiang-Yuan Yeh or Von-Wun Soo.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

LC participated in algorithm design, performed program and statistical analysis. HY carried out the design of the workflow, algorithm and molecular studies and drafted the manuscript. CY participated in program design. CRA carried out algorithm design. VW participated in its overall design and coordination of the research and helped to draft the manuscript. All authors read and approved the final manuscript.

Liang-Chun Chen, Hsiang-Yuan Yeh contributed equally to this work.

Electronic supplementary material

12918_2011_842_MOESM1_ESM.PDF

Additional file 1:The functional enrichment analysis of the genes treated by INH and ETA using David toolkit with p-value < 0.05 and FDR < 0.25. File name: additionalfile_1.pdf. We do the functional enrichment analysis of the genes in the drug resistance network treated by INH and ETA using DAVID toolkit. The first column denotes the functional terms. The second and third columns denote the p-value and FDR values. The last column "Cum(exp)" gives the cumulative expression value of the genes annotated in the functional term. (PDF 76 KB)

12918_2011_842_MOESM2_ESM.PDF

Additional file 2:Known genes related to the drug response and resistance under INH and ETA treatment. File name: additionalfile_2.pdf. We list the drug response and resistance genes which have received considerable attentions from public literature. (PDF 75 KB)

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Chen, LC., Yeh, HY., Yeh, CY. et al. Identifying co-targets to fight drug resistance based on a random walk model. BMC Syst Biol 6, 5 (2012). https://doi.org/10.1186/1752-0509-6-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/1752-0509-6-5

Keywords