The extraction of drug-disease correlations based on module distance in incomplete human interactome

Yu, Liang; Wang, Bingbo; Ma, Xiaoke; Gao, Lin

doi:10.1186/s12918-016-0364-2

Volume 10 Supplement 4

Proceedings of the 27th International Conference on Genome Informatics: systems biology

Research
Open access
Published: 23 December 2016

The extraction of drug-disease correlations based on module distance in incomplete human interactome

Liang Yu¹,
Bingbo Wang¹,
Xiaoke Ma¹ &
…
Lin Gao¹

BMC Systems Biology volume 10, Article number: 111 (2016) Cite this article

2706 Accesses
20 Citations
Metrics details

Abstract

Background

Extracting drug-disease correlations is crucial in unveiling disease mechanisms, as well as discovering new indications of available drugs, or drug repositioning. Both the interactome and the knowledge of disease-associated and drug-associated genes remain incomplete.

Results

We present a new method to predict the associations between drugs and diseases. Our method is based on a module distance, which is originally proposed to calculate distances between modules in incomplete human interactome. We first map all the disease genes and drug genes to a combined protein interaction network. Then based on the module distance, we calculate the distances between drug gene sets and disease gene sets, and take the distances as the relationships of drug-disease pairs. We also filter possible false positive drug-disease correlations by p-value. Finally, we validate the top-100 drug-disease associations related to six drugs in the predicted results.

Conclusion

The overlapping between our predicted correlations with those reported in Comparative Toxicogenomics Database (CTD) and literatures, and their enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways demonstrate our approach can not only effectively identify new drug indications, but also provide new insight into drug-disease discovery.

Background

Drug development is expensive, time consuming and has a high risk of failures. On average, it now takes around 14 years [1] and $800 ~ $1000 million to bring a single drug to market [2]. To overcome these problems, more and more researchers have focused on inferring drug-disease relationships by computational approaches, commonly referred to as “Drug Repositioning” or “Drug Repurposing”. Drug repositioning is the application of known drugs and compounds to new indications (i.e., new diseases) [3]. Using drug repositioning, pharmaceutical companies have achieved a number of successes, for example Pfizer's Viagra in erectile dysfunction [4] and Celgene's thalidomide in severe erythema nodosum leprosum [5].

With the dramatic expansion of large-scale genomic, transcriptomic and proteomic data, computational approaches to predict new drug-disease associations have become one of the leading ways. For example, in 2016, Huang et al. [6] developed a novel pipeline of drug repositioning to analyze four lung cancer microarray datasets, enriched biological processes, potential therapeutic drugs and targeted genes for non-small cell lung cancer (NSCLC) treatments. They integrated two approaches: machine learning algorithms and topological parameter-based classification. Zheng et al. [7] proposed a novel weighted ensemble similarity (WES) algorithm to predict the drug-target direct interactions, which provided a potential in silico model for drug repositioning and discovery. Wang et al. [8] developed a new strategy in 2015, which integrated two types of drug repositioning methods. Based on integration of chemical, gene and disease networks, Cheng et al. [9] inferred chemical hazard profiles, identified exposure data gaps, and incorporated genes and disease networks into chemical safety evaluations. With increasing evidence in genetic and molecular biology, we find most diseases reflect the interaction of multiple molecular components [10–13]. Therefore, we should consider the relevant interactions of disease-associated genes in the context of the human interactome [14–17], which point out the therapeutic importance of modules. In 2016, Luo et al. [18] utilized some comprehensive similarity measures and Bi-Random walk (BiRW) to develop a method named MBiRW to identify potential novel indications for a given drug. Yu et al. [19] proposed a method based on known protein complexes to infer drug-disease associations in 2015. PREDICT (PREdicting Drug IndiCaTions) [20] is based on the observation that similar drugs are indicated for similar diseases, and utilizes multiple drug–drug and disease–disease similarity measures for the prediction task.

However, high-throughput methods currently include less than 20% of all potential pairwise protein interactions in the human cell [21–26], which means that we seek to discover drug and disease associations relying on interactome maps that are 80% incomplete. Additionally, the gene lists of diseases and drugs remain incomplete [21–26]. Because of the incompleteness of the interactome and the limited knowledge of disease- and drug-associated genes, it is not clear if the available data have sufficient coverage to map out modules associated with each disease and each drug. Therefore, in order to identify the location of disease modules within the incomplete interactome, Menche et al. [27] presented a new module distance and used the overlap between the modules to predict disease-disease relationships. The module distance can be extended to address other questions at the forefront of network medicine. Furthermore, it discriminates known drug-disease pairs from unknown drug-disease pairs better than most of the existing similarity-based methods, such as the shortest path distance between their targets in the interactome, common targets, chemical similarity, etc. [28]. Hence based on the module distance [27], we propose a new network-based framework to extract drug-disease correlations. First, we map all the disease- and drug-associated genes to a combined protein interaction network. Then based on the module distance [27], we calculate the distances between each pair of drug gene set and disease gene set, and take the distances as the relationships of drug-disease pairs. We also filter possible false positive drug-disease correlations by p-value. Finally, we validate the top-100 drug-disease associations related to six drugs in the predicted results. The overlapping between our predicted correlations with those reported in Comparative Toxicogenomics Database (CTD) [29] and literatures, and their enriched KEGG pathways [30, 31] demonstrate our approach can effectively identify new drug indications. Furthermore, it can offer new insight into drug discovery.

Methods

Datasets

Drug and target data

Drugs and their corresponding targets are downloaded from KEGG database [30, 31] and DrugBank [32]. We combine two datasets and get 3,613 drugs, 1,504 targets, and 11,170 drug-target pairs. Each drug is represented by its KEGG Drug ID and each target is represented by its Entrez gene ID.

Disease and gene data

Diseases and their related genes are downloaded from KEGG database. In this study, we focus on cancers, so we get 55 cancer diseases, 2,255 associated genes, and 3,800 disease-gene pairs in all. Diseases are represented by its KEGG Disease IDs and genes are represented by Entrez gene IDs.

Human interaction network data

We download a complete and currently feasible interactome from ref. [27], which combines seven different interactions. Their details are shown in the supplementary files of ref. [27]. The combined network is scale-free, which includes 13,460 human proteins and 141,296 unique pairwise binary interactions. It is well connected and has small mean clustering coefficient and short shortest path [27]. Its topological properties are shown in Table 1.

Table 1 Network topological properties of the combined interaction network

Full size table

Benchmark of drug-disease associations

All the known associations between chemicals (or equivalently, drugs) and disorders or its descendants are got from Comparative Toxicogenomics Database (CTD) in May 2014 as our benchmark [29]. CTD contains two kinds of chemical–disease associations: curated and inferred. Curated associations are extracted from the published literature by CTD biocurators and inferred associations are established via CTD–curated chemical–gene interactions. In our study, we extract both curated and inferred associations, which can help researchers develop hypotheses about environmental diseases and their underlying mechanisms.

Functional enrichment analysis

In order to validate our method further, we utilize the Database for Annotation, Visualization and Integrated Discovery (DAVID) to perform functional enrichment analysis [33, 34] on the gene sets of predicted drug-disease pairs. With the genes as inputs, we observe the overlapping of enriched KEGG pathways between drugs and diseases. With Benjamin multiple testing correction method [35], the enrichment p-value is corrected to control family-wide false discovery rate under certain rate (e.g. ≥ 0.05).

Compute distance between modules

The disease- or drug-associated genes interacting with each other suggests that they tend to cluster in the same neighborhood of the interactome and form a disease module or a drug module, a connected subgraph that contains all molecular mechanisms of a disease or a drug. Therefore, the accurate evaluation of relationships between disease modules and drug modules is a very important step to identify potential drug-disease associations. Because the interactome remains incomplete, Menche et al. [27] proposed a new definition of module distance in 2015. Here, it is named as Module Distance for convenience. Given two modules marked as A and B, the Module Distance between them is defined as s _AB [27]:

$$ {s}_{\mathrm{AB}}\equiv <{d}_{\mathrm{AB}}>-\frac{<{d}_{\mathrm{AA}}>+<{d}_{\mathrm{BB}}>}{2} $$

(1)

< d _AA > represents mean shortest distance between each node and all the other nodes within module A. < d _BB > represents mean shortest distance between each node and all the other nodes within module B. < d _AB > represents mean shortest distance between nodes within module A and nodes within module B.

A simple example for calculating the distance between two disease modules A and B is shown in Fig. 1 [27]. In Fig. 1, the four nodes within disease module A, {a, b, c, d}, are labeled by blue and the other five nodes within disease module B, {c, e, f, g, h}, are labeled by red. For node a in module A as an example, its shortest distances to b, c and d are 1, 2 and 5 respectively, so its shortest distance with all the other nodes within module A is 1. Similarly, the shortest distances of b, c and d in module A are 1, 1 and 3 respectively (see Fig. 1). Therefore, the mean shortest distance within module A, <d _AA>, is (1 + 1 + 1 + 3)/4 = 2/3. In this way, the shortest distance in module B, <d _BB>, is (1 + 1 + 1 + 2 + 2)/5 = 7/5. Then we calculate the mean shortest distance between modules A and B, <d _AB>. Firstly, the shortest distances for all the node pairs between module A and module B are calculated. As shown in Fig. 1, node a in module A is closest to node c in module B, so the shortest distance between node a and module B is 2. In the same way, the mean shortest distance between modules A and B, <d _AB>, can be got and shown in Fig. 1. Finally, according to formula (1), the distance between modules A and B, s _AB, is calculated and its value is negative. The reason is that module A and module B share a common node c.

Construct drug-disease associations based on Module Distance scores

Based on Module Distance, we calculate the distances between 55 cancer modules and 3,594 drug modules. First, all the genes related to drugs and diseases are mapped to the combined protein network. For each drug and each disease, their related genes form a drug module and a disease module respectively. Then, using the formula (1), we can calculate the distance between each drug-disease module pair. Finally, in order to make the distances score be proportional to the drug-disease correlations, we process the distance scores as follows. At the beginning, we turn all distances into positive by adding the minimum distance score to each distance, and then we get their reciprocals. At last, we use maximum-minimum to normalize all the distances. Consequently, the larger the distance score, the more related between drug and disease. Eventually, we obtain (55 × 3594) disease-drug associations. In order to obtain more meaningful results and filter possible false positive correlations, we will filter the distances by p-value in the following section.

Filter drug-disease distances by p-value

Based on the combined protein interaction network, we generate 10,000 random networks which keep the degrees of nodes in the original network. Then in each of the random networks, we calculate the distances between drug modules and disease modules by using Module Distance (see formula (1)). Finally, for each one in 55 × 3594 disease-drug associations, we can get its corresponding p-value. We discard all the edges whose p-values are not lower than 0.01. As a result, we obtain 3,027 drug-disease associations and they are presented in Fig. 2.

Results and discussion

CTD benchmark verification

We rank the 3,027 remained drug-disease associations in descending order on the basis of their scores. According to the definition of the distance between a drug-disease pair, the drug-disease pairs with higher scores are what we need. In order to analyze our results more targeted and find more valuable associations, we focus on the top-100 drug-disease associations for further analysis by CTD benchmark. Their scores are more than 0.67.

For the top-100 drug-disease relationships, they relate to 6 drugs and 35 diseases in all. Their connected network shown in Fig. 3 is a drug-disease bipartite graph with 100 links between 6 drugs and 35 diseases. The green triangle nodes represent drugs and the red circle nodes represent diseases. From Fig. 3, we find D09539 (drug name: Gabapentin enacarbil), D00750 (drug name: Levamisole hydrochloride) and D02315 (drug name: Oleic acid), are associated with 35, 27 and 18 diseases respectively. The other three drugs, D00226 (drug name: Amifostine), D01993(drug name: Polidocanol), and D07564 (drug name: Allopurinol), are associated with the remained 20 associations. Table 2 gives the summary information of the six drugs based on CTD, including the number of existing diseases (represented by Ne), the number of predicted diseases (represented by Np) and the percentage, i.e. Ne/(Ne + Np).

Table 2 The summary information of D09539, D00750 and D02315 based on CTD

Full size table

In Table 2, we can find in the top-100 results, the 10 associations related to D00226 and 5 ones related to D07564 are all found in CTD database, i.e. their percentages are 100%. In a certain degree, the exciting results show the reliability of our algorithm. For D01993, it only relates to three diseases in CTD database: “Dermatitis, Allergic Contact”, “Facial Dermatoses” and “Hand Dermatoses”, so it is hard to find its existing diseases. The reason may be the interactome and the drug gene list remain incomplete and biased toward much-studied drugs genes and drug mechanisms. Furthermore, for D09539, D00750 and D02315, there is a total of 80 associations in the top 100 relationships related to them. Therefore, in the following sections, we will make a further analysis on D09539, D00750 and D02315 and their related diseases one by one.

For the first drug D09539 (drug name: Gabapentin enacarbil), its connections with related diseases are shown in Fig. 4. In the following figures, Figs. 4, 5 and 6, green triangle nodes represent drugs, gray hexagonal nodes represent existing diseases in CTD and red circular nodes represent predicted related diseases. There are 35 diseases connected to D09539 (Gabapentin enacarbil) and 26 of them are recorded in CTD database. The percentage reaches up to 74.3%. Therefore, the remaining 9 diseases are likely to be related to D09539 (Gabapentin enacarbil). They may be new indications of Gabapentin enacarbil or its side effects.

The second drug D00750 (drug name: Levamisole hydrochloride) is connected to 27 diseases and their connections are shown in Fig. 5. By verifying in CTD database, we find 18 of 27 diseases are known associations with Levamisole hydrochloride and only 9 diseases are newly predicted results. The prediction accuracy is more than 50%, i.e. 66.7%. We estimate that Levamisole hydrochloride may treat some of the nine predicted diseases or cause some of them.

Figure 6 shows the associations of the third drug D02315 (drug name: Oleic acid) and its related disease. In the same way, we use CTD benchmark to analyze our results. We find 18 diseases are related to Oleic acid: 6 of them are predicted ones and the other 12 diseases have been recorded in CTD database. The percentage also reaches up to 66.7%. No matter what kind of relationship between Oleic acid and the six new diseases, the results are helpful in drug discovery and disease treatment.

Through analyzing our results based on CTD benchmark, we find the prediction accuracies of three drugs (D09539, D00750 and D02315) are all relatively high, more than 50%. On the other hand, the facts indicate that those diseases having no records in CTD are likely to be the new indications of drugs. Therefore, in the following section, we will use KEGG functional enrichment analysis and literature mining to further verify the reliability of our predicted potential associations.

KEGG pathway functional enrichment analysis and literature verification

In the above section, the top-100 results are validated by CTD benchmark. We mainly analyze three drugs, whose associated diseases are 80% of the top-100 results. After our analysis, we obtain 9, 9 and 6 potential diseases for D09539 (drug name: Gabapentin enacarbil), D00750 (drug name: Levamisole hydrochloride) and D02315 (drug name: Oleic acid) respectively. Their details are shown in Table 3. We perform KEGG pathway enrichment analysis on the target sets of drugs and their related diseases with the functional annotation tool of DAVID [33, 34]. If a drug has overlapped KEGG pathways with a disease, the drug and the disease may have great relevance. The drug can probably treat or cause the disease through acting on the overlapping pathways. For DAVID, EASE Score, a modified Fisher Exact P-Value, is used as a threshold for gene-enrichment analysis [35]. It ranges from 0 to 1. When Fisher Exact P-Value is 0, it represents perfect enrichment. We set it as 0.01.

Table 3 Three drugs, their corresponding targets and related diseases

Full size table

Gabapentin enacarbil (KEGG DrugID: D09539) is a prodrug for the anticonvulsant and analgesic drug gabapentin [36]. It is used for treating restless leg syndrome (RLS) and postherpetic neuralgia (PHN) [37, 38]. Although the exact mechanism of action of gabapentin in RLS and PHN is unknown, it is presumed to involve the descending noradrenergic system, resulting in the activation of spinal alpha2-adrenergic receptors. There are five caners, H00025 (Penile cancer), H00028 (Choriocarcinoma), H00016 (Oral cancer), H00041 (Kaposi's sarcoma) and H00047 (Gallbladder cancer), have overlapped KEGG pathways with Gabapentin enacarbil (shown in Table 3 marked as boldface). "MAPK signaling pathway" is their overlapped pathway (shown in Table 4 marked as boldface), which has been found related to multiple human diseases, including cancer [39]. In fact, Gabapentin enacarbil was denied approval by the U.S. Food and Drug Administration (FDA) in February 2010, citing concerns about possible increased cancer risk shown by some animal studies. KEGG enrichment analysis shows that four caners still have no overlapping with Gabapentin enacarbil (D09539) and also have not found relationships through literature mining. The reason is possible that the studies on these four diseases are still very limited.

Table 4 Gabapentin enacarbil and its related KEGG pathways

Full size table

For the remaining two drugs Levamisole hydrochloride (D00750) and Oleic acid (D02315), they have no overlapped KEGG pathways with their related diseases because the two drugs have no related KEGG pathways. Levamisole is a drug used to treat parasitic worm infections [40]. It has also been studied as a method to stimulate the immune system as part of the treatment of cancer [41]. Its nine related diseases are all cancers. Furthermore, studies demonstrate that the role of levamisole immunotherapy is as an adjuvant to radiotherapy in Oral cancer [42, 43]. For Malignant melanoma, the degree of improvement experienced by the patients that were treated by levamisole is of sufficient magnitude to warrant further investigation of this dose of levamisole as adjuvant treatment in patients with melanoma [44]. The results of Pulay and Csömör [45] and reference to pertinent literature indicate the possible effects of levamisole are discussed, as well as possibilities and place of the drug in the therapy of cervical cancer.

The last drug Oleic acid is a common monounsaturated fat that occurs naturally in various animal and vegetable fats and oils. Monounsaturated fat has been related to decreased low-density lipoprotein (LDL) cholesterol [46], so Oleic acid may be effective for the hypotensive (blood pressure reducing) [47]. Shannon et al. [48] found Monounsaturated fatty acids and the alpha-linolenic:eicosapentaenoic ratio were associated with reduced risk of prostate cancer. However, oleic and monounsaturated fatty acid levels in the membranes of red blood cells are associated with increased risk of breast cancer [49], although the consumption of oleate in olive oil is associated with a decreased risk of breast cancer [50].

Conclusions

Because of the incompleteness of protein interactomes and the limited knowledge of disease genes and drug genes, we propose a new method based on a distance between two modules to predict drug-disease association. The distance is named Module Distance for convenience, which is originally defined to solve the incompleteness of human interactome. First, we project disease genes and drug genes to a combined protein interaction network respectively. Then based on Module Distance, we calculate the distances between drug genes and disease genes, and make a further processing to the distances before being the relationships of drug-disease pairs. Also, we filter possible false positive drug-disease correlations by p-value. Finally, we validate the top 100 associations related to six drugs by CTD benchmark. Three main drugs are further analyzed by KEGG pathway enrichment and literature mining, because they are related to 80 associations. The experimental results are encouraging. Both the positive and negative associations can be predicted. Our study offers opportunities for future toxicogenomics and drug-disease discovery.

Abbreviations

BiRW:: Bi-Random walk
CTD:: Comparative Toxicogenomics Database
DAVID:: the Database for Annotation, Visualization and Integrated Discovery
FDA:: Food and Drug Administration
KEGG:: Kyoto Encyclopedia of Genes and Genomes
LDL:: Low-Density Lipoprotein
NSCLC:: Non-Small Cell Lung Cancer
PHN:: PostHerpetic Neuralgia
PREDICT:: PREdicting Drug IndiCaTions
RLS:: Restless Leg Syndrome
WES:: Weighted Ensemble Similarity

References

Dimasi JA. New drug development in the United States from 1963 to 1999. Clin Pharmacol Ther. 2001;69(5):286–96.
Article CAS PubMed Google Scholar
Adams CP, Brantner VV. Estimating the cost of new drug development: is it really $802 million? Health Aff (Millwood). 2006;25(2):420–8.
Article Google Scholar
Sleigh SH, Barton CL. Repurposing Strategies for Therapeutics. Pharm Med. 2010;24(3):151–9.
Article Google Scholar
Novac N. Challenges and opportunities of drug repositioning. Trends Pharmacol Sci. 2013;34(5):267–72.
Article CAS PubMed Google Scholar
Walker SL, Waters MF, Lockwood DN. The role of thalidomide in the management of erythema nodosum leprosum. Lepr Rev. 2007;78(3):197–215.
PubMed Google Scholar
Huang CH, Chang PM, Hsu CW, Huang CY, Ng KL. Drug repositioning for non-small cell lung cancer by using machine learning algorithms and topological graph theory. BMC Bioinformatics. 2016;17 Suppl 1:2.
Article PubMed PubMed Central Google Scholar
Zheng C, Guo Z, Huang C, Wu Z, Li Y, Chen X, Fu Y, Ru J, Ali Shar P, Wang Y, Wang Y. Large-scale Direct Targeting for Drug Repositioning and Discovery. Sci Rep. 2015;5:11970.
Article PubMed PubMed Central Google Scholar
Wang H, Gu Q, Wei J, Cao Z, Liu Q. Mining Drug-disease Relationships As a Complement to Medical Genetics-based Drug Repositioning: Where A Rec-ommendation System Meets Genome-wide Association Studies. Clin Pharmacol Ther. 2015;97(5):451–4.
Article CAS PubMed Google Scholar
Cheng F, Li W, Zhou Y, Li J, Shen J, Lee PW, Tang Y. Prediction of human genes and diseases targeted by xenobiotics using predictive toxicogenomic-derived models (PTDMs). Mol Biosyst. 2013;9(6):1316–25.
Article CAS PubMed Google Scholar
Schadt EE. Molecular networks as sensors and drivers of common human diseases. Nature. 2009;461(7261):218–23.
Article CAS PubMed Google Scholar
Califano A, Butte AJ, Friend S, Ideker T, Schadt E. Leveraging models of cell regulation and GWAS data in integrative network-based association studies. Nat Genet. 2012;44(8):841–7.
Article CAS PubMed PubMed Central Google Scholar
Zanzoni A, Soler-López M, Aloy P. A network medicine approach to human disease. FEBS Lett. 2009;583(11):1759–65.
Article CAS PubMed Google Scholar
Barabási AL, Gulbahce N, Loscalzo J. Network medicine: A network-based approach to human disease. Nat Rev Genet. 2011;12(1):56–68.
Article PubMed PubMed Central Google Scholar
Goh KI, Cusick ME, Valle D, Childs B, Vidal M, Barabási AL. The human disease network. Proc Natl Acad Sci U S A. 2007;104(21):8685–90.
Article CAS PubMed PubMed Central Google Scholar
Lage K, Møllgård K, Greenway S, Wakimoto H, Gorham JM, Workman CT, Bendsen E, Hansen NT, Rigina O, Roque FS, Wiese C, Christoffels VM, Roberts AE, Smoot LB, Pu WT, Donahoe PK, Tommerup N, Brunak S, Seidman CE, Seidman JG, Larsen LA. Dissecting spatio-temporal protein networks driving human heart development and related disorders. Mol Syst Biol. 2010;6:381.
Article PubMed PubMed Central Google Scholar
Chuang HY, Lee E, Liu YT, Lee D, Ideker T. Network-based classification of breast cancer metastasis. Mol Syst Biol. 2007;3:140.
Article PubMed PubMed Central Google Scholar
Rolland T, et al. A proteome-scale map of the human interactome network. Cell. 2014;159(5):1212–26.
Article CAS PubMed PubMed Central Google Scholar
Luo H, Wang J, Li M, Luo J, Peng X, Wu FX, Pan Y. Drug repositioning based on comprehensive similarity measures and Bi-Random walk algorithm. Bioinformatics. 2016;32(17):2664-71.
Yu L, Huang J, Ma Z, Zhang J, Zou Y, Gao L. Inferring drug-disease associations based on known protein complexes. BMC Med Genomics. 2015;8 Suppl 2:S2.
Article PubMed PubMed Central Google Scholar
Gottlieb A, Stein GY, Ruppin E, Sharan R. PREDICT: a method for inferring novel drug indications with application to personalized medicine. Mol Syst Biol. 2011;7:496.
Article PubMed PubMed Central Google Scholar
Mosca R, Pons T, Céol A, Valencia A, Aloy P. Towards a detailed atlas of protein-protein interactions. Curr Opin Struct Biol. 2013;23(6):929–40.
Article CAS PubMed Google Scholar
Mohammadi S, Grama A. A convex optimization approach for identification of human tissue-specific interactomes. Bioinformatics. 2016;32(12):i243–52.
Article CAS PubMed PubMed Central Google Scholar
Hart GT, Ramani AK, Marcotte EM. How complete are current yeast and human protein-interaction networks? Genome Biol. 2006;7(11):120.
Article PubMed PubMed Central Google Scholar
Venkatesan K, Rual JF, Vazquez A, Stelzl U, Lemmens I, Hirozane-Kishikawa T, Hao T, Zenkner M, Xin X, Goh KI, Yildirim MA, Simonis N, Heinzmann K, Gebreab F, Sahalie JM, Cevik S, Simon C, de Smet AS, Dann E, Smolyar A, Vinayagam A, Yu H, Szeto D, Borick H, Dricot A, Klitgord N, Murray RR, Lin C, Lalowski M, Timm J, Rau K, Boone C, Braun P, Cusick ME, Roth FP, Hill DE, Tavernier J, Wanker EE, Barabási AL, Vidal M. An empirical framework for binary interactome mapping. Nat Methods. 2009;6(1):83–90.
Article CAS PubMed Google Scholar
Stumpf MP, Thorne T, de Silva E, Stewart R, An HJ, Lappe M, Wiuf C. Estimating the size of the human interactome. Proc Natl Acad Sci U S A. 2008;105(19):6959–64.
Article CAS PubMed PubMed Central Google Scholar
Wass MN, David A, Sternberg MJ. Challenges for the prediction of macromolecular interactions. Curr Opin Struct Biol. 2011;21(3):382–90.
Article CAS PubMed Google Scholar
Menche J, Sharma A, Kitsak M, Ghiassian SD, Vidal M, Loscalzo J, Barabási AL. Uncovering disease-disease relationships through the incomplete interactome. Science. 2015;347(6224):1257601.
Article PubMed PubMed Central Google Scholar
Emre G, Jörg M, Marc V, Barábasi AL. Network-based in silico drug efficacy screening. Nat Commun. 2016;7:10331.
Article Google Scholar
Davis AP, Grondin CJ, Lennon-Hopkins K, Saraceni-Richards C, Sciaky D, King BL, Wiegers TC, Mattingly CJ. The Comparative Toxicogenomics Database's 10th year anniversary: update 2015. Nucleic Acids Res. 2015;43(Database issue):D914–20.
Article PubMed Google Scholar
Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016;44(D1):D457–62.
Article PubMed Google Scholar
Kanehisa M, Goto S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000;28(1):27–30.
Article CAS PubMed PubMed Central Google Scholar
Law V, Knox C, Djoumbou Y, Jewison T, Guo AC, Liu Y, Maciejewski A, Arndt D, Wilson M, Neveu V, Tang A, Gabriel G, Ly C, Adamjee S, Dame ZT, Han B, Zhou Y, Wishart DS. DrugBank 4.0: shedding new light on drug metabolism. Nucleic Acids Res. 2014;42(Database issue):D1091–7.
Article CAS PubMed Google Scholar
Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57.
Article CAS Google Scholar
Huang DW, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009;37(1):1–13.
Article Google Scholar
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Statist Soc B. 1995;57(1):289–300.
Google Scholar
Landmark CJ, Johannessen SI. Modifications of antiepileptic drugs for improved tolerability and efficacy. Perspect Medicin Chem. 2008;2:21–39.
PubMed PubMed Central Google Scholar
Merlino G, Serafini A, Lorenzut S, Sommaro M, Gigli GL, Valente M. Gabapentin enacarbil in restless legs syndrome. Drugs Today (Barc). 2010;46(1):3–11.
Article Google Scholar
Jeffrey Susan. FDA Approves Gabapentin Enacarbil for Postherpetic Neuralgia. 2012.
Kim EK, Choi EJ. Pathological roles of MAPK signaling pathways in human diseases. Biochim Biophys Acta. 2010;1802(4):396–405.
Article CAS PubMed Google Scholar
Keiser J, Utzinger J. Efficacy of current drugs against soil-transmitted helminth infections: systematic review and meta-analysis. JAMA. 2008;299(16):1937–48.
Article CAS PubMed Google Scholar
Dillman RO. Cancer immunotherapy. Cancer Biother Radiopharm. 2011;26(1):1–64.
Article CAS PubMed Google Scholar
Balaram P, Remani P, Padmanabhan TK, Vasudevan DM. Role of levamisole immunotherapy as an adjuvant to radiotherapy in oral cancer. I. A three-year clinical follow up. Neoplasma. 1988;35(6):617–25.
CAS PubMed Google Scholar
Balaram P, Padmanabhan TK, Vasudevan DM. Role of levamisole immunotherapy as an adjuvant to radiotherapy in oral cancer. II. Lymphocyte subpopulations. Neoplasma. 1988;35(2):235–42.
CAS PubMed Google Scholar
Quirt IC, Shelley WE, Pater JL, Bodurtha AJ, McCulloch PB, McPherson TA, Paterson AH, Prentice R, Silver HK, Willan AR, et al. Improved survival in patients with poor-prognosis malignant melanoma treated with adjuvant levamisole: a phase III study by the National Cancer Institute of Canada Clinical Trials Group. J Clin Oncol. 1991;9(5):729–35.
CAS PubMed Google Scholar
Pulay TA, Csömör S. Effect of levamisole treatment on immunological parameters and the early course of cervical cancer. Neoplasma. 1982;29(1):81–6.
CAS PubMed Google Scholar
"You Can Control Your Cholesterol: A Guide to Low-Cholesterol Living". Krames Communications. 1989.
Teres S, Barcelo-Coblijn G, Benet M, Alvarez R, Bressani R, Halver JE, Escriba PV. Oleic acid content is responsible for the reduction in blood pressure induced by olive oil. Proc Natl Acad Sci. 2008;105(37):13811–6.
Article CAS PubMed PubMed Central Google Scholar
Shannon J, O’Malley J, Mori M, Garzotto M, Palma AJ, King IB. Erythrocyte fatty acids and prostate cancer risk: A comparison of methods. Prostaglandins Leukot Essent Fatty Acids. 2010;83(3):161–9.
Article CAS PubMed PubMed Central Google Scholar
Pala V, Krogh V, Muti P, Chajes V, Riboli E, Micheli A, Saadatian M, Sieri S, Berrino F. Erythrocyte Membrane Fatty Acids and Subsequent Breast Cancer: A Prospective Italian Study. J Natl Cancer Inst. 2001;93(14):1088–95.
Article CAS PubMed Google Scholar
Martin-Moreno JM, Willett WC, Gorgojo L, Banegas JR, Rodriguez-Artalejo F, Fernandez-Rodriguez JC, Maisonneuve P, Boyle P. Dietary fat, olive oil intake and breast cancer risk. Int J Cancer. 1994;58(6):774–80.
Article CAS PubMed Google Scholar

Download references

Acknowledgments

We thank three anonymous reviewers for their insightful and constructive critique.

Declarations

This article has been published as part of BMC Systems Biology Volume 10 Supplement 4, 2016: Proceedings of the 27th International Conference on Genome Informatics: systems biology. The full contents of the supplement are available online at http://bmcsystbiol.biomedcentral.com/articles/supplements/volume-10-supplement-4.

Funding

This work was supported in part by the National Natural Science Foundation of China (Nos. 61672406, 61532014, 91530113, 61502363, 61303118, 61303122, and 61402349), the Natural Science Basic Research Plan in Shaanxi Province of China (No. 2016JQ6057).

Availability of data and materials

The datasets supporting the results of the article are included within the article. The source datasets used in the article are available from the corresponding author on reasonable request.

Authors’ contributions

LY designed and performed experiments, analyzed data and wrote the paper; BBW, XKM and LG wrote the paper. All authors read and approved the final version of the manuscript.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.

Author information

Authors and Affiliations

School of Computer Science and Technology, Xidian University, Xi’an, 710071, People’s Republic of China
Liang Yu, Bingbo Wang, Xiaoke Ma & Lin Gao

Authors

Liang Yu
View author publications
You can also search for this author in PubMed Google Scholar
Bingbo Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoke Ma
View author publications
You can also search for this author in PubMed Google Scholar
Lin Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Liang Yu.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Yu, L., Wang, B., Ma, X. et al. The extraction of drug-disease correlations based on module distance in incomplete human interactome. BMC Syst Biol 10 (Suppl 4), 111 (2016). https://doi.org/10.1186/s12918-016-0364-2

Download citation

Published: 23 December 2016
DOI: https://doi.org/10.1186/s12918-016-0364-2

Proceedings of the 27th International Conference on Genome Informatics: systems biology

The extraction of drug-disease correlations based on module distance in incomplete human interactome