Characterization the regulation of herpesvirus miRNAs from the view of human protein interaction network
© Li et al; licensee BioMed Central Ltd. 2011
Received: 21 January 2011
Accepted: 13 June 2011
Published: 13 June 2011
miRNAs are a class of non-coding RNA molecules that play crucial roles in the regulation of virus-host interactions. The ever-increasing data of known viral miRNAs and human protein interaction network (PIN) has made it possible to study the targeting characteristics of viral miRNAs in the context of these networks.
We performed topological analysis to explore the targeting propensities of herpesvirus miRNAs from the view of human PIN and found that (1) herpesvirus miRNAs significantly target more hubs, moreover, compared with non-hubs (non-bottlenecks), hubs (bottlenecks) are targeted by much more virus miRNAs and virus types. (2) There are significant differences in the degree and betweenness centrality between common and specific targets, specifically we observed a significant positive correlation between virus types targeting these nodes and the proportion of hubs, and (3) K-core and ER analysis determined that common targets are closer to the global PIN center. Compared with random conditions, the giant connected component (GCC) and the density of the sub-network formed by common targets have significantly higher values, indicating the module characteristic of these targets.
Herpesvirus miRNAs preferentially target hubs and bottlenecks. There are significant differences between common and specific targets. Moreover, common targets are more intensely connected and occupy the central part of the network. These results will help unravel the complex mechanism of herpesvirus-host interactions and may provide insight into the development of novel anti-herpesvirus drugs.
Herpesviruses are members of Herpesviridae family, a large family of DNA viruses that cause chronic, latent and recurrent infections in animals and humans. Herpesviruses are double-stranded DNA viruses with large genomes encoding complex virus particles and enzymes involved in a variety of cellular process, including nucleic acid metabolism, DNA synthesis, and protein processing . In addition to herpesvirus proteins associated with pathogenic processes, herpesvirus-encoded microRNAs (miRNAs) have been also shown to play an indispensable role in herpesvirus pathogenesis . miRNAs are a class of endogenous, single strand RNAs, approximately 22 nucleotides long that bind to 3'untranslated regions of transcript causing degradation of their respective targets or block protein translation. Since the discovery of virus-encoded miRNAs in Epstein-Barr Virus (EBV) , the roles of virus encoded miRNAs in the regulation of the viral life cycle and in mediating interactions between viruses and their hosts, have been examined in some detail .
With the emergence of versatile miRNA target prediction algorithms and availability of proteome-wide protein-protein interaction data sets, manually curated or derived from high-throughput experiments (such as a yeast two-hybrid screen), it has become possible to investigate regulation of the whole human PIN by miRNAs. Since protein-protein interactions constitute the basis of most life processes, such studies might provide important clues necessary to the thorough understanding of biological mechanisms at the whole systems level. In recent years, human miRNA regulated cellular networks, such as signal transduction, gene regulatory network, PIN and metabolic network, have been studied in great detail [5–9]. Some of the results highlight an interesting commonality: that miRNAs tend to target nodes with high topological complexity, such as hubs and bottlenecks. In signal transduction network, miRNAs preferentially target downstream network components, positively linked network motifs and downstream components of the adaptors that have the potential of recruiting additional downstream components . Genes in regulatory networks with more transcription factor binding sites have, on average, more miRNA-binding sites and a higher probability of being targeted by miRNAs . Protein degree in the human PIN correlates to the number of miRNA target-site types of the gene encoding the respective protein . In addition, analysis of the human PIN and the human metabolic network showed that human-encoded miRNAs preferentially target hubs and bottlenecks [8, 9].
miRNAs are some of the key regulators of various biological processes, for example, they play an important role in virus-host interactions [2–4]. This applies both to human-encoded and virus-encoded miRNAs. We need to examine the mechanisms involved in such interactions to gain insight into this complex process. To date, only one study has systematically examined the functional characteristics of human herpesvirus miRNAs . The results of that study showed a statistically significant preferential targeting of host genes involved in cellular signalling and adhesion junction pathways. Other studies mentioned above revealed some of the regulatory characteristics of human encoded miRNAs in biological networks, however, in the field of virus miRNA-mediated virus-host interactions, not many studies have been conducted at the systems level. In this report, we explored the topological characteristics of human herpesvirus miRNAs that target human PIN. We believe that determining which human proteins are targeted by viruses will provide insight into molecular processes shared by related viruses. Taking into account the large differences between miRNAs encoded by different viruses , it is not unreasonable to expect that the analysis of one virus group, in our case the herpesviruses, will yield some interesting results. As essential cellular building blocks, proteins perform a variety of functions by interacting with other proteins. If we are to achieve comprehensive understanding of herpesvirus-host interactions and better understand the molecular basis of viral pathogenesis, it will be of great importance to study the function of herpesvirus miRNAs in the framework of PIN. The results of these studies are also likely to provide new means for developing novel therapeutic strategies for the treatment and prevention of viral infections.
Herpesvirus miRNAs preferentially target hubs and bottlenecks
Herpesvirus miRNAs targeting propensity for hubs and bottlenecks
miRNA-targeted Hub proportions
miRNA-targeted Bottleneck proportions
Random chosen nodes (mean)
The relationship between the targeting of herpesvirus miRNAs and human miRNAs
Characteristics of common and specific targets
The GO enrichment results of common targets
nervous system development
multicellular organismal development
multicellular organismal process
anatomical structure development
transmission of nerve impulse
regulation of signaling pathway
regulation of cell communication
Topological characteristics of common herpesvirus miRNA targets
The modularity of sub-network formed by common targets
Random nodes( mean)
It is well understood that cellular functions are carried out using various specialized groups of molecules interacting via intricate networks. No approach to complex systems can succeed without exploiting network topology . In this study, we investigated characteristics associated with the targeting of herpesvirus miRNAs to proteins in the human PIN. Virus-encoded miRNAs have unique advantages: they can function at the RNA level, affecting the expression of many genes rapidly and extensively. The results of this study will contribute to a better understanding of the complex herpesvirus-host interactions at the miRNA level.
We found that herpesvirus miRNAs preferentially target PIN hubs and bottlenecks, a process similar to that of human-encoded miRNAs. The biological networks displayed scale-free characteristics, i.e. most of the nodes have a relatively low degree, making them resistant to attacks on random nodes . It seems that the vulnerability of human protein networks (only a few nodes have a high degree) is successfully exploited by herpesviruses, suggesting that these viruses must have evolved to target key nodes preferentially, allowing them to take maximum control of the human protein network during infection. Although the various roles carried out by virus-encoded proteins have been extensively studied over the past few decades, it is only recently that viral protein-regulated PIN have been studied in detail. Matthew D. Dyer et al.  examined some human-pathogen protein-protein interactions and found that both viral and bacterial pathogens interacted with human PIN hubs and bottlenecks.
The results of our comparison between common and specific targets suggested that some topological differences existed between nodes related to processes associated with common and specific virus pathogenesis mechanisms. Furthermore, the significant hub and bottleneck proportions for common targets validated the preference of viral miRNAs for hubs and bottlenecks. These results provided valuable information that will help unravel mechanisms associated with herpesvirus pathogenesis.
We also characterized the modularity of common PIN targets. We found that common targets tend to form a larger module and have a higher density than randomly chosen nodes and that they are located in the global central core of PIN. During virus-host interactions, viruses use their limited resources to exploit fundamental cellular processes for finishing their life cycles. From that perspective, targeting nodes located in the central part of PIN seems a reasonable strategy designed to affect nodes in other parts of the network efficiently and rapidly. The nodes in the central part represent fundamental components of the cell, the control over which might be necessary for the virus to infect successfully. As our GO analysis confirmed, these nodes are related to processes associated with fundamental cellular regulation and development indispensable to viral survival.
To test the robustness of our results, two additional algorithms, miRanda  and TargetScan , were used to predict the herpesvirus miRNAs targets; meanwhile, a high confidence protein-protein interaction data set, HPRD (Human Protein Reference Database)-filtered, was also used to construct the PIN. Most results are in agreement with those obtained using PITA algorithm  and HPRD dataset . The detailed results are described in additional files (see additional files 1, 2 and 3).
In this paper, we focused on the description of statistically significant, functional characteristics of herpesvirus miRNAs involved in the process of regulation of the human PIN. Some limitations to the analysis described in this report are, first, the PIN used was incomplete and therefore subject to considerable error rates; second, the large number of predicted miRNA targets makes experimental validation rather difficult. Moreover, we know that the results of different types of predictions do not fully agree with each other . Our predicted herpesvirus miRNAs collection might not be complete; that is, the use of improved miRNA prediction algorithms and a wider implementation of high-throughput techniques might identify new miRNAs. Third, the herpesvirus miRNA mediated human protein interaction network, in the context of herpesvirus protein and human protein interactions, was not analyzed due to the disequilibrium and lack of herpesvirus protein and human protein interaction data. Despite these limitations, our analysis of herpesvirus miRNA interactions with the human PIN should help to reveal a broader picture of their functional mechanisms at the systems level and add to our knowledge of the viral pathogenesis process.
In this study, we explored the ability of herpesvirus miRNAs to target the human PIN. Viral miRNAs preferentially target PIN hubs and bottlenecks, behaviour similar to that of human-encoded miRNAs. Topological comparison between specific and common targets showed that common targets have significantly higher degree and betweenness centrality. K-core and ER analysis revealed that common targets occupy the global central part of the PIN. Furthermore, a significant modularity of common targets was found. Their crucial topological position in the PIN suggested that they might play a key role in herpesvirus pathogenesis. These results add to our understanding of herpesvirus miRNAs functions, giving us new insights into the complex process of herpesvirus-host interactions and provide information that can be used in the development of novel antiviral drugs.
Source of miRNAs
miRNAs sequences from six herpesviruses (HSV1, HSV2, EBV, KSHV, HCMV and BV) were downloaded from miRBase , including 86 precursor sequences. 138 mature sequences of miRNAs were used to predict the targets.
miRNA target prediction
We used three miRNA prediction tools to identify miRNAs targets: PITA , miRanda  and TargetScan . Using PITA, we followed standard seed parameter settings and took seeds 6-8 bases long, beginning at position 2 of the miRNA. No mismatches or loops were allowed but a single G:U wobble was allowed in 7- or 8-mers. We parsed all 3'UTRs from the reference sequences of human mRNAs that were downloaded from NCBI.
miRanda (version 3.3a) was used with following parameters: score cutoff = 140, energy cutoff ≤ -7.0, gap opening: -9.0, gap extension -4.0, 5' scaling: 4.
TargetScan (version 5.0) was also used without considering the conservation of genes and the sites with high context score percentiles (between 50 and 100) were chosen.
Protein interaction data
HPRD, Release 9 , with 9,673 nodes and 39,204 protein-protein interactions (PPIs), was used to analyze the targeting propensity of virus-encoded miRNAs. Among the exclusively, experimentally derived protein-protein interaction databases, HPRD is the most complete and overlaps well with other PPI databases  suggesting that it is most likely to represent the full panorama of human PIN.
To obtain a high confidence data set, we filtered HPRD data by choosing the interactions supported by at least two experimental conditions or two papers resulting in the identification of 6,101 proteins and 14,583 interactions contained in the 'HPRD-filtered' set. 'HPRD-filtered' data was also used to test the robustness of the results.
We obtained the GCC (HPRD: 9,270 nodes and 38,855 interactions and HPRD-filtered: 5,527 nodes and 14,158 interactions) by removing small clusters and single nodes. All topological parameters were computed using GCC.
Topological parameter definitions and computations
Degree denotes the number of edges linked to the specified node in the network.
where s and t are nodes in the PIN different from n, σ st specifies the number of shortest paths from s to t, while σ st (n) denotes the number of shortest paths from s to t that lie on n. Betweenness centrality was normalized by the number of node pairs excluding n, so the value of betweenness centrality for each node is defined from 0 to 1. In the PIN, the proteins bridging two functional modules can gain higher values than within the module.
Randomization tests 
To test if herpesvirus miRNAs had targeting propensity for hubs, we randomly chose a group of nodes (the same number of nodes targeted by the herpesvirus miRNAs) and computed its miRNA-targeted hub proportion. We repeated this procedure 10,000 times. We defined the p-value using the fraction of the number of miRNA-targeted hub proportions under random conditions that was greater than the actual miRNA-targeted hub proportion.
We tested the statistical significance of GCC and of the density of the sub-network formed by multiple targets by randomly choosing the same number of nodes as common targets, and recomputed the GCC and the density of sub-network. We defined the p value using the fraction of GCC node numbers (GCC density) under random conditions, which was greater than the actual GCC fraction (GCC density).
Permutation tests were used to examine the significance of the herpesvirus miRNAs regulation strength for hubs and non-hubs. We started with the difference (Ds) between mean miRNAs number (virus type) of hubs and non-hubs and then shuffled the number of miRNAs (virus type) between all nodes and re-computed the difference (Dr) between hubs and non-hubs. We repeated this procedure 10,000 times and defined the p-value by the ratio of the number of Dr under random conditions, which was greater than the actual value. Similar procedures were used to test the significance of the herpesvirus miRNAs regulation strength differences between bottlenecks and non-bottlenecks and the degree of difference between common targets and specific targets.
K-core analysis of the protein interaction network 
The k-core of a graph is defined as the maximum sub-graph obtained by pruning all nodes with a degree lower than k; series of k-cores are obtained by increasing the k value. The excess retention of nodes with property A in the k-core is defined by two steps: (1) computing the proportion of nodes with property A in the whole network (EA = N A /N) and in the k-core and (2) the excess retention (ER) obtained as . ER can be used to measure the distance of a group of nodes to the global center of the network. Aided by k-core, nodes are classified by considering both their degree and placement in the network. By recursively removing nodes with degrees less than the k, network layers can be systematically investigated. Combined with ER analysis, this procedure reveals the enrichment extent of a group of nodes in the k-core sub-graph and gives hints for their functional importance.
GO enrichment analysis
Cytoscape plugin BiNGO (version 2.42)  was used to perform GO enrichment analysis. We selected the nodes in the GCC of HPRD as a reference set and chose 0.01 as significance level. Moreover, hypergeometric tests were used for statistical analysis and the Benjamini and Hochberg False Discovery Rate (FDR) procedure was used for the multiple testing correction.
This work was supported by the National Key Technologies R&D Program for New Drugs (2009ZX09301-002), National Major Science and Technology Special Project for Infectious Diseases of China (2008ZX10002-011) and the National High Technology Research and Development Program (863 program) of China (No. 2007AA02Z108). The authors also thank anonymous reviewers for their valuable comments and suggestions to improve the quality of the paper.
- Cann AJ: Principles of Molecular Virology. 1997, Academic Press, 4
- Grey F, Hook L, Nelson J: The functions of herpesvirus-encoded microRNAs. Med Microbiol Immunol. 2008, 197: 261-267. 10.1007/s00430-007-0070-1PubMed CentralView ArticlePubMed
- Pfeffer S, Zavolan M, Grässer FA, Chien M, Russo JJ, Ju J, John B, Enright AJ, Marks D, Sander C, Tuschl T: Identification of virus-encoded microRNAs. Science. 2004, 304: 734-736. 10.1126/science.1096781View ArticlePubMed
- Scaria V, Hariharan M, Maiti S, Pillai B, Brahmachari SK: Host-virus interaction: a new role for microRNAs. Retrovirology. 2006, 3: 68- 10.1186/1742-4690-3-68PubMed CentralView ArticlePubMed
- Cui Q, Yu Z, Purisima EO, Wang E: Principles of microRNA regulation of a human cellular signaling network. Mol Syst Biol. 2006, 2: 46-PubMed CentralView ArticlePubMed
- Cui Q, Yu Z, Pan Y, Purisima EO, Wang E: MicroRNAs preferentially target the genes with high transcriptional regulation complexity. Biochem Biophys Res Commun. 2007, 352: 733-738. 10.1016/j.bbrc.2006.11.080View ArticlePubMed
- Liang H, Li W: MicroRNA regulation of human protein protein interaction network. RNA. 2007, 13: 1402-1408. 10.1261/rna.634607PubMed CentralView ArticlePubMed
- Hsu C, Juan H, Huang H: Characterization of microRNA-regulated protein-protein interaction network. Proteomics. 2008, 8: 1975-1979. 10.1002/pmic.200701004View ArticlePubMed
- Tibiche C, Wang E: MicroRNA Regulatory Patterns on the Human Metabolic Network. The Open Systems Biology Journal. 2008, 1: 1-8. 10.2174/1876392800801010001.View Article
- Gao G, Li J, Kong L, Tao L, Wei L: Human herpesvirus miRNAs statistically preferentially target host genes involved in cell signaling and adhesion/junction pathways. Cell Res. 2009, 19: 665-667. 10.1038/cr.2009.45View ArticlePubMed
- Gottwein E, Cullen BR: Viral and Cellular MicroRNAs as Determinants of Viral Pathogenesis and Immunity. Cell Host & Microbe. 2008, 3: 375-87. 10.1016/j.chom.2008.05.002View Article
- Kampstra P: Beanplot: A boxplot alternative for visual comparison of distributions. Journal of Statistical Software. 2008, 28: 1-9.View Article
- Bartel DP: MicroRNAs: target recognition and regulatory functions. Cell. 2009, 136: 215-233. 10.1016/j.cell.2009.01.002PubMed CentralView ArticlePubMed
- Kertesz M, Iovino N, Unnerstall U, Gaul U, Segal E: The role of site accessibility in microRNA target recognition. Nat Genet. 2007, 39: 1278-1284. 10.1038/ng2135View ArticlePubMed
- Wuchty S, Almaas E: Peeling the yeast protein network. Proteomics. 2005, 5: 444-449. 10.1002/pmic.200400962View ArticlePubMed
- Barabasi A: Scale-Free Networks: A Decade and Beyond. Science. 2009, 325: 412-413. 10.1126/science.1173299View ArticlePubMed
- Barabási A, Oltvai ZN: Network biology: understanding the cell's functional organization. Nat Rev Genet. 2004, 5: 101-113. 10.1038/nrg1272View ArticlePubMed
- Dyer MD, Murali TM, Sobral BW: The Landscape of Human Proteins Interacting with Viruses and Other Pathogens. PLoS Pathogens. 2008, 4: e32-EP - 10.1371/journal.ppat.0040032PubMed CentralView ArticlePubMed
- John B, Enright AJ, Aravin A, Tuschl T, Sander C, Marks DS: Human MicroRNA targets. PLoS Biol. 2004, 2: e363- 10.1371/journal.pbio.0020363PubMed CentralView ArticlePubMed
- Grimson A, Farh KK, Johnston WK, Garrett-Engele P, Lim LP, Bartel DP: MicroRNA targeting specificity in mammals: determinants beyond seed pairing. Mol Cell. 2007, 27: 91-105. 10.1016/j.molcel.2007.06.017PubMed CentralView ArticlePubMed
- Keshava Prasad TS, Goel R, Kandasamy K, Keerthikumar S, Kumar S, Mathivanan S, Telikicherla D, Raju R, Shafreen B, Venugopal A, Balakrishnan L, Marimuthu A, Banerjee S, Somanathan DS, Sebastian A, Rani S, Ray S, Harrys Kishore CJ, Kanth S, Ahmed M, Kashyap MK, Mohmood R, Ramachandra YL, Krishna V, Rahiman BA, Mohan S, Ranganathan P, Ramabadran S, Chaerkady R, Pandey A: Human Protein Reference Database--2009 update. Nucleic Acids Res. 2009, 37: D767-772. 10.1093/nar/gkn892View Article
- Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ: miRBase: tools for microRNA genomics. Nucleic Acids Res. 2008, 36: D154-158. 10.1093/nar/gkn221PubMed CentralView ArticlePubMed
- Mathivanan S, Periaswamy B, Gandhi TKB, Kandasamy K, Suresh S, Mohmood R, Ramachandra YL, Pandey A: An evaluation of human protein-protein interaction data in the public domain. BMC Bioinformatics. 2006, 7 (Suppl 5): S19- 10.1186/1471-2105-7-S5-S19PubMed CentralView ArticlePubMed
- Assenov Y, Ramírez F, Schelhorn S, Lengauer T, Albrecht M: Computing topological parameters of biological networks. Bioinformatics. 2008, 24: 282-284. 10.1093/bioinformatics/btm554View ArticlePubMed
- Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13: 2498-2504. 10.1101/gr.1239303PubMed CentralView ArticlePubMed
- Wang E, Purisima E: Network motifs are enriched with transcription factors whose transcripts have short half-lives. Trends in Genetics. 2005, 21: 492-495. 10.1016/j.tig.2005.06.013View ArticlePubMed
- Maere S, Heymans K, Kuiper M: BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics. 2005, 21: 3448-3449. 10.1093/bioinformatics/bti551View ArticlePubMed