The integrated analysis of metabolic and protein interaction networks reveals novel molecular organizing principles
© Durek and Walther; licensee BioMed Central Ltd. 2008
Received: 28 July 2008
Accepted: 25 November 2008
Published: 25 November 2008
The study of biological interaction networks is a central theme of systems biology. Here, we investigate the relationships between two distinct types of interaction networks: the metabolic pathway map and the protein-protein interaction network (PIN). It has long been established that successive enzymatic steps are often catalyzed by physically interacting proteins forming permanent or transient multi-enzymes complexes. Inspecting high-throughput PIN data, it was shown recently that, indeed, enzymes involved in successive reactions are generally more likely to interact than other protein pairs. In our study, we expanded this line of research to include comparisons of the underlying respective network topologies as well as to investigate whether the spatial organization of enzyme interactions correlates with metabolic efficiency.
Analyzing yeast data, we detected long-range correlations between shortest paths between proteins in both network types suggesting a mutual correspondence of both network architectures. We discovered that the organizing principles of physical interactions between metabolic enzymes differ from the general PIN of all proteins. While physical interactions between proteins are generally dissortative, enzyme interactions were observed to be assortative. Thus, enzymes frequently interact with other enzymes of similar rather than different degree. Enzymes carrying high flux loads are more likely to physically interact than enzymes with lower metabolic throughput. In particular, enzymes associated with catabolic pathways as well as enzymes involved in the biosynthesis of complex molecules were found to exhibit high degrees of physical clustering. Single proteins were identified that connect major components of the cellular metabolism and may thus be essential for the structural integrity of several biosynthetic systems.
Our results reveal topological equivalences between the protein interaction network and the metabolic pathway network. Evolved protein interactions may contribute significantly towards increasing the efficiency of metabolic processes by permitting higher metabolic fluxes. Thus, our results shed further light on the unifying principles shaping the evolution of both the functional (metabolic) as well as the physical interaction network.
To ensure stable and efficient of metabolic processes in cells, highly coordinated molecular interactions of the involved enzymes and metabolites are necessary. The study of spatially organizing principles of metabolic pathways has long been a research focus of cellular and molecular biology. Organelle compartmentalization and the organization of enzymatic pathways in so-called metabolons have been discussed as the main cellular-scale as well as molecular-scale organizational units to orchestrate the multiple metabolic processes inside cells and to separate as well as to integrate them in space and time. First introduced by Srere, the term metabolon describes a non-covalent association of several sequential enzymes involved in a metabolic pathway . Similar to industrial assembly lines, intermediates are passed on from one enzyme to the next, referred to as metabolic channeling, leading to an optimized metabolic flux. The stability and structural integrity of metabolons varies greatly ranging from temporary associations and their dynamic formation in response to environmental changes to stable, permanent enzyme complexes [2, 3]. Furthermore, it was found that enzyme complexes are often associated with intra-cellular membrane systems [4–6] demonstrating that the spatial organization of the metabolic network is not only limited to direct physical interaction of participating enzymes, but that it also involves passive – in the context of enzymatic pathways – mediating structural cellular components.
Metabolic channeling provides several advantages such as an increase of catalytic efficiency by shorter transition times between the consecutive active sites [7, 8], local enrichment of substrates, protection from toxic intermediates by shielding them from the cellular environment, prevention of decomposition of unstable chemical compounds , overcoming of thermodynamically unfavorable equilibria [10–12], as well as avoidance of competitive pathways [1, 13–17]. Although the concept of metabolic channeling has been discussed controversially at times , it is now supported by metabolic control analysis as well as experimental evidence [5, 18–23].
Recently, Huthmacher and co-workers analyzed the metabolic networks of yeast and Escherichia coli in the context of direct protein interactions as observed in newly available, large-scale protein-protein interaction surveys allowing a systematic scan for direct protein interactions of consecutive metabolic pathway enzymes [24, 25]. They found higher frequencies of physical interactions of enzymes sharing at least one common metabolite in the network. The chance for enzymes to physically interact was observed to be negatively correlated to the distance between enzymes in metabolic network in E. coli and, to a lesser degree, in yeast as well. In addition, they reported a higher probability of regulating enzymes to interact with other proteins, where regulating enzymes were defined either by a threshold of Gibbs free energy change of the associated reaction or by their position within the network as being located at highly connecting branching points. Furthermore, the analysis of high-throughput protein-protein interaction data yielded a number of novel candidates for metabolic channeling. Thus, the functional significance of protein-protein interactions for the metabolic pathway organization has been established and is supported by many experimental observations.
Here, we aim to expand the view on protein interactions in the context of metabolic pathways by treating both levels of molecular organization as network graphs and to investigate global as well as local network properties. The representation of complex biological networks as graphs and the study of their properties have contributed to an emerging system-wide approach towards studying the organizing principles of cellular and molecular processes. Global topological graph properties such as the degree distribution have received particular attention and have been discussed in the context of network stability and information exchange within networks [26–31].
The integrated analysis of different network types for different levels or domains of molecular organization has been applied to transfer evidence to support particular interactions from one network type to another. Ge et al. showed that gene expression and protein interaction data are correlated . Kemmeren and co-workers as well as Deane et al. used gene expression data to assess confidence levels for protein interaction networks [33, 34]. Goldberg and Roth predicted genetic interactions by utilizing protein interaction data , and Kelly and Ideker to predict the physical context of genes . Rhodes et al. used GO-annotations, integrated interlogs and expression data as well as data of protein domains, known to interact to predict protein interactions . The use of gene co-expression data to identify protein interactions has also been demonstrated recently . Finally, Lee et al. integrated expression, gene-fusion, phylogenetic profile, literature co-citation as well as protein interaction data to predict functional associations .
In this study, we expand on the study of Huthmacher and co-workers by investigating the entire protein interaction network and its significance for metabolic networks and metabolic pathways. We extended enzymatic physical interactions to also include non-enzymatic proteins as metabolic relationships between enzymes may also be mediated by metabolically inactive interface proteins. Specifically, we investigate whether large-scale topological equivalences of both the metabolomic and protein interaction network can be detected. Furthermore, as the physical organization of metabolic pathways is likely to have been under evolutionary optimization to increase metabolic throughput, we are studying here whether available flux data can be correlated to the protein interaction data supporting this hypothesis. So far, protein interaction data have been analyzed primarily across all functional categories. Here, we compare the general organization principles with those observed for the enzymatic protein subset, and report that, indeed, specific differences do exist. The significance of topological parameter distributions have largely been analyzed within the context of the examined network type itself, but not across different network types. For example, Macdonald and co-workers discovered defined relationship between fluxes going through metabolic network edges and the degree product of the connected nodes . Here, we explore whether such relationships can be established across network types, in particular protein interaction and metabolic networks.
Thus, our investigations aim to establish whether unifying principles shaping the evolution of both protein interactions as well as metabolic pathways can be detected.
Topological Properties of Interaction Networks
We start our investigations by first characterizing the global network properties of the various types of molecular networks examined in this study. Besides the two main network types, the protein interaction network and the metabolic network, further filtering and different construction methodologies were applied to reveal organizational differences between raw networks including all interactions, and networks designed specifically to capture aspects of metabolism and to also safe-guard against possible artifacts resulting from a particular reconstruction scheme.
Protein Interaction Networks (PIN)
The raw PIN (rPIN, see Methods) derived from the merged databases of DIP and BIOgrid comprises 5,438 proteins involved in 39,766 physical interactions. The network does not differentiate whether the interaction between two proteins is transient or permanent, or under which conditions the proteins were found to interact, or the functional relevance of the association. As the PINs used in this study are represented as undirected graphs, the functionality of an interaction cannot be resolved. A kinase interacting with a protease may activate the protease or be degraded by it.
To analyze aspects of the protein interaction network that are specifically associated with metabolic functions, we identified proteins of rather non-metabolic functions and processes and their associated interactions. The rPIN comprises 1,186 proteins related to DNA processing functions with 21,952 associated interactions, 297 protein-degradation related proteins involved in 4,999 interactions, and 267 kinase-phosphatase associated proteins with 8,251 associations as well as 2,300 other-non-metabolic rather unspecific proteins involved in 34,230 interactions. All these interactions were partially overlapping as proteins form different groups were also reported to interact. After removing these interactions, the remaining nodes span a graph of 1,517 proteins, which can be considered to be the key molecular components responsible for maintaining the metabolic machinery. We will refer to this graph as the filtered PIN (fPIN). Of the 1,517 proteins, 522 represent enzymes annotated with an EC-number. The fPIN comprises 1,086 links, with 289 interactions between enzyme pairs. One third of all nodes are included in the graphs giant component, the largest connected sub-graph. We left unconnected nodes in the graph as the absence of interactions of such proteins may also be significant. In comparison to the raw network, the number of enzymes (869 in the rPIN) is lower, because in the fPIN, non-metabolic enzymes such as protein kinases and protein phosphatases have been excluded.
Further removal of proteins not assigned to at least one EC-number led to the enzyme-only-PIN (ePIN), a graph comprising only enzymes and the interactions between them. Its giant component contains 19% of all nodes. Thus, with applied filtering, the PIN became progressively disintegrated.
As observed for the rPIN, and even more convincingly, the connectivity distribution of both networks, the fPIN and ePIN, follows a power law behavior with respective scale-free exponents of 2.0 for the fPIN, and 1.6 for the ePIN (Figure 1A). Compared to the rPIN and explained by the removal of many non-specific interactions, the fraction of highly-connected nodes is reduced in the fPIN and ePIN with a simultaneous increase of unconnected nodes. The characteristic length (CL) of the fPIN is 8.16 and for the ePIN 6.22, which is approximately twice as long as the CL associated with rPIN (3.49) suggesting that, in particular, highly connected nodes providing shortcuts have been removed in the fPIN and ePIN compared to the rPIN even though the networks as such are smaller as nodes have been deleted. Note that impossible paths (no connection between nodes) have not been included in the calculation of CL.
Thus, the organizing principles governing protein interactions between proteins involved in metabolic functions appear to be different than for other functional categories. While PINs generally are dissortative, protein interactions associated with metabolic functions appear to be assortative; i.e. enzymes preferentially interact with other enzymes of similar degree (connectivity).
The Metabolic Interaction Networks (MIN)
We analyzed three different realizations of metabolic interaction networks (MIN) each representing metabolic pathways from a different perspective. The first two representations of a metabolic interaction networks are the Enzyme Interaction Networks (EIN) and the EIN derived from KEGG pathway maps (mapEIN), where the nodes of the graph are enzymes with assigned EC-numbers. In the EIN, two enzymes are linked if they are associated by at least one product-substrate relationship. For constructing the mapEIN, we extracted relations from KEGG pathway maps directly rather than scanning reaction lists for product-substrate relationships as done for the EIN. While the EIN comprises a large number of enzymes and their relations, the mapEIN may capture better the established biochemical knowledge of metabolic pathways. The Compound (Metabolite) Interaction Network (CIN) represents a third representation of MINs. In this graph, nodes are metabolites, and links are drawn between them if they are connected by at least one reaction.
The EIN comprises 3,435 nodes representing unique EC-numbers. The connectivity distributions, P(k), of the graph follows approximately a scale-free distribution with an estimated scale-free exponent γ of 1.8 (Figure 1B). As observed for the rPIN, a deviation from a simple power law behavior is evident (see above). However, the distribution follows a power law only if small ubiquitously occurring, so-called currency metabolites, such as H+, NH3, H2O, CO2, and metal ions as well as co-enzymes and co-substrates, like CoA, NADH+, FAD, SAM are excluded (Figure 1D). Including these compounds significantly increases the degree of the enzymes interacting with them resulting in a distribution P(k) deviating from the power law distribution for high connectivity values (Figure 1D). Upon including currency metabolites, the total number of edges increases from 60,622 to 140,260 and characteristic length, CL, decreases from 3.64 to 3.00.
The mapEIN comprises 1,957 nodes connected by 6,395 relations. The scale-free exponent of the connectivity distribution, γ, was computed as 1.2 with an increased probability of nodes to be less connected as compared to the EIN, where many more relationships between enzymes are possible simply via their possible substrate-product relationships. The CL of the mapEIN network was determined as 6.62.
The third representation, the CIN, comprises 3,702 metabolites connected by 4,868 links. As done for the EIN, the currency metabolites, co-enzymes and co-substrates have been removed prior to analysis. The connectivity distribution of the CIN exhibits a scale-free exponent, γ, of 2.4 and CL of 12.3 (Figure 1C).
All three MIN graphs are assortative with assortativity values, r d , of 0.43, 0.26, and 0.09 in the EIN, mapEIN, and CIN, respectively. Correspondingly, an increasing neighbors connectivity, NC(k), was observed for increasing connectivity, k (Figure 2B, C). The high assortativity value for the EIN probably results from the construction procedure. The EIN was constructed by scanning for product-substrate relationships. As reactions are generally treated as reversible, so that the lists of substrate and products are interchangeable, all enzymes sharing a metabolite may be linked through substrate-product relations and form a complete sub-graph. While the high assortativity of the EIN may originate from the reconstruction method possibly resulting in too many connections, this may not be the case for the mapEIN as the interactions have been curated manually. However, many reactions in the KEGG-maps are known to be performed by isoenzymes carrying different EC numbers. Since reactions are treated as reversible, isoenzymes will be considered connected as the product of one isoenzyme can be the substrate of another, even though it is the same reaction they are catalyzing. Thus, a set of isoenzymes will form a fully-connected sub-graph, also including the enzymes of the preceding or subsequent reaction step as each isoenzyme is connected to them. The reconstruction of the CIN avoids this problem. This third representation of MINs is closest to the biological and intuitive understanding of metabolic pathways. A pathway in this sense is the path from a first substrate to a final product. The difference in the respective construction methods is also reflected by the average clustering coefficient (CC), where the CC for EIN was 0.67, EIN from KEGG-Maps 0.47, and 0.06 for the CIN, respectively.
Summary of global network properties associated with the different types of PINs and MINs investigated in this study.
Number of nodes
Number of edges
Characteristic length, CL
Correlation of protein interaction networks (PINs) and associated metabolic interaction networks (MINs)
Nodes in the EIN and mapEIN represent enzymes. It is therefore possible to link enzymes found in PINs to the EIN and mapEIN via their annotated EC-numbers. Enzymes from PINs can be linked to metabolites from the CIN network via the enzyme (EC-number)-substrates and -product relationships. Thus, it is possible to directly relate network distances of proteins (enzymes) across both network types (PINs and MINs) allowing us to study, how metabolic network or pathway distances are reflected in protein interaction networks.
A direct correspondence between the protein interaction networks and metabolic networks; i.e. the physical organization of enzyme interactions follows directly their reaction pathway network, would be reflected by red-colored squares – indicating increased occurrence compared to random expectation – along the diagonal in the pair-distance matrices shown in Figure 3. Indeed, the distributions of the enrichments and depletions of the distance-pairs reveal an overall correlation of the shortest paths in PINs and MINs. All PINs show a strong enrichment of direct interactions; i.e. distance 1, in relation to the EIN and mapEIN. Furthermore, an overall correlation of distance pairs with increased numbers of observations relative to the random background (red squares along the diagonal, blue squares primarily off-diagonally) for all PIN-MIN comparisons is evident, especially for the fPIN (Figure 3, central column). Interestingly, enzymes appear more closely related (shorter distances between them) in the fPIN in comparison to their distances derived from their metabolic network association (mapEIN and CIN), as the off-diagonal pattern of red-colored squares indicates a skewed distribution towards larger shortest paths between linked proteins in the MIN compared to their distances in PINs. Thus, it appears that enzymes catalyzing enzymatic steps of some medium distance, i.e. not directly subsequent to each other, are physically brought into contact via protein-protein interactions involving proteins that are not necessarily directly participating in the actual metabolic pathway. For the EIN, above-diagonal enrichments were not observed when compared to the fPIN and ePIN, possibly a consequence of the network reconstruction procedure that allows very many interactions leading to a highly connected metabolic network. As the fPIN contains both enzymes and structural proteins, some proteins included in this network may function as connector or bridging proteins holding distant parts of metabolic pathways together. In the ePIN, such proteins have been filtered out leaving only enzymes in the PIN. Here, the enrichment pattern follows the main diagonal, but at weaker significance as the absolute numbers are smaller. A direct comparison of shortest paths between enzyme pairs connected via valid paths in both the fPIN and the ePIN yielded a mean distances of 5.3 for the fPIN, and 6.2 for the ePIN (p = 0; paired, two-tailed t-test, N = 5,531). Thus, metabolic enzymes are brought into spatial proximity – by way of protein-protein interactions – via interactions mediated by non-metabolically active proteins.
Correlation between pairwise network distances of proteins in MINs compared to their respective distance in PINs.
The correlation of metabolic fluxes carried by enzymes and their Protein Interaction Network properties
Correlations between the relative flux rates between neighboring (physically interacting) enzymes and their PIN properties
Connectivity~Flux rate of nodes
Centrality~Flux rate of nodes
Flux Rate~Flux rate of neighbors
Physical interactions in high-throughput catabolic pathways and synthesis pathways of complex metabolites
Only a minor fraction of pyruvate (flux rate of 6 relative to the glucose uptake rate flux rate set to 100) appears to be channeled to the TCA-cycle, that is 3% (as one glucose molecules may lead to the formation of two pyruvate molecules) of the initial glucose influx is processed by the TCA-cycle enzymes. The production of pre-stage substrates of amino acids rather than energy production is the main function of the TCA-cycle. The flux rates decrease to 1 beyond the succinate dehydrogenase (SDH) reaction step. This path leads through the PYK1, PDH and the following enzymes of the TCA-cycle: citrate synthase (CIT), isocitrate deyhdrogenase (IDH), 2-oxoglutarate complex (KGD), succinlyCoA synthetase (LSC) and SDH. The enzymes of the TCA exhibit a relatively low number of physical interactions (Figure 5C). The interactions are mainly pooled in enzyme complexes, SDH (SDH1/2/3, TCM62), the LSC (LSC1/2) and KGD (KGD1/2, LPD1), performing the individual reactions steps of the TCA-cycle. Only the reactions of the malate dehydrogenase (MDH1) and the CIT are physically connected. However, taking the prior reactions of the PYK1 and PDH into account, the TCA reactions reveal a more dense interaction cluster. The PDH interacts with KGD sharing the common subunit lipoamide dehydrogenase (LPD1). The PYK1 interacts with PDH, KGD as well as IDH (Figure 5C).
The remaining reported direct physical interactions contained in the fPIN between enzymes detected within metabolic pathways are distributed throughout anabolic pathways. Most interactions are found in the biosynthesis of ergosterol, ubiquinone, sphingolipid and glucogen synthesis (Additional File 3). Single links between enzymes can be found in biosynthetic pathways of pyrimidine, leucine, isoleucine, and lysine. Within the fatty acid synthesis pathway, the malic enzyme (MAE1) interacts with the alpha subunit of FAS and Acetyl-CoA carboxylase (ACC1) (Additional File 3)
Figures 5B, C and the Additional File 3 provide a comprehensive account of all reported protein interactions mapped to canonical metabolic pathways from the SGD database; i.e. for pathways not included in this figure, no protein interaction was contained in the PIN.
Central proteins in the fPIN
Ten most central proteins in the fPIN.
ATP synthase H chain, mitochondrial
Enoyl reductase, very long fatty acid elongation
Oxidoreductase, fatty acid elongation
Cytochrome c oxidase subunit 1
D-3-phosphoglycerate dehydrogenase 1
Cytochrome c oxidase polypeptide Vb, mitochondrial [Precursor]
Bifunctional glutamine-dependent carbamoyl-phosphate synthase, Aspartate carbamoyl-transferase
Phosphoglycerate mutase 1
NAD-dependent malic enzyme, mitochondrial [Precursor]
TSC13 and IFA38, which are responsible for the elongation of very long fatty acids, connect enzymes involved in the biosynthesis of membrane lipids by interacting with enzymes from the biosynthesis of fatty acids, steroids and related metabolites, phosphatidyl -choline, -serine and -ethanol amine, suggesting that the pathways are brought into spatial proximity via protein-protein interactions (Figure 7B).
For comparison, other centrality measures for the top 10 most influencing proteins are listed in Table 4. While an overall correlation between the centrality measures is evident (correlation coefficients and associated p-values are provided in the legend of Table 4), each centrality measure identifies particular aspects of centrality and does not correspond directly to the robustness measure used here.
Our investigations integrated protein interaction networks with metabolic networks to study the extent to which metabolic pathways; i.e. functional processes, are pre-formed in the underlying structural interaction network, i.e. the "plumbing" of cellular components. The networks examined here were derived from different sources of information and provide different views on the metabolic as well as protein interaction systems.
We discovered that sub-systems of the entire protein-protein interaction network may follow specific organizing principles. While interactions associated with signaling and other regulatory processes (e.g. transcriptional regulation via DNA-interaction associated proteins) were found to be dissortative; i.e. proteins of high degree interact with proteins of low degree, interactions between metabolic enzymes were observed to be assortative such that enzymes frequently interact with other enzymes of similar degree (Table 1). Regulatory processes may often involve hierarchical one-to-many associations such as master regulators (e.g. kinases) and their respective individual target proteins. Physical interactions between metabolic enzymes, on the other hand, appear to generally follow a more horizontal organization with enzymes participating in larger complexes or sequential one-to-one interaction chains. Nonetheless, we identified interaction hub enzymes that are located at central positions integrating several metabolic systems and whose removal would severely impact the structural integrity of larger portions of the metabolism-focused metabolic network (fPIN, Table 4, Figure 7A, B).
When dealing with characteristics of protein-protein interaction network, possible technological as well as biases introduced by targeted scientific interest always are a concern. To best avoid this problem, it would be ideal to use strictly unbiased datasets for analysis. However, such fully unbiased datasets are not available (yet) as this would require nothing less than an identification of all true and relevant protein-protein interactions occurring inside cells. Presently, we have to resort to the best available unbiased datasets generated by high-throughput screens. As the BIOGRID data contains information about the source of information, it is possible to evaluate the assortativity of the biggest subsets in the database that were obtained from high-throughput experiment, namely Krogan et. al  comprising 1,669 nodes involved in 2,682 interactions, and Gavin et al.  with 2682 nodes involved in 8,138 interactions. Reducing the filtered PIN to the these subsets yields two sub-fPINs comprising 364 nodes involved in 411 interactions, and 757 nodes involved in 475 interactions, respectively. The reduced fPINs exhibit an assortativity of 0.31 and 0.15, respectively, confirming the results obtained for the whole fPIN. Correspondingly, for the rPIN, a reduced assortativity was obtained for both datasets with 0.15 for the Gavin set (Nnodes = 1,669, Ninteractions = 10,992) and -0.01 (Nnodes = 2,682 and Ninteractions = 8,138) for the Krogan set. Thus, given the available datasets, the increased positive assortativity of filtered/enzyme PINs does not appear to be resulting from a bias towards well-studied enzymes.
We generated three different versions of the metabolic interaction network (MIN). The enzyme interaction network, EIN, was introduced to capture all possible metabolic interactions between enzymes, whereas the mapEIN transformed the pathway knowledge available in KEGG into a metabolic network. The compound interaction network, CIN, was created as an alternative and focuses on main metabolites as network nodes rather than enzymes. With regard to our main research focus, the topological equivalence of protein interactions and metabolic pathways, all three versions yielded significant positive correlations between the respective shortest paths across both network types (Table 2). Thus, the reported results proved robust against details of the network reconstruction approach. All three MIN-versions were reported here with positive assortativity (Table 1) while a negative assortativity of metabolic networks has been reported elsewhere (degree correlation coefficient of -0.24 ). We note that this difference is caused by the elimination of ubiquitous (currency) metabolites and the inclusion of only main metabolic substrates and products in this work. Including all metabolites in the CIN yielded a degree correlation coefficient, r d , of -0.3 and an increased mean cluster coefficient of 0.7. As currency metabolites such as ATP follow a one-to-many network motif, thereby also introducing many more edges in the network, the dissortativity obtained when including them as well as the increased mean cluster coefficient can be rationalized.
The decision on the exact procedure to generate metabolic networks must remain operational and may dependent on the objective of the study at hand. Defining metabolic networks based on carbon atomic traces in metabolic reactions resulted in different topological characteristics of metabolic networks than for the commonly used approaches .
The interaction networks investigated in this study vary regarding their graph-parameters, such as characteristic length (CL) values and scale-free exponents and also differ from some networks reconstructed. Generally, biological networks tend to be scale-free with associated scale-free exponents reported below two, which was suggested to result form evolutionary mechanism driven by gene duplication . However, larger exponents have been reported for the CIN . Joang et al. analyzed compound interaction networks of 43 organisms. The average CL was observed as 3.29 ± 0.11 and the average scale-free exponent as 2.18 ± 0.09. While the scale-free exponent reported here (2.4) is in line with the reported average value, the CL reported here is much larger (12.3). The reasons for the difference can be attributed mainly to the different approaches taken to reconstruct the CIN, and the filtering mechanisms applied to remove compounds that are not directly relevant for main biochemical pathway routes such as co-factors. Here, we followed the concept of main metabolite relations introduced by Kotera and co-workers and annotated accordingly in the KEGG database [52, 53]. By contrast, the networks of Joang et al. comprised all relations between all substrates and products, including currency metabolites such as H2O or ATP rendering the CL much smaller.
In their analysis of the E. coli metabolic pathway network, Wagner and Fell reported a mean CL of 3.8. This value compares favorably with our value (3.64) for the yeast EIN, which corresponds to the network analyzed by Wagner and Fell . A similar value has also been reported by Huthmacher et al. (CL = 3) . Similarly, the scale-free exponents agree well (-1.3 Wagner and Fell; -1.2 reported here for yeast). Kotera et al.  reported a CL of 9 for the equivalent of our CIN. The larger value we obtained (CL = 12.3) is explained by the exclusion of currency metabolites in our analysis. The newly introduced mapEIN (CL = 6.62) is not directly comparable to previous studies. It was constructed to capture our accumulated knowledge of biochemical pathways represented in KEGG and with nodes represented by enzymes, not compounds.
For the rPIN, our reported values for graph properties such as CL and scale-free exponent agree well with previously reported data [31, 54]. For the other PIN types studied by us, no comparative data are available.
In their analysis of protein interaction data in the context of metabolic pathways, Huthmacher and co-workers  focused on direct interactions between enzymes catalyzing consecutive metabolic reaction steps. Here, we expanded the scope of an integrative analysis by also showing that such correlation between metabolic and protein interaction data is discernable even at larger distances. Of course, an increased probability for consecutive enzymes to interact naturally leads to correlations at larger distances as well, even though the significance can be expected to drop. We showed that such large-scale topological correspondence between both the PIN and MIN indeed exists adding further evidence for the significance of physical interactions for the functioning of metabolic reactions. Our analysis also revealed that shortest paths between two enzymes appear to be shorter in the PIN compared to their distance when analyzed in the metabolic network (Figure 3, elevated z-scores above the diagonal), especially when the allowed physical interactions also include proteins not actively participating in enzymatic reactions (fPIN). A direct comparison of shortest paths between enzyme pairs connected via valid paths in both the fPIN and the ePIN yielded a mean distances of 5.3 for the fPIN, where non-metabolically active proteins are still included, compared to 6.2 for the ePIN. Thus, our analyses suggest that such metabolically passive proteins may function as interface components to spatially organize enzymatic pathways.
The functional significance of topological parameters of molecular networks has largely been analyzed within the context of the examined network type itself such as the reported relationship between fluxes passing through metabolic network edges (reactions) and the degree product of the connected nodes , but not across different network types. Here, we showed that such relationships can also be established across different network types such that topological parameters of enzymes within the context of protein interactions have relevance for their functional, metabolic context. In particular, we observed that metabolic flux rates are positively correlated with degree and centrality of enzymes in their PIN (Table 3). We interpret this observation as evidence for a co-evolutionary adaptation of both network types. High-flux enzymes are physically interacting with many other enzymes such that metabolic substrates and products can be passed on to subsequent enzymes quickly and efficiently.
On the technical side, it has to be borne in mind that our knowledge of protein interactions is certainly incomplete and may contain many false positive interactions [55, 56] and the employed technologies may skew the datasets towards particular interactions . Furthermore, since we used sub-cellular localization information to eliminate potential false positive protein associations, this information, too, is to some degree based on predictions alone and may contain erroneous assignments. However, the fact that we did observe significant correlations between protein interactions and metabolic pathways despite the noise in the data may suggest that the true topological correspondence may actually be even stronger than reported here.
Our results reveal topological equivalences between the protein interaction network and the metabolic pathway network. Evolved protein interactions may contribute significantly towards rendering metabolic processes more efficient by permitting increased metabolic fluxes. Thus, our results shed further light on the unifying principles shaping the evolution of both the functional (metabolic) as well as the physical interaction network.
Because yeast represents a model organism with comprehensive experimental as well as annotation data available for both protein-protein interactions as well as metabolic reaction pathways, we focus our investigations on Saccharomyces cerevisiae.
Protein Interaction Networks (PIN)
To study protein interaction networks (PINs) from a global perspective as well as by focusing on enzymatic proteins alone, we generated three different network graphs describing protein-protein interactions. The raw, unfiltered PIN (rPIN) was constructed by extracting physical interactions reported in the protein interaction databases DIP, version 20060402  and BIOGrid, version 2.0.21 , respectively. Based on available gene ontology (GO) annotation information, proteins involved in processes related to protein translation, DNA-transcription and associated regulatory processes, such as transcription or translation factors, as well as proteins involved in the assembly of chromatin structures were labeled as 'DNA-related'. Proteins involved in degradation and related regulatory proteins were labeled as 'degradation-related', protein kinases and phosphatases and related regulators labeled as 'kinase-phophatase-related'. Additionally, we defined a set of proteins as 'other non-metabolic proteins'. This set comprised proteins assigned to unspecific functions and processes as judged by their available Gene Ontology (GO) annotation such as binding to unfolded proteins, protein targeting, protein transport, protein tagging as well as other post transcriptional modifications other than phosphorylation, which were labeled as kinase-phophatase-related. Proteins assigned to any of the above sets and associated physical interactions were removed from the rPIN to generate a second PIN, the filtered protein-protein interaction graph (fPIN). Thus, in the fPIN, all protein interactions of proteins involved functions other than metabolism – as judged by their GO-annotation – were removed from the rPIN. It also included proteins with currently unknown function. A third PIN including only enzymes as judged by an assigned EC-number was also generated and designated as the ePIN.
Interactions with inconsistent localization annotation according to the available gene ontology, GO:Cellular-Component annotation information; i.e. interactions between proteins located in different sub-cellular compartments, were discarded from the fPIN and ePIN as well. In case of membrane-embedded proteins, interactions between proteins localized in different, but neighboring compartments were retained.
GO-Annotations for yeast genes were obtained from the SGD database . The evidence codes for the gene ontology were not considered.
The GO-annotation information used to sub-set the protein data is available in the Additional File 4.
Metabolic Interaction Networks (MINs)
Metabolic Interaction Networks (MINs) are represented in this study by Enzyme Interaction Networks (EINs) focusing on metabolic reactions as well as a Compound Interaction Networks (CIN) establishing links between metabolites directly.
The metabolic reaction lists form KEGG , YeastCyc , and the set of metabolic reactions obtained from a whole-genome metabolic reconstruction approach, in the following referred to as the Förster-Set , were merged and used to reconstruct the Enzyme Interaction Network (EIN). The corresponding network graph is a representation of EC-numbers and associated reactions and their metabolic interactions. Two nodes are considered connected if they share at least one common substrate or product. Ubiquitously occurring molecules, so-called currency metabolites, such as H+, NH3, H2O, CO2, and metal ions as well as coenzymes and co-substrates such as CoA, NADH+, FAD, SAM have been excluded from the analysis. In total, 51 metabolites were excluded. (see Additional File 4).
The applied connectivity conditions to generate the EIN may produce links that are theoretically possible, but that have not been experimentally verified yet. To reflect the available biological knowledge, we also generated a metabolic network from curated pathway maps, the mapEIN. For the reconstruction of the mapEIN, the relations between enzymes were extracted directly from the xml-description files of the pathway maps from the KEGG database. Two nodes in this graph are connected if both are associated with at least one common metabolite node in a map.
The EINs reflect relationships between enzymes or reactions, respectively. Alternatively, a metabolic network can be reconstructed considering metabolites themselves as nodes. Such a network, the compound (metabolite) interaction network (CIN), was constructed utilizing the reaction lists from KEGG. Two metabolites are considered connected if both are recognized as a main substrate-product reaction as annotated in KEGG, respectively. As for EINs, currency metabolites and co-enzymes and co-substrates have been discarded from consideration. The YeastCyc and Förster-Set was not considered for the construction of the CIN as both databases do not differentiate between main and side substrates or products, respectively.
Topological properties of networks
To characterize global as well as local properties of the molecular interaction networks analyzed in this study, we computed several well established graph-theoretic network parameters.
where d(i, j) is the distance (shortest path) between nodes i and j, N is the total number of nodes, E defines the set of considered node pairs and |E| is their total number. Distances between unconnected node pairs were not considered.
where k is the degree of nodes, i.e. the number of links associated with a node, N is the total number of nodes, and N k is the number of nodes of degree k. The directionality of links was not considered. Biological networks were shown to follow scale-freeness according to a power law degree distribution with P(k) ≈ k-γ, where γ is the scale-free exponent [26–30], which was estimated by the slope of the linear regression line of degree distributions in log-log diagrams.
where A denotes the adjacency matrix with elements set to 1 in case of an established link between nodes and zero otherwise; k i is the degree of node i for which c is computed, i, p, and r are indexes of all nodes in the network with k > 1.
where r d , is the assortativity, j and k are the degrees of nodes at the ends of the i th edge within the set of considered node pairs E and |E| is their total number, as notated for Eq. 1.
Correlation of metabolic and protein interaction networks
The PINs were related to the EIN and mapEIN via protein-EC-number relations; i.e. proteins (enzymes) were identified in both network types and, subsequently, their pairwise distance computed in both network types. EC-number annotations were taken from KEGG , YeastCyc , SGD  and Expasy . The relation of PINs to the CIN followed from indirect protein – EC-number-metabolite mappings according to the annotation information in KEGG. Nodes were considered equivalent in both network types, if for a metabolite (node in the CIN) the corresponding protein (nodes in the PIN) was identified via its EC-number annotation and its list of main metabolites associated with the reaction catalyzed by the enzyme.
For nodes with representation in both networks, the respective shortest distances were correlated. The distribution of distances within the PINs and MINs were evaluated by consideration of all such node couples resulting in abundance matrices. The two dimensions of the abundance matrix are the respective distances in the PINs and MINs, and the elements contain the observed counts for the respective distances pairs. Note that proteins may be assigned to more than one EC-number and can be represented multiple times in the EINs. Likewise, unique EC-numbers may be assigned to multiple proteins. The EC-numbers may comprise multiple metabolites as well. All such possible relations between the PIN and MINs were considered.
where n is the number of times a particular distance pair d PIN and d MIN was observed (n observed ) or obtained in random networks (n rand ), brackets indicate mean values, and d PIN > 0 and d MIN > 0 (see next paragraph).
Treatment of multi-enzyme complexes
If subunits belonging to the same multi-enzyme complex carried identical EC numbers, their distance was considered zero and their network relationship was not analyzed further in the correlation analysis as the minimum distance included in the analyses is one. If they carried different EC numbers, their distances were computed as for any other enzyme pair given the available data.
The centrality of nodes
Notation as for Eq. 1.
Correlation of PINs and metabolic flux rates
For correlating PINs and metabolic flux rates, we used flux rate data from a large-scale 13C-flux analysis from Blank and colleagues . In this approach, flux rates of enzymes of the global metabolic network of yeast strain iFF708  were estimated by flux balance analysis. In particular, we used data of flux rates measured in yeast growing in a glucose-containing medium resulting in flux data for 747 unique reactions catalyzed by 672 enzymes. The enzymes were divided into a group of enzymes with flux rates greater than 50, enzymes with flux rates between 10 and 50, flux rates of 0.1 to 1, 1.0E-4 and 0.1, and 0 to 1.0E-4 relative to an uptake of glucose set to 100 (arbitrary flux units). While the flux rates were divided according to a logarithmic scale, the range 1.0E-4 to 0.1 had been chosen to yield similar numbers of enzymes in all bins.
Metabolic pathways and associated proteins used in this study were taken from the SGD database. For the fatty acid synthesis pathway, malic enzyme was assumed as a source of NADPH and the malat dehydrogenase as a source of AcetylCoA and added to the pathway.
The authors wish to thank Alisdair Fernie and Zoran Nikoloski for helpful discussions.
- Srere PA: Complexes of sequential metabolic enzymes. Annu Rev Biochem. 1987, 56: 89-124.View ArticlePubMedGoogle Scholar
- Mathews CK: The Cell Bag of Enzymes or Network of Channels. J Bacteriol. 1993, 175 (20): 6377-6381.PubMed CentralPubMedGoogle Scholar
- Spivey HO, Ovadi J: Substrate channeling. Methods. 1999, 19 (2): 306-321.View ArticlePubMedGoogle Scholar
- Ovadi J, Srere PA: Macromolecular compartmentation and channeling. Int Rev Cytol. 2000, 192: 255-280.View ArticlePubMedGoogle Scholar
- Srere PA: Macromolecular interactions: tracing the roots. Trends Biochem Sci. 2000, 25 (3): 150-153.View ArticlePubMedGoogle Scholar
- Giege P, Heazlewood JL, Roessner-Tunali U, Millar AH, Fernie AR, Leaver CJ, Sweetlove LJ: Enzymes of glycolysis are functionally associated with the mitochondrion in Arabidopsis cells. Plant Cell. 2003, 15 (9): 2140-2151.PubMed CentralView ArticlePubMedGoogle Scholar
- Easterby JS: A generalized theory of the transition time for sequential enzyme reactions. Biochem J. 1981, 199 (1): 155-161.PubMed CentralView ArticlePubMedGoogle Scholar
- Westerhoff HV, Welch GR: Enzyme organization and the direction of metabolic flow: physicochemical considerations. Curr Top Cell Regul. 1992, 33: 361-390.View ArticlePubMedGoogle Scholar
- Rudolph J, Stubbe J: Investigation of the mechanism of phosphoribosylamine transfer from glutamine phosphoribosylpyrophosphate amidotransferase to glycinamide ribonucleotide synthetase. Biochemistry. 1995, 34 (7): 2241-2250.View ArticlePubMedGoogle Scholar
- Ushiroyama T, Fukushima T, Styre JD, Spivey HO: Substrate channeling of NADH in mitochondrial redox processes. Curr Top Cell Regul. 1992, 33: 291-307.View ArticlePubMedGoogle Scholar
- Ovadi J, Huang Y, Spivey HO: Binding of malate dehydrogenase and NADH channelling to complex I. J Mol Recognit. 1994, 7 (4): 265-272.View ArticlePubMedGoogle Scholar
- Dewar MJ, Storch DM: Alternative view of enzyme reactions. Proc Natl Acad Sci USA. 1985, 82 (8): 2225-2229.PubMed CentralView ArticlePubMedGoogle Scholar
- Wakil SJ, Stoops JK, Joshi VC: Fatty acid synthesis and its regulation. Annu Rev Biochem. 1983, 52: 537-579.View ArticlePubMedGoogle Scholar
- Batke J: Channeling of glycolytic intermediates by temporary, stationary bi-enzyme complexes is probable in vivo. Trends Biochem Sci. 1989, 14 (12): 481-482.View ArticlePubMedGoogle Scholar
- Keleti T, Ovadi J: Control of metabolism by dynamic macromolecular interactions. Curr Top Cell Regul. 1988, 29: 1-33.View ArticlePubMedGoogle Scholar
- Ovadi J, Keleti T: Kinetic evidence for interaction between aldolase and D-glyceraldehyde-3-phosphate dehydrogenase. Eur J Biochem. 1978, 85 (1): 157-161.View ArticlePubMedGoogle Scholar
- Vertessy B, Ovadi J: A simple approach to detect active-site-directed enzyme-enzyme interactions. The aldolase/glycerol-phosphate-dehydrogenase enzyme system. Eur J Biochem. 1987, 164 (3): 655-659.View ArticlePubMedGoogle Scholar
- Cornish-Bowden A, Cardenas ML: Channelling can affect concentrations of metabolic intermediates at constant net flux: artefact or reality?. Eur J Biochem. 1993, 213 (1): 87-92.View ArticlePubMedGoogle Scholar
- Pettersson G: No convincing evidence is available for metabolite channelling between enzymes forming dynamic complexes. J Theor Biol. 1991, 152 (1): 65-69.View ArticlePubMedGoogle Scholar
- Wu XM, Gutfreund H, Lakatos S, Chock PB: Substrate channeling in glycolysis: a phantom phenomenon. Proc Natl Acad Sci USA. 1991, 88 (2): 497-501.PubMed CentralView ArticlePubMedGoogle Scholar
- Ro DK, Douglas CJ: Reconstitution of the entry point of plant phenylpropanoid metabolism in yeast (Saccharomyces cerevisiae): implications for control of metabolic flux into the phenylpropanoid pathway. J Biol Chem. 2004, 279 (4): 2600-2607.View ArticlePubMedGoogle Scholar
- Degenring D, Rohl M, Uhrmacher AM: Discrete event, multi-level simulation of metabolite channeling. Biosystems. 2004, 75 (1–3): 29-41.View ArticlePubMedGoogle Scholar
- Kholodenko BN, Westerhoff HV, Schwaber J, Cascante M: Engineering a living cell to desired metabolite concentrations and fluxes: pathways with multifunctional enzymes. Metab Eng. 2000, 2 (1): 1-13.View ArticlePubMedGoogle Scholar
- Huthmacher C, Gille C, Holzhutter HG: Computational analysis of protein-protein interactions in metabolic networks of Escherichia coli and yeast. Genome Inform. 2007, 18: 162-172.PubMedGoogle Scholar
- Huthmacher C, Gille C, Holzhutter HG: A computational analysis of protein interactions in metabolic networks reveals novel enzyme pairs potentially involved in metabolic channeling. J Theor Biol. 2008, 252 (3): 456-464.View ArticlePubMedGoogle Scholar
- Barabasi AL, Albert R: Emergence of scaling in random networks. Science. 1999, 286 (5439): 509-512.View ArticlePubMedGoogle Scholar
- Albert R, Barabási A-L: Statistical mechanics of complex networks. Reviews of Modern Physics. 2002, 74 (1): 47-View ArticleGoogle Scholar
- Albert R: Scale-free networks in cell biology. J Cell Sci. 2005, 118 (Pt 21): 4947-4957.View ArticlePubMedGoogle Scholar
- Jeong H, Tombor B, Albert R, Oltvai ZN, Barabasi AL: The large-scale organization of metabolic networks. Nature. 2000, 407 (6804): 651-654.View ArticlePubMedGoogle Scholar
- Wagner A, Fell DA: The small world inside large metabolic networks. Proc Biol Sci. 2001, 268 (1478): 1803-1810.PubMed CentralView ArticlePubMedGoogle Scholar
- Almaas E: Biological impacts and context of network theory. J Exp Biol. 2007, 210 (Pt 9): 1548-1558.View ArticlePubMedGoogle Scholar
- Ge H, Liu Z, Church GM, Vidal M: Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae. Nat Genet. 2001, 29 (4): 482-486.View ArticlePubMedGoogle Scholar
- Kemmeren P, van Berkum NL, Vilo J, Bijma T, Donders R, Brazma A, Holstege FC: Protein interaction verification and functional annotation by integrated analysis of genome-scale data. Mol Cell. 2002, 9 (5): 1133-1143.View ArticlePubMedGoogle Scholar
- Deane CM, Salwinski L, Xenarios I, Eisenberg D: Protein interactions: two methods for assessment of the reliability of high throughput observations. Mol Cell Proteomics. 2002, 1 (5): 349-356.View ArticlePubMedGoogle Scholar
- Goldberg DS, Roth FP: Assessing experimentally derived interactions in a small world. Proc Natl Acad Sci USA. 2003, 100 (8): 4372-4376.PubMed CentralView ArticlePubMedGoogle Scholar
- Kelley R, Ideker T: Systematic interpretation of genetic interactions using protein networks. Nat Biotechnol. 2005, 23 (5): 561-566.PubMed CentralView ArticlePubMedGoogle Scholar
- Rhodes DR, Tomlins SA, Varambally S, Mahavisno V, Barrette T, Kalyana-Sundaram S, Ghosh D, Pandey A, Chinnaiyan AM: Probabilistic model of the human protein-protein interaction network. Nat Biotechnol. 2005, 23 (8): 951-959.View ArticlePubMedGoogle Scholar
- Ramani AK, Li Z, Hart GT, Carlson MW, Boutz DR, Marcotte EM: A map of human protein interactions derived from co-expression of human mRNAs and their orthologs. Mol Syst Biol. 2008, 4: 180-PubMed CentralView ArticlePubMedGoogle Scholar
- Lee I, Date SV, Adai AT, Marcotte EM: A probabilistic functional network of yeast genes. Science. 2004, 306 (5701): 1555-1558.View ArticlePubMedGoogle Scholar
- Macdonald P, Almaas E, Barabasi A: Minimum spanning trees on weighted scale-free networks. Europhys Lett. 2005, 72: 308-314.View ArticleGoogle Scholar
- Giot L, Bader JS, Brouwer C, Chaudhuri A, Kuang B, Li Y, Hao YL, Ooi CE, Godwin B, Vitols E: A protein interaction map of Drosophila melanogaster. Science. 2003, 302 (5651): 1727-1736.View ArticlePubMedGoogle Scholar
- Amaral LA, Scala A, Barthelemy M, Stanley HE: Classes of small-world networks. Proc Natl Acad Sci USA. 2000, 97 (21): 11149-11152.PubMed CentralView ArticlePubMedGoogle Scholar
- Stumpf M, Ingram P, Nouvel I, Wiuf C: Statistical Model Selection Methods Applied to Biological Networks. Transactions on Computational Systems Biology III. 2005, 65-77.View ArticleGoogle Scholar
- Stumpf MP, Wiuf C, May RM: Subnets of scale-free networks are not scale-free: sampling properties of networks. Proc Natl Acad Sci USA. 2005, 102 (12): 4221-4224.PubMed CentralView ArticlePubMedGoogle Scholar
- Arita M: The metabolic world of Escherichia coli is not small. Proc Natl Acad Sci USA. 2004, 101 (6): 1543-1547.PubMed CentralView ArticlePubMedGoogle Scholar
- Forster J, Famili I, Fu P, Palsson BO, Nielsen J: Genome-scale reconstruction of the Saccharomyces cerevisiae metabolic network. Genome Res. 2003, 13 (2): 244-253.PubMed CentralView ArticlePubMedGoogle Scholar
- Blank LM, Kuepfer L, Sauer U: Large-scale 13C-flux analysis reveals mechanistic principles of metabolic network robustness to null mutations in yeast. Genome Biol. 2005, 6 (6): R49-PubMed CentralView ArticlePubMedGoogle Scholar
- Krogan NJ, Peng WT, Cagney G, Robinson MD, Haw R, Zhong G, Guo X, Zhang X, Canadien V, Richards DP: High-definition macromolecular composition of yeast RNA-processing complexes. Mol Cell. 2004, 13 (2): 225-239.View ArticlePubMedGoogle Scholar
- Gavin AC, Bosche M, Krause R, Grandi P, Marzioch M, Bauer A, Schultz J, Rick JM, Michon AM, Cruciat CM: Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature. 2002, 415 (6868): 141-147.View ArticlePubMedGoogle Scholar
- Newman ME: The structure and function of complex networks. SIAM REVIEW. 2003, 45: 167-256.View ArticleGoogle Scholar
- Chung F, Lu L, Dewey TG, Galas DJ: Duplication models for biological networks. J Comput Biol. 2003, 10 (5): 677-687.View ArticlePubMedGoogle Scholar
- Kotera M, Hattori M, Oh M, Yamamoto R, Komeno T, Yabuzaki J, Tonomura K, Goto S, Kanehisa M: RPAIR: a reactant-pair database representing chemical changes in enzymatic reactions. Genome Inform. 2004, 15: P062-Google Scholar
- Kotera M, Okuno Y, Hattori M, Goto S, Kanehisa M: Computational assignment of the EC numbers for genomic-scale analysis of enzymatic reactions. J Am Chem Soc. 2004, 126 (50): 16487-16498.View ArticlePubMedGoogle Scholar
- Bhan A, Galas DJ, Dewey TG: A duplication growth model of gene expression networks. Bioinformatics. 2002, 18 (11): 1486-1493.View ArticlePubMedGoogle Scholar
- Han JD, Dupuy D, Bertin N, Cusick ME, Vidal M: Effect of sampling on topology predictions of protein-protein interaction networks. Nat Biotechnol. 2005, 23 (7): 839-844.View ArticlePubMedGoogle Scholar
- Sprinzak E, Sattath S, Margalit H: How Reliable are Experimental Protein-Protein Interaction Data?. Journal of Molecular Biology. 2003, 327 (5): 919-923.View ArticlePubMedGoogle Scholar
- von Mering C, Krause R, Snel B, Cornell M, Oliver SG, Fields S, Bork P: Comparative assessment of large-scale data sets of protein-protein interactions. Nature. 2002, 417 (6887): 399-403.View ArticlePubMedGoogle Scholar
- Xenarios I, Rice DW, Salwinski L, Baron MK, Marcotte EM, Eisenberg D: DIP: the database of interacting proteins. Nucleic Acids Res. 2000, 28 (1): 289-291.PubMed CentralView ArticlePubMedGoogle Scholar
- Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M: BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006, D535-539. 34 Database,Google Scholar
- Cherry JM, Adler C, Ball C, Chervitz SA, Dwight SS, Hester ET, Jia Y, Juvik G, Roe T, Schroeder M: SGD: Saccharomyces Genome Database. Nucleic Acids Res. 1998, 26 (1): 73-79.PubMed CentralView ArticlePubMedGoogle Scholar
- Kanehisa M, Goto S, Kawashima S, Nakaya A: The KEGG databases at GenomeNet. Nucleic Acids Res. 2002, 30 (1): 42-46.PubMed CentralView ArticlePubMedGoogle Scholar
- Caspi R, Foerster H, Fulcher CA, Hopkinson R, Ingraham J, Kaipa P, Krummenacker M, Paley S, Pick J, Rhee SY: MetaCyc: a multiorganism database of metabolic pathways and enzymes. Nucleic Acids Res. 2006, D511-516. 34 Database,Google Scholar
- Watts DJ, Strogatz SH: Collective dynamics of 'small-world' networks. Nature. 1998, 393 (6684): 440-442.View ArticlePubMedGoogle Scholar
- Pastor-Satorras R, Vazquez A, Vespignani A: Dynamical and correlation properties of the internet. Phys Rev Lett. 2001, 87 (25): 258701-View ArticlePubMedGoogle Scholar
- Newman ME: Assortative mixing in networks. Phys Rev Lett. 2002, 89 (20): 208701-View ArticlePubMedGoogle Scholar
- Gasteiger E, Gattiker A, Hoogland C, Ivanyi I, Appel RD, Bairoch A: ExPASy: The proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 2003, 31 (13): 3784-3788.PubMed CentralView ArticlePubMedGoogle Scholar
- Newman ME: Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Phys Rev E Stat Nonlin Soft Matter Phys. 2001, 64 (1 Pt 2): 016132-View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.