The human metabolic reconstruction Recon 1 directs hypotheses of novel human metabolic functions
© Rolfsson et al; licensee BioMed Central Ltd. 2011
Received: 29 March 2011
Accepted: 1 October 2011
Published: 1 October 2011
Metabolic network reconstructions formalize our knowledge of metabolism. Gaps in these networks pinpoint regions of metabolism where biological components and functions are "missing." At the same time, a major challenge in the post genomic era involves characterisation of missing biological components to complete genome annotation.
We used the human metabolic network reconstruction RECON 1 and established constraint-based modelling tools to uncover novel functions associated with human metabolism. Flux variability analysis identified 175 gaps in RECON 1 in the form of blocked reactions. These gaps were unevenly distributed within metabolic pathways but primarily found in the cytosol and often caused by compounds whose metabolic fate, rather than production, is unknown. Using a published algorithm, we computed gap-filling solutions comprised of non-organism specific metabolic reactions capable of bridging the identified gaps. These candidate solutions were found to be dependent upon the reaction environment of the blocked reaction. Importantly, we showed that automatically generated solutions could produce biologically realistic hypotheses of novel human metabolic reactions such as of the fate of iduronic acid following glycan degradation and of N-acetylglutamate in amino acid metabolism.
The results demonstrate how metabolic models can be utilised to direct hypotheses of novel metabolic functions in human metabolism; a process that we find is heavily reliant upon manual curation and biochemical insight. The effectiveness of a systems approach for novel biochemical pathway discovery in mammals is demonstrated and steps required to tailor future gap filling algorithms to mammalian metabolic networks are proposed.
An in silico model of a genome-scale metabolic network reconstruction is based upon a biochemically, genetically and genomically (BiGG) structured knowledge base [1, 2]. It is subject to research that, in many cases, entails predicting an organism's phenotypic response to gene deletions and/or environmental perturbations in silico. These properties have resulted in widespread applications of metabolic models in microbial bioengineering, contextualisation of high-throughput data, and biochemical pathway discovery [2, 3]. While the number of microbial genome-scale metabolic networks has increased exponentially over the past 10 years , fewer have been reconstructed for higher eukaryotes as their inherent complexity results in larger and more complex models which are harder to experimentally validate [5, 6]. To date, only mouse [7–9], bovine , and human [11–13] genome-scale metabolic networks have been reconstructed. These latter ones have successfully been applied to systems driven eukaryotic metabolic research. For example, the human genome-scale metabolic network RECON 1 has been used to reveal transcriptional regulatory signatures of type 2 diabetes , to create tissue-specific models , to predict drug-off target effects , and to simulate cell specific metabolic changes upon pathogen infection . Their potential to discover novel metabolic functions has however not been demonstrated.
Multiple computational algorithms have been devised to address gaps in metabolic network models in an automated manner . These methods utilise metabolic network analysis, such as flux variability analysis (FVA) , in order to identify network gaps, alongside comparative genomics/metabolomics to suggest candidate metabolic reactions capable of restoring flux through the blocked reaction and/or dead-end metabolite. Experimental data, such as organism growth profiles [28, 29] or metabolic flux data  can be integrated with metabolic models in order to highlight model gaps, although this is not always a requirement . One such algorithm is the SMILEY algorithm, which has been successfully used to uncover novel metabolic functions required to explain discrepancies between the observed and model predicted growth phenotypes of E. coli on various substrates .
Because of the difference between humans and microorganisms in terms of the number of related organisms for which biological, biochemical, and genetic information exists, it was not known whether an automated approach to gap-filling of RECON 1 could yield biologically plausible hypotheses. Here we investigated the potential of RECON 1 for discovery of novel reactions involved in human metabolism. We used FVA to identify dead-end metabolites and blocked reactions in RECON 1 and SMILEY to propose reactions capable of restoring flux through the identified dead-end metabolites. We then characterised the metabolic pathway distribution of identified gaps and their solution types in order to get an idea of how gaps are distributed within RECON 1 and in what manner these gaps are resolved by SMILEY. Finally we validated the automatically generated metabolic reaction hypotheses by manual literature review in order to assess the biological relevance of the proposed solutions with the goal of identifying suitable experimental targets.
In this study we first identified blocked reactions and dead-end metabolites occurring in RECON 1 using FVA . We then employed SMILEY  to compute reactions that could be added to the RECON 1 (S) from universal reaction databases (U, X) to enable flux through a blocked reaction (Figure 1b). The matrix U was compiled from an extensive list of known metabolic reactions obtained from the KEGG database  while the matrix X contained transport reactions in and out of the system for every metabolite contained within S and U. SMILEY therefore proposed what reactions, irrespective of organism, needed to be added to RECON 1 in order to fill a network gap. If none were identified, SMILEY suggested transport of the dead-end metabolite into or out of the cell. Up to twenty solutions were suggested for each knowledge gap and each solution could be composed of multiple resolving reactions. Note that the number of solutions returned by SMILEY is user defined. Inspired by Satish Kumar et al.  we split the computed solutions into three categories based upon whether the complete solution involved a reversal of the directionality of the blocked reaction, addition of novel reaction(s), or addition of a transport reaction, defined as category I-III type solutions respectively (Figure 1c). The next four sections deal with the characterisation and metabolic pathway distribution of the gaps that we addressed in RECON 1 while the remainder of the results chapter reports analysis of SMILEY solutions and specific case studies.
Gap analysis basis
Sub-cellular distribution of gaps
The sub-cellular distribution of the dead-end metabolites and blocked reactions showed that the majority were found in the cytosol with the remainder distributed within the various cellular compartments, most notably in the lysosome, mitochondria, and peroxisome (Figure 2b). This observation agrees with the distribution of dead-end metabolites found in the eukaryotic metabolic reconstruction iND750 for Saccharomyces cerevisiae [20, 32]. Because some of the dead-end metabolites were responsible for multiple blocked reactions, which themselves could be associated with both types of dead-end metabolite, there was not a direct relationship between the number of dead-end metabolites and the number of blocked reactions within cellular compartments. In addition, blocked reactions can take part in metabolic subsystems/reaction cascades that span more than one cellular compartment. For example, in the mitochondria we found 24 blocked reactions participating in six distinct reaction cascades and ten dead-end metabolites. However, only seven of the dead-ends were responsible for the 24 blocked reactions while the remaining three metabolites caused blocked reactions in the cytosol. Gap distributions can be better understood by looking at the distribution of blocked reaction cascades caused by the dead-end metabolites. These data suggest that the number of knowledge gaps in cellular compartments other than the cytosol is around two to five knowledge gaps per compartment (Figure 2).
Metabolic pathway distribution of gaps
The metabolic pathway distribution of the blocked reactions is shown in Figure 2C and reflects their sub-cellular distribution. All of the metabolic pathways shown had blocked reactions in the cytosol. Most of the blocked reactions were observed in amino acid metabolism with the remainder distributed within the metabolic pathways shown. Of those involved in amino acid metabolism, nearly half were part of multiple reaction cascades in tryptophan metabolism (Additional file 1). These reactions were blocked due to root no-consumption heterocyclic derivatives of tryptophan, such as anthranilate, kynurenic acid, 5-methoxy-indole acetate, and more, suggesting missing information on the metabolic fate of these compounds. Apart from knowledge gaps in tryptophan metabolism, blocked reactions were also identified in metabolic subsystems associated with eleven of the twenty most common amino acids. These gaps were due to single blocked reactions or reaction cascades implying that current information concerning the metabolism of these amino acids is fairly complete.
Multiple blocked reactions were also observed in glycan biosynthesis, the metabolism of cofactors/vitamins and lipid metabolism. As opposed to those observed in amino acid metabolism, these blocked reactions were involved in relatively few reaction cascades. Many of the blocked reactions in glycan metabolism, for example, were part of two reaction cascades involved in the degradation of heparan sulfate and dermatan sulfate in the lysosome, respectively. The root no-consumption metabolite glucose-1, 3-mannose and derivatives thereof were also the cause of several blocked reactions in glycan biosynthesis. Many blocked reactions involved in the metabolism of cofactors/vitamins took place in the mitochondria due to just two root no-consumption metabolites causing gaps in the biosynthesis of ubiquinone and vitamin D. Similarly, multiple reactions in lipid metabolism were blocked in glycerophospholipid biosynthesis in the cytosol due to the root no-consumption metabolite plasmalogen in the cytosol. In total, 32% of the knowledge gaps investigated were associated with two or more reactions. Finding a solution to the dead-end metabolites in these reaction cascades could therefore result in the connection of multiple blocked reactions back into the metabolic network. The majority of blocked reactions were not, however, part of reaction cascades (Additional file 2).
Knowledge status of reactions causing gaps
We investigated whether the gaps addressed in this study were correlated with a lack of information available for each blocked reaction by assessing their confidence scores. Each reaction in RECON 1 has an assigned confidence score that allows the experimental evidence underlying the reaction to be quickly assessed . Figure 2D shows the distribution of confidence scores of the reactions investigated. Roughly two thirds of the blocked reactions were supported by biochemical or physiological evidence, which is similar to what is observed for all reactions contained in RECON 1 . This distribution implies that the addressed knowledge gaps are not simply due to ill-defined metabolic reactions. Rather, the fate of the participating metabolites within the metabolic network or how they contribute to human metabolism is not known. In order to suggest plausible hypotheses of how this might take place, we investigated whether the blocked reactions could be circumvented or connected back into the RECON 1 using reactions found in the KEGG database  in an automated manner by running the SMILEY algorithm .
Solutions to a blocked reaction are dependent upon the robustness of its metabolic network
SMILEY solutions involve few resolving reactions
Inspection of the SMILEY output made it clear that assessing each of the 1335 SMILEY solutions represented a time consuming task. Our focus was to assess whether SMILEY could generate biologically relevant hypotheses of missing reactions in human metabolism rather than produce a detailed list of missing enzyme functionalities. We therefore decided to focus on the SMILEY solutions containing the least number of resolving reactions for each of the 175 blocked reactions hereafter referred to as S1 solutions.
Mindful that we might overlook plausible solutions for blocked reactions having multiple SMILEY solutions by focusing entirely on S1 solutions, we investigated the categories of the alternative solutions. We found that, when available, alternative solutions were either of the same solution category as the S1 solution or were category III transport solutions (Additional file 4). This result indicated that the S1 solutions directly reflect the solution categories available for each blocked reaction although the resolving reactions comprising the solutions were different.
Blocked reactions have different S1 solutions depending on metabolic origin
Roughly one third of all the blocked reactions were resolved with category I solutions (Figure 4). Nearly half of these applied to blocked reactions involved in amino acid metabolism (Figure 4B), many of which were part of tryptophan and lysine metabolism in the cytosol. With respect to their metabolic origin, up to 70% of the total number of blocked reactions originating from a particular metabolic pathway had category I solutions (Additional file 5). Notably, none of the blocked reactions involved in glycan biosynthesis and metabolism had category I solutions. Instead, these reactions were primarily resolved by category III transport solutions. These applied to 90% of the blocked reactions involved in glycan biosynthesis and metabolism, more specifically to those in chondroitin sulphate and heparan sulphate degradation in the lysosome and N-glycan biosynthesis in the golgi apparatus. Apart from glycan metabolism, category III solutions applied to multiple blocked reactions involved in the metabolism of cofactors and vitamins and in lipid metabolism. Few blocked reactions within amino acid metabolism and the biosynthesis of secondary metabolites had category III solutions. Category II solutions were the least common solution. With respect to their metabolic origin, 10-50% of the blocked reactions had this solution type (Additional file 5). Many of these applied to blocked reactions taking part in various metabolic subsystems of amino acid metabolism such as tyrosine metabolism and urea cycle metabolism. Multiple blocked reactions in carbohydrate metabolism and fatty acid metabolism also had category II solutions.
Validation of SMILEY solutions
The investigation of large complex systems on a global scale makes it impractical and maybe even impossible to know details about all involved metabolites, genes, and proteins. At the same time, a high level of knowledge about metabolic subsystems and or enzyme activities is necessary in order to come up with hypotheses of particular metabolic fates and novel reactions. In this study we used a systems biology approach to characterise and fill gaps in human metabolism. The key results include i) many dead-end metabolites affect reaction cascades, ii) computationally predicted solutions require thorough manual curation and biochemical insight, and iii) four biological plausible hypotheses were identified. This work highlights that finding gene candidates for metabolic functions in the human genome is not a trivial issue and the extensive manual effort to curate the computational predictions of candidate reactions highlight the overall quality and quantity of data included in Recon 1.
We characterised knowledge gaps of human metabolism, represented by blocked reactions and dead-end metabolites identified in RECON 1 , to which solutions, in the form of non-organism specific metabolic reactions, could be found using the computer algorithm SMILEY  (Figure 1). We identified 175 blocked reactions, 70% of which had high confidence scores, and observed that while they were unevenly distributed within human metabolic pathways, most were found in the cytosol (Figure 2). Furthermore, we found that they arose due to different dead-end metabolite types that were in some cases responsible for up to 14 blocked reactions. These properties are likely to affect how trivial it will be to address these knowledge gaps experimentally and suggest that the impact of resolving these gaps, both in terms of novel metabolic discovery and their influence on RECON 1, will be different. For example, determining the fate of a metabolite, which results in multiple blocked reactions, will have a different impact on the biological accuracy of RECON 1 than resolving a single blocked reaction. Nevertheless, a single blocked reaction could be of great interest as a candidate target for novel metabolic discovery as its components could represent a drug target and resolving the gap could have unforeseen effects on network robustness, i.e., human metabolism. Also, assaying cytosolic reactions will be more straightforward than determining the function of compartmentalised reactions. Subsequently, which knowledge gaps are chosen for experimental research is ultimately a human decision depending on research goals, biological novelty factors, ease of experimental validation, and underlying evidence of the knowledge gap's validity.
We highlighted four examples of missing knowledge in human metabolism (Figures 5, 6, 7 and 8) that resulted in biologically plausible hypotheses using a combined algorithmic and manual approach. The hypotheses were strengthened with published experimental data. In the case of iduronic acid (Figure 6), a major constituent of glycosamine glycans, we argued for a hypothesis that an extracellular transport reaction needs to be added to RECON 1. Although no direct evidence for such transport could be identified in the human genome, the existence of iduronic acid in human urine (M. Fuller, personal communication) suggests that a transporter may be a biologically plausible solution. Further evidence is that the build-up of iduronic acid in the lysosome has been linked to lysosomal storage disorder caused by defects in the sialic acid lysosomal transporter . Similarly, defects in α-L-iduronidase (22.214.171.124), the exo-glycohydrolase that cleaves iduronic acid off the non-reducing end of dermatan sulfate and heparan sulfate are known to cause a different type of lysosomal storage disorder, called mucopolysacharideosis I . Despite its apparent involvement in disease, the metabolic fate of iduronic acid is unknown. The present work highlights knowledge gaps in human metabolic processes, such as the fate iduronic acid, which in the context of investigating lysosomal storage disorders due to protein deficiencies, have not been relevant but are now required to generate a complete picture of human metabolism. Our gap filling examples showed that algorithms, such as SMILEY, can be used to direct hypotheses of novel functions in human metabolism. Nevertheless, a semi-automated approach was required to assist with the identification of plausible gap filling candidates for experimental verification.
Multiple gap finding and gap filling algorithms exist, including GapFind/GapFill  and GrowMatch , and the use of alternative algorithms will undoubtedly increase the number of possible hypotheses as they employ different heuristics and data sources (e.g., universal databases). This work does not provide a comprehensive list of possible gap-filling reaction solutions but rather assesses the use of (semi)-automated computational approaches for identifying and completing missing functions in human metabolism on a large-scale. We found that computational tools, such as SMILEY, do not necessarily suggest biologically plausible gap filling hypotheses. The generated hypotheses need to be evaluated in a manual, time-consuming manner, similar to the gap filing process employed during the reconstruction approach [4, 26]. The search for novel functions is therefore only semi-automated. Automated algorithms could however be trained, based on manual effort, to prioritize or exclude certain types of solutions. In addition, approaches could be developed that incorporate methods to build hypotheses of genes associated with orphan reactions [56–60], which SMILEY does not directly do.
Identification of genes associated with biological plausible hypotheses as suggested by SMILEY was a major challenge. Relatively few knowledge gaps were resolved using known metabolic functions (Figure 3), and the solutions required detailed literature review such that homology, of what were often prokaryotic genes/proteins to possible human counterparts, could be assessed. In light of our results, we believe that existing automatic gap filling approaches for uncovering gene function will be of limited use for mammals. This limitation arises from a lack of phylogenetic information, which is extensively explored for annotating microbial genomes [58, 61]. Although various homology databases exist for mammalian genomes covering up to seventy mammalian species , the majority of phenotypic, genetic, and biochemical studies have been performed using mice, and to a lesser extent, human cells. Information derived from these databases therefore originates from few organisms making them less useful for annotation purposes. Furthermore, co-expression analysis is used in microbes to determine genes with related function [63–65] and could serve as a strategy for gene finding in the human genome. However, analysis of regions of correlated transcription (RCT) in human and mouse identified both related and unrelated genes being co-expressed . The majorities of RCT were not found in both human and mouse, which the authors explained with i) missing definition of homology and/or synteny, ii) no conserved pattern, and/or iii) physiological differences between human and mice. This example highlights the challenges associated with finding novel gene functions in the human genome using established methods from the bacterial world. Novel approaches may include the use of protein-protein interaction data [67–69], tissue-specific information [70, 71] and disease information  combined with gap filling algorithms. In particular, the latter work  observed a high degree of correlation between known co-occurring (co-morbid) diseases in patients and flux-coupling of the reactions that are perturbed in association with each of the disease states. Flux coupled reaction sets , or perfectly coupled reaction sets (Co-sets) , have been calculated in genome-scale metabolic models. Co-sets are often along linear pathways . Thus, a low co-morbidity of two metabolically linked diseases would indicate a missing link along a Co-set, which would break the flux coupling by creating a pathway split. Similarly, single nucleotide polymorphisms have been mapped onto metabolic networks  and may be used for identifying missing functions in human metabolism.
Ubiquitous unknowns e.g. genes with unknown function and orphan enzymes belonging to orthologous families, have been identified as top targets for functional elucidation in terms of biological knowledge payoff as these are ancient in origin and therefore likely to be involved in essential metabolic processes [22, 25, 76]. We believe that combining a metabolic network approach with knowledge of ubiquitous unknowns could also represent an ideal method for organism specific novel function identification.
The results presented here show that RECON 1 allows the identification of specific metabolic pathways and reactions for which knowledge is lacking; thereby focusing the search for unknown metabolic functions by putting them into context with previously gathered metabolic information. Following identification and characterisation of RECON 1 network gaps, we showed that gap solution hypotheses can be generated automatically and successfully but require detailed, time consuming manual investigation in order to validate their biological plausibility. In this manner we have derived multiple hypotheses, which we intend to use to direct our knowledge-driven approach towards novel metabolic discoveries.
Pre-processing to gap identification
RECON 1 was obtained from Duarte et al. . It contains 356 dead-end metabolites causing knowledge gaps in eight cellular compartments. The dead-end metabolites in the compartmentalised model are not unique, as the same metabolite can cause blocked reactions in two or more cellular compartments. In order to address this issue, RECON 1 was decompartmentalised, by placing all intracellular compartment reactions in the cytosol and removing duplicates. Extra-organism located reactions were kept. Therefore, the decompartmentalised network accounts for these two compartments. Subsequently, the number of dead-end metabolites was reduced to 145, as they were unique. Note that this modification affected the following gap analysis in that compartment specific gaps were not considered for gap filling; e.g., a reaction present in mitochondria but missing in cytosol would not result in a gap in the decompartmentalised network. All subsequent analysis was performed using the decompartmentalised version of RECON 1 (Recon_1_decomp).
As a next step, all blocked reactions present in Recon_1_decomp were identified using flux variability analysis (FVA)  as reactions unable to carry flux defined by |Vmax, i | ≤ 10-5 mmol/gdw/hr and |Vmin, i |≤ 10-5 mmol/gdw/hr for all i reactions in the network. All exchange reactions were unconstrained permitting free uptake and secretion of respective metabolites. A total of 285 reactions were identified.
The SMILEY algorithm has been described previously by Reed et al.  and implemented in the COBRA toolbox v2.0 (Schellenberger et al, submitted). We downloaded the KEGG  Ligand database (as of 1.10.2009), deemed U. Furthermore, we constructed a transport matrix, X, by defining a transport reaction from cytosol to extra-organism and an exchange reaction for each metabolite occurring Recon_1_decomp (S) and U (Figure 1). Matrix U and X served as reaction source for the SMILEY algorithm. Prior to the calculation, metabolites from S were matched to U. Note that not all metabolites in S have a known KEGG ID and that subsequent solutions are sensitive to missing KEGG IDs, meaning that some possible resolving solution may have been missed in our simulation due to this shortcoming. Current work has focused on adding more metabolite identifiers to RECON 1 (Thiele et al, in preparation).
In the next step, each blocked reaction vb, i was chosen as objective function, requiring the SMILEY algorithm to find reactions in U and/or X to be added to S such that |vb, i| could carry a flux greater than 10-5 mmol/gdw/hr. SMILEY is designed to find the shortest possible solution consistent with this requirement . For each blocked reaction, we computed the 20 shortest SMILEY solutions. Note that the number of solutions computed is user defined and in some cases 20 distinct solutions may not exist. For 175 out of 285 blocked reactions at least one SMILEY solution could be found.
Gap analysis and validation of SMILEY output
SMILEY generated a total of 1335 solutions for the 175 blocked reactions. We filtered the solutions such that: i) only category II solutions with the least number of resolving reactions were investigated, and ii) if no category II solution was found, the solution involving the least number of resolving reactions was investigated. In cases where a blocked reaction had multiple solutions with the same number of resolving reactions, a random solution was picked. These criteria generated the S1 SMILEY solution output that is reported in the results.
Biochemical and genetic information concerning each blocked reaction identified by FVA was obtained from the Bigg database http://bigg.ucsd.edu/. The reaction specific information allowed blocked reactions to be grouped depending on their metabolic pathway, subsystem, and/or other reaction specific features described within the Bigg database. Similarly, biochemical and genetic information concerning resolving reactions proposed by SMILEY was obtained from the KEGG database http://www.genome.jp/kegg/, which allowed organisation of the resolving reaction output. For resolving reactions where no human gene or protein information was directly available from Bigg or KEGG, blast homology searches of genes encoding the resolving reaction activity were performed against the human sequence databases on the NCBI website http://blast.ncbi.nlm.nih.gov/Blast.cgi and using the STRING database . The localisation of dead-end metabolites in human biofluids was obtained from the human metabolome project http://www.hmdb.ca/. Resolving reactions were investigated individually by literature review in order to verify the biological relevance of the proposed SMILEY solutions and generate plausible hypotheses for gap filling of RECON 1. Reaction directionalities were compared with experimentally reported reaction directionalities in the Brenda database  and by quantitative assignment of reaction directionality using the von Bertalanffy 1.0 algorithm  an extension available freely as part of the openCOBRA project . The details of these calculations will be published in a separate manuscript. Briefly, experimentally determined or computed standard metabolite Gibbs energy transformed to cellular compartmental conditions with respect to in vivo pH (pH = 5.5-8.0), temperature (37°C), ionic strength (0.25 M) and electrical potential (-150 - 30 mV) was used to predict the upper and lower bounds on standard transformed reaction Gibbs energy. The upper and lower bounds are dependent upon in vivo metabolite concentration ranges which were set to 10-7 - 10-2 M. When the transformed reaction Gibbs energy range spans zero the reaction is predicted to be quantitatively reversible. Spontaneous reactions proposed by SMILEY were assessed in a similar manner as enzyme catalysed reactions. All gap filling hypotheses can be found in Additional files 6 and 7. Reaction and sub-cellular compartment abbreviations are as described in the BiGG database.
We thank Jeffrey D. Orth for critical reading of the manuscript, Dr Maria Fuller for her discussion concerning iduronic acid and Hulda Haraldsdottir and Dr. Ronan M.T. Fleming for their inputs on thermodynamic constraints on metabolic reaction directionality. This work was supported by the European Research Council grant proposal no. 232816. IT was supported, in part, by a Marie Curie International Reintegration Grant awarded (N° 249261) within the 7th European Community Framework Program.
- Reed JL, Famili I, Thiele I, Palsson BO: Towards multidimensional genome annotation. Nat Rev Genet. 2006, 7: 130-141. 10.1038/nrg1769.View ArticlePubMed
- Feist AM, Herrgård MJ, Thiele I, Reed JL, Palsson BØ: Reconstruction of biochemical networks in microorganisms. Nat Rev Microbiol. 2009, 7: 129-143.PubMed CentralView ArticlePubMed
- Oberhardt MA, Palsson BO, Papin JA: Applications of genome-scale metabolic reconstructions. Mol Syst Biol. 2009, 5:
- Thiele I, Palsson BØ: A protocol for generating a high-quality genome-scale metabolic reconstruction. Nat Protoc. 2010, 5: 93-121.PubMed CentralView ArticlePubMed
- Chang RL, Ghamsari L, Manichaikul A, Hom EFY, Balaji S, Fu W, Shen Y, Hao T, Palsson BØ, Salehi-Ashtiani K, Papin JA: Metabolic network reconstruction of Chlamydomonas offers insight into light-driven algal metabolism. Mol Syst Biol. 2011, 7: 518-PubMed CentralView ArticlePubMed
- Manichaikul A, Ghamsari L, Hom EFY, Lin C, Murray RR, Chang RL, Balaji S, Hao T, Shen Y, Chavali AK, Thiele I, Yang X, Fan C, Mello E, Hill DE, Vidal M, Salehi-Ashtiani K, Papin JA: Metabolic network analysis integrated with transcript verification for sequenced genomes. Nat Methods. 2009, 6: 589-592. 10.1038/nmeth.1348.PubMed CentralView ArticlePubMed
- Sheikh K, Förster J, Nielsen LK: Modeling hybridoma cell metabolism using a generic genome-scale metabolic model of Mus musculus. Biotechnol Prog. 2005, 21: 112-121.View ArticlePubMed
- Quek L-E, Nielsen LK: On the reconstruction of the Mus musculus genome-scale metabolic network model. Genome Inform. 2008, 21: 89-100.PubMed
- Sigurdsson MI, Jamshidi N, Steingrimsson E, Thiele I, Palsson BO: A detailed genome-wide reconstruction of mouse metabolism based on human Recon 1. BMC Syst Biol. 2010, 4: 140-10.1186/1752-0509-4-140.PubMed CentralView ArticlePubMed
- Seo S, Lewin HA: Reconstruction of metabolic pathways for the cattle genome. BMC Syst Biol. 2009, 3: 33-10.1186/1752-0509-3-33.PubMed CentralView ArticlePubMed
- Duarte NC, Becker SA, Jamshidi N, Thiele I, Mo ML, Vo TD, Srivas R, Palsson BØ: Global reconstruction of the human metabolic network based on genomic and bibliomic data. Proc Natl Acad Sci USA. 2007, 104: 1777-1782. 10.1073/pnas.0610772104.PubMed CentralView ArticlePubMed
- Ma H, Sorokin A, Mazein A, Selkov A, Selkov E, Demin O, Goryanin I: The Edinburgh human metabolic network reconstruction and its functional analysis. Mol Syst Biol. 2007, 3: 135-PubMed CentralView ArticlePubMed
- Gille C, Bolling C, Hoppe A, Bulik S, Hoffmann S, Hubner K, Karlstadt A, Ganeshan R, Konig M, Rother K, Weidlich M, Behre J, Holzhutter H-G: HepatoNet1: a comprehensive metabolic reconstruction of the human hepatocyte for the analysis of liver physiology. Mol Syst Biol. 2010, 6:
- Zelezniak A, Pers TH, Soares S, Patti ME, Patil KR: Metabolic network topology reveals transcriptional regulatory signatures of type 2 diabetes. PLoS Comput Biol. 2010, 6: e1000729-10.1371/journal.pcbi.1000729.PubMed CentralView ArticlePubMed
- Shlomi T, Cabili MN, Herrgård MJ, Palsson BØ, Ruppin E: Network-based prediction of human tissue-specific metabolism. Nat Biotechnol. 2008, 26: 1003-1010. 10.1038/nbt.1487.View ArticlePubMed
- Chang RL, Xie L, Xie L, Bourne PE, Palsson BØ: Drug off-target effects predicted using structural analysis in the context of a metabolic network model. PLoS Comput Biol. 2010, 6: e1000938-10.1371/journal.pcbi.1000938.PubMed CentralView ArticlePubMed
- Bordbar A, Lewis NE, Schellenberger J, Palsson BO, Jamshidi N: Insight into human alveolar macrophage and M. tuberculosis interactions via metabolic reconstructions. Mol Syst Biol. 2010, 6:
- Schellenberger J, Park J, Conrad T, Palsson B: BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions. BMC Bioinformatics. 2010, 11: 213-10.1186/1471-2105-11-213.PubMed CentralView ArticlePubMed
- Orth JD, Thiele I, Palsson BØ: What is flux balance analysis?. Nat Biotechnol. 2010, 28: 245-248. 10.1038/nbt.1614.PubMed CentralView ArticlePubMed
- Satish Kumar V, Dasika MS, Maranas CD: Optimization based automated curation of metabolic reconstructions. BMC Bioinformatics. 2007, 8: 212-10.1186/1471-2105-8-212.PubMed CentralView ArticlePubMed
- Nakahigashi K, Toya Y, Ishii N, Soga T, Hasegawa M, Watanabe H, Takai Y, Honma M, Mori H, Tomita M: Systematic phenome analysis of Escherichia coli multiple-knockout mutants reveals hidden reactions in central carbon metabolism. Mol Syst Biol. 2009, 5: 306-PubMed CentralView ArticlePubMed
- Karp PD: Call for an enzyme genomics initiative. Genome Biol. 2004, 5: 401-10.1186/gb-2004-5-8-401.PubMed CentralView ArticlePubMed
- Lespinet O, Labedan B: Orphan enzymes?. Science. 2005, 307: 42-View ArticlePubMed
- Chen L, Vitkup D: Distribution of orphan metabolic activities. Trends Biotechnol. 2007, 25: 343-348. 10.1016/j.tibtech.2007.06.001.View ArticlePubMed
- Hanson AD, Pribat A, Waller JC, de Crécy-Lagard V: "Unknown" proteins and "orphan" enzymes: the missing half of the engineering parts list--and how to find it. Biochem J. 2010, 425: 1-11. 10.1042/BJ20091328.View Article
- Orth JD, Palsson BØ: Systematizing the generation of missing metabolic knowledge. Biotechnol Bioeng. 2010, 107: 403-412. 10.1002/bit.22844.PubMed CentralView ArticlePubMed
- Mahadevan R, Schilling CH: The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metab Eng. 2003, 5: 264-276. 10.1016/j.ymben.2003.09.002.View ArticlePubMed
- Reed JL, Patel TR, Chen KH, Joyce AR, Applebee MK, Herring CD, Bui OT, Knight EM, Fong SS, Palsson BO: Systems approach to refining genome annotation. Proc Natl Acad Sci USA. 2006, 103: 17480-17484. 10.1073/pnas.0603364103.PubMed CentralView ArticlePubMed
- Kumar VS, Maranas CD: GrowMatch: an automated method for reconciling in silico/in vivo growth predictions. PLoS Comput Biol. 2009, 5: e1000308-10.1371/journal.pcbi.1000308.PubMed CentralView ArticlePubMed
- Fong SS, Nanchen A, Palsson BO, Sauer U: Latent pathway activation and increased pathway capacity enable Escherichia coli adaptation to loss of key metabolic enzymes. J Biol Chem. 2006, 281: 8024-8033. 10.1074/jbc.M510016200.View ArticlePubMed
- Ogata H, Goto S, Sato K, Fujibuchi W, Bono H, Kanehisa M: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 1999, 27: 29-34. 10.1093/nar/27.1.29.PubMed CentralView ArticlePubMed
- Duarte NC, Herrgård MJ, Palsson BØ: Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. Genome Res. 2004, 14: 1298-1309. 10.1101/gr.2250904.PubMed CentralView ArticlePubMed
- Fleming RMT, Thiele I: von Bertalanffy 1.0: a COBRA toolbox extension to thermodynamically constrain metabolic models. Bioinformatics. 2011, 27: 142-143. 10.1093/bioinformatics/btq607.PubMed CentralView ArticlePubMed
- Schomburg I, Chang A, Schomburg D: BRENDA, enzyme data and metabolic information. Nucleic Acids Res. 2002, 30: 47-49. 10.1093/nar/30.1.47.PubMed CentralView ArticlePubMed
- Hoppe A, Hoffmann S, Holzhütter H-G: Including metabolite concentrations into flux balance analysis: thermodynamic realizability as a constraint on flux distributions in metabolic networks. BMC Syst Biol. 2007, 1: 23-10.1186/1752-0509-1-23.PubMed CentralView ArticlePubMed
- Kim JS, Kim WY, Rho HW, Park JW, Park BH, Han MK, Kim UH, Kim HR: Purification and characterization of adenosine diphosphate ribose pyrophosphatase from human erythrocytes. Int J Biochem Cell Biol. 1998, 30: 629-638. 10.1016/S1357-2725(97)00142-8.View ArticlePubMed
- Gasmi L, Cartwright JL, McLennan AG: Cloning, expression and characterization of YSA1H, a human adenosine 5'-diphosphosugar pyrophosphatase possessing a MutT motif. Biochem J. 1999, 344 (Pt 2): 331-337.PubMed CentralView ArticlePubMed
- Yang H, Slupska MM, Wei YF, Tai JH, Luther WM, Xia YR, Shih DM, Chiang JH, Baikalov C, Fitz-Gibbon S, Phan IT, Conrad A, Miller JH: Cloning and characterization of a new member of the Nudix hydrolases from human and mouse. J Biol Chem. 2000, 275: 8844-8853. 10.1074/jbc.275.12.8844.View ArticlePubMed
- Herman RH: Mannose metabolism. I. Am J Clin Nutr. 1971, 24: 488-498.PubMed
- Ning B, Elbein AD: Cloning, expression and characterization of the pig liver GDP-mannose pyrophosphorylase. Evidence that GDP-mannose and GDP-Glc pyrophosphorylases are different proteins. Eur J Biochem. 2000, 267: 6866-6874. 10.1046/j.1432-1033.2000.01781.x.View ArticlePubMed
- Wishart DS, et al, et al.: HMDB: the Human Metabolome Database. Nucleic Acids Res. 2007, 35: D521-D526. 10.1093/nar/gkl923.PubMed CentralView ArticlePubMed
- Hoppe W, Rauch U, Kresse H: Degradation of endocytosed dermatan sulfate proteoglycan in human fibroblasts. J Biol Chem. 1988, 263: 5926-5932.PubMed
- Fuller M, Meikle PJ, Hopwood JJ: Glycosaminoglycan degradation fragments in mucopolysaccharidosis I. Glycobiology. 2004, 14: 443-450. 10.1093/glycob/cwh049.View ArticlePubMed
- Scott HS, Anson DS, Orsborn AM, Nelson PV, Clements PR, Morris CP, Hopwood JJ: Human alpha-L-iduronidase: cDNA isolation and expression. Proc Natl Acad Sci USA. 1991, 88: 9695-9699. 10.1073/pnas.88.21.9695.PubMed CentralView ArticlePubMed
- Havelaar AC, Mancini GM, Beerens CE, Souren RM, Verheijen FW: Purification of the lysosomal sialic acid transporter. Functional characteristics of a monocarboxylate transporter. J Biol Chem. 1998, 273: 34568-34574. 10.1074/jbc.273.51.34568.View ArticlePubMed
- Mancini GM, de Jonge HR, Galjaard H, Verheijen FW: Characterization of a proton-driven carrier for sialic acid in the lysosomal membrane. Evidence for a group-specific transport system for acidic monosaccharides. J Biol Chem. 1989, 264: 15247-15254.PubMed
- Toida T, Qiu G, Matsunaga T, Sagehashi Y, Imanari T: Gas Chromatography-Mass Spectrometric Determinations of Iduronic and Glucuronic Acids in Glycosaminoglycans after Reduction of Carboxylic Group Using Sodium Borodeuteride. Anal Sci. 1992, 8: 799-804. 10.2116/analsci.8.799.View Article
- Elbein AD, Freeze H: Glycosylation Precursors. Essentials of Glycobiology. 2009, Cold Spring Harbor: Cold Spring Harbor Laboratory Press, 2
- Bout A, Franse MM, Collins J, Blonden L, Tager JM, Benne R: Characterization of the gene encoding human peroxisomal 3-oxoacyl-CoA thiolase (ACAA). No large DNA rearrangement in a thiolase-deficient patient. Biochim Biophys Acta. 1991, 1090: 43-51.View ArticlePubMed
- Wanders RJ, Vreken P, Ferdinandusse S, Jansen GA, Waterham HR, van Roermund CW, Van Grunsven EG: Peroxisomal fatty acid alpha- and beta-oxidation in humans: enzymology, peroxisomal metabolite transporters and peroxisomal diseases. Biochem Soc Trans. 2001, 29: 250-267. 10.1042/BST0290250.View ArticlePubMed
- Cunin R, Glansdorff N, Piérard A, Stalon V: Biosynthesis and metabolism of arginine in bacteria. Microbiol Rev. 1986, 50: 314-352.PubMed CentralPubMed
- Caldovic L, Tuchman M: N-acetylglutamate and its changing role through evolution. Biochem J. 2003, 372: 279-290. 10.1042/BJ20030002.PubMed CentralView ArticlePubMed
- Ohura T, Kominami E, Tada K, Katunuma N: Crystallization and properties of human liver ornithine aminotransferase. J Biochem. 1982, 92: 1785-1792.PubMed
- Gafan C, Wilson J, Berger LC, Berger BJ: Characterization of the ornithine aminotransferase from Plasmodium falciparum. Mol Biochem Parasitol. 2001, 118: 1-10. 10.1016/S0166-6851(01)00357-7.View ArticlePubMed
- Mancini GM, Beerens CE, Aula PP, Verheijen FW: Sialic acid storage diseases. A multiple lysosomal transport defect for acidic monosaccharides. J Clin Invest. 1991, 87: 1329-1335. 10.1172/JCI115136.PubMed CentralView ArticlePubMed
- Kharchenko P, Vitkup D, Church GM: Filling gaps in a metabolic network using expression information. Bioinformatics. 2004, 20 (Suppl 1): i178-185. 10.1093/bioinformatics/bth930.View ArticlePubMed
- Kharchenko P, Chen L, Freund Y, Vitkup D, Church GM: Identifying metabolic enzymes with multiple types of association evidence. BMC Bioinformatics. 2006, 7: 177-10.1186/1471-2105-7-177.PubMed CentralView ArticlePubMed
- Hsiao T-L, Revelles O, Chen L, Sauer U, Vitkup D: Automatic policing of biochemical annotations using genomic correlations. Nat Chem Biol. 2010, 6: 34-40.PubMed CentralView ArticlePubMed
- Lee I, Ambaru B, Thakkar P, Marcotte EM, Rhee SY: Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana. Nat Biotechnol. 2010, 28: 149-156. 10.1038/nbt.1603.PubMed CentralView ArticlePubMed
- Henry CS, DeJongh M, Best AA, Frybarger PM, Linsay B, Stevens RL: High-throughput generation, optimization and analysis of genome-scale metabolic models. Nat Biotechnol. 2010, 28: 977-982. 10.1038/nbt.1672.View ArticlePubMed
- Overbeek R, et al, et al.: The subsystems approach to genome annotation and its use in the project to annotate 1000 genomes. Nucleic Acids Res. 2005, 33: 5691-5702. 10.1093/nar/gki866.PubMed CentralView ArticlePubMed
- Bult CJ, Eppig JT, Kadin JA, Richardson JE, Blake JA: The Mouse Genome Database (MGD): mouse biology and model systems. Nucleic Acids Res. 2008, 36: D724-728.PubMed CentralView ArticlePubMed
- Conners SB, Montero CI, Comfort DA, Shockley KR, Johnson MR, Chhabra SR, Kelly RM: An expression-driven approach to the prediction of carbohydrate transport and utilization regulons in the hyperthermophilic bacterium Thermotoga maritima. J Bacteriol. 2005, 187: 7267-7282. 10.1128/JB.187.21.7267-7282.2005.PubMed CentralView ArticlePubMed
- Chhabra SR, Shockley KR, Conners SB, Scott KL, Wolfinger RD, Kelly RM: Carbohydrate-induced differential gene expression patterns in the hyperthermophilic bacterium Thermotoga maritima. J Biol Chem. 2003, 278: 7540-7552. 10.1074/jbc.M211748200.View ArticlePubMed
- Chen L, Vitkup D: Predicting genes for orphan metabolic activities using phylogenetic profiles. Genome Biol. 2006, 7: R17-10.1186/gb-2006-7-2-r17.PubMed CentralView ArticlePubMed
- Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G, Cooke MP, Walker JR, Hogenesch JB: A gene atlas of the mouse and human protein-encoding transcriptomes. Proc Natl Acad Sci USA. 2004, 101: 6062-6067. 10.1073/pnas.0400782101.PubMed CentralView ArticlePubMed
- Pagel P, Kovac S, Oesterheld M, Brauner B, Dunger-Kaltenbach I, Frishman G, Montrone C, Mark P, Stümpflen V, Mewes H-W, Ruepp A, Frishman D: The MIPS mammalian protein-protein interaction database. Bioinformatics. 2005, 21: 832-834. 10.1093/bioinformatics/bti115.View ArticlePubMed
- Lynn DJ, Chan C, Naseer M, Yau M, Lo R, Sribnaia A, Ring G, Que J, Wee K, Winsor GL, Laird MR, Breuer K, Foroushani AK, Brinkman FSL, Hancock REW: Curating the innate immunity interactome. BMC Syst Biol. 2010, 4: 117-10.1186/1752-0509-4-117.PubMed CentralView ArticlePubMed
- Bossi A, Lehner B: Tissue specificity and the human protein interaction network. Mol Syst Biol. 2009, 5: 260-PubMed CentralView ArticlePubMed
- Shlomi T, Cabili MN, Herrgård MJ, Palsson BØ, Ruppin E: Network-based prediction of human tissue-specific metabolism. Nat Biotechnol. 2008, 26: 1003-1010. 10.1038/nbt.1487.View ArticlePubMed
- Yang R, Su B: Characterization and comparison of the tissue-related modules in human and mouse. PLoS ONE. 2010, 5: e11730-10.1371/journal.pone.0011730.PubMed CentralView ArticlePubMed
- Park J, Lee D-S, Christakis NA, Barabási A-L: The impact of cellular networks on disease comorbidity. Mol Syst Biol. 2009, 5: 262-PubMed CentralView ArticlePubMed
- Burgard AP, Nikolaev EV, Schilling CH, Maranas CD: Flux coupling analysis of genome-scale metabolic network reconstructions. Genome Res. 2004, 14: 301-312. 10.1101/gr.1926504.PubMed CentralView ArticlePubMed
- Thiele I, Price ND, Vo TD, Palsson BØ: Candidate metabolic network states in human mitochondria. Impact of diabetes, ischemia, and diet. J Biol Chem. 2005, 280: 11683-11695. 10.1074/jbc.M409072200.View ArticlePubMed
- Jamshidi N, Palsson BØ: Systems biology of SNPs. Mol Syst Biol. 2006, 2: 38-PubMed CentralView ArticlePubMed
- Galperin MY, Koonin EV: "Conserved hypothetical" proteins: prioritization of targets for experimental study. Nucleic Acids Res. 2004, 32: 5452-5463. 10.1093/nar/gkh885.PubMed CentralView ArticlePubMed
- Jensen LJ, Kuhn M, Stark M, Chaffron S, Creevey C, Muller J, Doerks T, Julien P, Roth A, Simonovic M, Bork P, von Mering C: STRING 8--a global view on proteins and their functional interactions in 630 organisms. Nucleic Acids Res. 2009, 37: D412-416. 10.1093/nar/gkn760.PubMed CentralView ArticlePubMed
- Becker SA, Feist AM, Mo ML, Hannum G, Palsson BØ, Herrgard MJ: Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox. Nat Protoc. 2007, 2: 727-738. 10.1038/nprot.2007.99.View ArticlePubMed
- Mannaerts GP, van Veldhoven PP: Functions and organization of peroxisomal beta-oxidation. Ann NY Acad Sci. 1996, 804: 99-115. 10.1111/j.1749-6632.1996.tb18611.x.View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.