Proteomics-based metabolic modeling and characterization of the cellulolytic bacterium Thermobifida fusca
© Vanee et al.; licensee BioMed Central 2014
Received: 24 April 2014
Accepted: 14 July 2014
Published: 13 August 2014
Thermobifida fusca is a cellulolytic bacterium with potential to be used as a platform organism for sustainable industrial production of biofuels, pharmaceutical ingredients and other bioprocesses due to its capability of potential to convert plant biomass to value-added chemicals. To best develop T. fusca as a bioprocess organism, it is important to understand its native cellular processes. In the current study, we characterize the metabolic network of T. fusca through reconstruction of a genome-scale metabolic model and proteomics data. The overall goal of this study was to use multiple metabolic models generated by different methods and comparison to experimental data to gain a high-confidence understanding of the T. fusca metabolic network.
We report the generation of three versions of a metabolic model of Thermobifida fusca sp. XY developed using three different approaches (automated, semi-automated, and proteomics-derived). The model closest to in vivo growth was the proteomics-derived model that consists of 975 reactions involving 1382 metabolites and account for 316 EC numbers (296 genes). The model was optimized for biomass production with the optimal flux of 0.48 doublings per hour when grown on cellobiose with a substrate uptake rate of 0.25 mmole/h. In vivo activity of the DXP pathway for terpenoid biosynthesis was also confirmed using real-time PCR.
i Tfu296 provides a platform to understand and explore the metabolic capabilities of the actinomycete T. fusca for the potential use in bioprocess industries for the production of biofuel and pharmaceutical ingredients. By comparing different model reconstruction methods, the use of high-throughput proteomics data as a starting point proved to be the most accurate to in vivo growth.
With ongoing research in genomics, metagenomics, and bioprospecting, the breadth of novel and interesting biochemistry continues to grow. One of the current challenges associated with the large amount of data and resources available is to conduct detailed analyses to curate information to translate raw data into knowledge that provides functional insight. One computational method that facilitates metabolic analysis and dovetails well with genomic and biochemical information is genome-scale constraint-based modeling. While constraint-based models have benefits of being easily scalable and providing gene-protein-reaction level specificity, there are a number of limitations. One of the most fundamental problems revolves around the fact that metabolic networks are underdetermined and thus, there exist alternative flux states with different pathway usage that produce indistinguishable cellular phenotypes. This is an underlying problem with constraint-based models that impacts multiple facets of these models including the initial reconstruction (association of specific reactions with annotated genes) to producing simulation predictions (presence of alternate optimal solutions). In this study, we consider the metabolically under-characterized actinobacterium, Thermobifida fusca, and utilize three different methods to gain a better understanding of its metabolic network and identify preferred methodologies for network characterization.
Within the actinomycetes, Thermobifida fusca (aerobic, thermophilic, gram-positive) is known for its high temperature and pH stability as well as highly expressed cellulolytic system. The cellulolytic system is comprised of three endocellulases (Cel9B, Cel6A and Cel5A), two exocellulases (Cel6B and Cel48A) and a processive cellulase (Cel9A) . Numerous studies in the past have reported on various facets of the ability of T. fusca to degrade lignocellulosic biomass. Due to the high efficiency with which T. fusca can process lignocellulosic materials, efforts have been made to clone several individual cellulase genes into Streptomyces lividians, Streptomyces albus, Bacillus subtilis and Escherichia coli,. The cloned enzymes were isolated in good concentrations but failed to show similar level of cellulolytic activity as is found in T. fusca. This may be due to the complexity of cellulose degradation systems that are not defined by a few genes but is an intertwined network of various enzymes ,. Hence, it is suggested that the best way to fully utilize the cellulolytic capabilities of T. fusca may be to develop T. fusca rather than trying to move its cellulases into other systems by heterologous expression.
The ability to produce chemicals of industrial importance using inexpensive lignocellulosic biomass has been a recent focus for microbial systems. For T. fusca, the sequencing of its genome by Department of Energy (DOE) in 2005 sets up a milestone towards understanding this industrially applicable microbe . Besides proving an excellent host microbe for biofuel production , it also showed success towards utilization of untreated (without any preprocessing) lignocellulosic material. This is a promising development toward making use of the cellulolytic capabilities of this microbe to reduce the complex multi-step bioprocess to a CBP .
Different approaches have been taken in the past in trying to develop a consolidated bioprocess. One approach is to utilize the cellulolytic capabilities to well-established model organisms, such as E. coli and S. cerevisiae. This approach seeks to allow for direct use of lignocellulosic biomass as a starting point, but leverages the knowledge and tools available for well-characterized organisms. The alternative approach is to characterize and develop poorly-characterized cellulolytic organisms. Thus, high levels of cellulose processivity can be achieved, but the challenge is to develop the knowledge and tools to a sufficient level that metabolism can be designed and altered in a directed fashion.
Significant milestones for T. fusca research and characterization
Zhang et al.
Int. J Syst Bact. 1998 Apr
Kukolya et al.
Int. J Sys Evol Micr, 2002 Jul
JGI Finished Genome, 2005
Plant biomass degradation study and analysis of enzymatic system
Wilson et al.
Chem Rec. 2004
Biochem. 2006 Nov
Lykidis et al.
J. Bacteriol. 2007 Mar
First Genetic Modification
Deng & Fong
Appl. Environ. Microbiol 2010 Apr
Producing biofuel from untreated biomass
Deng & Fong
Metab Eng. 2011 Sept
With the availability of genomic sequences, it has become possible to use genome annotation and biochemical information to reconstruct cellular metabolic networks . These models can be used for simulating the living state of bacteria, if operated under the defined constraints and boundary conditions. There are various algorithms such as FBA - flux balance analysis ,, MOMA - minimization of metabolic adjustments , ROOM - regulatory on-off minimization  and MCA - metabolic control analysis that are currently used for the purpose of running computational simulations of these models . In the current study, we used FBA, which is based on linear programming, to simulate and optimize T. fusca model developed in this study for biomass production (growth).
T. fusca model and FBA
where, Z is the flux through objective function (biomass production and product optimization), S: stoichiometry of the reactions represented as matrix, v is reaction flux vector, a i and b i are the constraints placed on the flux v i of the reaction i.
Even after compiling biochemical information and gap-filling a model, there often are discrepancies between the computational model results and in vivo states that are difficult to identify using only computational approaches. An additional level of model curation can be achieved by integrating high-throughput experimental data with the framework of a computational model to put “content in context” . This data integration can be done using multi-scale high throughput experimental data such as transcriptomics, proteomics and metabolomics. This step reconciles in silico predictions with experimental results and thereby helps enhance the characterization of the cellular activity.
Genome-scale metabolic models have been created for many prokaryotic microbes and a variety of applications . Several of these models have been able to incorporate experimental data to more closely match cellular processes. Once the model closely resembles a biological system, it can be optimized for defined objective function. This objective function may range from production of biomass to production of a chemical target. Following the in silico optimization of yields the computational design may eventually be replicated for applications in industry, therapeutics or health-related predictions.
In this study, three different approaches for generating constraint-based models were used and analyzed for their ability to accurately predict cellular growth when given an input substrate uptake rate. The three model versions were 1) designed using Model SEED , 2) an in-house semi-automated reconstruction based on organism specific annotation and reaction information from KEGG ,, and 3) a proteomics-based model from 2D proteomic experiment of T. fusca grown on cellobiose media. A comparative analysis between the different models was conducted to develop the most experimentally accurate model of T. fusca and to provide comparison of different model generation methods. Detailed analysis of metabolic function of T. fusca is also included to provide perspective on potential industrial applications (eg: in biofuel and natural product) for development of consolidated bioprocesses using T. fusca. Here we will be discussing the mevalonate and non-mevalonate pathways for terpenoid biosynthesis. These pathways are utilized for the production of isoprenoid precursor (isopentenyl pyrophosphate and dimethylalyl pyrophosphate) compounds that have applications in pharmaceutical, nutraceutical and perfume industries.
Results and discussion
Metabolic reconstruction: summary and model statistics
Three different draft metabolic models were constructed for Thermobifida fusca. The three models varied based upon what was used as the starting point for generating the initial reaction list for the model (Model SEED, KEGG, proteomics data). In all cases, after initial generation of a reaction list, all reactions were associated with KEGG IDs to standardize comparisons between models.
Tfu_v1: The taxonomy number of T. fusca was used to generate a draft model from Model SEED (Tfu_v1) ,. The Model SEED output was then converted to KEGG compound identifiers before running gap analysis and FBA. The Model SEED-derived Tfu_v1 model contained 1302 reactions, 1213 metabolites, and 618 EC numbers. Gap analysis added 146 reactions that primarily include a variety of exchange reactions (reactions that denote the direct uptake/secretion of the respective metabolite from or to the extra cellular media). When used with unconstrained carbon input flux (cellobiose uptake of 1000 mmoles/gDW/h) the Tfu_v1 model calculated a growth rate of 24.25 doublings/h. However, when the experimentally determined substrate uptake rate of cellobiose (0.25 mmoles/gDW/h) was applied to this model it failed to arrive at a viable solution (warning of infeasible solution). This infeasibility was crosschecked and verified by using the Model SEED FBA runs. The model failed to perform under any media formulations (glucose and cellobiose) attempted using the Model SEED simulation platform.
Tfu_v2: The second version of a T. fusca model (Tfu_v2) was created using an in-house semi-automated reconstruction method as defined in our past publications ,. This model consists of 1002 reactions involving 584 EC numbers and accounting for 1105 metabolites. Eight out of 48 reactions added in the gap analysis were non-exchange reactions. Applying the experimentally determined constraint of substrate uptake rate for cellobiose as 0.25 mmoles/gDW/h, the optimal biomass growth was 72.84 doublings/h. This growth rate was compared to experimental growth rate of 0.43 doublings/h.
Summary of individual pathway contrasts between three versions of model: The reference frames selected here is KEGG
Reactions in pathway
Citrate Cycle (TCA Cycle)
Pentose Phosphate Pathway
Pentose and Glucuronate Intrconv.
Fructose and Mannose Metabolism
Starch and Sucrose Metabolism
Amino Sugar and Nucleotide Metabolism
Glyoxylate and Dicarboxylate Metabolism
C- 5 Branched Diabasic Acid Metabolism
Inositol Phosphate Metabolism
Amino Acid Metabolism
Ala, Asp and Glu Metabolism
Gly, Ser and Thr Metabolism
Cys and Met Metabolism
Val, Leu and Ile Degradation
Val, Leu and Ile Biosynthesis
Arg and Pro Metabolism
Phe, Tyr and Trp Biosynthesis
Tfu_v3: proteomics model reaction distribution
Based upon the initial model contents and growth rate simulation results, the the Tfu_v3 appears to most closely predict experimental results. Thus, further detailed computational analyses of T. fusca are based on the Tfu_v3 model. The Tfu_v3 model is designated as iTfu296 (represents 296 annotated genes).
It was observed that in the carbohydrate-amino acid network connection the majority of reactions (11 out of 23) branch in and out of alpha – ketoglutarate. The acetyl CoA node was more centralized by connecting the amino acids and fatty acid pathways with 8 significantly high flux reactions. The pathway specific details are also illustrated in the Table 2. It was observed that when contrasted with the reactions specific to the biosynthesis or degradation of amino acids, the overall analysis suggests the presence of all but one amino acid subsystem, cysteine and methionine metabolism.
Comparison of “model refinement with data” Vs “reconstruction from data”
In the past, several attempts have been made to use computational metabolic models as a scaffold for experimental data integration -. Experimental data integration not only serves as a means of testing model predictions, but can also be used to help refine the model solution space to provide simulations that more closely match in vivo flux states. In this study, we used two methods to combine metabolic models and experimental data to understand and characterize the metabolic network of T. fusca. For the first approach, an MILP algorithm (analogous to Shlomi et al. ) was used to integrate the proteomics dataset to the model Tfu_v2 (autobuild model). This algorithm aims at optimizing the agreement between the experimental data and the in silico model . In this context, the experimental information is used to assign a present or absent call to re-channelize the flux distribution of the network as explained by Gowen et al. .
Parallel to this, a second approach originating directly from an experimental proteomic data set was used to generate an independent model, Tfu_v3. Currently existing methods for constraint-based model reconstruction primarily depend on the bioinformatic information such as genomics data, biochemical data and models of related microorganisms at the initial phase of model building. The approach taken for construction of Tfu_v3 relies on the in vivo experimental evidence of the proteins as the starting point of model building. While these two methods of model construction used the same bioinformatics and experimental data, subtle differences in the order of process steps and algorithms, as shown in Figure 5, used to construct the two models resulted in vastly different functional consequences.
Besides this, another very interesting and significant difference between Tfu_v2 and Tfu_v3 was observed in the function of the TCA cycle. The reaction using pyruvate to make oxaloacetate was not found in the autobuilt Tfu_v2 version whereas the proteomics version clearly shows its presence (EC 184.108.40.206, Gene ID: Tfu_2557, Tfu_1530, Tfu_0947, Tfu_1228). In addition, most of the amino acid pathways were fully or partially incomplete in Tfu_v2. Thus, the Tfu_v2 model had artificially high fluxes through transport reactions to uptake external nutrients to satisfy simulation requirements for optimal biomass production. The Tfu_v3 model showed most of the pathways significantly complete except cysteine and methionine metabolism. Besides cysteine and methionine metabolism, phenylalanine metabolism was sparsely populated, but active flux through the reaction involved in the interconversion of phenylalanine to phenylpyruvate suggested making a present call for the entire phenylalanine pathway. These functional differences indicate some of the potential danger associated with over-reliance on genome annotation (that may contain numerous errors especially in under-characterized organisms).
Applicability of the model: biofuels and pharmaceutical precursors
The subsystem-based analysis of central metabolism and experimental foundation suggests the closer association of Tfu_v3 to the in vivo biochemical system of T. fusca. While the Tfu_v3 model is the most accurate of the three models constructed in this study, there are numerous areas of metabolism that are not well-characterized. For comparison, even the most update model of E. coli has an account of only for 30% of the gene products in the model . Likewise, this model being the first ever T. fusca metabolic network also opens huge scope of pathway-focused review and improvement. Once completely functional these models provide a ground for hypothesizing a target for the further study.
T. fusca is a potentially interesting organism for biochemical production of sustainable fuels or industrial chemicals. In these areas, two pathways of particular interest are butanol and secondary metabolite biosynthesis (e.g. terpenoids). The Tfu_v2 (ModelSEED autobuilt) model incorporates most of the reactions present in the butanoate metabolism however, no active flux was observed through most of them. Tfu_v3 based on experimental dataset confirms approximately 50% of these reactions but also predicts no active flux through these pathways. Thus, while production of butanol through butanoate metabolism appears possible in terms of biochemical capabilities, it remains to be explored and demonstrated experimentally. For comparison purposes, in 2011 a mutant strain T. fusca B6 was designed and constructed with heterologous expression of a bifunctional alcohol dehydrogenase (adhE2) for production of 1-propanol . Engineered production of 1-propanol in T. fusca provides first-step experimental evidence that T. fusca may be usable for the production of fuels directly from lignocellulosic raw materials.
One consideration for FBA simulations of both butanol/butanoate and terpenoids is that these are secondary pathways and utilization of these pathways are not explicitly included in the biomass objective for growth maximization simulations. Thus, it is not necessarily surprising that growth simulations with no genetic designs incorporated may not show flux through secondary pathways. However, due to the potential for secondary metabolism in actinomycetes, we considered studying the terpenoid backbone (TBB) pathway in more detail.
Experimental validation: expression analysis TBB pathway genes
T. fusca genes associated with the DXP pathway for terpenoid biosynthesis
Three different methodologies were applied to create metabolic reconstructions for T. fusca . The proteomics-based model (Tfu_v3) named i Tfu296 was found to mimic the biological growth conditions most closely. It was observed that when cellobiose uptake rate was constraint as 0.25 mmoles/gDW/h gave the growth rate of 0.49 doublings/h. This was comparable to the experimental growth rate of 0.43 doublings/h. This model was built using a novel scheme for model reconstruction based on high throughput proteomic data at the initial model building phase.
Genomic datasets are the most standard high throughput data currently available, but it is always a concern to what extent the genomics information is really transcribed and translated into the functional role inside the cell. Out of 3195 genes annotated in the T. fusca genome published in 2005, only 3117 translated into protein-coding genes and only 1757 were associated with predicted functions ,. This discrepancy between genomic and proteomic information may be due to environmental or evolutionary selection processes. Accurate proteomic data provides more information about the functional activities of the cell, as confirmed by the fact that our proteomics-based model agreed with experimental observations better than models built based on genomic information alone.
With the advent of standard genomics information, genome-scale metabolic models have become a widely used approach to gain a systems level understanding of metabolic processes and function . Every organism-specific model that is built needs to pass through multiple levels of curation and validation based on genome annotation, experimental evidence and (or) biochemical literature study. Knowing that omics-data can be used to help identify in vivo function, we applied an approach model building based on high throughput proteomic data. These models contain the scope of integration of genomics, transcriptomics, proteomics, metabolomics and phenomics data. For the current study, proteomics data has been used to establish a significantly reliable starting point for the metabolic model reconstruction. This version of model Tfu_v3 is based on functional building blocks that are more closely associated with the phenotypic characteristics when compared to genomics data in the hierarchy.
In a larger context, the modeling approach aims at establishing links between the molecular and cellular functions. However, it is still hard to find complete agreement between the “biology - biochemistry” and “network models - omics data”. It is reported that the most updated E. coli model only associates to 30% of gene products in the model and 1/3 of the gene products are not functionally annotated . Even with this limitation, model-based systems analysis can be useful for developing hypothesis and target for the focused study. In this case, we hypothesize the presence of an active terpenoid backbone pathway, which will be the focus of our follow up studies using experimental and analytical methods.
Nevertheless, this promising approach suffers with a major limitation to date – a lack of a standardized reaction database to build and analyze metabolic network. Due to inconsistency in the labeling of the metabolites (eg: citrate is almost chemically equivalent to citric acid; 2-hydroxy-1,2,3-propanetricarboxylic acid; 2-hydroxytricarballylic acid and have the same compound identifier on KEGG) it is difficult to assemble a non-redundant reaction database with standard nomenclature. Current systems biology experts are in the quest of cleaning and populating available database with minimal redundancy in the hope of exhaustive coverage of cellular biochemical reactions. Some of the examples are MetRxn and MetaCyc.
Computational analysis demonstrated the theoretical feasibility of producing terpenoids in T. fusca, however, no existing experimental evidence had previously supported or demonstrated this capability. Analysis of mRNA transcripts showed in vivo activity of the DXP pathway in T. fusca providing evidence that T. fusca may be capable of direct cellulose-to-terpenoid biosynthesis. Isoprene is the monomeric unit for the huge family of terpenoids, thus hold importance in pharmaceutical industry, perfumes, incense, flavoring, spices, and varnishes.
T. fusca was found to be a biofuel producing strain after the genetic modification protocol for this strain was established by Deng & Fong. With this systems level characterization of secondary metabolites, it can be suggested as a highly useful, robust and inexpensive strain for industrial application. However, this opens an arena for the scale up and optimization study to successfully launch this strain in industrially significant microbes.
Thermobifida fusca ATCC BAA-629 was grown in Hagerdahl medium containing 1.0% cellobiose. Experiments were conducted in Erlenmeyer flasks where 50 mL pre-cultures of T. fusca YX were grown at 55°C and 250 rpm for 24 hours in a 500 mL Erlenmeyer flask. Growth cultures for testing were inoculated using 5% of the pre-culture and grown at 55°C and 250 rpm for 42–48 hours.
Metabolic network reconstruction
Overall model construction steps illustrated in Figure 3 are as follows:
Tfu_v1: SEED. The autobuilt draft model was made in Model SEED , and used as the draft network. The *.xml file downloaded was converted into in-house MetModel format to run the FBA simulations with MetModel software.
Tfu_v3: Proteomics. The draft model was constructed using the output of a 2-dimensional LC-MS analysis performed at the Manitoba Centre for Proteomics & Systems Biology (University of Manitoba, Winnipeg, Canada). Cellobiose grown T. fusca sample was subjected to FASP lysis/digestion procedure (J.R. Wisnewski et al. Nature Methods 2009. 6(5). 359–362) followed by 2D-HPLC-MS/MS acquisition using TripleTOF 5600 mass spectrometer (ABSciex, Mississauga, ON) . Thirteen pairwise-concatenated fractions in the first dimension were analyzed over a 1-hour HPLC-MS/MS session, each. This collection yielded 276,129 MS/MS spectra that were interrogated using an in-house GPU-based search engine  and yielded identification of 126,471 peptides (16,598 non-redundant) spanning 2101 proteins. This represents approximately 68% total proteomic coverage. Protein identification expectation values were computed using a Bayes’ theorem application of its member peptide expectation values, following the design by Beavis and Fenyo for X!tandem . Over 1700 proteins have expectation values of log(e) < −10 (a one in ten-billion probability of random miss-assignment).
Linear programming for flux balance analysis
In-house python scripts were used to run the FBA simulations using the linear programming algorithm as shown in the introduction.
Objective function: biomass equation
Gap analysis and model comparison
The draft model consists of the list of reactions however there are patches in the network that obstruct continuous flow of flux through the pathway. These links are filled in by using the reaction databank and suggesting the list of reactions required to complete the network. FBA-GAP is used to suggest the connection nodes/reactions that are missing . The use and application of this framework have been described in past by Roberts et al., Gowen et al. and Vanee et al. ,,. FBA-GAP takes a draft model and biomass reaction and uses distances in the reaction network and mathematical optimization to produce a list of metabolites that are necessary for biomass production but cannot be produced or consumed by the cell. Reactions producing and consuming this list of metabolites are obtained from a reference database. These potentially gap-filling reactions were manually checked for relevant evidence such as associated proteins/enzymes characterized or genome annotations,. On detecting the specific evidence these reaction additions to the model wer accepted. The process is repeated until a positive biomass flux value is obtained. In this way, only high-confidence reactions are used to complete the reconstruction.
Data integrations and model validation for Tfu_v2
The mixed integer linear programming algorithm (MILP) published in 2008 by Shlomi et al.  was used for integration of proteomics data to the Tfu_v2 version of model. This algorithm was re-written in python by Gowen et al.  to include in our MetModel package.
Characterization of TBB pathway
Reactions associated with Terpenoids backbone biosynthesis pathway that were added to the model Tfu_v3 for the simulation of Secondary metabolites pathway central hub
[c]: C00024 -- > C00332
[c]: C00332 + C00024 -- > C00356
[c]: C00356 -- > C00418
[c]: C00418 -- > C01107
[c]: C01107 -- > C01143
[c]: C01143 -- > C00129
[c]: C00129 -- > C00235
#Non mevalonate pathway
DOXP synthase (Dxs)
[c]: C00118 + C00022 -- > C11437
DOXP reductase (Dxr)
[c]: C11437 -- > C11434
MEP synthase (IspD)
[c]: C11434 -- > C11435
CDP-ME kinase (IspE)
[c]: C11435 -- > C11436
CDP-MEP synthase (IspF)
[c]: C11436 -- > C11453
HMB-PP synthase (IspG)
[c]: C11453 -- > C11811
HMB-PP reductase (IspH)
[c]: C11811 -- > C00129
HMB-PP reductase (IspH)
[c]: C11811 -- > C00129
[c]: C00129 -- > C00235
Expression analysis of TBB pathway genes
T. fusca strain YX grown on Cellobiose media to till the early log phase with dry cell weight of 2.005 mg/mL was used for isolation of RNA using the Qiagen RNA Protect and Qiagen RNAeasy kit. The total RNA was sent to Nucleic Acid Research Facility (Virginia Commonwealth University) for RT-PCR. Tfu_2950 was selected as housekeeping gene to measure the relative expression levels.
This work was partially supported by Genome Canada (MGCB2 project). The funding agency was not involved in any aspect of the study (design, execution, writing, publication submission).
- Wilson DB: Studies of Thermobifida fusca plant cell wall degrading enzymes. Chem Rec. 2004, 4: 72-82. 10.1002/tcr.20002.View ArticlePubMedGoogle Scholar
- Ghangas GS, Wilson DB: Cloning of the thermomonospora fusca endoglucanase E2 Gene in streptomyces lividans: affinity purification and functional domains of the cloned gene product. Appl Environ Microbiol. 1988, 54: 2521-2526.PubMed CentralPubMedGoogle Scholar
- Irwin DC, Zhang S, Wilson DB: Cloning, expression and characterization of a family 48 exocellulase, Cel48A, from Thermobifida fusca. Eur J Biochem. 2000, 267: 4988-4997. 10.1046/j.1432-1327.2000.01546.x.View ArticlePubMedGoogle Scholar
- Spiridonov NA, Wilson DB: Regulation of biosynthesis of individual cellulases in Thermomonospora fusca. J Bacteriol. 1998, 180: 3529-3532.PubMed CentralPubMedGoogle Scholar
- Kukolya J, Nagy I, Laday M, Toth E, Oravecz O, Marialigeti K, Hornok L: Thermobifida cellulolytica sp. nov., a novel lignocellulose-decomposing actinomycete. Int J Syst Evol Microbiol. 2002, 52: 1193-1199. 10.1099/ijs.0.01925-0.PubMedGoogle Scholar
- Lee J, Postmaster A, Soon HP, Keast D, Carson KC: Siderophore production by actinomycetes isolates from two soil sites in Western Australia. Biometals. 2012, 25: 285-296. 10.1007/s10534-011-9503-9.View ArticlePubMedGoogle Scholar
- Takahashi S, Toyoda A, Sekiyama Y, Takagi H, Nogawa T, Uramoto M, Suzuki R, Koshino H, Kumano T, Panthee S, Dairi T, Ishikawa J, Ikeda H, Sakaki Y, Osada H: Reveromycin A biosynthesis uses RevG and RevJ for stereospecific spiroacetal formation. Nat Chem Biol. 2011, 7: 461-468. 10.1038/nchembio.583.View ArticlePubMedGoogle Scholar
- Niraula NP, Kim SH, Sohng JK, Kim ES: Biotechnological doxorubicin production: pathway and regulation engineering of strains for enhanced production. Appl Microbiol Biotechnol. 2010, 87: 1187-1194. 10.1007/s00253-010-2675-3.View ArticlePubMedGoogle Scholar
- Cane DE, Ikeda H: Exploration and mining of the bacterial terpenome. Acc Chem Res. 2012, 45: 463-472. 10.1021/ar200198d.PubMed CentralView ArticlePubMedGoogle Scholar
- Citron CA, Gleitzmann J, Laurenzano G, Pukall R, Dickschat JS: Terpenoids are widespread in actinomycetes: a correlation of secondary metabolism and genome data. Chembiochem. 2012, 13: 202-214. 10.1002/cbic.201100641.View ArticlePubMedGoogle Scholar
- Lykidis A, Mavromatis K, Ivanova N, Anderson I, Land M, DiBartolo G, Martinez M, Lapidus A, Lucas S, Copeland A, Richardson P, Wilson DB, Kyrpides N: Genome sequence and analysis of the soil cellulolytic actinomycete Thermobifida fusca YX. J Bacteriol. 2007, 189: 2477-2486. 10.1128/JB.01899-06.PubMed CentralView ArticlePubMedGoogle Scholar
- Deng Y, Fong SS: Metabolic engineering of Thermobifida fusca for direct aerobic bioconversion of untreated lignocellulosic biomass to 1-propanol. Metab Eng. 2011, 13: 570-577. 10.1016/j.ymben.2011.06.007.View ArticlePubMedGoogle Scholar
- Carere CR, Sparling R, Cicek N, Levin DB: Third generation biofuels via direct cellulose fermentation. Int J Mol Sci. 2008, 9: 1342-1360. 10.3390/ijms9071342.PubMed CentralView ArticlePubMedGoogle Scholar
- Edwards JS, Covert M, Palsson B: Metabolic modelling of microbes: the flux-balance approach. Environ Microbiol. 2002, 4: 133-140. 10.1046/j.1462-2920.2002.00282.x.View ArticlePubMedGoogle Scholar
- Orth JD, Thiele I, Palsson BO: What is flux balance analysis?. Nat Biotechnol. 2010, 28: 245-248. 10.1038/nbt.1614.PubMed CentralView ArticlePubMedGoogle Scholar
- Varma A, Palsson BO: Metabolic flux balancing: basic concepts, scientific and practical use. Nat Biotechnol. 1994, 12: 994-998. 10.1038/nbt1094-994.View ArticleGoogle Scholar
- Segre D, Vitkup D, Church GM: Analysis of optimality in natural and perturbed metabolic networks. Proc Natl Acad Sci U S A. 2002, 99: 15112-15117. 10.1073/pnas.232349399.PubMed CentralView ArticlePubMedGoogle Scholar
- Shlomi T, Berkman O, Ruppin E: Regulatory on/off minimization of metabolic flux changes after genetic perturbations. Proc Natl Acad Sci U S A. 2005, 102: 7695-7700. 10.1073/pnas.0406346102.PubMed CentralView ArticlePubMedGoogle Scholar
- Rapoport TA, Heinrich R, Jacobasch G, Rapoport S: A linear steady-state treatment of enzymatic chains. A mathematical model of glycolysis of human erythrocytes. Eur J Biochem. 1974, 42: 107-120. 10.1111/j.1432-1033.1974.tb03320.x.View ArticlePubMedGoogle Scholar
- Durot M, Bourguignon P-Y, Schachter V: Genome-scale models of bacterial metabolism: reconstruction and applications. FEMS Microbiol Rev. 2009, 33: 164-190. 10.1111/j.1574-6976.2008.00146.x.PubMed CentralView ArticlePubMedGoogle Scholar
- Kanehisa M, Goto S: KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000, 28: 27-30. 10.1093/nar/28.1.27.PubMed CentralView ArticlePubMedGoogle Scholar
- Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M: From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2006, 34: D354-D357. 10.1093/nar/gkj102.PubMed CentralView ArticlePubMedGoogle Scholar
- Schellenberger J, Park JO, Conrad TC, Palsson BØ: BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions. BMC Bioinformatics. 2010, 11: 213-10.1186/1471-2105-11-213.PubMed CentralView ArticlePubMedGoogle Scholar
- Thorleifsson SG, Thiele I: rBioNet: A COBRA toolbox extension for reconstructing high-quality biochemical networks. Bioinformatics. 2011, 27: 2009-2010. 10.1093/bioinformatics/btr308.View ArticlePubMedGoogle Scholar
- UniProt C: Update on activities at the Universal Protein Resource (UniProt) in 2013. Nucleic Acids Res. 2013, 41: D43-D47. 10.1093/nar/gks1068.View ArticleGoogle Scholar
- Chagoyen M, Pazos F: MBRole: enrichment analysis of metabolomic data. Bioinformatics. 2011, 27: 730-731. 10.1093/bioinformatics/btr001.View ArticlePubMedGoogle Scholar
- Roberts SB, Gowen CM, Brooks JP, Fong SS: Genome-scale metabolic analysis of Clostridium thermocellum for bioethanol production.BMC Syst Biol 2010, 4:31-0509-0504-0531.,Google Scholar
- Roberts SB, Robichaux JL, Chavali AK, Manque PA, Lee V, Lara AM, Papin JA, Buck GA: Proteomic and network analysis characterize stage-specific metabolism in Trypanosoma cruzi.BMC Syst Biol 2009, 3:52-0509-0503-0552.,Google Scholar
- Vanee N, Roberts SB, Fong SS, Manque P, Buck GA: A genome-scale metabolic model of Cryptosporidium hominis. Chem Biodivers. 2010, 7: 1026-1039. 10.1002/cbdv.200900323.View ArticlePubMedGoogle Scholar
- Brooks JP, Burns WP, Fong SS, Gowen CM, Roberts SB: Gap detection for genome-scale constraint-based models. Adv Bioinformatics. 2012, 2012: 323472-10.1155/2012/323472.PubMed CentralView ArticlePubMedGoogle Scholar
- Joyce AR, Palsson BÃ: Toward Whole Cell Modeling And Simulation: Comprehensive Functional Genomics Through The Constraint-Based Approach. Prog Drug Res. 2007, 64: 267-309.Google Scholar
- Burgard AP, Pharkya P, Maranas CD: Optknock: a bilevel programming framework for identifying gene knockout strategies for microbial strain optimization. Biotechnol Bioeng. 2003, 84: 647-657. 10.1002/bit.10803.View ArticlePubMedGoogle Scholar
- Ranganathan S, Suthers PF, Maranas CD: OptForce: an optimization procedure for identifying all genetic manipulations leading to targeted overproductions. PLoS Comput Biol. 2010, 6: e1000744-10.1371/journal.pcbi.1000744.PubMed CentralView ArticlePubMedGoogle Scholar
- Yang L, Cluett WR, Mahadevan R: EMILiO: a fast algorithm for genome-scale strain design. Metab Eng. 2011, 13: 272-281. 10.1016/j.ymben.2011.03.002.View ArticlePubMedGoogle Scholar
- Palsson BÃ: Systems Biology: Properties Of Reconstructed Networks. 2007, Cambridge University Press, New YorkGoogle Scholar
- Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crecy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V: The subsystems approach to genome annotation and its use in the project to annotate 1000 genomes. Nucleic Acids Res. 2005, 33: 5691-5702. 10.1093/nar/gki866.PubMed CentralView ArticlePubMedGoogle Scholar
- Wilson DB, Kostylev M: Cellulase processivity. Meth Mol Biol. 2012, 908: 93-99.Google Scholar
- Henry CS, DeJongh M, Best AA, Frybarger PM, Linsay B, Stevens RL: High-throughput generation, optimization and analysis of genome-scale metabolic models. Nat Biotechnol. 2010, 28: 977-982. 10.1038/nbt.1672.View ArticlePubMedGoogle Scholar
- Chandrasekaran S, Price ND: Probabilistic integrative modeling of genome-scale metabolic and regulatory networks in Escherichia coli and Mycobacterium tuberculosis. Proc Natl Acad Sci U S A. 2010, 107: 17845-17850. 10.1073/pnas.1005139107.PubMed CentralView ArticlePubMedGoogle Scholar
- Colijn C, Brandes A, Zucker J, Lun DS, Weiner B, Farhat MR, Cheng TY, Moody DB, Murray M, Galagan JE: Interpreting expression data with metabolic flux models: predicting Mycobacterium tuberculosis mycolic acid production. PLoS Comput Biol. 2009, 5: e1000489-10.1371/journal.pcbi.1000489.PubMed CentralView ArticlePubMedGoogle Scholar
- Lerman JA, Hyduke DR, Latif H, Portnoy VA, Lewis NE, Orth JD, Schrimpe-Rutledge AC, Smith RD, Adkins JN, Zengler K, Palsson BO: In silico method for modelling metabolism and gene product expression at genome scale. Nat Commun. 2012, 3: 929-10.1038/ncomms1928.View ArticlePubMedGoogle Scholar
- Shlomi T, Cabili MN, Herrgard MJ, Palsson BÃ, Ruppin E: Network-based prediction of human tissue-specific metabolism. Nat Biotechnol. 2008, 26: 1003-1010. 10.1038/nbt.1487.View ArticlePubMedGoogle Scholar
- Gowen CM, Fong SS: Genome-scale metabolic model integrated with RNAseq data to identify metabolic states of Clostridium thermocellum. Biotechnol J. 2010, 5: 759-767. 10.1002/biot.201000084.View ArticlePubMedGoogle Scholar
- Orth JD, Conrad TM, Na J, Lerman JA, Nam H, Feist AM, Palsson BO: A comprehensive genome-scale reconstruction of Escherichia coli metabolism–2011. Mol Syst Biol. 2011, 7: 535-10.1038/msb.2011.65.PubMed CentralView ArticlePubMedGoogle Scholar
- Markowitz VM, Chen IM, Palaniappan K, Chu K, Szeto E, Grechkin Y, Ratner A, Jacob B, Huang J, Williams P, Huntemann M, Anderson I, Mavromatis K, Ivanova NN, Kyrpides NC: IMG: the Integrated Microbial Genomes database and comparative analysis system. Nucleic Acids Res. 2012, 40: D115-D122. 10.1093/nar/gkr1044.PubMed CentralView ArticlePubMedGoogle Scholar
- Hyduke DR, Lewis NE, Palsson BO: Analysis of omics data with genome-scale models of metabolism. Mol Biosyst. 2013, 9: 167-174. 10.1039/c2mb25453k.PubMed CentralView ArticlePubMedGoogle Scholar
- Dwivedi RC, Spicer V, Harder M, Antonovici M, Ens W, Standing KG, Wilkins JA, Krokhin OV: Practical implementation of 2D HPLC scheme with accurate peptide retention prediction in both dimensions for high-throughput bottom-up proteomics. Anal Chem. 2008, 15;80 (18): 7036-42. 10.1021/ac800984n.View ArticleGoogle Scholar
- McQueen P, Krokhin O: Optimal selection of 2D reversed-phase-reversed-phase HPLC separation techniques in bottom-up proteomics. Expert Rev Proteomics. 2012, 9 (2): 125-8. 10.1586/epr.12.8.View ArticlePubMedGoogle Scholar
- Craig R, Beavis RC: TANDEM: matching proteins with tandem mass spectra. Bioinformatics. 2004, 12;20 (9): 1466-7. 10.1093/bioinformatics/bth092.View ArticleGoogle Scholar
- Feist AM, Herrgard MJ, Thiele I, Reed JL, Palsson BO: Reconstruction of biochemical networks in microorganisms. Nat Rev Microbiol. 2009, 7: 129-143. 10.1038/nrmicro1949.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.