In order to reduce time and efforts to develop microbial strains with better capability of producing desired bioproducts, genome-scale metabolic simulations have proven useful in identifying gene knockout and amplification targets. Constraints-based flux analysis has successfully been employed for such simulation, but is limited in its ability to properly describe the complex nature of biological systems. Gene knockout simulations are relatively straightforward to implement, simply by constraining the flux values of the target reaction to zero, but the identification of reliable gene amplification targets is rather difficult. Here, we report a new algorithm which incorporates physiological data into a model to improve the model’s prediction capabilities and to capitalize on the relationships between genes and metabolic fluxes.
We developed an algorithm, flux variability scanning based on enforced objective flux (FVSEOF) with grouping reaction (GR) constraints, in an effort to identify gene amplification targets by considering reactions that co-carry flux values based on physiological omics data via “GR constraints”. This method scans changes in the variabilities of metabolic fluxes in response to an artificially enforced objective flux of product formation. The gene amplification targets predicted using this method were validated by comparing the predicted effects with the previous experimental results obtained for the production of shikimic acid and putrescine in Escherichia coli. Moreover, new gene amplification targets for further enhancing putrescine production were validated through experiments involving the overexpression of each identified targeted gene under condition-controlled batch cultivation.
FVSEOF with GR constraints allows identification of gene amplification targets for metabolic engineering of microbial strains in order to enhance the production of desired bioproducts. The algorithm was validated through the experiments on the enhanced production of putrescine in E. coli, in addition to the comparison with the previously reported experimental data. The FVSEOF strategy with GR constraints will be generally useful for developing industrially important microbial strains having enhanced capabilities of producing chemicals of interest.
Flux variability scanning based on enforced objective fluxGrouping reaction constraintsPutrescineEscherichia coli
One of the most ambitious goals in metabolic engineering is the design of biological systems based on in silico predictions using mathematical models. The advent of high-throughput technologies and the completion of genome sequencing for many organisms have led to an explosion of systems-wide biological data [1, 2]. Genome-scale stoichiometric models of the increasing number of microorganisms and mammalian cells have been developed at the moment [3, 4]. Some of such models have been used to identify gene knockout targets for the efficient production of important industrial chemicals, including amino acids [5, 6] and chemicals that are conventionally derived from petroleum [7–9]; other such models have been used to identify drug targets in pathogens [10–12]. In modeling and simulation approaches, target reactions whose knockout is predicted to overproduce the chemical of interest can be easily tested experimentally by deleting the corresponding genes in the microbial host.
Increasing the expression levels of the relevant genes has also been successfully employed for the overproduction of target chemicals [13, 14]. To avoid unnecessarily massive experiments to be performed, several computational algorithms have been devised in an effort to reveal the relationship between metabolic reactions and the biological properties of interest [15–27]; however, the identification of gene amplification targets is more complicated than the identification of gene knockout targets; hence, correlations among the genes, mRNAs, transcriptional or translational regulations, proteins, and metabolic fluxes must be carefully examined. Genome-scale metabolic models that rely on constraints-based flux analysis without additional physiological information are limited in their ability to describe the complex nature of biological systems, particularly biological phenomena beyond metabolism. Several systematic methods have been developed to overcome such limitations: flux variability analysis (FVA) [17, 19–21], flux coupling analysis [16–18], flux sensitivity analysis , flux response analysis , OptReg , genetic design through local search , OptForce , and flux scanning based on enforced objective flux (FSEOF) . In particular, FSEOF is a method that first scans and searches for variations in the metabolic fluxes in response to the enforced fluxes directed towards a target product. Reactions were then selected as amplification targets, the flux values of which increased in accordance with the enforced fluxes toward the production of a target chemical. This method was experimentally validated by identifying amplification targets that improved the production of lycopene in Escherichia coli. These approaches demonstrated that incorporating physiological constraints during the model simulation are critical to identifying trustworthy gene amplification targets, but much improvement is still needed [24, 28]. One of the major problems is the existence of a too large flux solution space in optimization problems.
In this study, in order to systematically handle the large flux solution spaces, as also revealed in the implementation of FSEOF , we considered functionally grouped reactions that simultaneously carry fluxes based on unique features of microbial genomes. Considering such functionally grouped reactions helps reducing the number of and selecting multiple solutions existing for each optimal objective value, enabling to identify more reliable gene amplification targets when combined with FSEOF. Grouped reactions were previously revealed by genomic context and flux-converging pattern analyses as promising constraints . Genomic context analysis interrogates conserved neighborhood, gene fusion, and co-occurrence using a STRING database with the goal of suggesting groups of reaction fluxes that are most likely correlated in their on/off activities [28, 29]. Flux-converging pattern analysis further limits the range of possible flux values in a metabolic reaction by examining the number of carbon atoms in metabolites that participate in the reactions and the converging patterns of fluxes from a carbon source (see Methods and Figure 1) . Consequently, flux balance analysis (FBA) with constraints controlling simultaneous on/off activity (Con/off) and the flux scale (Cscale) of the metabolic reactions accurately predicted flux distributions in gene knockout mutant strains .
Based on these analyses, the grouping reaction (GR) constraints that constrain reactions to co-carry fluxes altogether regardless of the condition were incorporated into the E. coli genome-scale metabolic model. The model then facilitated the scanning of changes in the variability among metabolic fluxes using FVA in response to the enforced enhancement of the fluxes toward a target chemical. This newly developed method, called flux variability scanning based on enforced objective flux (FVSEOF) with GR constraints, was employed in this study to identify gene amplification targets for the production of target chemicals. FVSEOF with GR constraints was first validated based on amplification targets reported for the production of shikimic acid and putrescine in E. coli, and then further validated by actually engineering E. coli for the enhanced production of putrescine based on new amplification targets.
E. coli genome-scale metabolic model
EcoMBEL979 was used throughout this study , which is a slightly modified version of the genome-scale E. coli metabolic network model, iJR904 . EcoMBEL979 contains 814 metabolites (144 extracellular metabolites and 670 intermediates) and 979 metabolic reactions, along with a biomass equation derived from the E. coli biomass composition .
Constraints-based flux analysis
The stoichiometric relationships among the metabolites and the reactions of the E. coli genome-scale metabolic model were balanced under the pseudo-steady state assumption. The balanced reaction model was almost always underdetermined in calculations of the flux distribution due to insufficient measurements of the extracellular fluxes. Thus, the unknown fluxes within the metabolic reaction network were calculated by linear programming-based optimization using an objective function that maximized the growth rate, subject to constraints pertaining to mass conservation and reaction thermodynamics , This optimization problem can be mathematically formulated as follows:
where Sij represents the stoichiometric coefficient for metabolite i in reaction jνj is the flux of reaction jJ is the set of all reactions, and bi is the net transport flux of metabolite i. If this metabolite is an intermediate, bi is equal to zero. αj and βj are the lower and upper bounds of the flux of reaction j, respectively. Herein, the flux of any irreversible reaction is considered to be positive; a negative flux indicates the reverse direction of a reaction.
Grouping reaction (GR) constraints based on the genomic context and flux-converging pattern analyses
The algorithm introduced in this study, FVSEOF with GR constraints, starts with formulation of GR constraints, which are based on the genomic context and flux-converging pattern analyses (Figure 1). Briefly, genomic context and flux-converging pattern analyses aim at grouping functionally related reactions. Such functionally related reactions were constrained to be on or off simultaneously (Figure 1) . First, reactions were grouped using STRING database that performs genomic context analysis, including conserved neighborhood, gene fusion, and co-occurrence [28, 29]. Simultaneous on/off constraint (Con/off) can be described as follows:
where y( v1) and y( v2) indicate binary variables (on or off) of a certain reaction 1 and 2, respectively.
Each reaction is then given a CxJy index, determined by flux-converging pattern analysis. Cx and Jy denote the total number of carbon atoms in metabolites that participate in each reaction and the number of passing flux-converging metabolites, respectively. Here, it should be noted that cofactors were not considered because the flux scales are controlled by the carbon number of primary metabolites, not cofactors, according to 13C-based flux analysis . For Jy, the flux-converging metabolites indicate metabolites at which two pathways split by another metabolite converge. Jy has four types, including JAJBJC, and JD, depending on the characteristics of flux-converging metabolites. Subscript of Jy denotes the passing number of flux-converging metabolites, counting zero, one, two, or three times for the flux coming from a carbon source. In some cases, the subscript E is placed next to the subscripts of A, B, C, or D to indicate the fluxes derived from pyruvate, which causes more complex changes in flux distributions. The values of CxJy for each reaction were assigned based on possible flux routes reaching from glucose, and are partitioned by a slash. Based on this analysis, another constraint Cscale, indicating the flux scale of a reaction, can be given to the metabolic reactions. First, terms used to describe the flux scale of the reaction are as follows:
where indicates the carbon number involved in a reaction j the number of the passing of the flux through the flux-converging metabolite near reaction j, and NC,Rj the total number of carbon of primary metabolites without cofactors in reaction j.
If reaction 1 and 2 were predicted to be in the same functional unit according to the genomic context analysis, and their and are equivalent, Cscale is applied to these two reactions, which is defined as follows:
where vn1 and vn2 are the normalized flux of reaction 1 and 2, obtained by dividing each reaction flux by the carbon source uptake rate, such as glucose. δ is the constant defining the flux level of reactions in this functional unit; the value of δ is recommended as 0.3.
Flux variability scanning based on enforced objective flux (FVSEOF) with grouping reaction (GR) constraints
Once grouping reaction constraints are defined, FVSEOF with GR constraints is subsequently performed as follows (Figure 2). First, the initial or theoretical minimum () and theoretical maximum () of the target product formation rates were calculated; these were implemented by setting the objective function as minimizing and maximizing the target product formation rate using constraints-based flux analysis with GR constraints. This can be formulated as follows:
Min/Max = or
where indicates the initial or minimal point of the flux value constrained for the target bioproduct, while indicates the maximal flux value for the bioproduct. li and ui are the lower and upper bound for the net transport flux of metabolite i, respectively, and is the carbon source uptake rate.
Second, the cell growth rate, Z( vbiomass), was maximized while gradually increasing the target product formation rate from its initial (or minimal) flux value to its near theoretical maximum: () . The is an additional constraint provided during this stage of the constraints-based flux analysis; it starts with the initial value plus one nth of the difference between the and , and is increased to a value adjacent to in k steps.
Third, FVA was carried out with GR constraints by maximizing or minimizing the fluxes of all intracellular reactions, Z( vintracellular reaction), with additional constraints: the enforced production rate of the target bioproduct, which varied from its initial to maximum values in 10 steps, and 95% optimal cell growth rate, vbiomass = 0.95 · Z( vbiomass)opt, for each step. The attainable flux ranges of intracellular reactions for each step were subsequently subjected to the targeting criteria introduced in the following section.
FVSEOF with GR constraints was calculated using mixed integer nonlinear programming with the DICOPT solver, subject to the constraints including GR constraints, mass conservation and reaction thermodynamics.
Flux bias, its slope and flux capacity as targeting criteria
Flux bias (Vavg), its slope (qslope) and flux capacity (lsol) were employed as targeting criteria for the initial set of gene amplification targets predicted from FVSEOF with GR constraints (Figure 2 and 3). Among them, Vavg and lsol were determined as follows in order to effectively investigate the changes of flux variabilities for genetic perturbations :
The and indicate the maximal and minimal flux values for a reaction under the given condition. The lsol indicates the difference between the maximal and minimal flux values for a reaction. qslope was calculated using linear regression of the flux values for a reaction towards the gradually maximized product formation rate.
Bacterial strains and plasmids
The E. coli strains used in this study are listed in the Additional file 1. The XQ52 strain, a putrescine producer, was used as a base strain . E. coli TOP10 was used for gene cloning studies. The plasmid p15SpeC containing a strong tac promoter was used as an expression vector. The plasmid p15SpeC was constructed from the pTac15K plasmid by cloning the speC gene (encoding ornithine decarboxylase in the putrescine biosynthetic pathway) into the site between the EcoRI and SacI restriction enzyme sites of pTac15K. The plasmid contained a kanamycin resistance selective marker. Cells were grown in Luria–Bertani (LB) broth or on plates containing appropriate antibiotics at 37°C for the construction of strains and plasmids. Antibiotics were added at following concentrations: 50 μg/mL ampicillin, 25 μg/mL kanamycin, and 35 μg/mL chloramphenicol.
The plasmids used in this study are listed in the Additional file 1. Polymerase chain reaction (PCR) primers for the gene cloning studies conducted here are listed in the Additional file 2. Pfu DNA polymerase was purchased from Solgent (Daejeon, Korea). Restriction enzymes and T4 DNA ligase were obtained from New England Biolabs (Ipswich, MA) and Roche (Mannheim, Germany), respectively. The genomic DNA of E. coli W3110 was amplified to overexpress the target genes using the Pfu polymerase and PCR primers (Additional file 2). The PCR product was then digested with SacI and XbaI, and ligated into p15SpeC at the same restriction sites downstream of the tac promoter .
Batch cultivation was conducted at 37°C in a 6.6 L jar fermentor (Bioflo 3000; New Brunswick Scientific Co., Edison, NJ) containing 2 L R/2 medium supplemented with 10 g/L glucose and 3 g/L (NH4)2SO4. The R/2 medium (pH 6.8) contained (per liter): 2 g (NH4)2HPO4, 6.75 g KH2PO4, 0.85 g citric acid, and 0.7 g MgSO4·7H2O. In addition, 5 mL/L of a trace metal stock solution  was added. The trace metal solution contained per liter of 5 M HCl: 10 g FeSO4·7H2O, 2.25 g ZnSO4·7H2O, 1 g CuSO4·5H2O, 0.5 g MnSO4·5H2O, 0.23 g Na2B4O7·10H2O, 2 g CaCl2·2H2O, and 0.1 g (NH4)6Mo7O24. One milliliter of the overnight culture was transferred into a 300 mL Erlenmeyer flask containing 50 mL of the R/2 medium at 37°C and spun at 220 rpm in a shaking incubator (JEIOTech. Co. SI-900R). After obtaining an initial OD600 of 0.3, the seed cultures (200 mL) were introduced into the bioreactor for batch cultivation. The culture pH was maintained at 6.8 by the addition of 6 M KOH. The dissolved oxygen concentration was maintained at 20% air saturation by automatically adjusting the agitation speed. Under the comparable batch culture conditions, the single gene-overexpressing strains based on the E. coli XQ52 strain harboring p15SpeC, denoted as XQ52 (p15SpeC), with each target gene were tested by flask cultivation in duplicate using R/2 medium supplemented with 10 g/L glucose at 37 °C.
Cell growth was estimated by measuring the optical density at 600 nm (OD600) using an Ultrospec 3000 spectrophotometer (Amersham Biosciences, Uppsala, Sweden). Glucose concentrations were measured using a glucose analyzer (model 2700 STAT; Yellow Springs Instrument, Yellow Springs, OH, USA). The concentrations of glucose and organic acids were determined by high-performance liquid chromatography (ProStar 210; Varian, Palo Alto, CA) equipped with UV/visible light (ProStar 320; Varian, Palo Alto, CA) and refractive index (Shodex RI-71, Tokyo, Japan) detectors. A MetaCarb 87H column (300 by 7.8 mm; Varian) was eluted isocratically with 0.01 NH2SO4 at 60°C at a flow rate of 0.4 mL/min.
The putrescine concentration was determined by derivatizing putrescine with o-phthaldialdehyde (OPA; Sigma, St. Louis, MO), and the o-phthaldialdehyde derivative was detected by high-performance liquid chromatography (1100 Series HPLC, Agilent Technologies, Palo Alto, CA) with UV detection, as described previously . The OPA derivatization reagent was prepared as described previously [34, 36, 37]. Following the addition of the OPA reagent, the mixture was filtered through a 0.2 mm PVDF syringe filter (Whatman, Maidstone, UK), and the filtrate was immediately injected into the HPLC. A SUPELCO C18 column (cat# 504955; 5μm, 150 mm x 4.6 mm) was operated at 25°C with a 0.8 mL/min mobile phase flow rate. The mobile phase consisted of solution A (55% methanol in 0.1 M sodium acetate, pH 7.2) and solution B (methanol). The following gradient was applied (values given in vol%): 1–6 min, 100% A; 6–10 min, linear gradient of B from 0% to 30%; 10–15 min, linear gradient of B from 30% to 50%; 15–19 min, linear gradient of B from 50% to 100%; 19–23 min, 100% B; 23–25 min, linear gradient of B from 100% to 30%; 25–28 min, linear gradient of B from 30% to 0% . The derivatized putrescine was detected at a wavelength of 230 nm using a variable wavelength detector (G1314A, Agilent Technologies).
Results and discussion
FVSEOF with GR constraints
Functionally related reactions can be grouped by genomic context and flux-converging pattern analyses . Several reactions appeared to be related with one another based on genomic context analysis of conserved neighborhoods, gene fusions, and co-occurrence [28, 29]. Flux-converging pattern analysis narrows the range of plausible flux values for metabolic reactions by examining the number of carbon atoms in metabolites that participate in reactions and the converging patterns of fluxes from a carbon source . By controlling the simultaneous on/off activity (Con/off) and flux scale (Cscale) of the metabolic reactions, based on FBA with GR constraints, the flux distributions in gene knockout mutants were accurately predicted . In this study, the GR constraints were further applied to reactions related to the biosynthesis of a target chemical to improve the model accuracy (Figure 1 and the Additional file 3).
FVSEOF with GR constraints was implemented as follows (Figure 2). First, the theoretical minimal and maximal flux values for the target product formation were calculated using constraints-based flux analysis by minimizing and maximizing the target product formation rate with GR constraints. Second, again with GR constraints, the cell growth rate was maximized while gradually increasing the constraint value for the target product formation rate (our objective function which is artificially enforced) from a minimal to a theoretical maximum, as calculated from the first step. Finally, FVA was conducted by maximizing and minimizing the fluxes of all intracellular reactions under additional constraints, including GR constraints, the enforced production rate of the target chemical varied from a minimal to a maximal value, and a 95% optimal growth rate constraint for each step. The attainable flux ranges for the intracellular reactions were calculated under the imposed constraints for each of the three steps.
Criteria for selecting gene amplification targets
Initial simulation results for FVSEOF with GR constraints were filtered based on rational criteria in an effort to select only the most effective amplification targets. The most important criterion was to identify gene amplification targets, the fluxes of which increased with the flux directed toward the target chemical. This procedure was implemented with quantitative values of qslopeVavg, and lsol (Figures 2 and 3). The flux bias (Vavg) and flux capacity (lsol) indicate an average value for the maximal and minimal flux values and the length of the attainable flux ranges for a reaction, respectively . Finally, the gene amplification candidates were evaluated by calculating the slope (qslope) of the Vavg flux for each metabolic reaction using linear regression analysis (Figure 2). Changes in the patterns of the reaction fluxes in response to incrementally increasing fluxes toward a target product were categorized into nine types based on combinations of positive and negative changes in Vavg and lsol in order to facilitate the identification of amplification targets (Figure 3). Types 1, 2, and 3 displayed positive correlations with the amplification candidates for the production of a target chemical; types 4, 5, and 6 displayed negative correlations with the amplification candidates. Finally, types 7, 8, and 9 displayed no clear correlations with the production of a target chemical, based on Vavg (Figure 3). The reaction sets that were positively correlated (type 1, 2, and 3) were initially selected as amplification candidates.
Reaction sets belonging to type 1, 2 and 3, showing positive correlations with the enforced fluxes toward a target chemical, can then be further divided into strongly and weakly positive reactions (Figure 3 and Additional file 4). This step also allows narrowing down the candidates of gene amplification targets. The strongly positive reactions display a continuously increasing Vavg and a positive qslope in response to the enhanced production of a target chemical, whereas weakly positive reactions show the same pattern, except for the presence of a partially negative qslope (Additional file 4). Certainly, strongly positive reactions deserve primary attention as potential gene amplification targets.
The potential gene amplification candidates were prioritized by considering the lsol value, which indicates the length between the maximal and minimal flux values of a metabolic reaction. Among the reactions that were positively correlated with the desired product, reactions with smaller values of lsol received higher priorities because these reactions were more likely to display the predicted flux values than reactions with larger values of lsol. A final list of gene amplification targets obtained from the above procedure was then selected based on biological knowledge.
Implementation of FVSEOF with GR constraints for shikimic acid production in E. coli
FVSEOF with GR constraints was employed to identify gene amplification targets for the enhanced production of an important aromatic chemical shikimic acid in E. coli (Figure 2). Shikimic acid is a key metabolic intermediate in the aromatic amino acid biosynthetic pathway. Shikimic acid and its derivatives are industrially important starting compounds for the production of several chemicals, such as phenols, herbicides, antibacterial agents, and the neuramidase inhibitor Tamiflu used for the treatment of influenza infections [38, 39]. FVSEOF with GR constraints predicted that 11 reaction fluxes in the glycolysis (glk and pps), pentose phosphate pathway ( rpi, talAB, and tktAB), and the shikimic acid biosynthetic pathway ( aroBaroDaroEaroFaroG, and aroH) were potential amplification targets. The amplification of aroBaroDaroEaroFaroGaroHtalABtktAglk, and pps genes [38–46], which are the amplification targets predicted by FVSEOF with GR constraints, was previously reported to enhance the production of shikimic acid. The previous FSEOF method without FVA and GR constraints could not identify pps gene as an amplification target. FSEOF results without FVA and GR constraints did not show notable fluxes among metabolic reactions controlled by the pps gene in response to the enforced shikimic acid production rate; however, the FVSEOF method with GR constraints correctly predicted the pps gene as one of the amplification targets beneficial for the accumulation of phosphoenolpyruvate, an important precursor for the production of shikimic acid from pyruvate. This consistency partly demonstrated the power of utilizing FVA and GR constraints for predicting reliable amplification targets by FSEOF. In practice, the overexpression of phosphoenolpyruvate synthase encoded by the pps gene also increased the yield of precursors for the production of shikimic acid [43, 45]. Thus, this strategy enabled the successful identification of gene amplification targets for the enhanced production of a primary metabolite, shikimic acid, in E. coli, in accordance with previous literature reports.
Implementation of FVSEOF with GR constraints for enhanced putrescine production in E. coli
The general applicability of FVSEOF with GR constraints was examined by applying the method to putrescine production in E. coli. Putrescine (1,4-diaminobutane) is an important industrial precursor for the synthesis of polymers, pharmaceuticals, surfactants, and certain additives . We confirmed the validity of the newly predicted gene amplification targets by comparison with the genes engineered in the previously reported putrescine-producing E. coli XQ52 (p15SpeC) strain . FVSEOF with GR constraints predicted potential gene amplification targets among the reactions involved in glycolysis (enopgm, gapA, fbaAB, tpiA, pgk, pykAF, and glk), TCA cycle ( icdacnAacnB, and gltA), putrescine biosynthesis ( gdhAargAargB, argC, argD, argE, speC, and speF), and other pathways ( ackA and ppc). The predicted amplification targets ( argB, argC, argD, argE, speC, and speF) involved in the putrescine biosynthetic pathway were consistent with the mutations introduced in the E. coli XQ52 (p15SpeC) strain, as described in the previous report .
The genes predicted to be relevant to the putrescine biosynthetic pathway were expected based on the pathway knowledge, and were intuitively obvious; hence, we focused on the effects associated with amplifying the predicted gene targets involved in other metabolic pathways in order to more rigorously validate FVSEOF with GR constraints. Accordingly, each of the predicted amplification targets was examined one by one by amplifying the gene dosage in the E. coli XQ52 (p15SpeC) strain (see Methods). Among these genes, the glk, acnA, acnB, ackA, and ppc gene, five out of the sixteen target genes, were found to attain increased putrescine yield when they were individually amplified in the E. coli XQ52 (p15SpeC) strain (Additional file 5). These strains, initially examined using flask cultivation, were further validated by batch cultivation at 37°C under aerobic condition (Figure 4 and see the Additional files 1 and 2). The recombinant E. coli XQ52 (p15SpeC) strain additionally expressing the glk, acnA, acnB, ackA, and ppc genes resulted in the production of 2.23, 1.90, 1.89, 2.04, and 2.06 g/L of putrescine, respectively, which are 20.5 % more than that (1.68 g/L) produced by the control strain on average (Figure 4). The yields of putrescine obtained with these strains were 0.223, 0.190, 0.189, 0.204, and 0.206 g putrescine per g glucose, which are again higher than that (0.168 g putrescine per g glucose) obtained with the control strain. Thus, FVSEOF with GR constraints could be successfully used to identify non-obvious gene amplification targets that enhance the production of putrescine in E. coli.
Other gene amplification targets identified by FVSEOF with GR constraints, which did not affect putrescine production in flask cultivation, also deserve discussion. These false-positive hits are most likely involved in biological processes that were not accurately captured in the genome-scale metabolic model. These ineffective genes, including enopgmgapAfbaABtpiApgk, and pykAF genes in glycolysis, and icd and gltA genes in TCA cycle might have been associated with transcriptional and translational regulations because the direct correlation between gene expressions and the metabolic fluxes was not observed. The fact that some of obvious gene amplification targets, such as icd gene responsible for biosynthesis of α-ketoglutarate, seem to be resistant to gene manipulations indicates that other biological variables may affect the effects of the gene amplifications. Potential variables include the plasmid copy number, gene dosage, optimal gene expression, and the gene expression method, either plasmid-based overexpression or chromosomal integration . Although we improved the accuracy of the predicted gene targets by imposing GR constraints, these factors should be carefully considered in any implementation of the FVSEOF method with GR constraints .
FVSEOF with GR constraints, which is an upgraded version of the FSEOF method, allows for the in silico identification of fluxes to be amplified for the enhanced production of target products. This method was conducted through the analysis of trends in reaction flux variability in response to varying the flux of target chemical production from initial to maximal flux values under GR constraints. The confidence with which amplification targets are identified may be increased by incorporating physiological data. This approach involves grouping functionally related reactions based on their genomic context and flux-converging pattern analyses. The interaction data may be obtained easily from public databases, and subjected to GR constraints. FVA was also performed to overcome the problems associated with multiple solutions for an optimal objective value. FVSEOF with GR constraints was shown to suggest successful metabolic engineering strategies (in particular, gene amplification) for the production of shikimic acid and putrescine in E. coli. In conclusion, the strategy reported here should be generally useful for developing industrial strains that display enhanced production of a target chemical.
Flux balance analysis
Flux scanning based on enforced objective flux
Flux variability analysis
Flux variability scanning based on enforced objective flux
This work was supported by the Technology Development Program to Solve Climate Changes (systems metabolic engineering for biorefineries) from the Ministry of Education, Science and Technology (MEST) through the National Research Foundation of Korea (NRF-2012-C1AAA001-2012M1A2A2026556).
Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 program), Center for Systems and Synthetic Biotechnology, Institute for the BioCentury, Korea Advanced Institute of Science and Technology (KAIST)
BioInformatics Research Center, KAIST
Department of Bio and Brain Engineering and BioProcess Engineering Research Center, KAIST
Lee SY, Lee DY, Kim TY: Systems biotechnology for strain improvement.Trends Biotechnol 2005, 23:349–358.PubMedView Article
Joyce AR, Palsson BO: The model organism as a system: integrating 'omics' data sets.Nat Rev Mol Cell Biol 2006, 7:198–210.PubMedView Article
Kim HU, Kim TY, Lee SY: Metabolic flux analysis and metabolic engineering of microorganisms.Mol Biosyst 2008, 4:113–120.PubMedView Article
Kim TY, Sohn SB, Kim YB, Kim WJ, Lee SY: Recent advances in reconstruction and applications of genome-scale metabolic models.Curr Opin Biotechnol 2011, 23:617–623.PubMedView Article
Lee KH, Park JH, Kim TY, Kim HU, Lee SY: Systems metabolic engineering of Escherichia coli for L-threonine production.Mol Syst Biol 2007, 3:149.PubMedView Article
Park JH, Lee KH, Kim TY, Lee SY: Metabolic engineering of Escherichia coli for the production of L-valine based on transcriptome analysis and in silico gene knockout simulation.Proc Natl Acad Sci U S A 2007, 104:7797–7802.PubMedView Article
Lee SJ, Lee DY, Kim TY, Kim BH, Lee J, Lee SY: Metabolic engineering of Escherichia coli for enhanced production of succinic acid, based on genome comparison and in silico gene knockout simulation.Appl Environ Microbiol 2005, 71:7880–7887.PubMedView Article
Bro C, Regenberg B, Forster J, Nielsen J: In silico aided metabolic engineering of Saccharomyces cerevisiae for improved bioethanol production.Metab Eng 2006, 8:102–111.PubMedView Article
Yim H, Haselbeck R, Niu W, Pujol-Baxley C, Burgard A, Boldt J, Khandurina J, Trawick JD, Osterhout RE, Stephen R, et al.: Metabolic engineering of Escherichia coli for direct production of 1,4-butanediol.Nat Chem Biol 2011, 7:445–452.PubMedView Article
Jamshidi N, Palsson BO: Investigating the metabolic capabilities of Mycobacterium tuberculosis H37Rv using the in silico strain iNJ661 and proposing alternative drug targets.BMC Syst Biol 2007, 1:26.PubMedView Article
Kim HU, Kim TY, Lee SY: Genome-scale metabolic network analysis and drug targeting of multi-drug resistant pathogen Acinetobacter baumannii AYE.Mol Biosyst 2010, 6:339–348.PubMedView Article
Kim HU, Kim SY, Jeong H, Kim TY, Kim JJ, Choy HE, Yi KY, Rhee JH, Lee SY: Integrative genome-scale metabolic analysis of Vibrio vulnificus for drug targeting and discovery.Mol Syst Biol 2011, 7:460.PubMed
Jensen PR, Hammer K: The sequence of spacers between the consensus sequences modulates the strength of prokaryotic promoters.Appl Environ Microbiol 1998, 64:82–87.PubMed
Koffas MA, Jung GY, Stephanopoulos G: Engineering metabolism and product formation in Corynebacterium glutamicum by coordinated gene overexpression.Metab Eng 2003, 5:32–41.PubMedView Article
Delgado J, Liao JC: Inverse flux analysis for reduction of acetate excretion in Escherichia coli.Biotechnol Prog 1997, 13:361–367.PubMedView Article
Burgard AP, Nikolaev EV, Schilling CH, Maranas CD: Flux coupling analysis of genome-scale metabolic network reconstructions.Genome Res 2004, 14:301–312.PubMedView Article
Puchalka J, Oberhardt MA, Godinho M, Bielecka A, Regenhardt D, Timmis KN, Papin JA: Martins dos Santos VA: Genome-scale reconstruction and analysis of the Pseudomonas putida KT2440 metabolic network facilitates applications in biotechnology.PLoS Comput Biol 2008, 4:e1000210.PubMedView Article
Bundy JG, Papp B, Harmston R, Browne RA, Clayson EM, Burton N, Reece RJ, Oliver SG, Brindle KM: Evaluation of predicted network modules in yeast metabolism using NMR-based metabolite profiling.Genome Res 2007, 17:510–519.PubMedView Article
Khannapho C, Zhao H, Bonde BK, Kierzek AM, Avignone-Rossa CA, Bushell ME: Selection of objective function in genome scale flux balance analysis for process feed development in antibiotic production.Metab Eng 2008, 10:227–233.PubMedView Article
Bushell ME, Sequeira SI, Khannapho C, Zhao H, Chater KF, Butler MJ, Kierzek AM, Avignone-Rossa CA: The use of genome scale metabolic flux variability analysis for process feed formulation based on an investigation of the effects of the zwf mutation on antibiotic production in Streptomyces coelicolor.Enzyme Microb Technol 2006, 39:1347–1353.View Article
Mahadevan R, Schilling CH: The effects of alternate optimal solutions in constraint-based genome-scale metabolic models.Metab Eng 2003, 5:264–276.PubMedView Article
Pharkya P, Maranas CD: An optimization framework for identifying reaction activation/inhibition or elimination candidates for overproduction in microbial systems.Metab Eng 2006, 8:1–13.PubMedView Article
Choi HS, Lee SY, Kim TY, Woo HM: In silico identification of gene amplification targets for improvement of lycopene production.Appl Environ Microbiol 2010, 76:3097–3105.PubMedView Article
Kim J, Reed JL: OptORF: Optimal metabolic and regulatory perturbations for metabolic engineering of microbial strains.BMC Syst Biol 2010, 4:53.PubMedView Article
Lun DS, Rockwell G, Guido NJ, Baym M, Kelner JA, Berger B, Galagan JE, Church GM: Large-scale identification of genetic design strategies using local search.Mol Syst Biol 2009, 5:296.PubMedView Article
Jung YK, Kim TY, Park SJ, Lee SY: Metabolic engineering of Escherichia coli for the production of polylactic acid and its copolymers.Biotechnol Bioeng 2010, 105:161–171.PubMedView Article
Ranganathan S, Suthers PF, Maranas CD: OptForce: an optimization procedure for identifying all genetic manipulations leading to targeted overproductions.PLoS Comp Biol 2010, 6:e1000744.View Article
Park JM, Kim TY, Lee SY: Prediction of metabolic fluxes by incorporating genomic context and flux-converging pattern analyses.Proc Natl Acad Sci U S A 2010, 107:14931–14936.PubMedView Article
Jensen LJ, Kuhn M, Stark M, Chaffron S, Creevey C, Muller J, Doerks T, Julien P, Roth A, Simonovic M, et al.: STRING 8–a global view on proteins and their functional interactions in 630 organisms.Nucleic Acids Res 2009, 37:D412–416.PubMedView Article
Lee SY, Woo HM, Lee D-Y, Choi HS, Kim TY, Yun H: Systems-level analysis of genome-scale microbial metabolisms under the integrated software environment.Biotechnol Bioproc Eng 2005, 10:425–431.View Article
Reed JL, Vo TD, Schilling CH, Palsson BO: An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR).Genome Biol 2003, 4:R54.PubMedView Article
Neidhardt FC, Umbarger HE: Chemical composition ofEscherichia coli. In Escherichia coli and Salmonella: cellular and molecular biology. 2nd edition. Edited by: Neidhardt FC, Curtiss R. ASM Press, Washington, D.C; 1996:13–16.
Park JM, Kim TY, Lee SY: Constraints-based genome-scale metabolic simulation for systems metabolic engineering.Biotechnol Adv 2009, 27:979–988.PubMedView Article
Qian ZG, Xia XX, Lee SY: Metabolic engineering of Escherichia coli for the production of putrescine: a four carbon diamine.Biotechnol Bioeng 2009, 104:651–662.PubMed
Lee SY, Chang HN: High cell density cultivation of Escherichia coli W using sucrose as a carbon source.Biotechnol Lett 1993, 15:971–974.View Article
Onal A: A review: Current analytical methods for the determination of biogenic amines in foods.Food Chem 2007, 103:1475–1486.View Article
Yildirim HK, Uren A, Yucel U: Evaluation of biogenic amines in organic and non-organic wines by HPLC OPA derivatization.Food Technol Biotechnol 2007, 45:62–68.
Kramer M, Bongaerts J, Bovenberg R, Kremer S, Muller U, Orf S, Wubbolts M, Raeven L: Metabolic engineering for microbial production of shikimic acid.Metab Eng 2003, 5:277–283.PubMedView Article
Johansson L, Lindskog A, Silfversparre G, Cimander C, Nielsen KF, Linder G: Shikimic acid production by a modified strain of E. coli (W3110.shik1) under phosphate-limited and carbon-limited conditions.Biotechnol Bioeng 2005, 92:541–552.PubMedView Article
Knop DR, Draths KM, Chandran SS, Barker JL, von Daeniken R, Weber W, Frost JW: Hydroaromatic equilibration during biosynthesis of shikimic acid.J Am Chem Soc 2001, 123:10173–10182.PubMedView Article
Knaggs AR: The biosynthesis of shikimate metabolites.Nat Prod Rep 2003, 20:119–136.PubMedView Article
Chandran SS, Yi J, Draths KM, von Daeniken R, Weber W, Frost JW: Phosphoenolpyruvate availability and the biosynthesis of shikimic acid.Biotechnol Prog 2003, 19:808–814.PubMedView Article
Escalante A, Calderon R, Valdivia A, de Anda R, Hernandez G, Ramirez OT, Gosset G, Bolivar F: Metabolic engineering for the production of shikimic acid in an evolved Escherichia coli strain lacking the phosphoenolpyruvate: carbohydrate phosphotransferase system.Microb Cell Fact 2010, 9:21.PubMedView Article
Lu JL, Liao JC: Metabolic engineering and control analysis for production of aromatics: Role of transaldolase.Biotechnol Bioeng 1997, 53:132–138.PubMedView Article
Patnaik R, Liao JC: Engineering of Escherichia coli central metabolism for aromatic metabolite production with near theoretical yield.Appl Environ Microbiol 1994, 60:3903–3908.PubMed
Yi J, Draths KM, Li K, Frost JW: Altered glucose transport and shikimate pathway product yields in E. coli.Biotechnol Prog 2003, 19:1450–1459.PubMedView Article
Keasling JD: Gene-expression tools for the metabolic engineering of bacteria.Trends Biotechnol 1999, 17:452–460.PubMedView Article
Thiele I, Jamshidi N, Fleming RM, Palsson BO: Genome-scale reconstruction of Escherichia coli's transcriptional and translational machinery: a knowledge base, its mathematical formulation, and its functional characterization.PLoS Comput Biol 2009, 5:e1000312.PubMedView Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.