 Research article
 Open Access
 Published:
Modelling overflow metabolism in Escherichia coli with flux balance analysis incorporating differential proteomic efficiencies of energy pathways
BMC Systems Biology volume 13, Article number: 3 (2019)
Abstract
Background
The formation of acetate by fastgrowing Escherichia coli (E. coli) is a commonly observed phenomenon, often referred to as overflow metabolism. Among various studies that have been carried over decades, a recent work (Basan, M. et al. Nature 528, 99–104, 2015) suggested and validated that it is the differential proteomic efficiencies in energy biogenesis between fermentation and respiration that lead to the production of acetate at rapid growth conditions, as the consequence of optimally allocating the limited proteomic resource. In the current work, we attempt to incorporate this newly developed proteome allocation theory into flux balance analysis (FBA) to capture quantitatively the extent of overflow metabolism in different E. coli strains.
Results
A concise constraint was introduced into a FBAbased model with three proteomic cost parameters to represent constrained allocation of proteome over two energy (respiration and fermentation) pathways and biomass synthesis. Linear relationships were shown to exist between the three proteomic cost parameters. Tests with three different strains revealed that the proteomic cost of fermentation was consistently lower than that of respiration. A slowgrowing strain appeared to have a higher proteomic cost for biomass synthesis than fastgrowing strains. Different assumed levels of carbon flowing into pentose phosphate pathway affected the absolute value of model parameters, but had no qualitative impact on the comparative proteomic costs. For the prediction of biomass yield, significant errors that occurred for one of the tested strains (ML308) were rectified by adjusting the cellular energy demand according to literature data.
Conclusions
With the aid of a concise proteome allocation constraint, our FBAbased model is able to quantitatively predict the onset and extent of the overflow metabolism in various E. coli strains. Such prediction is enabled by three linearlycorrelated (as opposed to uniquely determinable) proteomic cost parameters. The linear relationships between these parameters, when determined using data from cell culturing experiments, render biologically meaningful comparative proteomic costs between fermentation and respiration pathways and between the biomass synthesis sectors of slow and fastgrowing species. Simultaneous prediction of acetate production and biomass yield in the overflow region requires the use of reliable cellular energy demand data.
Background
The formation of acidic byproducts, predominantly acetate, when Escherichia coli (E. coli) grows under aerobicglucose conditions is a commonly observed phenomenon, which has been extensively studied over decades [1,2,3,4,5]. Lee reviewed 19 studies of recombinant E. coli where acetate was accumulated in fedbatch systems [6]. It has been reported that the portion of glucose converted into acetate can be as high as 15% [7], representing a seemingly huge waste of feedstock. The accumulation of acetate in the culture medium appears to be a major limiting factor for achieving high cell density [8], which is particularly severe in the growth of recombinant strains [9]. Acetate also impairs the microbial production of recombinant proteins [1] and drug precursors [9]. These complications of acetate in bioreactors thus call for elucidation of acetatepertinent metabolic processes. A similar phenomenon has been observed in tumour cells (Warburg effect) [10,11,12]. The associated mathematical models for explaining the Warburg effect have recently been reviewed [13].
Traditionally, the aerobic formation of acetate has been referred to as overflow metabolism: the excess glucose saturates or inhibits the tricarboxylic acid (TCA) cycle, which subsequently forces the cell to modulate the redundant carbon to the acetate pathway [3, 14]. However, the study by Molenaar et al. suggested that the overflow metabolism as shown in the growth phenotype is probably a result of the global allocation of cellular resources, where the enzyme efficiency and the pathway yield were both taken into account to obtain the optimal growth strategies subject to different growth conditions [15]. Later in 2015, Basan et al. proposed and validated that the overflow metabolism in E. coli originates from the global physiological proteome allocation for rapid growth [16]. In particular, the proteomic efficiency of energy biogenesis through aerobic fermentation was found to be higher than that of respiration; this difference in proteomic efficiency between fermentation and respiration appears to play a central role in dictating the degree of overflow metabolism in E. coli.
Given the importance of the overflow metabolism, several phenomenological models were developed to depict this effect [5, 8, 17, 18], where the prediction of acetate excretion was dictated by a combination of (i) the constraints on oxygen and carbon supply and (ii) cellular mass and energy balance. Later, models that adopt conventional regulatory mechanisms of acetate metabolism [14], such as oxygen limitation, carbon source availability and tight regulation of cofactor pools were evaluated [19], with an attempt to explain the metabolic shifts from a fully aerobic mode to the aerobic acetate fermentation (overflow). More recently, constraintbased metabolic models [20] were established to analyse the optimal cellular growth strategy, incorporating principles of (i) limitation in the cellular resource on the maximal attainable growth rate, such as the maximum cytoplasmic density adopted by FBAwMC [21, 22] and the finite amount of resource to be allocated between metabolic network and ribosomes, as applied in RBA [23,24,25], (ii) metabolic regulation based on enzyme kinetic information, such as mechanistically detailed descriptions of gene expression and the synthesis of functional macromolecules used in MEModel [26] and (iii) membrane occupancyderived competition between glucose transporters and respiration chain (an extension of MEModel) [27]. The major target of these models is to predict the maximum cellular growth rate. Predictions were validated quantitatively by the experimental data, while the overflow metabolism in fastgrowing phase was mostly captured in a qualitative way. In addition, it was pointed out [16, 28] that cell volumes were empirically found to vary widely with virtually constant densities across different growth conditions [29], which suggests that the cytoplasmic densitybased constraint might not be fully justified.
Inspired by a recent experimental work studying the proteomic cost of the core metabolic pathways of E. coli [16], a model named constrained allocation flux balance analysis (CAFBA) [28] managed to predict the rates of acetate production in the overflow metabolism for different E. coli strains, with good quantitative agreement with experimental data. However, the proteomic costs adopted in CAFBA were applied to individual metabolic reactions, without focusing on the exploration of the critical role played by specific metabolic modules such as energy biogenesis pathways.
In this work, we attempt to depict the overflow metabolism in various E. coli strains with quantitative accuracy, i.e. predicting aerobic steadystate rates of acetate production at different growth rates and validating the model with experimental data in literature. In particular, we adopt a concise proteome allocation constraint as identified by Basan et al. [16], referred to as the Proteome Allocation Theory (PAT) in this work. The PAT suggests that the choice of energy biogenesis pathways under different growth conditions results from the discrepancy of proteomic efficiencies between fermentation and respiration. E.coli cells tend to use the more proteinefficient fermentation pathway to generate energy in order to accommodate the high proteomic demand in biosynthesis under rapid growth. The key concepts of PAT are fully embedded and realised in our model. With a parsimonious, PATbased metabolic model capable of accurately capturing the overflow metabolism, we further analyse the interdependency between pathwaylevel proteomic cost parameters, the disparity in these parameters between different E. coli strains, and the impact of cellular energy demand on the accuracy of the coprediction of the overflow metabolism and the biomass yield on substrate.
Methods
Formulation of the PAT constraint
Following Basan et al. [16], the fractions of three proteome sectors in the entire proteome (i.e. the total protein content) of the cell sum to unity:
where ϕ_{f} and ϕ_{r} are the fractions of the fermentation and respirationaffiliated enzymes, respectively, which enable the fluxes for energy generation; ϕ_{BM} represents the fraction of the remaining part of the proteome enabling other cellular activities, broadly referred to as the sector of biomass synthesis [16, 28].
More specifically, ϕ_{f} represents the mass abundance of the enzymes that carry fermentation fluxes involved in glycolysis (glucose to acetylCoA), oxidative phosphorylation and acetate synthesis pathways (phosphotransacetylase and acetate kinase). ϕ_{r} comprises all the enzymes that catalyse the respirationassociated reactions in glycolysis, tricarboxylic acid (TCA) cycle and oxidative phosphorylation system. Same as in Basan et al. [16], in this work we adopt the linear dependences assumed in Hui et al. [30] to relate ϕ_{f} and ϕ_{r} with the fermentation and respiration fluxes respectively,
where v_{f} (v_{r}) is the fermentation (respiration) pathway flux, which in this work is represented by the enzymatic reaction “acetate kinase ACKr” (“2oxogluterate dehydrogenase AKGDH”); w_{f} (w_{r}) is the pathwaylevel proteomic cost, denoting the proteome fraction required per unit fermentation (respiration) flux.
On the biomass synthesis sector, ϕ_{BM} corresponds to the remaining part of proteome that is not covered by the fermentation and respiration sectors, including ribosomal proteins and anabolic enzymes (the major part, referred to as biomass synthesis), catabolic enzymes and cellular maintenance proteins. Motivated by the observed linear dependency between growth rate and proteome fraction for biomass synthesis [30,31,32], the following linear relationship is assumed:
where bλ is the growth rateassociated component with λ being the specific growth rate and the constant b quantifying the proteome fraction required per unit growth rate. In Basan et al. [16], ϕ_{0} was considered as a growth rate independent constant.
Combining Eqs. (1)–(3), we have
Equation (4) implies that the sum of the three proteomic cost terms on the lefthand side remains constant. However, when the growth rate (and hence the fermentation and respiration fluxes) becomes very low, it is difficult to envisage numerically how this sum could still remain at a constant level. In fact, in Basan et al. [16] (see its Supplementary Information), it was acknowledged that at growth rates lower than that corresponding to the onset of the overflow phenomenon, the proteome sectors would no longer be constrained by the equality indicated by Eq. (4). This suggests that across the entire range of possible growth rates, ϕ_{0} is unlikely a growth rate independent constant: it may remain at a constant (and minimum) level in the overflow region where the proteomic resource is stretched, but become growthrate dependent (and larger) at lower growth rates outside the overflow region, i.e. ϕ_{0, min} ≤ ϕ_{0} ≤ 1, where ϕ_{0, min} is a true constant. Defining ϕ_{max} ≡ 1 − ϕ_{0, min}, Eq. (4) then becomes
In Vazquez and Oltvai (2016) [33], ϕ_{0} was also interpreted as a variable instead of a constant, with a (nonzero) minimum value. For simplicity, both sides of Eq. (5) is divided by ϕ_{max}, leading to the final form of the proteome constraint adopted in this work, referred to as PAT constraint from this point on:
where \( {w}_f^{\ast}\equiv {w}_f/{\phi}_{max} \), \( {w}_r^{\ast}\equiv {w}_r/{\phi}_{max} \) and b^{∗} ≡ b/ϕ_{max}. \( {w}_f^{\ast } \), \( {w}_r^{\ast } \) and b^{∗} are referred to as the proteomic cost parameters.
Predicting the acetate flux
Flux balance analysis (FBA) [20] is used to determine the optimal flux distribution under different growth condition, with a set of constraints:
maxf_{obj}, subject to
where f_{obj} is the assumed cellular objective. We specified minimizing substrate uptake as the objective function because in this study the commonly used objective ‘growth rate’ was used as the model input (with acetate production as the model output). S is the stoichiometric matrix defined by the metabolic model; v is a column vector comprising the reactions/fluxes described in the metabolic network; v^{L} and v^{U} represent the lower and upper limits of the reactions, respectively. The inequality constraint (iii) is same as Eq. (6) introduced earlier.
The prediction of the extent of overflow metabolism (rate of acetate production) requires of the parameter values of \( {w}_f^{\ast } \), \( {w}_r^{\ast } \) and b^{∗} in the third constraint (PAT constraint) of Eq. (7). We show in the next section that these three parameters cannot be uniquely determined by the experimentally measured growth rateacetate production profile alone. A set of values for these parameters was randomly chosen from mathematically equivalent sets (see Additional file 1: Table S1). The PATbased FBA was run at different growth rates under aerobicglucose conditions. FBA was carried out using the core E. coli metabolic model [34], referred to as the core model in the rest of the paper. The optimal flux distribution was solved via COBRA toolbox [35] in MATLAB (R2016a). LP solution was determined by Gurobi 6.0. Detailed model descriptions such as uptake bounds and flux regulations are given in Additional file 2, sections 1 and 2.
Interdependency of proteomic cost parameters
A linear relationship was previously shown to hold between the fermentation (or respiration) flux and steady state growth rates in the overflow region [16]:
where, as introduced earlier, v_{f} is the fermentation flux (referred to as “acetate line” in [16]); v_{r} is the respiration flux. k_{f} (k_{r}) and v_{f, 0} (v_{r, 0}) are constants representing the slope and intercept of the fermentation (respiration) line. Substituting Eq. (8) and Eq. (9) into Eq. (6), with the equal sign held for the overflow condition:
Equation (10) holds for any growth rate (λ) in the overflow region, which requires
Equations (11) and (12) indicate that (i) there is a linear relationship between the fermentation and respiration proteomic cost parameters \( {w}_r^{\ast } \) and \( {w}_f^{\ast } \), and (ii) the third growthrate dependent proteomic cost parameter b^{∗} is a linear combination of \( {w}_r^{\ast } \) and \( {w}_f^{\ast } \), thus b^{∗} also possesses a linear relationship with \( {w}_f^{\ast } \) (or \( {w}_r^{\ast } \)).
If experimental data exist that allow for both the fermentation line and the respiration line to be plotted (such as the steady state growth rate – acetate excretion and growth rate – CO_{2} revolution data given in [16]), their slopes and intercepts, appearing in Eqs. (11) and (12), can be obtained. However, the three proteomic cost parameters cannot be uniquely determined by the two equations, although specific values of similar parameters have previously been derived from measured cellular protein compositions [16].
In this work, k_{f} and v_{f, 0} were directly determined from the experimentally measured growth rateacetate excretion profile (data sources are shown in Fig. 1). To our knowledge, no directly experimental data were available for the rate of intracellular respiration. Alternatively we took the growth rateacetate profile as the input of FBA (setting the objective function to the minimisation of glucose uptake) to estimate the respiration flux at each data point, which was subsequently used to determine k_{r} and v_{r, 0}. Flux variability analysis (FVA) was conducted which confirmed that all the relevant fluxes used in the model were uniquely determined. After obtaining k_{f} (k_{r}) and v_{f, 0} (v_{r, 0}), a set of values of \( {w}_f^{\ast } \), \( {w}_r^{\ast } \) and b^{∗} can be determined by arbitrarily specifying the value for one of the parameters. In this work, we took \( {w}_f^{\ast } \) to be specified, in a range of [0, 0.11] for MG1655 and [0, 0.07] for ML308 and NCM3722. This thus allows us to present the parameter estimation results in the form of \( {w}_f^{\ast } \)\( {w}_r^{\ast } \) and \( {w}_f^{\ast } \)b^{∗} plots. Simulation results presented in this paper were obtained with a randomly chosen value of \( {w}_f^{\ast } \) within the ranges mentioned above and the correspondingly determined values of \( {w}_r^{\ast } \) and b^{∗}. Note that different values chosen for \( {w}_f^{\ast } \) yielded identical simulation results (Additional file 2: Figures S9S17).
Adjustment of cellular energy demand
We observed discrepancy in biomass yield between model predictions and the experimental data, especially for ML308 (Fig. 5), and hypothesised the reason being the inaccuracy of cellular energy demand assumed by the core model when applied to this particular species growing under nonoverflow and overflow conditions. To test this hypothesis, we collected the growth data in the overflow region for ML308 (Table 7 in [3]) and found that the reported steady state ATP production rate was lower than what the core model suggested. Therefore, we decided to remodel the cellular energy demand by subtracting the surplus portion, reducing it to the strainspecific values reported in the literature (Additional file 1: Table S3).
The original cellular energy demand embedded in the core model is quantified by
where r_{ATP, nominal} is the overall ATP consumption rate (equivalent to the ATP production rate at steady state); subscript “nominal” indicates the default specification of the core model; v_{GLNS} and v_{PFK} are the fluxes of the enzymatic reactions glutamine synthetase and phosphofructokinase (one mole ATP is required per mole flux of each reaction); (ATPM + σλ) denotes the maintenance energy required for nonmetabolic processes, where ATPM corresponds to the nongrowthassociated maintenance (NGAM) and σ to the growthassociated maintenance (GAM) [36].
The adjusted cellular energy demand is formulated as
where S(λ) is the offset energy, i.e. the amount of energy overpredicted by the core model. The mathematical analysis of the growth data of ML308 suggested that in the overflow region, the offset energy is linearly related with the growth rate (R^{2} = 0.9998):
where k and c are constants. Substituting Eqs. (13) and (15) into Eq. (14), we have
For simplicity, we define M ≡ ATPM − c and N ≡ σ − k; M and N are referred to as the (adjusted) maintenance parameters. The adjustment of the cellular energy demand in our FBA is achieved by manipulating the maintenance energy, more specifically, through the maintenance parameters. Note that in the case where growth energetic data is not available (i.e. for MG1655 and NCM3722), such adjustment was not possible, therefore the default maintenance parameter values were used (see Table 1).
Alternative pathways in central metabolism
The model constructed in this work considers only the central metabolism of E. coli as detailed in the E. coli core model. The energy biogenesis pathways in the model consist of glycolysis (the EMP pathway), the TCA cycle, the acetate pathway (PTAACKA) and the terminal oxidative phosphorylation system. However we noted the existence of alternative pathways in the central carbon metabolism, which include the EntnerDoudoraff (ED) pathway, the pentose phosphate (PP) pathway and the more recently explored PEPglyoxylate cycle [37].
ED pathway
The ED pathway was found to be three to fivefold more proteinefficient than the EMP pathway to achieve the same glycolytic flux [38], which provides a clear rationale for the utilisation of the ED pathway in a number of bacteria, e.g. Sinorhizobium meliloti, Rhodobacter sphaeroides, Zymomonas mobilis, and Paracoccus versutus [39]. However, Flamholz et al. acknowledged that E. coli, which is capable of using both the ED and the EMP pathways, tends to use the latter. Flux measurements also suggest that the usage of the ED pathway by E.coli K12 is minimal: only about 2% of glucose catabolism proceeds by means of the ED pathway in batch cultures [40] and about 6% in miniscale chemostats [41]. Furthermore, the activity of the ED pathway was detected only under slow to mildgrowing conditions [40, 41]. To our knowledge, no activity of ED pathway in E. coli has been reported under fastgrowing scenarios.
PEPglyoxylate cycle
As for the PEPglyoxylate cycle, similar to the ED pathway, its usage was identified to be significant only under slowgrowing conditions. Not even a trace activity of the PEPglyoxylate cycle was found in wildtype batch cultures or more rapidly growing chemostats [37]. Furthermore, the flux comparison between the aceApckA knockout strain and the sucC knockout strain [16] verifies that compared to the TCA cycle, the alternative PEPglyoxylate cycle plays a less significant role in glucoselimited fastgrowing cultures of E. coli.
Based on above literature evidences, this work, focusing on the overflow metabolism that occurs at fastgrowing cultures of E.coli with relatively sufficient substrate availability, has taken the assumption that the use of alternative ED pathway and PEPglyoxylate cycle is negligible compared to the glycolysis (i.e. EMP pathway) and the TCA cycle.
PP pathway
The PP pathway, on the other hand, can function as a significant alternative to the upper part of glycolysis for carbon catabolism in E. coli. Previous studies showed that the carbon flow through the PP pathway could reach 20–35% of the total carbon intake and can vary with different growth rates [40, 41] hence neither a constant portion of carbon is diverted into PP pathway nor this portion of carbon flux negligible. The uncertainty embedded in the PP pathway flux motivates us to study the impact of different portion of substrate carbon allocated between the upper part of the EMP pathway and the PP pathway on the proteomic cost parameters and the model predictions. More details can be found in the Additional file 2, section 3.
We define the PP pathway ratio (PPP%) as the portion of substrate carbon directing to PP pathway to the total carbon intake:
where 6phosphogluconolactonase (PGL) is chosen to represent the PP pathway flux as it is a major and also the beginning enzymatic reaction in the pentose phosphate shunt; EX_glc(e) is the exchange reaction denoting glucose uptake rate. In our simulation, PPP% was controlled by setting the upper bound of the portion of carbon that is directed into the PP pathway, denoted as uPPP%, with the aid of an auxiliary term DM_PPP_RATIO:
Results
E.coli MG1655, NCM3722 and ML308 have been selected as the model strains in this work and are referred to as MG, NCM and ML respectively from this point on. In this section, the simulated acetate excretion pattern is presented against experimental data to demonstrate the accuracy of the model prediction. Subsequently we elucidate the linear interdependency of the proteomic cost parameters. In particular, we reveal the similarities and differences between the three E. coli strains. With respect to the PP pathway ratio (PPP%), previous studies on the slowgrowth strain MG show that the portion of carbon that goes into the PP pathway can be approximately 20% of the total carbon intake [40, 41]. In this work, we set the upper bound of the carbon flowing into PP pathway (uPPP%) to 25, 35 and 40% to investigate the potential effect of the change in PPP% on proteomic cost parameters and model prediction (more justification is provided in Additional file 2, section 3). On cellular energy demand, we refer to the original energy demand specified in the core model [34] as nominal energy demand, and present first the set of results which were generated on this basis. Subsequently, we show how an adjusted energy demand (particularly applied to ML, referred to as MLnew) affects the patterns of the estimated proteomic cost parameters and the accuracy of biomass yield prediction.
Model prediction of overflow metabolism with nominal energy demand
Here, the accuracy of the predicted acetate excretion rate is compared with experimental data. Variation in the proteomic cost parameters with the changed carbon level diverted into the PP pathway is also presented.
Model prediction of acetate production
Figure 1 shows that model prediction of the pattern of acetate excretion is in good agreement with the experimental observations for three different E. coli strains. The onset of the production of acetate is concomitant with the drop in the respiratory flux, indicating a switch between fullyrespiration to respirationfermentation mode. As the growth rate further increases, the acetate flux becomes dominant while the extent of respiration is gradually diminishing. It is worth noting that zero acetate production was commonly observed at low growth rates of different strains [3, 16, 41, 42]; To emphasise the (strainspecific) acetate production pattern, we only collect the data with nonzero acetate production. For all the strains, data involving growth rates lower than those presented in Fig. 1 are associated with nondetectable acetate excretion, hence are not shown here.
Linear relationships between proteomic cost parameters
When the nominal energy demand is adopted (indicated by “nom”), the change of uPPP% leads to insignificant changes to the \( {w}_r^{\ast }{w}_f^{\ast } \) line for each strain (Fig. 2). Between different strains, MG and NCM share nearly identical lines. The lines of MLnom deviate from those of the former two, but not significantly (although this closeness will be altered with the adjusted energy demand, see Fig. 2 MLnew and the section below). In any case, \( {w}_r^{\ast } \) is clearly higher than the corresponding \( {w}_f^{\ast } \), implying that respiration has a higher (lower) proteomic cost (efficiency) than fermentation for energy production, which is consistent with what was derived from protein abundances data for comparable parameters in [16].
To inspect the insignificant disparity in the \( {w}_r^{\ast }{w}_f^{\ast } \) lines when all strains use the nominal energy demand, Eq. (12) is rearranged to
The slope and intercept of the \( {w}_r^{\ast }{w}_f^{\ast } \) line are dictated by \( \frac{v_{f,0}}{v_{r,0}} \) and \( \frac{1}{v_{r,0}} \), respectively. v_{f, 0} can be determined directly by the experimental measurement of acetate production. v_{r, 0}, on the other hand, is a result of the combination of (measured) rates of acetate production and the mass and energy balance structure of the metabolic model.
For a specific strain, v_{f, 0} only depends on the pattern of acetate excretion, not affected by assumed level of uPPP%. Therefore, the impact of uPPP% on the \( {w}_r^{\ast }{w}_f^{\ast } \) line is through affecting the value of v_{r, 0}, which turns out to be rather moderate. Between different strains, the ratio of v_{f, 0} and v_{r, 0} and the value of v_{r, 0} are nearly identical between MG and NCM, regardless of the level of uPPP% adopted, resulting in the very much overlapped pattern of the \( {w}_r^{\ast }{w}_f^{\ast } \) relationship between MG and NCM. For ML, the value of the slope is slightly smaller than MG and NCM, while the intercept is about 25% larger (as shown in Additional file 1: Table S2). Figure 2 also suggests that the proteomic cost (efficiency) of respiration pathways for ML is higher (lower) than that for MG and NCM, regardless the modification in the energy demand.
Compared to the \( {w}_r^{\ast }{w}_f^{\ast } \) relationship, that of \( {b}^{\ast }{w}_f^{\ast } \) appears to be affected by the level of uPPP% more visibly (Fig. 3). Between different species, the difference is also more pronounced, and closeness is present between the two rapidgrowth strains NCM and ML (as presented in Additional file 2: Figure S1).
For a specific strain, the increase of uPPP% gradually moves the \( {b}^{\ast }{w}_f^{\ast } \) line to the right (yellow arrow, Fig. 3), corresponding to an increase in b^{∗} (blue arrow, Fig. 3). This trend can be explained by inspecting a rearrangement of Eq. (11):
Equation (20) suggests that the shift of the \( {b}^{\ast }{w}_f^{\ast } \) line results from the change in the respiratory flux (note the intercept, \( \frac{k_r}{v_{r,0}} \)). In E. coli, the PP pathway and the TCA cycle are two major sources for the production of NADPH [40]. At a given growth rate, the amount of NADPH needed for cell growth is fixed based on the mass and energy balance. As uPPP% increases, more carbon is predicted to enter the PP pathway. In the model simulation, an increase in the amount of NADPH produced via PP pathway would force a drop of the flux into the TCA cycle in order to maintain the constant total production rate of NADPH. The reduction in the TCA flux in turn manifests in a lower v_{r}. In the overflow region, Eq. (6) is bounded by the equal sign. As shown earlier, the level of uPPP% has a negligible impact (when nominal energy demand is adopted) on the \( {w}_r^{\ast }{w}_f^{\ast } \) line. Also recall that the relation between the rate of acetate excretion and steady state growth rate is fixed by the experimentally measured growth data. With all the other quantities (\( {w}_f^{\ast } \), v_{f}, \( {w}_r^{\ast } \) and λ) fixed in Eq. (6), the drop in v_{r} due to the increase in uPPP% will necessarily be accompanied by an increase in b^{∗}.
Between different strains, b^{∗} varies significantly. In particular, b^{∗} for MG is remarkably larger than that of NCM and ML (as presented in Additional file 2: Figure S1). This disparity can again be explained by Eq. (6). For a certain value of \( {w}_f^{\ast } \), \( {w}_r^{\ast } \) is rather similar among different strains (with nominal energy demand) as shown by Fig. 2. In the overflow region, the respiration flux v_{r} of MG is much smaller than the others (see Fig. 1), which thus leads to a lower value of the \( {w}_r^{\ast }{v}_r \) term for MG than NCM and ML. As the value of the \( {w}_f^{\ast }{v}_f \) term (for any selected value of \( {w}_r^{\ast } \)) is similar between these strains, due to their similarity in the relationship between \( {w}_r^{\ast } \) and \( {w}_f^{\ast } \), the value of the remaining term on the lefthand side of Eq. (6), b^{∗}λ, must be higher for MG than for the other two strains. On the other hand, in the overflow region and at a same acetate excretion rate v_{f}, the growth rate of MG has been shown to be much lower than that of NCM and ML. Now, a higher value of b^{∗}λ coupled with a lower value of λ will undoubtedly lead to a higher value of b^{∗}for MG, compared to the other two strains.
The above mathematical explanation in fact coincides with the known biological fact that the inverse of b^{∗} is proportional to the rate of protein synthesis [31]: the slower the rate of protein synthesis, the higher the value of b^{∗}. Thus for the slowgrowing strain MG, it is expected to have a higher value of b^{∗}compared to the fastgrowing strains NCM and ML.
Predicted evolution of PP pathway flux
The results presented above show rather moderate impact of the upper limit of PP pathway ratio (uPPP%) on the linear interdependency of the proteomic cost parameters. With an interest in the FBA solution of the flux distribution in PP pathway (at different growth rates), simulation results were recorded for three strains with uPPP%set to 35%; other uPPP% levels displayed a similar trend (as presented in Additional file 2: Figures S4 and S5). Flux variability analysis (FVA) [43] was performed to confirm that the trend of PPP% presented here was unique.
In general, PPP% gradually increases with the growth rate. Two turning points can be observed, which divide the whole curve into three distinct phases (Fig. 4a). A close inspection of the model simulation revealed that the variation of the predicted PP pathway ratio was corelated particularly with three fluxes, namely NAD transhydrogenase (NADTRHD), transketolase (TKT2) and NADP transhydrogenase (THD2).
In phase I, only NADTRHD is active, with zero fluxes for both TKT2 and THD2. The enzymatic reaction NADTRHD functions to convert NADPH into NADH. Thus in phase I, it is likely that the amount of NADPH produced exceeds the required amount for biosynthesis; NADTRHD is thus activated to consume the surplus NADPH.
In phase II, an on/off swap occurs between NADTRHD and TKT2 while THD2 still remains silent. We infer that in this phase, NADPH produced satisfies the demand, but the amount of carbon flowing into the PP pathway surpasses the rate of the carbon withdrawal (for the synthesis of biomass precursors). Therefore, TKT2 is activated to direct the extra amount of fourcarbon and fivecarbon compounds back to the glycolysis.
In phase III, THD2 is finally switched on and becomes significantly active in the highgrowthrates region. TKT2 increases progressively while NADTRHD remains silent. It is presumed that in this phase, as the growth rate becomes higher, more NADPH is required for biomass synthesis. NADP transhydrogenases (THD2) is activated to produce NADPH needed in rapid growth. The surplus carbon flux in the PP pathway, which might result from the high glucose uptake rate at a high growth rate, is directed back to glycolysis via TKT2.
It would be desirable to verify the theoretical prediction of the evolution of PPP% with experimental measurements, which unfortunately have not been widely reported in the literature. Nevertheless, Fig. 4b shows a comparison with one set of experimental observations available [42], which suggests a good degree of qualitative similarity.
Adjusting cellular energy demand improves the prediction of biomass yield
Although combining the PAT constraint with the core model succeeded in predicting the rates of acetate production, the accuracy in biomass yield varied and was especially unsatisfactory for ML strain (Fig. 5). A similar deficiency in yield prediction was also reported in [28]. Focusing on the yield, two features can be observed: (i) in the overflow region, for a fixed growth rate (associated with an acetate excretion rate) the biomass yield for ML is higher than MG and NCM; and (ii) the rate of the drop in yield (i.e. the slope) of ML is sharper than the other two strains.
Intuitively, feature (i) suggests that in ML, the amount of energy required per unit mass of biomass formation should be less than NCM or MG. Therefore we collected the growth data of ML and remodelled the cellular energy demand (see Methods).
It is worth noting that for ML, the negative value of M (Table 1) clearly indicates a constrained applicable range of the maintenance parameters, i.e. valid only within the overflow region. As growth rate decreases, if M stays unchanged, the overall energy consumption (Eq. (16)) will drop to a negative value, which is clearly not biologically feasible. This then implies a certain degree of nonlinearity in the global relationship between (total or maintenance) energy requirement and growth rate. Such proposition was previously referred to as “varied nongrowthassociated maintenance” [3]. Nonlinearity in energy consumption manifesting before and after the onset of the overflow metabolism has also been observed and discussed in a recent work [44].
Model prediction of biomass yield with adjusted energy demand
We first reestimated the set of proteomic cost parameters for ML with the adjusted energy demand (see Table 1 and Methods). Applying updated values of \( {w}_f^{\ast } \), \( {w}_r^{\ast } \) and b^{∗} together with the adjusted maintenance energy, our model is now able to effectively capture the unique trend of biomass yield for ML, without any compromise in the accuracy of predicting acetate excretion (Fig. 6). Simulation results for ML with adjusted energy demand are referred to as “MLnew”.
It is worth noting that, our model also succeeds in matching the elevated reduction in the yield of ML in the overflow region as the growth rate increases. This captured trend appears to originate from the low energy demand of ML. Approximately, the yield reduction rate can be considered as being proportional to the ratio of the increase in the acetate excretion (ac_{2} − ac_{1}) and the increase in glucose uptake rate (glc_{2} − glc_{1}), while the growth rate rises from λ_{1} to λ_{2}:
NCM and ML exhibit similar acetate excretion rates, hence a similar value in “ac_{2} − ac_{1}”. However, the energy demand per unit growth of ML is much lower than NCM, which means that with a similar increase in acetate production, the increase in substrate intake (i.e. glc_{2} − glc_{1}) for ML will be lower than NCM to achieve a given increment in the growth rate. According to Eq. (21), the yield reduction rate of ML will thus be higher than NCM.
Impact of the adjusted energy demand on \( {w}_r^{\ast }{w}_f^{\ast } \) and \( {b}^{\ast }{w}_f^{\ast } \) relationships
To investigate the impact of the change in cellular energy demand on the linear relationships of \( {w}_f^{\ast } \), \( {w}_r^{\ast } \) and b^{∗} of MLnew, we recalculated constants k_{f}, v_{f, 0}, k_{r} and v_{r, 0} at different uPPP% values (25, 35 and 40%) to update the linear equations describing \( {w}_r^{\ast }{w}_f^{\ast } \) line and \( {b}^{\ast }{w}_f^{\ast } \) (Eqs. (11) and (12)). The resulting \( {w}_r^{\ast }{w}_f^{\ast } \) lines for MLnew are plotted in Fig. 2, together with the results obtained earlier for MG/NCM/ML with nominal energy demand.
The switch to the adjusted energy demand makes the \( {w}_r^{\ast } \) value for MLnew much higher than that of MLnom, the latter being rather close to those of the MGnom and NCMnom. This implies that the adjustment of the energy demand of ML leads to an enlarged gap in the proteomic efficiency between respiration and fermentation.
The similarity among MG/NCM−/MLnom has already been discussed in the previous section. Here we mainly focus on the discrepancy with MLnew. We found that both the slope and intercept of \( {w}_r^{\ast }{w}_f^{\ast } \) line for MLnew are about three times larger than MLnom (as shown in Additional file 1: Table S4). The dramatic changes in the slope and intercept of MLnew predominantly result from the reduction in the respiratory flux v_{r} when applying the adjusted energy demand (see Fig. 7 and Additional file 2: Figure S3).
The link between the drop in v_{r} and the increase in \( {w}_r^{\ast } \) has been discussed in the previous section. The results presented herein indicate that it is the energy demand that plays a major role in distinguishing the \( {w}_r^{\ast }{w}_f^{\ast } \) relationship between different strains, not the uPPP% or the acetate excretion pattern.
Applying the adjusted energy demand also has an impact on the relationship between b^{∗} and \( {w}_f^{\ast } \). As shown in Fig. 8, the \( {b}^{\ast }{w}_f^{\ast } \) lines are significantly rightshifted when the model is changed from MLnom to MLnew (i.e. red lines are located in a much right area than yellow lines). Given the identical pattern of acetate excretion between MLnom and MLnew (as both predicted the same set of experimental data), the amount of energy produced through fermentation remains unchanged. For MLnew, as the energy demand per unit of growth is much lower than that of the nominal strain, the respiratory flux v_{r} must decrease significantly to avoid energy overproduction, as confirmed in Fig. 7. Although the value of \( {w}_r^{\ast } \) for MLnew is higher than that for MLnom (for a given value of \( {w}_f^{\ast } \), Fig. 2), the value of the product \( {w}_r^{\ast }{v}_r \) for MLnew still becomes lower (as the increase in \( {w}_r^{\ast } \) is not able to compensate for the sharp drop in v_{r}). With no change in \( {w}_f^{\ast }{v}_f \) between MLnew and MLnom (for a given \( {w}_f^{\ast } \)), Eq. (6) again dictates b^{∗} to become higher for MLnew than MLnom, hence the rightshifting of the \( {b}^{\ast }{w}_f^{\ast } \) lines.
In the case of MLnom, it was shown earlier in Fig. 3 that the increase in uPPP% would lead to a reduction of v_{r}, which in turn would lead to an increase in b^{∗} or rightshifting of the \( {b}^{\ast }{w}_f^{\ast } \) line. Now for MLnew, an enlarged gap between the \( {b}^{\ast }{w}_f^{\ast } \) lines at different uPPP% levels is observed compared to the case of MLnom. This implies that the effect of v_{r} reduction due to the increase in uPPP% is more pronounced with the adjusted energy demand.
Discussion
Comparison with relevant models
In the study by Basan et al. [16], which has been the basis of the PAT constraint formulated in our model, the application of a constraint on proteome fractions (similar to Eq. (4)) with parameters derived from measured protein abundances was able to accurately predict the patterns of acetate excretion for E. coli under different growth conditions, when coupled with a simple energy balance equation. In this work, we have embedded the PAT into the core metabolic model of E. coli, taking the advantage of the latter in offering more rigorous modelling of intracellular mass and energy balances. Furthermore, the constraintbased metabolic model allows prediction of detailed metabolic fluxes as opposed to merely acetate production, which could provide more insights about metabolic pathways in connection with the overflow metabolism and pave the way for investigating acetate excretion in junction with possible manipulations of the metabolic network.
In addition, the respiratory flux in Basan’s work was associated with the carbon dioxide produced in respiration, termed J_{CO2, r}, whose value was deduced by subtracting the fermentationdependent CO_{2} and the growthdependent CO_{2} from the total CO_{2} production. As such, J_{CO2, r} could not directly correspond to a specific flux in the metabolic network. In our model, the respiratory flux directly refers to a specific flux within the TCA cycle (AKGDH), which appears to be a convenient choice when the PAT is embedded into FBA. Using a constraintbased metabolic model that includes the TCA cycle with a reasonable level of detail, the respiration flux can be directly resolved via FBA, without the need for multistep calculation along with different levels of assumptions and uncertainties.
Another important comparison we would like to make is with the recently developed model – CAFBA. It was mentioned in the Background section that unlike CAFBA which considers the proteomic cost of every individual reaction in metabolic network, the PAT constraint in our model follows the treatment of Basan et al.’s work and quantifies the proteomic costs at the pathway level. This simplification allows the model to explicitly incorporate the differential proteomic efficiencies of fermentation and respiration that are proven (in both Basan et al. and this work) to govern the flux split between the two pathways. This simplicity comes at the cost of the limited utility of our model: it is intended to be used only for predicting the interplay between the excretion of acetate (or other fermentation products) and the growth rate during overflow metabolism, not other effects of stressed resource allocation.
The proteome allocation constraint in CAFBA includes a Csector (via a term expressed as w_{c}v_{c}) representing the proteome requirement for uptaking carbon source, which is not explicitly considered in this work. We have ignored this sector as the carbon overflow occurs only at the high growth rate region, where w_{c} (proteomic cost of the Csector) approaches zero at high substrate uptake rates, as shown in CAFBA [28]. In this region, the low value of w_{c} makes the Csector negligible compared with other proteome sectors. At low growth rate region, the value of w_{c} becomes significant, however no acetate is excreted in this region, where the equality relationship in the PAT constraint in our work becomes inactive so that the significance of the Csector becomes irrelevant.
As for the prediction of biomass yield, CAFBA noticed the difficulty in predicting the biomass yield of ML308. In this work, we have found that it is the cellular energy demand that significantly affects the FBA prediction of biomass yield. After replacing the default energy demand with data reported specifically for ML308 strain, our model was able to produce an accurate prediction. Therefore, we consider that it is important to carry out necessary adjustment to the cellular energy demand when applying such a constraintbased modelling approach to specific strains.
Parameterisation of the proteome allocation constraint
The modelling approach proposed in this work can be considered as “halfway” between the coarsegrained proteome allocation model of Basan et al. [16] and the FBA models that incorporate reactionlevel resource allocation constraints such as CAFBA [28] and FBAwMC [21]. In CAFBA, the proteome constraint involves ~ 1000 proteomic cost parameters (w_{i}) for a genomescale model. Similarly in FBAwMC, a large number of crowing coefficients need to be specified. In both cases, the existence of numerous cost parameters originates from associating the resource cost with individual reactions. These parameters conceptually have a clear biological meaning and in principle can be determined experimentally by e.g. proteome measurements or extensive enzyme assays. However, in practice, it has appeared to be difficult to reliably obtain precise values for all the parameters, especially for different strains growing at different growth rates or conditions. In fact, instead of pursuing the exact values for all the individual parameters, CAFBA focused on applying the average value of the proteome fraction invested per unit flux, termed as 〈w〉, to capture the key flux pattern, along with evaluating the impact of possible heterogeneous values of the proteome parameter w_{i} on the model prediction. Similarly, FBAwMC [21, 22] also appears to encounter a certain degree of “randomness” of its crowding coefficients due to the unknown enzyme kinetics and/or turnover numbers. Subsequently, molecularcrowdingbased modelling normally treats this “randomness” as noise, where the crowding coefficients are chosen randomly from a distribution of crowding coefficients [45] or the majority of the crowding coefficients are estimated from a limited number of known enzyme turnover (k_{cat}) values [11].
In this work, we have intended to formulate a constraint with a greatly reduced number of proteomic cost parameters, while still capturing the essence of constrained cellular resource allocation. This is achieved by formulating the proteome allocation constraint at the pathway (as opposed to reaction) level. The proposal is the concise Eq. (6) (\( {w}_f^{\ast }{v}_f+{w}_r^{\ast }{v}_f+{b}^{\ast}\lambda \le 1 \)), involving only three proteomic cost parameters (representing proteomic efficiencies of fermentation, respiration and biomass synthesis pathways). In principle, these parameters can be obtained through the direct measurement of protein abundances, following an approach similar to that adopted by Basan et al.’s work [16]. However, in the current study, we attempted to parameterise this constraint using widely available growth data from cell culturing experiments, in particular growth rate and acetate production rate. It should be noted that cell culturing experiments often yield relatively simple data sets with measurements of a few process variables. It is infeasible to use such data sets to determine a large number of proteomic cost parameters encountered in a proteome constraint expressed at the individual reaction level. Even with the pathwaylevel constraint adopted in this work, our results show that cell growth and acetate production measurements (alone) cannot uniquely determine the three parameters, but two linear relationships between these parameters can be derived (Eqs. (11) and (12)).
Furthermore, our model shows that it is the two linear relationships (but not the absolute values) of the proteomic cost parameters that allow an accurate prediction of the overflow metabolism. We thus speculate that for a FBAbased model, the ability of capturing the overflow behaviour is rendered by (i) an extra constraint representing the constrained proteomic resources and (ii) certain relations or relative magnitudes of the proteomic cost parameters embedded in the proteome constraint. In reality, the proteomic efficiencies of the metabolic pathways may vary (within a certain range), at different points in time or between cells in a population which often exhibits heterogeneity [28]. However, as long as the specific relations or relative magnitudes of these efficiencies are maintained, one can expect that the overflow behaviour will emerge.
Applicability of the linear formulation of the proteomic cost
As indicated in the earlier section, our formulation of the proteomic cost (Eqs. (2a) and (2b)) reflects the observed linear dependency between proteome fraction and growth rate [30,31,32]. Combining this linear dependency with the assumption that the flux processed by a proteome sector i is proportional to the growth ratedependent component of the associated proteome fraction [30], we have related ϕ_{i} linearly with the flux it carries. A similar model is also adopted in CAFBA [28], where the linear proteomeflux relation is derived on the assumption that the substrate concentration is proportional to the flux.
Note that our model is intended specifically for predicting the steadystate overflow metabolism in E. coli under glucoselimited conditions. In some other circumstances, observations not conforming to this relatively simple model have been reported. For example, Goel and his coworkers found hardly any changes in protein levels in anaerobic slowgrowing Lactococcus lactis chemostats, when the cell shifted from a high yield metabolic mode to a low yield metabolic mode with an increased growth rate [46]. In this case, although the metabolic shift in L. lactis is similar to the overflow metabolism observed in E. coli, proteome allocation did not seem to accompany the changes in metabolic fluxes. In a study on yeast’s transient transcript, enzyme and metabolite responses under metabolic perturbation, it is revealed that the reaction rates are jointly regulated by enzyme capacity and metabolite concentration due to the cell’s tendency in sacrificing the local metabolite homeostasis to maintain fluxes and global metabolite homeostasis upon enzyme perturbation [47]. Another yeastbased study also suggests that changes in individual flux are predominantly regulated by the levels of metabolites, not enzymes [48].
The abovementioned experimental observations suggest that the linear proteomeflux relationship modelled in this work might not be applicable to those circumstances. We hypothesise that this might be at least partially due to the differences between E. coli (the target organism of our model) and the organisms with which those observations were made. Besides, our model has been developed to describe the relationship between (i) the observed steadystate global proteome configuration and (ii) the growth rate or the corresponding flux, drawing on evidences and hypothesises from several previous studies [16, 28, 30, 32]. Such a model, being global and coarsegrained, is not intended to represent delicate regulatory mechanisms responsible for the transient metabolic changes to maintain cellular homeostasis under perturbations, and might not be suitable for revealing the local regulatory insights on the key factors dictating the individual reaction rates.
Conclusions
With three different E. coli strains, we have evaluated a new model that integrates a previously proposed proteome allocation theory (PAT) into the constraintbased modelling approach – flux balance analysis (FBA), which predicts the distribution of carbon fluxes between fermentation and respiration due to the differential proteomic efficiencies of the two energy biogenesis pathways. Using a simple proteome allocation constraint, our model allows the accurate prediction of acetate production at different steady state growth rates during overflow conditions (with sufficient oxygen and glucose). The model involves three pathwaylevel proteomic cost parameters linearly interrelated by two equations, which is the consequence of (i) the assumed linear dependency of proteomic costs and the growth rate and (ii) the experimentally observed linear correlation between the fermentation or respiration flux and the growth rate. The nonunique optimal values of the three parameters, or the two linear relationships between them, could be obtained by fitting the model to experimentally measured acetate excretion rates at specific growth rates.
The linear relationships between the parameters were shown to be affected, in varying degrees, by (i) the acetate excretion pattern, (ii) the assumed upper limit of the substrate carbon diverting into PP pathway and (iii) the cellular energy demand. The proteomic cost of the fermentation pathway was estimated always to a lower value than that of the respiration pathway, i.e. \( {w}_f^{\ast }<{w}_r^{\ast } \). The proteomic cost of the biomass synthesis sector was estimated to be higher in a slowgrowing strain that excretes acetate at a lower growth rate, in comparison with the other two fastgrowing strains, i.e. \( {b}_{\mathrm{MG}}^{\ast }>{b}_{\mathrm{NCM}/\mathrm{ML}}^{\ast } \). The estimated values of \( {w}_f^{\ast } \) and b^{∗}both meet qualitatively the expectation from a biological point of view. Furthermore, the relationship between the proteomic efficiencies of fermentation and respiration, i.e. the \( {w}_f^{\ast }{w}_r^{\ast } \) line, was shown to change between different strains most significantly with the cellular energy demand rather than with the pattern of acetate excretion. This \( {w}_f^{\ast }{w}_r^{\ast } \) relationship remained relatively stable when the upper bound of the portion of substrate carbon flowing into PP pathway (uPPP%) varied in the modelling studies. On the other hand, the increase of uPPP% was shown to lead to a visible increase in the estimated proteomic cost of the biomass synthesis sector b^{∗}, which mathematically results from a reduction in the predicted respiration flux.
Finally, and as a general point for constraintbased models, cellular energy demand appeared to have a major impact on the predicted biomass yield; tuning the default energy demand with strainspecific data was shown to be critical in making simultaneously accurate predictions of biomass yield and overflow metabolism.
Overall, this work demonstrates the potential of combining a detailed metabolic model with a coarsegrained, pathwaylevel resource allocation constraint in producing quantitatively accurate predictions of the overflow phenomenon in E coli; similar modelling approaches that feature this type of combination may prove suitable also for other applications.
Abbreviations
 FBA:

Flux balance analysis
 GAM:

Growthassociated maintenance
 NADTRHD:

NAD transhydrogenase
 NGAM:

Nongrowthassociated maintenance
 PAT:

Proteome allocation theory
 PPP%:

Pentose phosphate pathway ratio
 THD2:

NAD(P) transhydrogenase
 TKT2:

Transketolase
 uPPP%:

Upper bound of the pentose phosphate pathway ratio
References
 1.
Eiteman MA, Altman E. Overcoming acetate in Escherichia coli recombinant protein fermentations. Trends Biotechnol. 2006;24:530–6.
 2.
Farmer WR, Liao JC. Reduction of aerobic acetate production by Escherichia coli. Appl Environ Microbiol. 1997;63:3205–10.
 3.
Holms H. Flux analysis and control of the central metabolic pathways in Escherichia coli. FEMS Microbiol Rev. 1996;19:85–116.
 4.
Pan JG, Rhee JS, Lebeault JM. Physiological constraints in increasing biomass concentration of Escherichiacoli B in fedbatch culture. Biotechnol Lett. 1987;9:89–94.
 5.
Xu B, Jahic M, Enfors SO. Modeling of overflow metabolism in batch and fedbatch cultures of Escherichia coli. Biotechnol Prog. 1999;15:81–90.
 6.
Lee SY. High celldensity culture of Escherichia coli. Trends Biotechnol. 1996;14:98–105.
 7.
Holmes WH. The central metabolic pathways of Escherichia coli: relationship between flux and control at a branch point, efficiency of conversion to biomass, and excretion of acetate. Curr Top Cell Regul. 1986;28:69–105.
 8.
Luli GW, Strohl WR. Comparison of growth, acetate production, and acetate inhibition of Escherichia coli strains in batch and fedbatch fermentations. Appl Environ Microbiol. 1990;56:1004–11.
 9.
Zhou K, Qiao K, Edgar S, Stephanopoulos G. Distributing a metabolic pathway among a microbial consortium enhances production of natural products. Nat Biotechnol. 2015;33:377–83. https://doi.org/10.1038/nbt.3095.
 10.
Schulz TJ, Thierbach R, Voigt A, Drewes G, Mietzner B, Steinberg P, et al. Induction of oxidative metabolism by mitochondrial frataxin inhibits cancer growth Otto Warburg revisited. J Biol Chem. 2006;281:977–81.
 11.
Shlomi T, Benyamini T, Gottlieb E, Sharan R, Ruppin E. Genomescale metabolic modeling elucidates the role of proliferative adaptation in causing the Warburg effect. PLoS Comput Biol. 2011;7. https://doi.org/10.1371/journal.pcbi.1002018.
 12.
Vazquez A, Liu J, Zhou Y, Oltvai ZN. Catabolic efficiency of aerobic glycolysis: the Warburg effect revisited. BMC Syst Biol. 2010;4. https://doi.org/10.1186/17520509458.
 13.
Schuster S, Boley D, Möller P, Stark H, Kaleta C. Mathematical models for explaining the Warburg effect: a review focussed on ATP and biomass production. Biochem Soc Trans. 2015;43:1187–94. https://doi.org/10.1042/BST20150153.
 14.
Wolfe AJ. The acetate switch. Microbiol Mol Biol Rev. 2005;69:12–50. https://doi.org/10.1128/MMBR.69.1.1250.2005.
 15.
Molenaar D, van Berlo R, de Ridder D, Teusink B. Shifts in growth strategies reflect tradeoffs in cellular economics. Mol Syst Biol. 2009;5. https://doi.org/10.1038/msb.2009.82.
 16.
Basan M, Hui S, Okano H, Zhang Z, Shen Y, Williamson JR, et al. Overflow metabolism in Escherichia coli results from efficient proteome allocation. Nature. 2015;528:99–104. https://doi.org/10.1038/nature15765.
 17.
Enjalbert B, Millard P, Dinclaux M, Portais JC, Létisse F. Acetate fluxes in Escherichia coli are determined by the thermodynamic control of the PtaAckA pathway. Sci Rep. 2017;7:42135. https://doi.org/10.1038/srep42135.
 18.
Anane E, López CDC, Neubauer P, Cruz Bournazou MN. Modelling overflow metabolism in Escherichia coli by acetate cycling. Biochem Eng J. 2017;125:23–30. https://doi.org/10.1016/j.bej.2017.05.013.
 19.
Goel A, Wortel MT, Molenaar D, Teusink B. Metabolic shifts: a fitness perspective for microbial cell factories. Biotechnol Lett. 2012;34:2147–60.
 20.
Orth JD, Thiele I, Palsson BØ. What is flux balance analysis? Nat Biotechnol. 2010;28:245–8. https://doi.org/10.1038/nbt.1614.
 21.
Beg QK, Vazquez A, Ernst J, de Menezes MA, BarJoseph Z, Barabasi AL, et al. Intracellular crowding defines the mode and sequence of substrate uptake by Escherichia coli and constrains its metabolic activity. P Natl Acad Sci USA. 2007;104. https://doi.org/10.1073/pnas.0609845104.
 22.
Vazquez A, Beg QK, Demenezes MA, Ernst J, BarJoseph Z, Barabasi AL, et al. Impact of the solvent capacity constraint on E. coli metabolism. BMC Syst Biol. 2008;2. https://doi.org/10.1186/1752050927.
 23.
Goelzer A, Fromion V, Scorletti G. Cell design in bacteria as a convex optimization problem. Automatica. 2011;47:1210–8.
 24.
Goelzer A, Fromion V. Bacterial growth rate reflects a bottleneck in resource allocation. Biochim Biophys Acta (BBA)General Subj. 2011;1810:978–88.
 25.
Goelzer A, Fromion V. Resource allocation in living organisms. Biochem Soc Trans. 2017. https://doi.org/10.1042/BST20160436.
 26.
O’Brien EJ, Lerman JA, Chang RL, Hyduke DR, Palsson BO. Genomescale models of metabolism and gene expression extend and refine growth phenotype prediction. Mol Syst Biol. 2014;9:693. https://doi.org/10.1038/msb.2013.52.
 27.
Zhuang K, Vemuri GN, Mahadevan R. Economics of membrane occupancy and respirofermentation. Mol Syst Biol. 2011;7. https://doi.org/10.1038/msb.2011.34.
 28.
Mori M, Hwa T, Martin OC, De Martino A, Marinari E. Constrained allocation flux balance analysis. PLoS Comput Biol. 2016;12(6):e1004913.
 29.
Woldringh CL, Binnerts JS, Mans A. Variation in Escherichia coli buoyant density measured in Percoll gradients. J Bacteriol. 1981;148:58–63.
 30.
Hui S, Silverman JM, Chen SS, Erickson DW, Basan M, Wang J, et al. Quantitative proteomic analysis reveals a simple strategy of global resource allocation in bacteria. Mol Syst Biol. 2015;11. https://doi.org/10.15252/msb.20145697.
 31.
Scott M, Gunderson CW, Mateescu EM, Zhang Z, Hwa T. Interdependence of cell growth and gene expression: origins and consequences. Science (80 ). 2010;330:1099–102. https://doi.org/10.1126/science.1192588.
 32.
You C, Okano H, Hui S, Zhang Z, Kim M, Gunderson CW, et al. Coordination of bacterial proteome with metabolism by cyclic AMP signalling. Nature. 2013;500:301–6. https://doi.org/10.1038/nature12446.
 33.
Vazquez A, Oltvai ZN. Macromolecular crowding explains overflow metabolism in cells. Sci Rep. 2016;6:31007. https://doi.org/10.1038/srep31007.
 34.
Orth JD, Palsson BØ, Fleming RMT. Reconstruction and use of microbial metabolic networks: the Core Escherichia coli metabolic model as an educational guide. EcoSal Plus. 2010;4. https://doi.org/10.1128/ecosalplus.10.2.1.
 35.
Schellenberger J, Que R, Fleming RMT, Thiele I, Orth JD, Feist AM, et al. Quantitative prediction of cellular metabolism with constraintbased models: the COBRA toolbox v2. 0. Nat Protoc. 2011;6:1290.
 36.
Feist AM, Henry CS, Reed JL, Krummenacker M, Joyce AR, Karp PD, et al. A genomescale metabolic reconstruction for Escherichia coli K12 MG1655 that accounts for 1260 ORFs and thermodynamic information. Mol Syst Biol. 2007;3. https://doi.org/10.1038/msb4100155.
 37.
Fischer E, Sauer U. A novel metabolic cycle catalyzes glucose oxidation and anaplerosis in hungry Escherichia coli. J Biol Chem. 2003;278:46446–51.
 38.
Flamholz A, Noor E, BarEven A, Liebermeister W, Milo R. Glycolytic strategy as a tradeoff between energy yield and protein cost. Proc Natl Acad Sci. 2013;110:10039–44.
 39.
Fuhrer T, Fischer E, Sauer U. Experimental identification and quantification of glucose metabolism in seven bacterial species. J Bacteriol. 2005;187. https://doi.org/10.1128/JB.187.5.15811590.2005.
 40.
Sauer U, Canonaco F, Heri S, Perrenoud A, Fischer E. The soluble and membranebound transhydrogenases UdhA and PntAB have divergent functions in NADPH metabolism of Escherichia coli. J Biol Chem. 2004;279:6613–9.
 41.
Nanchen A, Schicker A, Sauer U. Nonlinear dependency of intracellular fluxes on growth rate in miniaturized continuous cultures of Escherichia coli. Appl Environ Microbiol. 2006;72:1164–72.
 42.
Folsom JP, Carlson RP. Physiological, biomass elemental composition and proteomic analyses of Escherichia coli ammoniumlimited chemostat growth, and comparison with ironand glucoselimited chemostat growth. Microbiology. 2015;161:1659–70.
 43.
Becker SA, Feist AM, Mo ML, Hannum G, Palsson BØ, Herrgard MJ. Quantitative prediction of cellular metabolism with constraintbased models: the COBRA toolbox. Nat Protoc. 2007;2. https://doi.org/10.1038/nprot.2007.99.
 44.
Kayser A, Weber J, Hecht V, Rinas U. Metabolic flux analysis of Escherichia coli in glucoselimited continuous culture. I. Growthratedependent metabolic efficiency at steady state. Microbiology. 2005;151:693–706 http://mic.microbiologyresearch.org/content/journal/micro/10.1099/mic.0.274810.
 45.
van Hoek MJ, Merks RM. Redox balance is key to explaining full vs. partial switching to lowyield metabolism. BMC Syst Biol. 2012;6:22. https://doi.org/10.1186/17520509622.
 46.
Goel A, Eckhardt TH, Puri P, Jong A, dos Santos F, Giera M, et al. Protein costs do not explain evolution of metabolic strategies and regulation of ribosomal content: does protein investment explain an anaerobic bacterial Crabtree effect? Mol Microbiol. 2015;97:77–92.
 47.
Fendt SM, Buescher JM, Rudroff F, Picotti P, Zamboni N, Sauer U. Tradeoff between enzyme and metabolite efficiency maintains metabolic homeostasis upon perturbations in enzyme capacity. Mol Syst Biol. 2010;6:356.
 48.
Hackett SR, Zanotelli VRT, Xu W, Goya J, Park JO, Perlman DH, et al. Systemslevel analysis of mechanisms regulating yeast metabolic flux. Science (80 ). 2016;354:aaf2786.
Acknowledgements
We would like to thank the two anonymous reviewers for their constructive suggestions and comments on the original manuscript.
Funding
HZ is supported by the China Scholarship Council through a PhD scholarship. The funding body played no role in the design, analysis, data interpretation or manuscript writing of this study.
Availability of data and materials
All model equations, parameter values are included in the main text or in the Supporting Information (Additional files 1 and 2). The MATLAB code for running simulations and the generated datasets in this study are available upon request from the corresponding author.
Author information
Affiliations
Contributions
HZ performed the modelling work, analysed and interpreted the results, and wrote the manuscript. AY designed and supervised the study, analysed and interpreted the results, and revised the manuscript. Both authors have read and approved the manuscript.
Corresponding author
Correspondence to Aidong Yang.
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional files
Additional file 1:
Supplementary Tables S1S6. (XLSX 26 kb)
Additional file 2:
Supplementary text and Supplementary Figures S1S17. (DOCX 229 kb)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Zeng, H., Yang, A. Modelling overflow metabolism in Escherichia coli with flux balance analysis incorporating differential proteomic efficiencies of energy pathways. BMC Syst Biol 13, 3 (2019) doi:10.1186/s1291801806774
Received
Accepted
Published
DOI
Keywords
 Flux balance analysis
 Proteomic efficiency
 Overflow metabolism
 Acetate production
 Biomass yield
 Escherichia coli