- Methodology article
- Open Access
- Published:

# Exploring metabolism flexibility in complex organisms through quantitative study of precursor sets for system outputs

*BMC Systems Biology*
**volume 8**, Article number: 8 (2014)

## Abstract

### Background

When studying metabolism at the organ level, a major challenge is to understand the matter exchanges between the input and output components of the system. For example, in nutrition, biochemical models have been developed to study the metabolism of the mammary gland in relation to the synthesis of milk components. These models were designed to account for the quantitative constraints observed on inputs and outputs of the system. In these models, a compatible flux distribution is first selected. Alternatively, an infinite family of compatible set of flux rates may have to be studied when the constraints raised by observations are insufficient to identify a single flux distribution. The precursors of output nutrients are traced back with analyses similar to the computation of yield rates. However, the computation of the quantitative contributions of precursors may lack precision, mainly because some precursors are involved in the composition of several nutrients and because some metabolites are cycled in loops.

### Results

We formally modeled the quantitative allocation of input nutrients among the branches of the metabolic network (AIO). It corresponds to yield information which, if standardized across all the outputs of the system, allows a precise quantitative understanding of their precursors. By solving nonlinear optimization problems, we introduced a method to study the variability of AIO coefficients when parsing the space of flux distributions that are compatible with both model stoichiometry and experimental data. Applied to a model of the metabolism of the mammary gland, our method made it possible to distinguish the effects of different nutritional treatments, although it cannot be proved that the mammary gland optimizes a specific linear combination of flux variables, including those based on energy. Altogether, our study indicated that the mammary gland possesses considerable metabolic flexibility.

### Conclusion

Our method enables to study the variability of a metabolic network with respect to efficiency (i.e. yield rates). It allows a quantitative comparison of the respective contributions of precursors to the production of a set of nutrients by a metabolic network, regardless of the choice of the flux distribution within the different branches of the network.

## Background

When studying metabolism, it is important to elucidate how fluxes are distributed among the different pathways of the metabolic network with respect to the available quantitative information about the system behavior. Several methods can be used to address this issue. The first approach consists of building a mechanistic description of transformations and identifying the regulations involved in the system. Continuous dynamical models are often used for this purpose, especially when time-series responses to different treatments are available to infer the dynamics of the network. Static approaches such as Petri net can also identify qualitative distributions of fluxes in a metabolic network [1], and their stochastic extensions can even take into account stoichiometric and kinetic information [2]. With a complementary approach, one can study a system at the functional level, based on the study of fluxes at steady states. This is the purpose of the Flux Balance Analysis (FBA) framework, which has evolved considerably over the past decades [3, 4]. With an FBA, the identification of external regulations is not necessary because it is assumed that the global behavior of the system can be modeled by optimizing linear combinations of selected fluxes (i.e. the *objective function*). Roughly, the methods developed in this field aim to explore a convex space of plausible flux distributions and to study the *extreme* flux distributions obtained when optimizing a linear objective function. It allows checking whether an extreme flux distribution is consistent with experimental data and to predict new experimental observations [5–8]. Nonetheless, the consistency of the solutions obtained by FBA depends on the quality of the constraints integrated in the model. To overcome this limitation, several extensions have been proposed in the literature. These extensions can be broken down into two parts. The first incorporates additional biological knowledge, such as reaction thermodynamics [9] or multioptimization [10]. The second is based on the use of FBA to globally analyze large-scale metabolic networks. For example, in flux variability analysis [11], the minimum and the maximum flux for each reaction in the network are computed under some (sub-)optimal conditions.

In this paper, our main purpose is to extend this framework to study the variability of a metabolic network at the level of *efficiencies* instead of *fluxes*. Indeed, when studying a metabolism at the organ level, a major challenge consists in comparing the efficiency, or yield rates, of two metabolic situations, i.e. the response to various input patterns. A typical example of such studies are those concerning animal nutrition, which aim to predict the quality and quantity of animal production (meat, fat, milk, etc.) in response to breeding factors. In this field, the energy and protein conversion efficiencies are derived from the study of the flux distribution of input nutrients between the different branches of the metabolic network [12]. More generally, although its definition depends on the field of application, the concept of efficiency is often linked to energy, mass growth and protein conversion [13–15]. However, the computing of efficiencies is prone to difficulties and errors, since it requires computing the quantities precursors sets which are required to explain the measured composition of outputs, although some of the internal products may be recycled within cycles [16]. With this goal, we defined the *allocation of an input towards an output* (with the abbreviation AIO) to be the proportion of a matter component (such as carbon) in a given input flux that is recovered in the selected output flux. Our definition is based on the choice of a material component, such as carbon or nitrogen, which allows a comparison of the contributions of input metabolites to the composition of output products. It can be seen as a yield rate, which is uniform among all the outputs of the system, allowing a precise understanding of the precursors used. As a first methodological contribution, we prove that AIO can be uniquely described and computed, even in the case when there are metabolic system cycles, by a matrix whose coefficients are nonlinear functions of the flux variables. Introducing nonlinearity in the definition of AIO cannot be avoided because of the presence of these cycles.

Studying the variability of AIO within a complete space of plausible flux distributions requires the solving of nonlinear optimization problems which are underdetermined in tangible applications. As a second methodological contribution, we have proposed efficient algorithms to compute lower and upper bounds for AIOs over the family of flux distributions which are compatible with both the system’s stoichiometry and the experimental datasets, regardless of the choice of a flux distribution for the internal branches of the network. An important aspect is that when the metabolic network is provided with input-output data, the complete space of plausible distributions appears to have a relatively small size, and can therefore be studied with our method. Our framework is depicted in Figure 1.

Our main example of application is related to milk production. In this context, several models have been introduced, in relation with the aforementioned classification of models. One class of small-size dynamical mechanistic models predicts the blood flow and input nutrients of the metabolic system (i.e. the mammary gland) [17, 18] or, alternatively, the nutrients produced by the metabolic system (in terms of milk composition) [19, 20]. Another class of models predicts the distribution (i.e. partitioning) of fluxes over the pathways from the input and output nutrients of the system [21, 22]. In the latter family, both dynamic and static approaches exist. Indeed, numerical models based on mass-action equations were initially proposed to describe the fluxes to and from individual metabolites, with different levels of description [21, 22]. To that goal, optimizers were used to determine a reasonable set of parameter estimates for the dynamical model of the system. Although such a set of parameter estimates is not unique. With a totally different approach, in another study, the main biochemical pathways in the different metabolisms were integrated in a generic stoichiometric model called the *metabolic spreadsheet*[15, 23]. Its structure was based on a restricted number of intermediary metabolites called carbon-chain *pivots*, which correspond to important cross-over points between metabolic pathways. This allowed the computation of the flux rates for all reactions, constrained by a general rule stating that there is no accumulation of intermediary metabolite of cofactors. A study of manual calculations performed with the metabolic spreadsheet showed that it works, in practice, by maximizing an objective function (ATP production) in a convex solution space [24]. Therefore, this model can be considered as an application of the Flux Balance Analysis framework [25].

Nonetheless, in this field of study, there has been little discussion on the impact of the choice of a single model among several (possibly infinitely many) reasonable models [14]. To investigate this issue in a more automatic way, we compared the aforementioned conventional models to the convex space of plausible flux distributions associated with steady states of a model of mammary gland metabolism, as computed in FBA. We checked the consistency of extreme flux distributions of nutrients with experimental data of the mammary gland and milk production, including the contribution of nutrient input-output and isotope balance studies [26, 27]. Based on our AIO computation framework, our analysis highlighted that the metabolic behavior of the mammary gland cannot be modeled by maximizing ATP production or by optimizing a linear combination of flux variables of the model of the mammary gland. In other words, although an infinite number of flux distributions are compatible with the data, none are extreme within the space of feasible models. Selecting any of these nonoptimal flux distributions to predict the system behavior appears difficult without additional experimentation.

To gain better understanding of the system response regardless of the choice of a flux distribution for the internal branches of the network, we applied our method to estimate the variability of AIO coefficients in our model and compared the effects of two different diets on mammary gland metabolism. Our results suggest that the bounds of AIO are sufficient to distinguish the effects of different nutritional treatments without selecting a flux distribution for the internal reactions of the metabolic network by any method - optimization of a linear combination of fluxes or a residual score. Overall, the complete study suggests considerable flexibility in mammary gland metabolism. It provides a view of the functioning of the system although its internal processes still cannot be clarified because of limitations on experimentation on large animals such as ruminants.

## Results

We first investigated the set of flux distributions that are compatible with the stoichiometry of our mammary gland model (depicted in Figure 2), without taking datasets into account. The model exhibited a large variability, since thousands of extreme pathways could be identified.

We then successively computed the set of flux distributions compatible with the model and the real datasets of lactation metabolism in dairy cows. The datasets are given in Table 1. They include a control diet (Ctrl), a diet related to an increased protein supply through casein infusion into the duodenum (CN), and a complementary dataset (HB) previously used in a mechanistic model of the mammary metabolism [21]. For the three datasets, the set of plausible flux distributions – solutions to a linear system detailed in Eq. (1) – was an unbounded convex cone of dimension 5. In all cases, the solution spaces shared the same set of five independent variables. This suggests that there are five independent levels of variability within the system: peptide hydrolysis (*R*
_{64}), NADPH oxidation (*R*
_{19}), OAA →PYR (*R*
_{14}), OAA →G3P (*R*
_{15}) and G3P →G6P (*R*
_{8}). Therefore, by including this dataset the model became a fairly small and constrained network. Nevertheless, it was not uniquely determined since there were still several degrees of freedom.

### Investigating the relevance of the optimization strategies for mammary metabolism

The balance between the ATP generated by the system and the ATP used by the system (including the ATP cost of milk component synthesis) was computed for the three datasets (Ctrl), (CN) and (HB). The results are detailed in Table 2. In this table, the FBA approach based on optimization of the ATP balance is compared to the natural functioning of the system.

First, we considered the manual computation of fluxes with a tool named “metabolic spreadsheet” [15, 23]. We applied the rules of no accumulation on any intermediary metabolites and cofactors with a utilization rejecting all the cycles. As expected, since some cycles are ATP-consuming, both flux distributions were equivalent in both approaches (Table 2). The ATP balances were respectively 3081, 2045 and 6628 mmol/h/half udder (i.e. 318 mol/d/udder) in the (Ctrl), (CN) and (HB) datasets.

According to this model, the ATP generated is estimated to be 6500 mmol/h/half udder (i.e. 312 mol/d/udder) while 2125 mmol/h/half udder (i.e. 102 mol/d/udder) were estimated to be used for milk component synthesis [31]. Therefore, the ATP-balance obtained for this model was slightly smaller than the optimal one obtained for our mammary gland model.

Independently of the approach considered and the model or dataset at hand, the remaining ATP (ATP balance), after use for milk synthesis, appeared to be rather high and nonxconstant. Indeed, as shown in [31], an ATP balance of 1250 mmol/h/half udder (60 mol/d/udder) can be expected to be used for other functions not accounted for in the models, such as maintaining membrane potential and synthesizing nucleic acids. The numerical values obtained here were far from this estimated balance, suggesting that each of these three models have a nonrelevant ATP balance.

In addition, both in the ODE model and the ATP-optimization approach, peptide hydrolysis was obtained at zero, implying an absence of any protein turnover. This contradicts all the observations about this pathway: considerable use of this pathway has been evidenced in several publications, although the peptide hydrolysis rates differed significantly depending on the technique used for the measurements: peptide hydrolysis (*R*
_{64} i.e. mammary protein degradation) spans from 0.25,0.23 to 0.67 of peptide synthesis (*R*
_{63} i.e. total mammary protein synthesis) in (Ctrl), (CN) and (HB), respectively [29, 31].

Overall, we concluded with this analysis that the energy-based optimization function may not allow an appropriate simulation of mammary gland metabolism. This was expected, considering that the system is studied at the complete organ level, involving competing processes which can rarely be modeled with a single linear objective function.

### Exploring all extreme flux distributions in a refined simplex

In order to study the variability within the space of plausible flux distributions and to identify alternative relevant optimization strategies, an additional constraint was placed on the ATP balance of the system. We considered that an ATP-balance of 1250 mmol/h/half udder (60 mol/d/udder) was a relevant measure for this study [31], although we checked that all the results discussed below were still valid when introducing a 10% tolerance on this estimation of ATP-balance. Providing a bound for ATP yielded a new constraint on several variables of the system. For the two datasets (Ctrl) and (CN), the flux G3P →G6P (*R*
_{8}) was no longer an independent variable in the system. The simplex, which was previously unbounded, appeared to be bounded with four independent variables: OAA →PYR (*R*
_{14}), OAA →G3P (*R*
_{15}), NADPH oxidation (*R*
_{19}), peptide hydrolysis (*R*
_{64}).

Applying flux variability tools [32], it appeared that the minimum of each of the fluxes *v*
_{14},*v*
_{15},*v*
_{19},*v*
_{64} was equal to zero for all treatments. The maxima of the fluxes (*v*
_{14},*v*
_{15},*v*
_{19},*v*
_{64}) were (1831,1831,669,305) for the (Crtl) treatment and (795,795,22,133) for the (CN) treatment This suggests that the (Ctrl) treatment generates a more flexible space of plausible flux distributions than the (CN) treatment.

We computed all extreme vertices of the simplex of plausible flux distributions for the two treatments (Ctrl), (CN). The simplex structure of the polyhedron implies that the optimum of any linear combination of metabolic fluxes involved in the model is either uniquely attained for one of these extreme points or attained by all flux distributions positioned on a face of the simplex. To gain insight on the pathways involved in the variability of our model, we also computed the linear combination of fluxes optimized for one of the extreme flux distributions. Eight extreme vertices were found for the (Ctrl) and (CN) datasets. They were obtained when optimizing the same set of linear functions^{a}. In Table 3, the optimal flux distributions for (Ctrl), (CN) are classified according to the activation or inactivation of several pathways in the model. It is worth noting that, according to this table, the two sets of plausible flux distributions associated with (CN) and (Ctrl) have the same topological structure. Indeed, the eight extremal behaviors for (CN) and (Ctrl) have clear combinatorics: first, NADPH oxidation is set either at zero or at its maximal value. Then, a single flux within OAA →PYR (*R*
_{14}), OAA →G3P (*R*
_{15}), G3P →G6P (*R*
_{8}), peptide hydrolysis (*R*
_{64}) is strongly activated whereas the three other remaining fluxes are blocked.

As discussed in a previous paragraph, flux distributions with no peptide hydrolysis cannot be considered as relevant [28, 31]. This suggests the extreme flux distributions A-F in (CN) and (Ctrl) are not biologically relevant. On the contrary, two extreme flux distributions (distributions G and H) are consistent with the stoichiometry of the system for the (CN) and (Ctrl) treatments. These two flux distributions consist in optimizing peptide synthesis and hydrolysis after NADPH oxidation. They corresponded to a ratio between peptide hydrolysis and synthesis \frac{{v}_{64}}{{v}_{63}} of 0.70 in the (Ctrl) treatment and of 0.47 in the (CN) treatment. These ratios are equal or lower to the maximum ratio of 0.67*v*
_{63}[31]. However, in (CN), treatment peptide synthesis was expected to be higher than in Ctrl treatments since in mammals protein synthesis is reported to increase with increasing protein intake (as CN in treatment) [29]. Curiously, on the contrary, in both flux distributions G and H, the total mammary protein synthesis decreased for the (CN) treatment when compared to the (Ctrl) treatment: *v*
_{63} equals 430 or 410 mmol/h/half udder in (Ctrl) and 283 or 282 mmol/h/half udder in (CN).

### Study of quantitative contributions of precursors (AIO) for plausible extreme flux distributions

As a further investigation to check the relevance of distributions G and H in the (CN) and (Ctrl) treatments, we studied the quantitative contributions of precursors of output nutrients. This study was inspired by the usual techniques in the field of nutrition - or any domain concerned with organ studies. These techniques consist in computing yield rates to elucidate how an input nutrient may contribute to the composition of an output product, for instance to clarify what proportion of glucose, acetate or alanine taken up by the mammary gland can be recovered in the milk components (lactose, fatty acids, protein) or oxidized and recovered in CO2 released in blood. To formalize this issue, we first selected carbon as the component according to which the contributions of precursors were to be computed. Then, in order to determine how much carbon introduced into the system through a given input flux can be recovered in the rate of production of an output metabolite, we introduced a precise model for the *allocation of nutrients in measured outputs (AIO)* (see Eq. (3)). As shown in Tables 4 and 5, the nutrient allocation corresponding to both flux distributions G and H in the (CN) and (Ctrl) treatments provides evidence that glucose is the unique precursor of lactose synthesis. This contradicts studies of dairy cows suggesting that glycerol and perhaps amino acids contribute to lactose synthesis [26], and that fifteen percent of lactose carbon could not derive from glucose [27].

This precise analysis of the origin of carbon in lactose synthesis with AIO suggests that extreme flux distributions G and H have to be rejected, although both flux distributions are consistent with flux variability criteria. We concluded that none of the extreme vertices of the set of plausible flux distributions could be considered biologically relevant with respect to the model and data at hand. Notice that the optimization of any linear combination of metabolic fluxes is either reached by these extreme distributions or reached by the infinite number of flux distributions lying in a face of the simplex. This suggests that the functioning of the mammary gland cannot be uniquely modeled by the optimization of any linear combination of metabolic fluxes involved in the current model.

### Discriminate treatments despite mammary gland flexibility: maxima of AIO on the complete polyhedron of flux distributions

Previous studies suggest that the response of the mammary gland cannot be modeled uniquely by the optimization of a linear objective function of fluxes. However, there are several nonoptimal flux distributions that satisfy the literature-based information that we have used so far^{b}. More generally, many flux distributions compatible with the additional constraints, including the condition on carbon precursors for lactose can be shown for both (CN) and (Ctrl) treatments. Topological arguments prove that an infinite set of flux distributions exist. Nonetheless, without an idea about the exact shape of the space of feasible fluxes (because of the nonlinear nature of the condition of carbon precursors), we cannot select a plausible point within this space. In other words, the available knowledge appears insufficient to determine uniquely a flux distribution of nutrients among the different branches of the proposed model.

To understand the functioning of the mammary gland and despite this difficulty, we introduced a method to estimate the variability of nutrient allocation among pathways on a carbon basis (see Eq. (3)) by computing the range (min-max) of AIO coefficients. As these coefficients are nonlinear functions of flux variables, computing these min and max over the complete space of plausible flux distribution requires solving nonlinear optimization problems. Our dedicated algorithm detailed in the method section scaled properly to the real case that we are studying^{c}. Interestingly, with this approach, we did not favor any internal functioning of the system since we parsed all objective functions (linear combinations of flux variables), in order to have a complete description of the space of plausible flux distributions.

The min-max tables for allocations of nutrients in the different pathways provided a clearer view of nutrient utilization within the mammary gland. Unlike the functioning of the extremal distributions (G) and (H) shown in Tables 4 and 5, the contribution of carbon from glucose to lactose carbon was quite variable within the full set of plausible distribution. Indeed, as shown in Tables 6 and 7, glucose was the precursor of 85% (\frac{1751}{886} or \frac{866}{1002}) to 100% of the carbon lactose in the (Ctrl) and (CN) treatments. Therefore, there exist flux distributions such that glucose is not the only precursor of lactose synthesis [26], and these distributions are consistent with quantitative estimations of the ratio of carbon glucose recovered in lactose [27].

More generally, the intervals of distributions of nutrients in different pathways were used to compare the effects of the (Ctrl) and (CN) treatments. Biologically, in comparison to the (Ctrl) treatment, the (CN) treatment was characterized by a lower proportion of glucose (on a carbon basis) which is oxidized in CO2, and a larger ratio used for lactose synthesis. This hypothesis can be sustained since the intervals of distribution of carbon from glucose in CO2 and lactose almost did not overlap in the (Ctrl) and (CN) treatments. More precisely, in the (Ctrl) treatment, from \frac{396}{1422}=27.9\% to \frac{565}{1422}=39.7\% of glucose carbon was oxidized to produce CO2, whereas this ratio ranged from 17.5*%* to 28.9*%* for the (CN) treatment. In addition, in the (Ctrl) treatment, 52.8*%* to 62.3*%* of the glucose carbon was required to produce lactose carbon, whereas 62.2*%* to 72% of glucose was required during the (CN) treatment.

Altogether, this analysis allowed us to discriminate the effects of the different treatments whatever the internal functioning of the system may be: the (CN) treatment (increase in protein supply to cows) was characterized by a lower proportion of glucose oxidized in CO2 than in (Ctrl). It appears to be a suitable strategy to analyze the metabolism flexibility without selecting a precise flux distribution or making any assumption on the internal metabolic fluxes.

As a final study, we used an interior point exploration method to estimate the AIO variability over the boundary of the simplex, rather than the complete space that was explored previously. As both tables appeared to be equal, we concluded that optimized AIO are reached on the boundary of the simplex. However, not all extreme vertices of the simplex are relevant biologically and they do not optimize AIO. This suggests that the flux distributions that optimize an AIO coefficient are placed at the interior of the simplex faces. In the future, it may be interesting to study the biological significance with multi-objective approaches [34] and check whether these can be considered as a “characteristic” point of these faces.

### Impact of long-chain fatty acids oxidation over the model predictions

As a last study, we studied the robustness of our conclusion with respect to changes in the modeling of input (net uptake) of long-chain fatty acids (LCFA), C16 and C18, in the tricarboxylic acid (TCA) cycle. In the study of the (Ctrl) and (CN) treatments, such an input of LCFA was not considered in the system since they are not synthesized by the mammary gland and we hypothesized that they are not oxidized within the mammary gland, based on isotope measurements in studies in fed lactating goats and in nonruminants [35–37]. This hypothesis was sustained by the measured carbon balance in the (Ctrl) and (CN) datasets [28], which did not require increasing the prediction of CO2 by introducing any LCFA in the TCA cycle (see also Table 8). Nonetheless, in other contexts and models, LCFA oxidation was either measured in starvation condition [38] or introduced in order balance carbon consumption and production with respect to CO2 prediction [18, 20–22]. This LCFA oxidation hypothesis was for instance retained in models studying the (HB) dataset [20, 21].

From this literature study, it appears that LCFA oxidation may depend on both the environmental and experimental contexts, and no single model can be favored yet. To study the impact of LCFA oxidation, we successively introduced a ratio of LCFA in the TCA cycle (10%, 20%, 25%), and assumed that 50% of C16 output are synthesized within the mammary gland [21] (see Figure 3). Then we performed a complete study of the (Ctrl), (CN) and (HB) datasets for these hypotheses (see Tables 8 and 9).

For the (Ctrl) treatment, by comparing the predicted and measured CO2 quantities, we concluded that the hypotheses of 20% and 25% of FA oxidized in the TCA cycle were not in agreement with our experimental data: the increase of CO2 prediction was too large both when compared to measured CO2 (see Table 8) and when confronted to the measured increase of 9% of CO2 deriving from long LCFA in lactating goats in an extreme starvation condition [38]. Similarly, the hypothesis of 0% of LCFA oxidation was not consistent with the (HB) treatment [22].

For all remaining compatible pairs of model and datasets, we first studied the ATP maximization hypothesis. In all cases, our results, shown in Table 8, suggest that ATP maximization is not biologically relevant: ATP balance was even larger than the ATP balance of the models where no LCFA was introduced in the TCA cycle (0%).

Then we enumerated extreme flux distributions and studied their biological relevance. As shown in Table 9, for each ratio of LCFA in the TCA cycle, the structure of the simplex of plausible flux distributions for the (Ctrl) and (CN) datasets was similar to that shown in Table 3 (0%). Using similar arguments, no extreme distribution could be considered as biologically relevant. When studying the space of solution associated with the (HB) dataset, a more complex structure appeared, based on 13 extreme flux distributions instead of 8. Explicit (although not unique) linear functions that can be optimized to obtain these extreme distributions are detailed in Table 9
^{d}. More precisely, for the (HB) treatment, we still obtain a division according to the activation of the four fluxes OAA →PYR (*R*
_{14}), OAA →G3P (*R*
_{15}), G3P →G6P (*R*
_{8}), peptide hydrolysis (*R*
_{64}). Nonetheless, an intricate phenomenon appears. Indeed, measurements imply that OAA →G3P (*R*
_{15}) is very constrained. It cannot vanish at the same time as OAA →PYR (*R*
_{14}) or when NADPH oxidation (*R*
_{19}) is at its maximal value. Therefore, for several optimal conditions related to NADPH oxidation (*R*
_{19}), OAA →PYR (*R*
_{14}), G3P →G6P (*R*
_{8}), peptide hydrolysis (*R*
_{64}), we need to decide whether OAA →G3P (*R*
_{15}) is slightly non-zero, or whether OAA →G3P is assumed to vanish although the optimal is not exactly reached for the initial flux considered. Despite this difference of structure between the (HB) dataset and the (Ctrl) and (CN) datasets, our analysis suggested that extreme distributions were not biologically relevant, independently from the ratio of LCFA oxidation or the dataset under study.

Finally, comparing the variability of the AIO coefficients for the (Ctrl) and (CN) treatments when oxidizing 10% of LCFA in the TCA cycle still suggested that the (CN) treatment is characterized by a lower proportion of glucose (on a carbon basis) which is oxidized in CO2, and a larger ratio used for lactose synthesis.

Altogether, this study suggests that the main characteristics of the (Ctrl) and the (CN) treatments are robust and could be elucidated despite lacking information on the precise internal behavior of LCFA oxidation.

## Discussions

### Analyzing the distribution of nutrients in a metabolic network to study the flexibility of a metabolism at the organ level

An important challenge in applicational fields of metabolism studies at the organ level is to understand how the components of inputs are transformed into some expected outputs, under some assumptions about the functioning of a system. To that end, great use is made of comparisons between yield rates describing the allocation of input nutrients within the set of outputs. Nonetheless, to allow a precise comparison of nutrients, these studies require insights on the distribution of matter components across the output of *each* reaction involved in the system. Such information may be provided by several experimental marking techniques (*C* 13 MFA) that make use of carbon isotopes [39]. Nonetheless, their application to mammals is challenging [40].

To elucidate how input nutrients are allocated among the output nutrients of a metabolic system despite experimental limitations, we have introduced novel methods which refine the flux balance analysis of a metabolic system related to an organ of a large animal. Our method can be seen as an extension of Flux Variability Analysis [11], where the emphasize is put on *efficiency* or *yield rate* variability rather than on flux variability.

As an example of application, we have studied the mammary metabolism in ruminants (dairy cows) [15, 23, 24]. Compared to the conventional models available, our stoichiometric model describes very precisely the variations of energy consumptions as ATP, allowing us to investigate some optimization hypotheses related to energy variations.

As a methodological innovation, we introduced a method to estimate the *Allocation of Inputs in Outputs* (AIO), that is, the ratio of transformation of each input nutrient into outputs, provided that we are given a flux distribution balancing the production and consumption of intermediary metabolites. Two AIO computations are possible: one may fix a linear combination of flux variable to optimize, leading to an extreme flux distribution (i.e. extreme pathways) whose allocations (AIO) are precisely computed with our method. Alternatively, we can reason with regard to the possibly infinite complete convex space of plausible flux distributions, in the spirit of FVA analysis, without solving the set of equations. In this case, our method estimates the minima and maxima of each allocation (AIO) coefficient within the complete space.

The latter point is where our main algorithmic innovation lies. Indeed, AIO coefficients are not linear with respect to the flux variables *v*. Therefore, they have no reason to be extremal for flux distributions corresponding to vertices, edges or faces of the simplex, unless the gradient of every component of the AIO is a nonzero function. Therefore, we have introduced two algorithms allowing us to compute the extremal values of AIO coefficients when the flux space is bounded. The fastest algorithm requires the exhibition of an analytic expression of the AIO matrix. If this is not possible, a local-search algorithm can be used. These algorithms can be used to check whether the extremal values are reached for flux distributions lying on the boundary of the simplex of plausible flux distributions.

This approach applied to a stoichiometry model taking ATP variations into account permitted a better understanding of ruminant mammary gland metabolism in comparison to previous studies based on similar models and datasets [22, 24]. For instance, we were able to characterize the differences in the effects of two treatments, such as the quantitative range of the proportion of glucose which is oxidized in CO2, or used for lactose synthesis. This information was derived from the carbon composition of each metabolite, expert knowledge which is easily accessible and is therefore compatible with the experimental limitations regarding mammals.

### Optimization strategies within tissue or organs

Our study allowed us to revisit optimization-based hypotheses on the functioning of the mammary metabolism. We have provided evidence that flux distributions corresponding to an optimal production of energy (ATP) cannot describe appropriately the metabolism of the mammary gland, as would be the case for bacterial metabolisms, which tend to optimize biomass-related functions [5, 41]. The main reason is that the ATP balance is too large and variable among the responses to different treatments, confirming previous observations [31]. As a consequence, the underlying hypotheses used to drive previous studies of mammary metabolism have to be carefully reformulated [15, 21, 23].

As an alternative, we have hypothesized that ATP balance remains nonoptimal and almost constant in response to several treatments, and we have introduced in the model a recent estimation for this quantity [31]. Our goal was to check whether the observed responses to the system could be explained by the optimization of a linear combination of fluxes, that is, an extremal vertex of the simplex of plausible flux distributions. This hypothesis was rejected, first because it led to non-realistic orders of magnitude for some fluxes such as peptide hydrolysis, and second because the precursors of some components such as glucose were not biologically relevant [26, 27]. Notably, it was necessary to trace back the quantitative origin of the nutrients to complete this analysis and reject all extreme flux distributions, illustrating the advantage of the AIO approach.

Therefore, no optimization of a linear combination of flux variables could be found to uniquely describe the metabolism of the mammary gland, a multicellular organ, as an extreme flux distribution of our model. To overcome this limitation of FBA-inspired analyses at the mammalian tissue or organ level, we studied the variability of AIO coefficients by introducing the computation of min-max ranges of AIO. This led to a general overview of the effects of treatments without precluding any steady state internal behavior.

### Benefits of studying the variability of the allocation of input in output (AIO) and future model refinements

The study of the variability of AIO, that is, intervals of allocation of nutrients, made it possible to distinguish the metabolism of the mammary gland in two different nutritional conditions corresponding to an increase in protein supply (Ctrl and CN). Notably, the intervals of allocation of glucose in CO2 and lactose were different in the two nutritional conditions, reflecting the mammary gland’s metabolic flexibility. Nonetheless, several improvements can be made to the model.

First, the simplex of plausible distributions could be reduced by introducing knowledge currently available on the kinetic bounds for enzymatic activities, introduced in previous numerical models [21]. This may for instance prevent some non-constrained behaviors shown in Tables 6 and 7, such as the possible nonzero contribution of acetate and *β*-hydroxybutyrate (BHBA) carbon to lactose. The main issue in this area will be to aggregate the enzymatic knowledge to fit with the format of the reactions in the model, such as the reaction *R*
_{6} which transforms G3P to PYR under the regulation of five different enzymes, together with NAD+ and ATP availability. Therefore, proposing relevant maximal values requires the use of advanced methods for model reductions.

A second improvement of the model is dependent on the production of additional observations to clarify the set of possible behaviors of the system. Particularly, the contribution of acetate and *β*-hydroxybutyrate (BHBA) to lactose shown in Tables 6 and 7 remains questionable in ruminants since it suggests that the carbon of acetyl-COA could leave the TCA cycle of the mitochondria through the OAA → G3P pathway. This could be explained by the fact that the carbon allocation in the model did not take account of the carbon positions. However, one isotope model and one study of gluconeogenesis in man suggested that labeled carbon within glucose came from [2-14*C*]-acetate (through 14C-acetylCoA in TCA) [42, 43]. It remains challenging to obtain such precise information for large ruminants such as dairy cows. Nevertheless, the model could be improved by modeling with additional constraints the few items of knowledge we have on the flux distribution of positioned carbons.

The last improvement of the model consists of including constraints that do not correspond to rule-based metabolic equations. More precisely, in several numerical models, the effects of external fluxes were introduced to predict the response of the mammary gland in a consistent way [14, 17, 18]. Among external fluxes, we noticed the regulatory effect of long-chain fatty acids on the activity of enzymes involved in fatty acid synthesis, although the mechanisms are not clearly understood [44]. Although some extensions of the FBA formalism to dynamic regulation exist [45], the data availability was insufficient to apply the extensions in the model. Therefore, existing regulations of enzyme activities cannot be included directly in the stoichiometric model. To overcome this limitation, we plan to investigate the effects of external fluxes by introducing additional constraints in the model, to check whether the nutrient output in the milk can be predicted from the nutrient input.

## Conclusion

We have introduced a method in the framework of flux-balance analysis of a metabolic network. As a main novelty, our approach allows studying the variability of efficiencies (or yield rates) of a metabolic model provided with input-output measurements. More precisely, our approach allows a quantitative estimation of the minimum and maximum proportions of the carbon quantity of each input nutrient which is recovered in each output component of the system. The main innovation is to propose a method which does not require determining the quantitative distribution of nutrients between the branches of the system. To that end, we have performed a parsing of the space of flux distributions which are compatible with both the model stoichiometry and input-output measurements.

This method was applied to study the response of the mammary gland to several treatments. It allowed us to distinguish two different metabolic responses of the system, corresponding to two nutritional situations and accurately reflecting metabolic flexibility. Overall, our method appears to be configured to study the variability of the yield rates of a metabolic system at the multicellular or organ level without making any hypothesis on the internal behavior of the system.

## Method

### Flux modeling based on Flux Balance Analysis

Metabolic models are described according to the generic framework of Flux Balance Analysis [6, 25, 46]. Metabolites split into three non-intersecting sets. First, *input metabolites* are gathered in set . The rate vector of all input fluxes (with in the form “ →*I*”) is denoted by {v}_{I}\sum {\mathbb{R}}^{p}. Second, *output metabolites* are denoted by . We denote by {v}_{O}\sum {\mathbb{R}}^{q} the rate vector of the corresponding output fluxes (in the form “ *O*→”). Finally, *intermediary metabolites* (or *pivots*) are denoted by , and its cardinality is denoted by *n*. The complete set of metabolites is \mathcal{\mathcal{M}}=\mathcal{I}\cup \mathcal{P}\cup \mathcal{O}=\left\{{m}_{1}\dots {m}_{p+n+q}\right\}.

Let \mathcal{R}=\{{r}_{1}\dots {r}_{t}\} be the set of *t* reactions which produce some metabolites while degrading others. The rate vector of these reactions is denoted by v\sum {\mathbb{R}}^{t}. A reaction *r*
_{
j
} has the form \sum _{m\sum \mathcal{I}\cup \mathcal{P}}{s}_{m,j}m\to \sum _{{m}^{\prime}\sum \mathcal{P}\cup \mathcal{O}}{s}_{m,j}^{\prime}{m}^{\prime} where *s*
_{
m,j
} and {s}_{m,j}^{\prime} are input and output stoichiometric coefficients. The metabolic system is described by the so-called *stoichiometric matrix* *M*=(*a*
_{
i,j
})_{
i≤p+n+q,j≤r
}, where {a}_{i,j}={s}_{{m}_{i},j}^{\prime}-{s}_{{m}_{i},j}.

As a usual assumption, intermediary metabolites cannot accumulate in the cell: the consumption and production flux rates of intermediary metabolites are balanced. A linear constraint is derived on the rate vector v\sum {\mathbb{R}}^{t}. Additional biological knowledge about the distribution of fluxes within the system is modeled by a matrix *B*. Overall, plausible flux distributions *v* satisfy the following equations:

Assuming that reactions are irreversible and fluxes have physical upper-bounds, the set of solutions to Eq.(1) is a convex polyhedron, called *a simplex, of plausible flux distributions*. The simplex can be described by means of its vertices, edges or faces [6, 25, 46], which are related to the optimization of objective functions and have shown their biological significance in several contexts [47]. In the example described below, vertices of the simplex (i.e. *extreme pathways*) were computed with standard linear-based methods in the bound case, and with probabilistic methods in the unbound case. As a refinement of these methods, we used a random sampling of the set of possible linear objective functions to obtain a complete description of the set of extreme pathways. This approach was efficient since the dimension of the space of feasible distribution is quite small, because it is strongly constrained by the input-output vectors *v*
_{
I
} and *v*
_{
O
}.

### Modeling the quantitative contribution of input metabolites to output nutrients: AIO

In studies at the organ level such as in nutrition, the choice of a plausible flux distribution *v* aims to elucidate how an input nutrient may contribute to the composition of an output product. To formalize this issue, we introduce a *component* according to which the allocation of input nutrients will be computed. For dietary applications, carbon is a relevant element since it appears in the composition of all metabolites in the system. Nitrogen is less generic but more relevant when specifically studying nitrogen metabolism. Our issue therefore reads as follows: how much carbon introduced into the system through a given input flux can be recovered in the rate of production of an output metabolite?

We introduce the following formalism. Let c\sum {\mathbb{R}}^{p+n+q} be a vector describing the *component composition* of metabolites (its carbon composition for instance). Assume that the stoichiometry of each reaction in the system satisfies a matter-invariant property with respect to the component, that is: \sum _{{m}_{i}\sum \mathcal{\mathcal{M}}}{a}_{i,j}c\left({m}_{i}\right)=0 for every reaction {r}_{j}\sum \mathcal{R}. Of the total component mass provided as a substrate to a reaction *r*, only a part contributes to the composition of a given product *m*. Let *I* *n*
^{r}(*m*) denote this ratio. Assuming that the system follows a *proportional matter distribution*, a consequence of the mass-invariant property is that \sum _{m\sum \mathcal{\mathcal{M}}}I{n}^{r}\left(m\right)=1 for all r\sum \mathcal{R}.

Let *v* be a plausible flux distribution and a, solution to Eq.(1). Let *m* be an input metabolite, for example glucose. The rate of the flux of component mass brought into the system by *m* equals *C*(*m*)*v*
_{
I
}(*m*). We denote by {x}_{O}[v,m]\sum {\mathbb{R}}^{q} the vector of proportions of input component fluxes recovered in each output flux (for instance, the proportion of carbon from the input glucose appearing in the composition of the CO2 in blood). The coefficients of *x*
_{
O
}[*v*,*m*] are called the *allocation of the metabolite m in output nutrients*.

To determine these ratios, we introduce {x}_{P}[v,m]\sum {\mathbb{R}}^{n} the *vector of ratios of input component fluxes* which are required to produce each intermediary metabolite *m* before its breakdown into other metabolites. The law of matter conservation and the assumption that intermediary metabolites do not accumulate put several constraints on *x*
_{
P
}[*v*,*m*] and *x*
_{
O
}[*v*,*m*], which are detailed in Tables 8 and 9. We deduce that the vector of allocations *x*
_{
O
}[*v*,*m*] is a solution to the equation {D}_{2}\left[v\right]{x}_{I}^{\left(m\right)}=-{D}_{1}\left[v\right]\left(\begin{array}{l}{x}_{P}[v,m]\\ {x}_{O}[v,m]\end{array}\right) for the input parameter vector {x}_{I}^{\left(m\right)}=(0,\dots ,0,C(m\left){v}_{I}\right(m),0,\dots ,0), where *D*
_{1} and *D*
_{2} are defined in Table 10. To elucidate whether this system of matrices determines uniquely *x*
_{
O
}[*v*,*m*], we noticed that *D*
_{1} is a square matrix such that all diagonal coefficients are equal to 1 and the others are nonpositive. This family of matrices is named the *M*-matrix in [48], and it has been shown that such a matrix is invertible. We deduce that *D*
_{1} is invertible so that the vectors of allocations *x*
_{
O
}[*v*,*m*] and *x*
_{
P
}[*v*,*m*] are determined uniquely. With these formulas, when the flux distribution v has been chosen, any algorithm or method to inverse matrix *D*
_{1} can be used.

Overall, the *complete table showing the distribution of nutrient inputs in nutrient outputs* is defined to be the *q*×*p* matrix *A* *I* *O*[*v*], the columns of which are *x*
_{
O
}[*v*,*m*
_{1}], …, *x*
_{
O
}[*v*,*m*
_{
p
}], as follows:

With these formulas at hand, as soon as the flux distribution *v* has be chosen, any algorithm or method to inverse the matrix *D*
_{1} can be used. Altogether, the computation of a complete AIO table takes *O*((*n*+*p*+*q*)^{3}+*n* *p*) operations for a given flux distribution *v*.

### Computing extrema of the AIO coefficients among the complete polyhedron of plausible flux distributions: solving nonlinear optimization problems

Computing the extrema of the AIO coefficients among the convex polyhedron of plausible flux distributions *v*, leads to an optimization problem in a nonlinear context. The problem rewrites in an optimization scheme, leading to 2*q*×*p* optimization problems of the form

These problems are nonlinear programming (NLP) problems [49] since they aim to optimize a nonlinear objective function over a possibly nonbounded simplex search space.

When both the simplex search space was bounded and an analytical expression for *A* *I* *O*[*V*] as a function of all the free variables of the eq.(1) solution space was producible with formal algebra software [50], the Matlab *fmincon* function was used to solve the NLP problem. Among all available optimization routines, the best performance was achieved by the interior point algorithm, which has a polynomial complexity for these problems [51]. Other optimization routines were used to confirm our results, but the computation time was much longer.

When an analytical expression was not available, we implemented an alternative strategy making use of some local search routines together with the fact that *A* *I* *O*(*v*) can be computed for any *v* in a cubic time. First, we compute the Chebyshev ball *B*(*x*
_{0},*r*) of the simplex [52]. Any point *x* of the *n*-dimensional ball is uniquely identified by a vector (*θ*
_{1},…,*θ*
_{
n
}) with *θ*
_{
i
}∈[0,2*π*[ for 1≤*i*<*n* and *θ*
_{
n
}∈[0,*π*[. The intersection *P*(*x*) of the half line [*x*
_{0},*x*) with the simplex of plausible flux distributions is unique if it exists. Conversely, for any point *y* on the boundary of the simplex, there exists a unique *x*∈*B*(*x*
_{0},*r*) such that *y*=*P*(*x*). By extension, we say that (*θ*
_{1},…,*θ*
_{
n
}) is a parameterization of the boundary of the simplex. This parameterization is used to determine a discretization of the simplex boundary. Given an integer *p*>0, every *θ*
_{
i
} is assumed to be of the form 2*π* *j*/*p*, for *j*∈{0,…,*p*-1}, whereas *θ*
_{
n
} rewrites as *π* *j*/*p*. The finite neighborhood of any point *y* on the boundary, denoted by \mathcal{N}\left(y\right) is obtained by modifying each single coordinate of the parameterization (*θ*
_{1},…,2*π* *j*/*p*,…,*θ*
_{
n
}) of *y*: for instance, among the maximal 2*n* neighbors, we find the point with paramerization (*θ*
_{1},…,2*π*(*j*-1)/*p*,…,*θ*
_{
n
}) and (*θ*
_{1},…,2*π*(*j*+1)/*p*,…,*θ*
_{
n
}). We can finally apply a local search algorithm based on a dichotomy principle to improve the discretization level and reach the solution to the optimization problem on the simplex boundary.

The same algorithm is extended to the full space by introducing a supplementary coordinate standing for the distance between *P*(*x*) and the center *x*
_{0} of the Chebyshev ball. This makes it possible to check whether the optimum identified previously lies on the boundary of the simplex.

### Analysis workflow

A web application dedicated to the computation of AIO for metabolic networks is freely available [53]. It enables us to build a stoichiometric model, define its inputs and outputs, solve the associated FBA problem and finally compute the AIO of the selected flux distribution. Offline tools enabling the computation of the extrema of the AIO are also provided in a companion web page^{e}.

The complete workflow of analysis requires the online webpage together with the use of several environments. First, *php* is required for the computation of extremal vertices of the convex polyhedron of plausible flux distributions, including the case when this space is not bounded. Second, *SAGE* is used to performed formal algebra tasks involved in the computation of AIO matrices. Finally, *matlab* is needed to compute the extrema of the AIO coefficients. The main functionalities of the workflow are depicted in Figure 2.

### Mammary gland stoichiometry model

As the basis of the ruminant mammary gland metabolism that is studied in this paper, we used a generic model of mammary metabolism [15, 23]. This generic model contained 54 reactions involving six intermediary metabolites, called carbon-chain *pivots*, at cross-over points between metabolic pathways, chosen to describe the main possible conversions between metabolites: oxaloacetate (*OAA*), *α*-ketoglutarate (*α*-*KG*), pyruvate (*PYR*), acetyl coenzyme A (*AcoA*), glucose (*GLC*) and serine (*SER pivot*). The reactions also take into account the variations in ATP, four cofactors (*NADHc*, *NADHm*, *NADPH* and *FADH2*) and other metabolites (*O2*, *CO2* and *NH3*).

The model was first extended and detailed in order to include the reactions included in other models of mammary metabolism [15, 21, 22, 24]. Reactions for lactose synthesis, milk protein, fatty acids and glycerol-3P of triglycerides were included in the model, as well as four new pivots. Mitochondrial (*mAcoA*) and cytosolic (*cAcoA*) acetyl-CoA were used to distinguish between utilization of *cACoA* for fatty acid (*FA*) synthesis and utilization of *mACoA* in the tricarboxylic acid (*TCA*) cycle. In addition, glucose-6-phosphate (*G6P*) and triose-phosphate (*G3P*) were introduced to account more specifically for the transformation of glucose through the pentose phosphate pathway, lactose synthesis and glycerol-3P utilization. For lipid metabolism, three new intermediary metabolites were introduced (*Glycerol-3P*) and two fatty acid primers (*Primer (C2:0)CoA* and *Primer (C4:0)CoA*). They correspond to the primers of two (from acetate) or four (from *β*-hydroxybutyrate; *BHBA*) carbon-units necessary to initiate fatty acid synthesis before elongation, since in ruminant mammary glands fatty acids are synthesized from acetate or *BHBA* and not glucose. Other additional reactions were included to describe milk synthesis, such as NADPH synthesis through the isocitrate dehydrogenase pathway. The input (net uptake) of long-chain fatty acids was not considered in the system since these are not synthesized by the mammary gland and we hypothesized that they are not oxidized within the mammary gland of lactating animals, as measured in well-fed goat and nonruminant through isotopes [35, 37]. This hypothesis was sustained by carbon balance measurements based on experimental measures of CO2 [28]. This latter hypothesis was relaxed in a second step (see Figure 3) to study how our results are impacted by the introduction of oxidation of long-chain fatty acid within TCA cycle, as modeled in alternative contexts [18, 20–22]. In addition, the action of long-chain fatty acids to regulate enzyme activities involved in C(4:0) … C(16:0) production was taken into account by the fact that C(4:0), … C(16:0) fluxes of synthesis are explicitly given in the dataset.

Protein synthesis was summarized by the number of peptide links synthesized (peptide synthesis) and the ATP required for each peptide synthesis was fixed at 5 in the present model [23]. Similarly, protein degradation was summarized by the number of hydrolyzed peptide links (peptide hydrolysis). The ATP consumption for each peptide hydrolysis was set at a single ATP [54].

Finally, four output reactions were included in the model in order to calculate the overall allocations in carbon (*R*
_{137}), nitrogen (*R*
_{138}), *CO2* (*R*
_{139}) and Serine (*R*
_{135}).

As a last step, this model was restricted to the case of ruminants. Some reactions, such as fatty acid oxidations (*R*
_{20} to *R*
_{23}) and pyruvate synthesis through malic enzyme *R*
_{66}) and some input and output fluxes present in the generic stoichiometric model [23] were not considered to occur in ruminant mammary glands (fructose (*R*
_{25}); glucose production from *G6P* neoglucogenesis (*R*
_{10}), acetone synthesis (*R*
_{69}), *BHBA* synthesis (*R*
_{70}), acetoacetate synthesis (*R*
_{71}), and ureogenesis (*R*
_{50}).

Altogether, this mammary gland model contained 140 reactions, involving eleven intermediary metabolites, four cofactors and ATP, CO2, O2 and NH3. The stoichiometry of the system implies that the balance (production minus consumption) of these four last outputs can be uniquely deduced from the choice of a plausible flux distribution. The full model is shown in Figure 2 and the corresponding stoichiometric model is provided in the supplementary webpage (SBML format).

### Additional information: datasets, inputs, outputs, additional knowledge, component

Two datasets [28, 29] were used, in relation to the response of dairy cow mammary gland metabolism to some increases in the protein supply: a control diet (Ctrl) and increased protein supply through casein infusion into the duodenum (CN). A complementary dataset (HB), previously used in a mechanistic model of the mammary metabolism, was used as a reference response [21]. Datasets are shown in Table 1. All data were converted into mmol/h/half udder.

Fluxes of nutrients taken up on a net basis by the mammary gland were considered as inputs whereas fluxes of nutrients produced in milk, such as lactose, milk protein, milk fatty acid arranged in triglycerides or CO2 produced and released in blood, were considered as outputs. Special attention was paid to amino acid inputs and outputs, milk protein output, fatty acids synthesized within the mammary gland and total triglyceride output according to established calculation rules [21, 30]. Milk proteins were modeled by their number of peptide links (peptide output). The amino acid inputs or outputs corresponded to the balance between their real net uptake minus their contribution to peptide outputs (utilization in milk protein). When the balance of an amino acid was positive, it corresponded to an input flux. When it was negative, it corresponded to an output flux (see Table 1). Two input reactions were also assumed to be inactive in all data sets since the corresponding net uptake were not measured in the datasets (Table 1): *propionate* input (*R*
_{28}) and triglyceride hydrolysis (*R*
_{65}) in *long-chain fatty acids* input.

Eight additional linear constraints were introduced to the fluxes to ensure that the model was relevant biologically [21]. These are detailed in Tables 4 and 5 and allow building matrix *B* appearing in Eq.(1)^{f}. The rate of fatty-acid acylCoA (*C(n:m)-acylCoA*) was fixed to be three times higher than the triglyceride output, assuming that 100% of milk lipids are triglycerides: it included all fatty acid acylation, that is, fatty acid synthesized in the mammary gland and long fatty acid taken up. 50% of fatty acids were assumed to be synthesized from acetate, i.e. from *primer C(2:0)CoA* and the other 50% from BHBA (*primer C(4:0)CoA*), with the exception of *C(4:0) fatty acid*, which was supposed to be synthesized only from BHBA. It was assumed that 30% of NADPH was generated by the isocitrate desydrogenase pathway and 70% of NADPH was generated in the pentose phosphate pathway [21].

*Carbon* was the component chosen to compute and analyze the allocation of nutrient inputs in nutrient outputs. The stoichiometry of the metabolic network satisfies the mass-invariant property according to this component. This required a very precise description of CO2 production and consumption in the model reactions and was validated with external measurements of the CO2 balance provided with the datasets.

## Availability of supporting data

The datasets supporting the results of this article are included within the article (Table 1). The SBML file for the mammary gland model and tools enabling the computation of all results are provided in a companion web page: http://nutritionanalyzer.genouest.org/SupplementaryMaterial.

## Endnotes

^{a} These functions are: *v*
_{8}+*v*
_{19}, *v*
_{8}-*v*
_{19}, *v*
_{14}+*v*
_{19}, *v*
_{14}-*v*
_{19}, *v*
_{15}+*v*
_{19}, *v*
_{15}-*v*
_{19}, *v*
_{64}+*v*
_{19}, *v*
_{64}-*v*
_{19}

^{b} For instance, for the (Crtl) treatment, there is a distribution such that *v*
_{8}<591 and *v*
_{13}<266 are lower than the bounds introduced in [26, 27], while *v*
_{64}/*v*
_{63}=0.62<0.67 and glucose is the precursor for 90.79% of the lactose carbon

^{c} All the computations were performed by using *Matlab* release 2011 on a Xeon E5645 multicore computer.

^{d} These functions are: *v*
_{8}+*v*
_{19}, *v*
_{8}+*v*
_{19}-50 *v*
_{15}, *v*
_{8}-*v*
_{19}-*v*
_{14}, *v*
_{8}-*v*
_{19}-*v*
_{15}, *v*
_{14}+*v*
_{19}, *v*
_{14}+*v*
_{19}-50 *v*
_{15}, *v*
_{14}-*v*
_{19}, *v*
_{15}+*v*
_{19}, *v*
_{15}-*v*
_{19}, *v*
_{64}+*v*
_{19}, *v*
_{64}+*v*
_{19}-50 *v*
_{15}, *v*
_{64}-*v*
_{19}-*v*
_{14}, *v*
_{64}-*v*
_{19}-*v*
_{15}.

^{e}
http://nutritionanalyzer.genouest.org/SupplementaryMaterial. The access to the pre-formatted model and datasets supporting the results is planned to be on request. A valid access is provided by: login : SuppMat; password : tamppuss

^{f} Mathematically, these constraints can be derived as: 0.7*v*
_{82}-0.3*v*
_{58}=0; *v*
_{51}-*v*
_{52}=0; *v*
_{54}-*v*
_{90}=0; *v*
_{55}-*v*
_{91}=0; *v*
_{86}-*v*
_{92}=0; *v*
_{87}-*v*
_{93}=0; *v*
_{88}-*v*
_{94}=0; *v*
_{56}-3*v*
_{62}=0.

## References

Voss K, Heiner M, Koch I:Steady state analysis of metabolic pathways using Petri nets. In Silico Biol. 2003, 3 (3): 367-387.

Hardy S, Robillard PN:Modeling and simulation of molecular biology systems using petri nets: modeling goals of various approaches. J Bioinformatics Comput Biol. 2004, 2 (4): 595-613. 10.1142/S0219720004000752.

Lakshmanan M, Koh G, Chung BKS, Lee D-Y:Software applications for flux balance analysis. Brief Bioinformatics. 2012, published online of Nov 2012,

Orth JD, Thiele I, Palsson BØ:What is flux balance analysis?. Nat Biotechnol. 2010, 28 (3): 245-248. 10.1038/nbt.1614.

Edwards JS, Palsson BØ:The Escherichia coli MG1655 in silico metabolic genotype: its definition, characteristics, and capabilities. Proc Natl Acad Sci USA. 2000, 97 (10): 5528-5533.

Kauffman KJ, Prakash P, Edwards JS:Advances in flux balance analysis. Curr Opin Biotechnol. 2003, 14 (5): 491-496. 10.1016/j.copbio.2003.08.001.

Schilling CH, Letscher D, Palsson BØ:Theory for the systemic definition of metabolic pathways and their use in interpreting metabolic function from a pathway-oriented perspective. J Theor Biol. 2000, 203 (3): 229-248. 10.1006/jtbi.2000.1073.

Schilling CH, Schuster S, Palsson BO, Heinrich R:Metabolic pathway analysis: basic concepts and scientific applications in the post-genomic era. Biotechnol Prog. 1999, 15 (3): 296-303. 10.1021/bp990048k.

Beard DA, Liang S-D, Qian H:Energy balance for analysis of complex metabolic networks. Biophys J. 2002, 83 (1): 79-86. 10.1016/S0006-3495(02)75150-3.

Oh Y-G, Lee D-Y, Lee SY, Park S:Multiobjective flux balancing using the NISE method for metabolic network analysis. Biotechnol Prog. 2009, 25 (4): 999-1008. 10.1002/btpr.193.

Mahadevan R, Schilling CH:The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metab Eng. 2003, 5 (4): 264-276. 10.1016/j.ymben.2003.09.002.

Ghozlane A, Bringaud F, Soueidan H, Dutour I, Jourdan F, Thebault P:Flux Analysis of the Trypanosoma brucei glycolysis based on a multiobjective-criteria bIoinformatic approach. Adv Bioinformatics. 2012, 2012: 159423-

Beurton-Aimar M, Beauvoit B, Monier A, Vallee F, Dieuaide-Noubhani M, Colombie S:Comparison between elementary flux modes analysis and 13C-metabolic fluxes measured in bacterial and plant cells. BMC Syst Biol. 2001, 5: 95-

Hanigan MD, Crompton LA, Reynolds CK, Wray-Cahen D, Lomax MA, France J:An integrative model of amino acid metabolism in the liver of the lactating dairy cow. J Theor Biol. 2004, 228 (2): 271-289. 10.1016/j.jtbi.2004.01.010.

Van-Milgen J, Gondret F, Renaudeau D: The use of nutritional models as a tool in basis research . Progress in Research on Energy and Protein Metabolism. Edited by: Edited by Souffrant WB MetgesCC. Rostock-Warnemünde:EAAP. 2003, 259-263.

Acuña V, Marchetti-Spaccamela A, Sagot M-F, Stougie L:A note on the complexity of finding and enumerating elementary modes. Biosystems. 2010, 99 (3): 210-214. 10.1016/j.biosystems.2009.11.004.

Cant JP, McBride BW:Mathematical analysis of the relationship between blood flow and uptake of nutrients in the mammary glands of a lactating cow. J Dairy Res. 1995, 62 (3): 405-422. 10.1017/S0022029900031113.

Volpe V, Cant JP, Boston RC, Susmel P, Moate P:Development of a dynamic mathematical model for investigating mammary gland metabolism in lactating cows. J Agric Sci. 2010, 148 (01): 31-10.1017/S0021859609990244.

Hanigan MD, Crompton LA, Bequette BJ, Mills JAN, France J:Modelling mammary metabolism in the dairy cow to predict milk constituent yield, with emphasis on amino acid metabolism and milk protein production: model evaluation. J Theor Biol. 2002, 217 (3): 311-330. 10.1006/jtbi.2002.3037.

Hanigan MD, Crompton LA, Metcalf JA, France J:Modelling mammary metabolism in the dairy cow to predict milk constituent yield, with emphasis on amino acid metabolism and milk protein production: model construction. J Theor Biol. 2001, 213 (2): 223-239. 10.1006/jtbi.2001.2417.

Hanigan MD:A mechanistic model of mammary gland metabolism in the lactating cow. + Agric Syst. 1994, 45 (4): 369-419. 10.1016/0308-521X(94)90132-Y.

Waghorn GC, Baldwin RL:Model of metabolite flux within mammary gland of the lactating cow. J Dairy Sci. 1984, 67: 531-544. 10.3168/jds.S0022-0302(84)81336-3.

Van-Milgen J:Modeling biochemical aspects of energy metabolism in mammals. J Nutr. 2002, 132 (10): 3195-3202.

Lemosquet S, Abdou-Arbi O, Siegel A, Guinard-Flament J, Van Milgen J, Bourdon J:A generic stoichiometric model to analyse the metabolic flexibility of the mammary gland in lactating dairy cows. Modelling Nutrient Digestion and Utilization in Farm Animals. Edited by: Edited by Faverdin P SauvantDVan Milgen J. 2010, Wageningen Academic Publishers,

Papin JA, Stelling J, Price ND, Klamt S, Schuster S, Palsson BO:Comparison of network-based pathway analysis methods. Trends Biotechnol. 2004, 22: 400-405. 10.1016/j.tibtech.2004.06.010.

Bequette BJ, Sunny NE, El-Kadi SW, Owens SL:Application of stable isotopes and mass isotopomer distribution analysis to the study of intermediary metabolism of nutrients. J Anim Sci. 2006, 84 (Suppl): E50-E59.

Bickerstaffe R, Annison EF, Linzell JL:The metabolism of glucose, acetate, lipids and amino acids in lactating dairy cows. J Agric Sci. 1974, 82 (01): 71-85. 10.1017/S0021859600050243.

Lemosquet S, Raggio G, Lobley G, Rulquin H, Guinard-Flament J, Lapierre H:Whole-body glucose metabolism and mammary energetic nutrient metabolism in lactating dairy cows receiving digestive infusions of casein and propionic acid. J Dairy Sci. 2009, 92 (12): 6068-6082. 10.3168/jds.2009-2018.

Raggio G, Lemosquet S, Lobley G, Rulquin H, Lapierre H:Effect of casein and propionate supply on mammary protein metabolism in lactating dairy cows. J Dairy Sci. 2006, 89: 4340-4351. 10.3168/jds.S0022-0302(06)72481-X.

Swaisgood HE: Handbook of Milk Composition, volume 1. 1995, San Diego: Academic Press

Hanigan M, France J, Mabjeesh S, Mcnabb WC, Bequette B:High rates of mammary tissue protein turnover in lactating goats are energetically costly. J Nutrit. 2009, 139: 1118-1127.

Pfeiffer T, Sanchez-Valdenebro I, Nuevo JC, Montero F, Schuster S:Metatool: for studying metabolic networks. Bioinformatics. 1999, 15 (3): 251-257. 10.1093/bioinformatics/15.3.251.

Scott RA, Beuman DE, Clark JH:Cellular gluconeogenesis by lactating bovine mammary tissue. J Dairy Sci. 1976, 59 (1): 50-56. 10.3168/jds.S0022-0302(76)84155-0.

Schuetz R, Zamboni N, Zampieri M, Heinemann M, Sauer U:Multidimensional optimality of microbial metabolism. Science. 2012, 336 (6081): 601-604. 10.1126/science.1216882.

Annison EF, Linzell JL, Fazakerley S, Nichols BW:The oxidation and utilization of palmitate, stearate, oleate and acetate by the mammary gland of the fed goat in relation to their overall metabolism, and the role of plasma phospholipids and neutral lipids in milk-fat synthesis. Biochem J. 1967, 102 (3): 637-647.

Shorten PR, Pleasants TB, Upreti GC:A mathematical model for mammary fatty acid synthesis and triglyceride assembly: the role of stearoyl CoA desaturase (SCD). J Dairy Res. 2004, 71 (4): 385-397. 10.1017/S0022029904000354.

Smith GH, Crabtree B, Smith R: Energy Metabolism in the Mammary Gland. 1983, Amsterdam, New York: Elsevier

Annison EF, Linzell JL, West CE:Mammary and whole animal metabolism of glucose and fatty acids in fasting lactating goats. J Physiol (Lond). 1968, 197 (2): 445-459.

Wiechert W:13C Metabolic Flux Analysis. Metab Eng. 2001, 3 (3): 195-206. 10.1006/mben.2001.0187.

Crown SB, Ahn WS, Antoniewicz MR:Rational design of 13C-labeling experiments for metabolic flux analysis in mammalian cells. BMC Syst Biol. 2012, 6: 43-10.1186/1752-0509-6-43.

Fell DA, Small JR:Fat synthesis in adipose tissue. An examination of stoichiometric constraints. Biochem J. 1968, 238: 781-786.

Consoli A, Kennedy F, Miles J, Gerich J:Determination of Krebs cycle metabolic carbon exchange in vivo and its use to estimate the individual contributions of gluconeogenesis and glycogenolysis to overall glucose output in man. J Clin Invest. 1987, 80 (5): 1303-1310. 10.1172/JCI113206.

Katz J:Determination of gluconeogenesis in vivo with 14C-labeled substrates. Am J Physiol. 1985, 248 (4 Pt 2): :R391-399.

Blavy P, Gondret F, Guillou H, Lagarrigue S, Martin PGP, van Milgen J, Radulescu O, Siegel A:A minimal model for hepatic fatty acid balance during fasting: application to {PPAR} alpha-deficient mice. J Theor Biol. 2009, 261 (2): 266-278. 10.1016/j.jtbi.2009.07.025.

Covert MW, Schilling CH, Palsson B:Regulation of gene expression in flux balance models of metabolism. J Theor Biol. 2001, 213 (1): 73-88. 10.1006/jtbi.2001.2405.

Larhlimi A, Bockmayr A: Minimal metabolic behaviors and the reversible metabolic space. 2009, Technical report, Matheon, Berlin, Germany,

Edwards JS, Ibarra RU, Palsson BO:In silico predictions of Escherichia coli metabolic capabilities are consistent with experimental data. Nat Biotechnol. 2001, 19 (2): 125-130. 10.1038/84379.

Adamou O: Study of Some Epidemiological Models by the Methods of Computer Algebra. 2009, PhD thesis, University of Rennes-1 (France) and UAM (Niger),

Bertsekas DP: Nonlinear Programming. 1999, Cambridge: Athena Scientific

Stein WA: Sage Mathematics Software (Version 5.5). 2012, The Sage Development Team,http://wiki.sagemath.org/Publications_using_SAGE,

Byrd RH, Gilbert JC, Nocedal J:A trust region method based on interior point techniques for nonlinear programming. Math Prog. 2000, 89 (1): 149-185. 10.1007/PL00011391.

Botkin ND, Turova-Botkina VL:An algorithm for finding the chebyshev center of a convex polyhedron. Appl Math Optimization. 1994, 29 (2): 211-222. 10.1007/BF01204183.

NutritionAnalyzer, an online tool to explore the flexibility of metabolic models in nutrition studies.http://nutritionanalyzer.genouest.org,

Lobley GE:Energy metabolism reactions in ruminant muscle: responses to age, nutrition and hormonal status. Reprod Nutr Dev. 1990, 30 (1): 13-34. 10.1051/rnd:19900102.

## Acknowledgements

Part of this program was supported by the French National Agency for Research (ANR-10-BLANC- 0218). The authors wish to thank Damien Eveillard for fruitful discussions.

## Author information

### Authors and Affiliations

### Corresponding author

## Additional information

### Competing interests

The authors declare that they have no competing interests.

### Authors’ contributions

OA-A, JB and AS formalized the computation AIO as an optimization issue. OA-A and JB conceived the algorithms to compute AIO. OA-A performed experimentations. OA-A, SL and JV-M built the stoichiometric model. SL prepared experimental datasets and performed the biological interpretation of the results. AS and SL studied the structure on the simplex of plausible flux distributions. All authors contributed to the redaction of the paper. All authors read and approved the final manuscript.

## Electronic supplementary material

### 12918_2013_1281_MOESM1_ESM.zip

Additional file 1:SBML version of the metabolic model. The complete metabolic model of mammary gland metabolism is provided in the free and open standard SBML representation format (Systems Biology Markup Language). (ZIP 2 KB)

## Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

## Rights and permissions

**Open Access**
This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License (
https://creativecommons.org/licenses/by/2.0
), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

## About this article

### Cite this article

Abdou-Arbi, O., Lemosquet, S., Milgen, J.V. *et al.* Exploring metabolism flexibility in complex organisms through quantitative study of precursor sets for system outputs.
*BMC Syst Biol* **8**, 8 (2014). https://doi.org/10.1186/1752-0509-8-8

Received:

Accepted:

Published:

DOI: https://doi.org/10.1186/1752-0509-8-8

### Keywords

- Flux Balance Analysis
- Flux distributions exploration
- Yield variability
- Nutritional model