Global dynamic optimization approach to predict activation in metabolic pathways

de Hijas-Liste, Gundián M; Klipp, Edda; Balsa-Canto, Eva; Banga, Julio R

doi:10.1186/1752-0509-8-1

Research article
Open access
Published: 06 January 2014

Global dynamic optimization approach to predict activation in metabolic pathways

Gundián M de Hijas-Liste¹,
Edda Klipp²,
Eva Balsa-Canto¹ &
…
Julio R Banga¹

BMC Systems Biology volume 8, Article number: 1 (2014) Cite this article

6394 Accesses
38 Citations
2 Altmetric
Metrics details

Abstract

Background

During the last decade, a number of authors have shown that the genetic regulation of metabolic networks may follow optimality principles. Optimal control theory has been succesfully used to compute optimal enzyme profiles considering simple metabolic pathways. However, applying this optimal control framework to more general networks (e.g. branched networks, or networks incorporating enzyme production dynamics) yields problems that are analytically intractable and/or numerically very challenging. Further, these previous studies have only considered a single-objective framework.

Results

In this work we consider a more general multi-objective formulation and we present solutions based on recent developments in global dynamic optimization techniques. We illustrate the performance and capabilities of these techniques considering two sets of problems. First, we consider a set of single-objective examples of increasing complexity taken from the recent literature. We analyze the multimodal character of the associated non linear optimization problems, and we also evaluate different global optimization approaches in terms of numerical robustness, efficiency and scalability. Second, we consider generalized multi-objective formulations for several examples, and we show how this framework results in more biologically meaningful results.

Conclusions

The proposed strategy was used to solve a set of single-objective case studies related to unbranched and branched metabolic networks of different levels of complexity. All problems were successfully solved in reasonable computation times with our global dynamic optimization approach, reaching solutions which were comparable or better than those reported in previous literature. Further, we considered, for the first time, multi-objective formulations, illustrating how activation in metabolic pathways can be explained in terms of the best trade-offs between conflicting objectives. This new methodology can be applied to metabolic networks with arbitrary topologies, non-linear dynamics and constraints.

Background

Optimality principles have been successfully used to describe the design, organization and behavior of biological systems at different levels. Sutherland [1] argues that optimization can in fact play an even major role, allowing biology to move from just describing mechanisms to being able to predict, from first principles, how organisms should be designed. In the context of cell biology, mathematical optimization has been the underlying hypothesis in applications such as flux balance analysis [2], the analysis of activation of metabolic pathways [3–7], model identification (including parameter estimation and optimal experimental design) [8] and optimal stimulation procedures to achieve a desired biological behavior [9–11].

Here we focus on the problem of enzyme activation in metabolic networks, which has received substantial attention in the recent literature. Several authors have shown that the genetic regulation of metabolic networks may follow an optimality principle, such as the minimization of the transition time or the maximization of the production of a given metabolite. In their seminal work, Klipp et al. [4] showed how sequential gene expression appears in unbranched metabolic networks under the hypothesis of minimum transition time. The sequence matched the enzyme order in the pathway. This “just-in-time” activation pattern was experimentally confirmed later for the case of the amino-acid biosynthesis systems of E. coli[5]. These authors also showed that in the arginine, methionine and serine systems, the earlier the enzyme is involved in a pathway, the shorter is the response time and the higher the maximal promoter activity of the corresponding gene. Evidences of temporal distribution of enzymes were also found in the lysine biosynthesis pathway by E. coli[12]. Chechik et al. [13] proposed the notion of activity motif to systematically study the dynamic behavior of metabolic networks. As a case study they considered the transcriptional response of metabolic genes after a sudden change in environmental or nutritional condition in S. cerevisiae. These authors also found that enzymes in a metabolic chain are induced in the same order they are used in the pathway in both directions forward and backward. All these results support the idea that “just-in-time” activation in metabolic pathways is a widespread phenomena.

These works indicate that, starting with a mathematical model of a given metabolic network, it is in principle possible to anticipate activation profiles for specific cellular objectives solving the corresponding dynamic optimization problem. Needless to say, the results obtained must be compared with existing or new experimental measurements. In this dynamic optimization framework, the objective is to compute time varying control profiles, usually enzyme concentrations or the corresponding expression rates, so as to minimize a given cost function, such as the time needed to reach a given amount of product, subject to the system dynamics (the model) and algebraic constraints, for example in the maximum amount of enzyme available.

However solving this class of optimization problems for arbitrary metabolic networks of certain complexity is still a challenge. So far, the literature reports applications in very simple scenarios. Oyarzún et al. [6, 14] suggested the application of classical optimal control theory, giving an analytical proof of sequential activation for unbranched networks for the case the transition time is to be minimized subject to specific constraints. Almost in parallel, Bart et al. [7, 15] also considered the maximum principle of Pontryagin to solve related examples. Although successful for simple unbranched pathways, these works suggest that the use of indirects approaches from optimal control theory, such as the maximum principle of Pontryagin or the use of the Hamilton-Jacobi-Bellman equation, may not be appropriate to deal with general problems (such as those related to branched and/or large networks) due to the complexity of the resulting numerical problems.

It is also important to highlight that all these authors have studied enzyme activation in metabolic networks considering that the metabolism is optimal with respect to a single-objective. We believe that a more realistic approach is to take more than one objective into account. Multi-objective optimization has been successfully applied in several classes of biological problems [16], including metabolic engineering [17, 18], the heat-shock response of bacteria [19], and the allosteric regulation of enzymes [20]. These and other recent works [21–23] illustrate the importance of studying optimality as the trade-off between conflicting objectives, such as economy (cost) and effectiveness (benefit), in order to obtain results with more biological meaning. It should be noted that multi-objective optimization problems do not have a unique solution, but a set of optimal solutions known as the Pareto front. All Pareto solutions are optimal but represent different trade-offs between objectives.

Here we consider, for the first time, a multi-objective formulation to explain activation in metabolic networks. It should be noted that several authors [6, 14, 24] have considered composed objective functions, such us combinations of transition time and enzyme consumption, thus obtaining only one of the possible Pareto solutions instead of a Pareto front. In this work we adopt a general multi-criteria framework and we propose the use of advanced numerical dynamic optimization techniques to study/predict enzyme activation in general pathways. The underlying idea is to combine the control vector parameterization (CVP) approach with adequate global optimization techniques. This new methodology can be applied to metabolic networks with arbitrary topologies, non- linear dynamics and constraints.

We then illustrate the approach with two sets of problems. First, a set of single-objective examples of increasing complexity taken from the literature. The multimodality of these problems is evaluated by means of multi-start local deterministic methods. The need of global optimization methods arises from the non-convexity of the general problem, due to the frequent bi-linear terms for the controls, the non-linear character of the systems dynamics, and the presence of (nonlinear) constraints [25]. As a consequence, the use of standard local optimization methods results in local solutions. To surmount these difficulties, we also present a metaheuristic approach which is compared with several other stochastic global optimization methods. Second, we consider the multi-objective formulation of several problems, illustrating how activation in metabolic pathways can be explained in terms of the best trade-offs between conflicting objectives.

Methods

Problem formulation

The aim of the general multi-objective dynamic optimization problem is to compute the time varying control profiles and the final time (u(t), t _f) so as to minimize (or maximize) a given set of cost functions (J) subject to the system dynamics and possibly algebraic constraints. Mathematically this reads:

min_{u (t), t_{f}} J [x, u]

(1)

Where:

J [x, u] = [\begin{matrix} J_{1} [x, u] \\ J_{2} [x, u] \\ ⋮ \\ J_{n} [x, u] \end{matrix}]

Subject to:

\frac{d x}{dt} = \tilde{Ψ} [x {t}, u {t}, t], x (t_{0}) = x_{0}

(2)

\begin{array}{lcr} h [x (t), u (t)] = 0 \end{array}

(3)

\begin{array}{lcr} g [x (t), u (t)] \leq 0 \end{array}

(4)

\begin{array}{lcr} ph [x (t_{i}), u (t_{i})] = 0 \end{array}

(5)

\begin{array}{lcr} pg [x (t_{i}), u (t_{i})] \leq 0 \end{array}

(6)

\begin{array}{lcr} u^{L} \leq u (t) \leq u^{U} \end{array}

(7)

where:

The costs functional (1) corresponds, for example, to the time needed to reach a given state of the system or the enzyme cost. In a single-objective problem J[x,u]=J ₁[x,u];
x is the vector of state variables, typically metabolite concentrations;
u is the vector of control variables: enzyme concentrations (e) or expression rates (r);
Equation (2) represents the system dynamics (dynamic mathematical model of the network);
Equations (3)–(4) represent equality and inequality path constraints, such as the total amount of enzyme available during the process;
Equations (5)–(6) represent equality and inequality point constraints, i.e. those that must be verified at a given time, for example the amount of metabolite at the end of the process;
Equation (7) corresponds to the upper and lower bounds for the control variables, for instance the minimum and maximum enzyme available through out the process.

Numerical methods

Roughly speaking, numerical methods for the solution of dynamic optimization problems can be classified into two groups: direct and indirect methods. Indirect methods solve the optimization problem using the Pontryagin’s maximum principle. This method is based on the transformation of the original problem using the necessary optimality conditions of Pontryagin. This results in a two or multi-point boundary value problem, in the presence the constraints, to be solved for state and co-state variables.

Direct approaches, such us complete parameterization (CP, [26]), multiple shooting (MS, [27]) or control vector parameterization (CVP, [28]) transform the original problem into a non-linear programming problem (NLP) by means of the discretization and approximation, either of the control variables or both the control and state variables. The main differences among these methods are the number of decision variables, the presence or absence of parameterization constraints and the necessity of using an initial value problem solver. The use of CP or MS results in larger NLP problems with junction constraints to handle the system dynamics, which requires the use of specific optimization methods and may be computationally intensive for large scale dynamic systems. Therefore the CVP method is selected here.

Control vector parametrization

In the CVP approach the time horizon is divided into a number of ρ time intervals, with variable or fixed length. The control variables are then approximated within each interval by means of low order polynomials. With this parametrization the original dynamic optimization problem is transformed into a non-linear programming problem with dynamic and algebraic constraints. The non-linear programming problem obtained must be solved by employing a suitable NLP method and an initial value problem solver.

Nonlinear programming methods

Nonlinear programming methods are basically classified in local and global methods. Local methods are designed to converge to the closest solution to the initial guess. The iterates are computed by means of the cost function value or the gradient and/or the Hessian of the cost function with respect to the decision variables. In addition there are implementations that are able to automatically handle constraints. In the context of dynamic optimization, Sequential Quadratic Programming (SQP) methods, which guarantee feasibility of the solution at convergence, or feasible SQP methods, which guarantee feasibility throughout the optimization, have shown to be efficient in the solution of large-scale constrained NLPs.

Unfortunately, NLPs arising from the application of direct approaches (such as CVP) are frequently multimodal. This is particularly the case in the presence of non-linear or bi-linear dynamic constraints (as in the problems considered in this work) or when variable length elements are used in the CVP approach. Therefore, local optimization techniques may converge to local optima. A multi-start of local methods may help to analyze multimodality of the optimization problem. Interested readers will find illustrative examples in the Additional file 1.

Stochastic global optimization methods are robust alternatives for the search of the global solution. However these methods are well known to be particularly inefficient in the refining of the solutions. In this concern, Balsa-Canto et al. [29], presented the combination of a stochastic global solver with a deterministic local method in a so called sequential hybrid approach for the solution of dynamic optimization problems. That work highlights the role of the tuning of the hybrid, i.e. the selection of the methods and the switching point, in the overall performance of the method. However several examples illustrated the benefits of the combination with respect to the individual solvers in the sense of robustness and efficiency.

Recent works [30, 31] show how the scatter search metaheuristic in combination with local methods outperforms previous sequential hybrid methods in the solution of complex optimization problems. The key property of this approach is that the switching points are automatically selected by the algorithm based on a balance between quality and diversity among the solutions generated in every iteration.

Based on the above, the following methods were selected attending to their previously reported performance:

SRES, the stochastic ranking evolution strategy method [32]. An evolutionary approach that is able to handle constraints by means of the stochastic ranking approach.
DE, differential evolution [33]. It is based on the generation of new solutions by adding the weighted difference between two population vectors to a third vector. The method was complemented with a death penalty function approach to handle constraints.
Sequential hybrids of SRES or DE with different implementations of successive quadratic programming methods.
eSS, scatter search [34] in combination with different implementations of successive quadratic programming methods.

Multi-objective optimization methods

In contrast to the single-objective case, the aim in a multi-objective case is to find the optimal trade-offs between conflicting objectives. The notion of a single optimal solution is replaced by the notion of a Pareto front, i.e. a set of optimal trade-off solutions. All solutions in this set are optimal in the sense that it is not possible to improve one of the objectives without degrading one or more of the others. The weighted sum or the ε−constraint approach transforms the multi-objective problem into a set of single-objectives problems whose solutions result in the Pareto front. However the use of the weighted sum presents a number of drawbacks such as the impossibility to recover non-convex area in the Pareto front or the selection of weigths to obtain a uniform distribution of Pareto solutions. In view of this, the ε−constraint method [18] was selected here. The underlying idea is to solve a set of single-objective problems where one objective (J _p) is to be minimized while the others (J _i) are incorporated as inequality constraints. Mathematically this reads as:

min_{u (t), t_{f}} J_{p} [x, u]

(8)

Subject to:

J_{i} [x, u] \leq ε_{i} i = 1, ..., n and i \neq p

(9)

The Pareto front is obtained modifying the upper-bounds of the constraints. Each of the single-objective problems was solved using control vector parametrization combined with the eSS scatter search solver.

Results and discussion

This work considers the solution of two sets of examples. The first set consists of single-objective problems taken from the literature. All these problems were initially solved with a multi-start of local methods to analyze their possible multimodal nature. Since all problems appeared to be multimodal, a selection of global and hybrid methods were then used to find the global solution for each case study. The details are listed in Additional file 1, showing that scatter search (eSS) offered the best compromise between efficiency and robustness. A second set of problems considered a more general multi-objective formulation. The ε−constraint approach and scatter search (eSS) were subsequently used to solve this second set of problems, enabling us to obtain the optimal trade-offs between different objectives. A detailed analysis of the resulting enzyme activation profiles revealed the “just-in-time” activation is still present in multi-criteria optimality. The examples considered are summarized in Table 1. Results are presented and discussed below.

Table 1 Summary of the pathways considered together with the objectives used for each example

Full size table

Three-step linear pathway with mass action kinetics (LPN3B)

This example was originally formulated by Bartl et al. [7] and solved by means of the maximum principle of Pontryagin. The pathway (Figure 1A) consists of three enzymatic reactions with mass action kinetics, each reaction catalyzed by a specific enzyme (e _i). S ₁ corresponds to the substrate, S ₂ and S ₃ to the intermediate metabolites and S ₄ to the product. Metabolites and enzymes are expressed in concentration units and time in seconds.

In this work we considered the objective of the minimization of the time needed to reach a certain amount of product (Equation (10)), when the substrate remains unalterable (buffered substrate concentration) as represented in matrix N. This objective reflects a biological situation where there is no need to convert all the substrate into the product but a certain amount is essential.

The mathematical statement of the single-objective problem is as follows.

Find e over t∈[ t _o,t _f] to minimize:

J = t_{f}

(10)

Subject to the system dynamics:

\frac{d S}{dt} = N υ

(11)

Where N:

N = [\begin{array}{l} 0 & 0 & 0 \\ 1 & - 1 & 0 \\ 0 & 1 & - 1 \\ 0 & 0 & 1 \end{array}]

And:

υ_{i} = k_{i} \cdot S_{i} \cdot e_{i}

(12)

With the following end-point constraint:

S_{4} (t_{f}) = P (t_{f})

(13)

and path constraint:

\sum_{i = 1}^{3} e_{i} \leq E_{T}

(14)

with: E _T=1M, k _i=1 s ⁻¹ i=2,3,4, S ₁(t ₀)=1M, S _i(t ₀)=0 for i=2,3,4 and P(t _f)=0.9 M.

The above problem was solved with several methods, with the eSS scatter search solver as the best performer. Detailed results, including a discussion on multimodality, are given in Additional file 1: Tables S.6 and S.7.

Figure 1B presents the corresponding metabolite dynamics and optimal profiles of the enzymes for the single-objective optimization problem presented in Equation (10). The solution was obtained under the assumption of no substrate consumption (for example, in a constant culture medium). The first phase of the process is devoted to produce S ₂ as fast as possible, the enzyme is fully activated and since substrate is not consumed a high amount of S ₂ is produced and accumulated. Similarly the second and third phases correspond to a full activation of enzymes e ₂ and e ₃ respectively. The process time is t _f=4.2s. It should be remarked that the optimal solution for this case was achieved by eSS in less than 30 s of computation in a standard PC. Sequential hybrids of SRES and DE achieved similar values but requiring longer computation times.

The optimal solution of the above single-objective problem (see Equation (10)) results in a significant accumulation of intermediates at the end of the process which may be harmful for the cell. Thus, we formulated a generalized problem so as to find the best compromise between the time required to achieve a certain amount of product and the intermediates accumulation. Mathematically the formulation of this multi-objective problem is:

Find e over t∈ [ t _o,t _f] to minimize:

J = [t_{f}, \int_{t_{0}}^{t_{f}} (S_{2} + S_{3}) dt]

(15)

subject to the system dynamics presented in Equation (11), and with the constraints presented in Equations (13)–(14). The obtained Pareto front for the multi-objective problem (Equation (15)) is presented in Figure 1C. It can be observed that for process durations longer than 8 seconds the effect on the intermediate concentration is negligible, which means that no further improvements in the reduction of intermediates concentration can be achieved to satisfy the required amount of product.

To illustrate the differences in enzyme activation profiles for different trade-offs, Figure 1D presents the optimal profiles for three solutions from the Pareto front (P ₁ and P ₃ as extreme points, and P ₂ as a balanced trade-off between objectives). It can be noticed that the first enzyme is always fully activated, and its activation time is reduced when both objectives are considered (P ₂ and P ₃). It can also be noted that an intermediate zone is generated where both enzymes are activated. In this region S ₂ and S ₃ are produced and consumed in order to avoid their accumulation in the pathway. Additionally it is observed that in the final stage the third enzyme is fully activated to obtain the desired amount of product (S ₄).

Four-step linear pathway with Michaelis-Menten kinetics (LPDN4)

This example considers a four step linear pathway with Michaelis-Menten kinetics and was originally considered by Oyarzún et al. [6]. In this case, in order to obtain more realistic results, enzymes are assumed not to become activated instantaneously, but to follow first order kinetics. The pathway (Figure 2A) consists in four enzymatic reactions catalyzed by a specific enzyme (e _i) where S ₁ corresponds to the substrate, S ₂−S ₃ to the intermediate metabolites and S ₄ to the product. The objective of the system is to minimize the enzyme consumption and the time needed to reach a given steady state.

This single-objective problem reads mathematically as follows:

Find r over t∈[ t _o,t _f] to minimize:

J = \int_{t_{0}}^{t_{f}} (1 + \sum_{i = 1}^{n = 4} e_{i}) dt

(16)

Subject to the system dynamics:

\frac{d S}{dt} = N υ

(17)

where the enzyme dynamics is considered to be linear with the expression rate (r) and λ=0.5.

\frac{d e}{dt} = r - λ \cdot e

(18)

with:

N = [\begin{array}{l} - 1 & 0 & 0 & 0 \\ 1 & - 1 & 0 & 0 \\ 0 & 1 & - 1 & 0 \\ 0 & 0 & 1 & - 1 \end{array}]

and:

υ_{i} = \frac{k_{cati} \cdot S_{i}}{K_{M} + S_{i}} \cdot e_{i}

(19)

with the following end point constraints, which describe the given steady state.

e (t_{f}) = e_{SS}

(20)

In addition the following constraints are imposed to limit the amount of enzymes and their rates:

0 \leq r_{i} \leq 1 mM. s^{- 1}

(21)

0 \leq e_{i} \leq E_{t}

(22)

With: E _T=1 m M, k _{c a t 1}=1 s ⁻¹, k _{c a t 2}=2 s ⁻¹, k _{c a t 3}=4 s ⁻¹, k _{c a t 4}=3 s ⁻¹, K _M=1 s ⁻¹, V=0.2 m M/s, S ₁(t ₀)=5mM, S _i(t ₀)=0 for i=2,3,4, e _SS= [0.24 0.26 0.21 0.29] mM for i=1,2,3,4.

Note that in this case the expression rates are computed via dynamic optimization and the optimal enzyme activation profiles are obtained from the Equation (18).

The optimal value J ^∗ = 6.154 was obtained using eSS. Note that this result is in good agreement with the one previously reported (J ^∗ = 6.298) [35]. Figure 2B presents the optimal metabolite and enzyme dynamics for the single-objective case (Equation (16)). As expected, the incorporation of the enzyme dynamics slows down the entire process. The optimal profiles for the expression rates follow a switching pattern that matches the pathway topology, leading to enzyme profiles that follow a sequential activation in agreement with previous results. The enzyme profiles show that for the synthesis of one enzyme the degradation of the previous enzyme is needed. The constraints imposed in the final amount of metabolites require a high accumulation of metabolites in the system and this could be lethal for the system [36].

It is interesting to note that in two previous works [6, 35] a combined objective function including the transition time and the enzyme cost was proposed, obtaining a single trade-off solution. In other words, they obtained one of all the possible Pareto solutions. Here we go a step a further and formulate and solve the multi-objective optimization problem in order to obtain the full Pareto front. Mathematically the formulation of the multi-objective problem is:

Find e over t∈[ t _o,t _f] to minimize:

J = [t_{f}, \int_{t_{0}}^{t_{f}} (\sum_{i = 1}^{n = 4} e_{i}) \cdot dt]

(23)

Subject to Equations (17)–(18) and with the constraints presented in Equations (21)–(22).

Due to the final time constraints used to define the steady state the system keeps several enzymes activated during the process, which implies an extra effort for the system as discussed in literature [7]. In this case end point constraints (Equation (20)) are replaced by a constraint on the final amount of product:

S_{4} (t_{f}) = 0.7

(24)

The Pareto front for the multi-objective case (Equation (23)) is presented in Figure 2C. It can be noticed that for large times (around 15 seconds) the effect on the reduction of enzyme concentration is negligible, which means that the enzyme consumption can not be further reduced to achieve the desired amount of product. In Figure 2D optimal profiles for expression rates and enzyme dynamics are shown in order to check how enzyme activation is affected by the different trade-off. P ₁ corresponds to the minimization of the time and P ₂ and P ₃ to different trade-off between objectives.

We also notice that the optimal profiles for the expression rates follow a switching pattern that matches with the pathway topology, i.e. is reflected in a sequential activation of the enzymes. Before activating one enzyme, the degradation process of the previous one has started (i.e. the cell has only a certain amount of protein available). This situation is more relevant when the enzyme consumption is reduced. Note that when the process time increases the activation time of the enzymes is slightly reduced (this decrease is more relevant for short process times).

Glycolysis inspired network (GBD)

The original problem was considered by Bartl et al. [15]. Results achieved for that formulation are discussed in Additional file 1. Here we propose a more realistic formulation that incorporates the enzyme dynamics. The enzyme dynamics (Equation (27)) are considered to be linear with the expression rate (r _i). The pathway (Figure 3A) consists of four enzymatic reactions with one branch where S ₁ corresponds to the substrate, S ₂−S ₄ to the intermediate metabolites and S ₅ to the product. The hypothesis in this problem is that the pathway activation minimizes the time needed to transform the substrate (S ₁) into a fixed amount of product (S ₅). Note that the substrate S ₁ is not consumed during the process.

The activation profiles for the single-objective case may be acquired by computing r over t∈ [ t _o,t _f] to minimize the cost function.

J = t_{f}

(25)

Subject to the system dynamics:

\begin{array}{lcr} \frac{d S}{dt} = N υ \end{array}

(26)

\begin{array}{lcr} \frac{d e}{dt} = r - λ \cdot e \end{array}

(27)

Where:

υ_{i} = \frac{k_{cati} \cdot S_{i}}{K_{M} + S_{i}} \cdot e_{i}

(28)

With:

N = [\begin{array}{l} 0 & 0 & 0 & 0 \\ 1 & - 1 & 0 & 0 \\ 0 & 1 & - 1 & 0 \\ 0 & 1 & 1 & - 1 \\ 0 & 0 & 0 & 1 \end{array}]

and the following end point constraints:

S_{5} (t_{f}) = P (t_{f})

(29)

and the following path constraint:

\sum_{i = 1}^{4} e_{i} \leq E_{T}

(30)

With E _T=1mM, P(t _f)=0.75mM, k _cati=1 s ⁻¹, K _M=1 s ⁻¹, S ₁(t ₀)=1mM, S _i(t ₀)=e _i(t ₀)=0 for i=2,3,4,5, λ=0.5 s ⁻¹.

The optimal value (J ^∗=9.5) was obtained using scatter search (eSS) in less than 50 s of CPU time. Detailed optimization results are presented in Additional file 1 (Table S.13). The corresponding optimal enzyme activation profiles for the single-objective case (Equation (25)) are shown in Figure 3B. Again the optimal profiles for the expression rate follow a switching pattern that matches with the pathway topology leading to enzyme profiles that follow a sequential activation profile.

The enzyme profiles show that for the synthesis of one enzyme the degradation of the previous is needed, meeting the constraint on the total amount of enzyme (Equation (30)). As in previous cases, there is a high accumulation of metabolites, which could be harmful for the cell. Note that the problem formulation could be modified so to predict the scenario with no accumulation of intermediates. In addition, these calculations were preformed for a fixed λ value, but of course the problem can be easily solved to consider other cases where genes might have different expression rates and proteins might have different degradation rates.

We now consider a multi-objective formulation, extending the objective function and keeping unchanged the rest of the problem. Mathematically, the multi-objective problem is formulated as follows:

Find r over t∈ [ t _o,t _f] to minimize:

J = [t_{f}, \int_{t_{0}}^{t_{f}} (\sum_{i = 1}^{4} e_{i}) \cdot dt]

(31)

The Pareto front is presented in Figure 3C for the objectives in Equation (31). For long process times, no significant improvement in the enzyme cost is achieved. The optimal enzyme profiles corresponding to 3 points in the Pareto front are depicted in Figure 3D. P ₁ corresponds to the single-objective case (time minimization), and P ₂ and P ₃ represent different trade-offs between process time and enzyme cost minimization. The optimal profiles for the expression rate follow a switching pattern that matches with the pathway topology. In P ₁ we observed that for the synthesis of one enzyme the degradation of the previous is needed. In fact, from P ₂ and P ₃ it can be noted that such situation gains relevance as we move in the Pareto front.

One interesting and common situation in branched pathways is that the system could have two different outputs (p.e. produce several amino acids in sensible ratios for protein synthesis), which in practice means that the cell resources are distributed accordingly. The introduction of a second output in the single-output pathway was considered to study the behavior of the system if several products are generated. This new pathway is presented in Figure 4A.

The new multi-objective problem mathematically reads as: find activation profiles by computing r over t∈[ t _o,t _f] to maximize:

J = [S_{5}, S_{6}]

(32)

Subject to the system dynamics presented in Equations (26)–(27), with:

N = [\begin{array}{l} 0 & 0 & 0 & 0 \\ 1 & - 1 & 0 & 0 \\ 0 & 1 & - 1 & - 1 \\ 0 & 1 & 1 & - 1 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 1 \end{array}]

And with the path constraint presented in (30). In this case the final time was considered constant (t _f=15 s). The Pareto front for the problem formulated in Equation (32) is presented in Figure 4B, where point P ₁ and P ₃ stand for single-objective situations were only one product is produced, and P ₂ is a balanced trade-off. In Figure 4C the optimal profiles of the selected Pareto points (P ₁, P ₂ and P ₃) are depicted. In all cases we can see a sequential enzymatic activation. In P ₁ four enzymes are activated to convert the substrate (S ₁) in the product S ₅. In P ₃ only 3 enzymes are activated to obtain the product S ₆. Regarding P ₂, it can be noted that e ₁ and e ₂ are sequentially activated during the early process. Later on, in the intermediate stage, e ₄ and e ₅ are simultaneously activated. And finally, at the end of the process, only e ₄ is active.

Central metabolism of Saccharomyces cerevisiae during diauxic shift (SC)

The diauxic shift is characterized by decreased growth rate and by switching metabolism from glycolysis to aerobic utilization of ethanol under conditions of glucose depletion. This allows the maintenance of the cellular redox potential, NADH/NAD and ATP levels, enabling the cell to survive over longer periods without nutrients, in a so called quiescent state. The idea is to explain this particular behavior by computing the enzymatic activation profile that maximizes the survival time.

This case was considered by Klipp et al. [3, 4] using orthogonal collocation in combination with a genetic algorithm. The pathway (Figure 5A) consists in a simplified model of the central metabolism of Saccharomyces cerevisiae. The system complies six reactions: upper glycolysis, lower glycolysis, ethanol formation, ethanol consumption TCA cycle and respiratory chain. The model has arbitrary units.

Mathematically the single-objective problem is formulated as follows:

Find e over t∈ [ t _o,t _f] to maximize:

J = t_{f}

(33)

Subject to the system dynamics:

\frac{d S}{dt} = N υ

(34)

Where S and N correspond to:

S = {[X_{1}, X_{2}, X_{3}, X_{4}, NADH, ATP, NAD, ADP]}^{T}

(35)

N = [\begin{array}{l} - 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 2 & - 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & - 1 & 1 & - 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & - 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & - 1 & 1 & - 4 & - 1 & - 1 & 0 \\ - 1 & - 2 & 0 & 0 & 0 & 3 & 0 & - 1 \\ 0 & - 1 & 1 & - 1 & 4 & 1 & 1 & 0 \\ 1 & 2 & 0 & 0 & 0 & - 3 & 0 & 1 \end{array}]

And:

\begin{array}{lcl} υ_{1} & = & e_{1} \cdot X_{1} \cdot ATP \\ υ_{2} & = & e_{2} \cdot X_{2} \cdot NAD \cdot ADP \\ υ_{3} & = & e_{3} \cdot X_{3} \cdot NADH \\ υ_{4} & = & e_{4} \cdot X_{4} \cdot NAD \\ υ_{5} & = & e_{5} \cdot X_{3} \cdot NAD \\ υ_{6} & = & e_{6} \cdot NADH \cdot ADP \\ υ_{7} & = & k_{7} \cdot ATP \\ υ_{8} & = & k_{8} \cdot NAD \end{array}

(36)

With a constraint in the total amount of enzyme:

\sum_{i = 1}^{6} e_{i} \leq E_{T}

(37)

And:

\begin{array}{lcr} NADH \geq {NADH}_{c} \end{array}

(38)

\begin{array}{lcr} ATP \geq {ATP}_{c} \end{array}

(39)

With: X ₁(t ₀)=1, X ₂(t ₀)=1, X ₃(t ₀)=1, X ₄(t ₀)=10, N A D H(t ₀)=0.7, N A D(t ₀)=0.3, A T P(t ₀)=0.8, A D P(t ₀)=0.2, NADH _c=0.5, ATP _c=0.7, k ₇=3 and k ₈=0.1.

It should be noted that constraints (38)–(39) indicate that the cells will survive provided the concentrations of NADH and ATP are above some given critical values (NADH _c and ATP _c respectively).

In order to compare our results with the ones reported in the work by Klipp et al. [4] and the available experimental data, two types of enzyme CVP approximations were considered: i) a step-wise approximation with 120 constant elements for each enzyme (i.e. 721 decision variables) and ii) a piecewise linear approximation with 8 variable length linear elements (i.e. 55 decision variables).

The optimal value (J* = 95.8) was again obtained using eSS scatter search. Note that the optimal solution is highly sensitive to the switching times (details are given in Additional file 1). The optimal profiles and the metabolite dynamics for the single-objective problem (Equation (33)) are depicted in Figure 5B, showing a higher activity of the lower part of the glycolysis (e ₂) and in the ethanol formation (e ₃), as expected. During the last part of the process the activity of (e ₄) is increased allowing the formation of NADH. Also there is an increase in the tricarboxylic acid cycle (e ₅) and in the respiratory chain (e ₆) which compensate the decline in supply of NADH and ATP.

In the above single-objective formulation (Equation (33)), we can interpret that the system tries to survive as long as possible without any limitation on the investment of resources used. In the next case, we try to find the best compromise between the survival of the system and the resources used to do that. The aim in this case is to maximize the survival time with a low investment on the enzyme cost. We formulate the optimization problem as a multi-objective problem (with objectives time and the enzyme cost). Mathematically the multi-objective problem is formulated as follows:

Find e over t∈ [ t _o,t _f] to maximize:

J = [t_{f}, - \int_{t_{0}}^{t_{f}} (\sum_{i = 1}^{6} e_{i}) dt]

(40)

The set of solutions for the maximization problem (Equation (40)) is presented in Figure 5C. P ₁ corresponds to the single-objective situation above, while P ₂, P ₃ and O are different trade-offs among the objectives. There, we can observe that the survival time of the system is not significantly reduced by a decrease on the enzyme consumption until the reduction on the enzyme consumption is over 60%. For low enzyme consumptions (under 60%), the survival time is clearly affected and diminishes rapidly.

The reduction on enzyme consumption is reflected as a reduction on the initial value of enzymes and their dynamics during the process. In Figure 5E optimal profiles for e ₅ obtained for four different points of the Pareto front are compared. Besides the reduction on the initial concentration of the enzyme, it can be observed that in P ₁ and O there is a sharp increase on the activity of the enzyme towards then end of the process. This dynamic is significantly reduced in P ₂, and disappears in P ₃, where the enzyme has a constant activation value. Complete optimal profiles are presented in Additional file 1.

We computed, for all the Pareto points, the distance between experimental data (taken from [4]) and model predictions, finding that point “O” in the Pareto corresponds to the best fit. Figure 5D presents the comparison between experimental data and model predictions for this optimal trade-off solution. The numerical profiles are in good agreement with the general tendency of the experimental data. e ₁ and e ₃ slowly decrease the activity whereas e ₄ and e ₅ slowly increase their activity arriving to high activation values. e ₂ keeps an almost constant activation value along the time horizon.

Conclusions

This work proposes the use of multi-objective optimization, combined with an advanced dynamic optimization method, as a systematic way to predict or explain genetic metabolic regulation. The presented methodology can handle very general formulations, including several objectives, arbitrary network topologies and non-linear kinetics, large numbers of control variables (enzymes or activation rates), and general path and point constraints on controls and states (metabolites concentrations).

The methodology is based on the transformation of the original multi-objective dynamic optimization problem into a set of non-linear programming (NLP) problems by means of the control vector parametrization. The NLP problems can be solved using the combination of an initial value problem solver and an efficient global optimizer. The need of global optimization arises from the non-convexity of these problems.

To illustrate this methodology, two sets of problems were considered. A first set of single-objective examples were taken from the literature. The obtained results are comparable or better than those previously published and were obtained with a minimum computational cost, between a few seconds and a few minutes in a standard PC. Our results show clearly that a hybrid metaheuristic based on scatter search was the most robust and efficient solver when dealing with this class of problems.

A second set of multi-objective formulations were considered, for the first time, to achieve more biologically meaningful results. The Pareto front which represents the different trade-offs between objectives was obtained by transforming the multi-criteria problem into a set of single-objective problems, each of them solved by means of the previously mentioned methodology. Interestingly, the optimal activation profiles computed exhibited the “just-in-time” behavior in most cases and for both single and multi-criteria formulations. By definition, all the solutions in a Pareto front are equally optimal. However, if one introduces an additional requirement or constraint (e.g. fit to experimental data), it is then possible to select a single solution from the front which meets such additional criteria (as illustrated with the S. cerevisiae case study).

The methodology presented here opens up new possibilities, especially with respect to the handling of medium and large scale complex networks. It will also enable further research, including experimental validation, in order to asses biological optimality in terms of Pareto-optimal trade-offs.

References

Sutherland WJ: The best solution. Nature. 2005, 435: 569-10.1038/435569a.
Article CAS PubMed Google Scholar
Orth JD, Thiele I, Palsson BO: What is flux balance analysis?. Nat Biotechnol. 2010, 28 (3): 245-248. 10.1038/nbt.1614.
Article PubMed Central CAS PubMed Google Scholar
Klipp E, Holzhütter HG, Heinrich R: Reprogramming of the metabolic system by altering gene expression. Proceedings of the 9th Int. Meeting on BioThermoKinetics: Animating the Cellular Map. 2000, South Africa: Stellenbosch University Press, 65-70.
Google Scholar
Klipp E, Heinrich R, Holzhütter HG: Prediction of temporal gene expression. Metabolic optimization by re-distribution of enzyme activities. Eur J Biochem. 2002, 269: 5406-5413. 10.1046/j.1432-1033.2002.03223.x.
Article CAS PubMed Google Scholar
Zaslaver A, Mayo AE, Rosenberg R, Bashkin P, Sberro H, Tsalyuk M, Surette M, Alon U: Just-in-time transcription program in metabolic pathways. Nat Genet. 2004, 36: 486-491. 10.1038/ng1348.
Article CAS PubMed Google Scholar
Oyarzún DA, Ingalls BP, Middleton RH, Kalamatianos D: Sequential activation of metabolic pathways: a dynamic optimization approach. Bull Math Biol. 2009, 71: 1851-1872. 10.1007/s11538-009-9427-5.
Article PubMed Google Scholar
Bartl M, Li P, Schuster S: Modelling the optimal timing in metabolic pathway activation-use of Pontryagin’s Maximum Principle and role of the golden section. Biosystems. 2010, 101: 67-77. 10.1016/j.biosystems.2010.04.007.
Article CAS PubMed Google Scholar
Banga JR, Balsa-Canto E: Parameter estimation and optimal experimental design. Essays Biochem. 2008, 45: 195-210. 10.1042/BSE0450195.
Article CAS PubMed Google Scholar
Lebiedz D, Maurer H: External optimal control of self-organisation dynamics in a chemotaxis reaction diffusion system. IET Syst Biol. 2004, 2: 222-229.
Article Google Scholar
Salby O, Sager S, Shaik OS, Kummer U, Lebiedz D: Optimal control of self-organized dynamics in cellular signal transduction. Math Comput Model Dynamical Syst. 2007, 13: 487-502. 10.1080/13873950701243969.
Article Google Scholar
Vilas C, Balsa-Canto E, Garcia MSG, Banga JR, Alonso AA: Dynamic optimization of distributed biological systems using robust and efficient numerical techniques. BMC Syst Biol. 2012, 6: 79-10.1186/1752-0509-6-79.
Article PubMed Central PubMed Google Scholar
Ou J, Yamada T, Nagahis K, Hirasawa T, Furusawa C, Yomo T, Shimizu H: Dynamic change in promoter activation during lysine biosynthesis in Escherichia coli cells. Mol Biosyst. 2008, 4: 128-134. 10.1039/b711035a.
Article CAS PubMed Google Scholar
Chechik G, Oh E, Rando O, Weissman J, Regev A, Koller D: Activity motifs reveal principles of timing in transcriptional control of the yeast metabolic network. Nat Biotechnol. 2008, 26: 1251-1259. 10.1038/nbt.1499.
Article PubMed Central CAS PubMed Google Scholar
Oyarzún DA: Optimal control of metabolic networks with saturable enzyme kinetics. IET Syst Biol. 2011, 5 (2): 110-119. 10.1049/iet-syb.2010.0044.
Article Google Scholar
Bartl M, Kötzing M, Kaleta C, Schuster S, Li P: Just-in-time activation of a glycolysis inspired metabolic network - solution with a dynamic optimization approach. Proceedings 55nd International Scientific Colloquium. 2010, Ilmenau (Germany): Universitätsbibliothek Ilmenau
Google Scholar
Handl J, Kell DB, Knowles J: Multiobjective optimization in bioinformatics and computational biology. Computat Biol Bioinformatics, IEEE/ACM Trans. 2007, 4 (2): 279-292.
Article CAS Google Scholar
Sendin JOH, Vera J, Torres NV, Banga JR: Model based optimization of biochemical systems using multiple objectives: a comparison of several solution strategies. Math Comput Model Dynamical Syst. 2006, 12 (5): 469-487. 10.1080/13873950600723442.
Article Google Scholar
Sendin JOH, Exler O, Banga JR: Multi-objective mixed integer strategy for the optimisation of biological networks. IET Syst Biol. 2010, 4 (3): 236-248. 10.1049/iet-syb.2009.0045.
Article CAS PubMed Google Scholar
El-Samad H, Khammash M, Homescu C, Petzold L: Optimal performance of the heat-shock gene regulatory network. Proceedings 16th IFAC World Congress, Volume 16. 2005, Prague, Czech Republic: Elsevier, 2206-2206.
Google Scholar
Higuera C, Villaverde AF, Banga JR, Ross J, Moran F: Multi-criteria optimization of regulation in metabolic networks. PLoS ONE. 2012, 7 (7): e41122-10.1371/journal.pone.0041122.
Article PubMed Central CAS PubMed Google Scholar
Shoval O, Sheftel H, Shinar G, Hart Y, Ramote O, Mayo A, Dekel E, Kavanagh K, Alon U: Evolutionary trade-offs, pareto optimality, and the geometry of phenotype space. Science. 2012, 336 (6085): 1157-1160. 10.1126/science.1217405.
Article CAS PubMed Google Scholar
Noor E, Milo R: Efficiency in evolutionary trade-offs. Science. 2012, 336 (6085): 1114-1115. 10.1126/science.1223193.
Article CAS PubMed Google Scholar
Szekely P, Sheftel H, Mayo A, Alon U: Evolutionary tradeoffs between economy and effectiveness in biological homeostasis systems. PLOS Comput Biol. 2013, 9: e1003163-10.1371/journal.pcbi.1003163.
Article PubMed Central CAS PubMed Google Scholar
Wessely F, Bartl M, Guthke R, Li P, Schuster S, Kaleta C: Optimal regulatory strategies for metabolic pathways in Escherichia coli depending on protein costs. Mol Syst Biol. 2011, 7: 515-528.
Article PubMed Central PubMed Google Scholar
Banga JR, Balsa-Canto E, Moles C, Alonso AA: Dynamic optimization of bioprocesses: efficient and robust numerical strategies. J Biotechnol. 2005, 117: 407-419. 10.1016/j.jbiotec.2005.02.013.
Article CAS PubMed Google Scholar
Biegler LT, Cervantes AM, Wätcher A: Advances in simultaneous strategies for dynamic process optimization. Chem Eng Sci. 2002, 57 (4): 575-593. 10.1016/S0009-2509(01)00376-1.
Article CAS Google Scholar
Bock HG, Plitt KJ: A multiple shooting algorithm for direct solution of optimal control problems. Proceedings 9th IFAC World Congress. 1984, New York: Pergamon Press, 242-247.
Google Scholar
Vassiliadis VS, Sargent RWH, Pantelides CC: Solution of a class of multistage dynamic optimization problems. 1. Problems without path constraints. Ind Eng Chem Res. 1994, 33 (9): 2111-2122. 10.1021/ie00033a014.
Article CAS Google Scholar
Balsa-Canto E, Vassiliadis VS, Banga JR: Dynamic optimization of single- and multi-stage systems using a hybrid stochastic-deterministic method. Ind Eng Chem Res. 2005, 44 (5): 1514-1523. 10.1021/ie0493659.
Article CAS Google Scholar
Egea JA, Rodriguez-Fernandez M, Banga JR, Marti R: Scatter search for chemical and bio-process optimization. J Glob Optimization. 2007, 37 (3): 481-503. 10.1007/s10898-006-9075-3.
Article Google Scholar
Egea JA, Balsa-Canto E, Garcia MG, Banga JR: Dynamic optimization of nonlinear processes with an enhanced scatter search method. Ind Eng Chem Res. 2009, 48 (9): 4388-4401. 10.1021/ie801717t.
Article CAS Google Scholar
Runarsson TP, Yao X: Stochastic ranking for constrained evolutionary optimization. IEEE Trans Evol Comput. 2000, 564: 284-294.
Article Google Scholar
Storn R, Price K: Differential evolution - a simple and efficient heuristic for global optimization over continuous spaces. J Glob Optimization. 1997, 11: 341-359. 10.1023/A:1008202821328.
Article Google Scholar
Egea JA, Martí R, Banga JR: An evolutionary method for complex-process optimization. Comput Oper Res. 2010, 37 (2): 315-324. 10.1016/j.cor.2009.05.003.
Article Google Scholar
Oyarzún DA: A control-theoretic approach to dynamic optimization of metabolic networks. PhD thesis. PhD thesis. National University of Ireland Maynooth, Ireland 2010
Google Scholar
Cascante M, Lloréns M, Meléndez-Hevia E, Puigjaner J, Montero F, Martí E: The metabolic productivity of the cell factory. J Theor Biol. 1996, 153: 317-325.
Article Google Scholar

Download references

Acknowledgements

This research received financial support from the Spanish Ministerio de Economía y Competitividad (and the FEDER) through the project “MultiScales” (DPI2011-28112-C04-03), and from the CSIC intramural project “BioREDES” (PIE-201170E018). Gundián M. de Hijas Liste acknowledges financial support from the MICINN-FPI programme.

Author information

Authors and Affiliations

Bioprocess Engineering Group, Spanish National Research Council, IIM-CSIC, C/Eduardo Cabello 6, 36208, Vigo, Spain
Gundián M de Hijas-Liste, Eva Balsa-Canto & Julio R Banga
Theoretical Biophysics, Humboldt-Universität zu Berlin, Invalidenstr. 42, 10115, Berlin, Germany
Edda Klipp

Authors

Gundián M de Hijas-Liste
View author publications
You can also search for this author in PubMed Google Scholar
Edda Klipp
View author publications
You can also search for this author in PubMed Google Scholar
Eva Balsa-Canto
View author publications
You can also search for this author in PubMed Google Scholar
Julio R Banga
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Eva Balsa-Canto or Julio R Banga.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

GMHL performed the computations. EBC implemented the dynamic optimization code. EBC and JRB contributed to the conception and design of the work. EBC, JRB and EK supervised the project and helped to draft the manuscript. All authors read and approved the final manuscript.

Electronic supplementary material

12918_2013_1260_MOESM1_ESM.pdf

Additional file 1:Further details on the use of global dynamic optimization to predict the activation in metabolic pathways. The Additional file 1 presents a more detailed description of the numerical approaches used in this work as well as a comparative study of the results achieved. (PDF 488 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Authors’ original file for figure 12

Authors’ original file for figure 13

Authors’ original file for figure 14

Authors’ original file for figure 15

Authors’ original file for figure 16

Authors’ original file for figure 17

Authors’ original file for figure 18

Authors’ original file for figure 19

Authors’ original file for figure 20

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

de Hijas-Liste, G.M., Klipp, E., Balsa-Canto, E. et al. Global dynamic optimization approach to predict activation in metabolic pathways. BMC Syst Biol 8, 1 (2014). https://doi.org/10.1186/1752-0509-8-1

Download citation

Received: 04 February 2013
Accepted: 20 December 2013
Published: 06 January 2014
DOI: https://doi.org/10.1186/1752-0509-8-1

Global dynamic optimization approach to predict activation in metabolic pathways

Abstract

Background

Results

Conclusions

Background

Methods

Problem formulation

Numerical methods

Control vector parametrization

Nonlinear programming methods

Multi-objective optimization methods

Results and discussion

Three-step linear pathway with mass action kinetics (LPN3B)

Four-step linear pathway with Michaelis-Menten kinetics (LPDN4)

Glycolysis inspired network (GBD)

Central metabolism of Saccharomyces cerevisiae during diauxic shift (SC)

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Additional information

Competing interests

Authors’ contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Systems Biology

Contact us