Decoding complex biological networks - tracing essential and modulatory parameters in complex and simplified models of the cell cycle
- Olivia Eriksson^{1, 3}Email author,
- Tom Andersson^{1},
- Yishao Zhou^{2} and
- Jesper Tegnér^{3}Email author
DOI: 10.1186/1752-0509-5-123
© Eriksson et al; licensee BioMed Central Ltd. 2011
Received: 24 February 2010
Accepted: 7 August 2011
Published: 7 August 2011
Abstract
Background
One of the most well described cellular processes is the cell cycle, governing cell division. Mathematical models of this gene-protein network are therefore a good test case for assessing to what extent we can dissect the relationship between model parameters and system dynamics. Here we combine two strategies to enable an exploration of parameter space in relation to model output. A simplified, piecewise linear approximation of the original model is combined with a sensitivity analysis of the same system, to obtain and validate analytical expressions describing the dynamical role of different model parameters.
Results
We considered two different output responses to parameter perturbations. One was qualitative and described whether the system was still working, i.e. whether there were oscillations. We call parameters that correspond to such qualitative change in system response essential. The other response pattern was quantitative and measured changes in cell size, corresponding to perturbations of modulatory parameters. Analytical predictions from the simplified model concerning the impact of different parameters were compared to a sensitivity analysis of the original model, thus evaluating the predictions from the simplified model. The comparison showed that the predictions on essential and modulatory parameters were satisfactory for small perturbations, but more discrepancies were seen for larger perturbations. Furthermore, for this particular cell cycle model, we found that most parameters were either essential or modulatory. Essential parameters required large perturbations for identification, whereas modulatory parameters were more easily identified with small perturbations. Finally, we used the simplified model to make predictions on critical combinations of parameter perturbations.
Conclusions
The parameter characterizations of the simplified model are in large consistent with the original model and the simplified model can give predictions on critical combinations of parameter perturbations. We believe that the distinction between essential and modulatory perturbation responses will be of use for sensitivity analysis, and in discussions of robustness and during the model simplification process.
Background
In recent years, numerous studies characterizing static topological properties of molecular networks derived from heterogeneous sources such as metabolic, transcriptomic, transcription factor and protein-protein interaction data have been published [1]. Concurrently, several studies have revealed the dynamics of small-scale molecular networks using innovative applications of reverse-engineering techniques, modeling and computer simulation of kinetic equations [2]. However, observations regarding overall parameter robustness of biological circuits may imply that beyond the apparent model complexity there exists, in a mathematical sense, one or several core systems or operational principles driving the system dynamics that are shielded by the more complex original dynamical equations [3–6]. Such a dynamical core is not necessarily apparent from the connectivity graph or from the original dynamical equations. The existence of such a core has been supported by studies employing an ensemble simulation approach in the analysis of various computational models [7, 8]. Essentially, a large number of different parameter configurations exist that produce the "same" dynamical output from the system, as emphasized in a study by Gutenkunst et al [9], whereby, using a large set of biochemical models, they found a "sloppy" parameter sensitivity spectra to be a universal characteristic of these models.
To advance our understanding of molecular networks, we need to develop methods that permit an analysis of such underlying core dynamics. Research on model reduction and reformulation has been at the forefront within the systems biology community over the last few years (e.g. [4, 10–16]). Several studies have targeted the elaborate original model using different approaches, and have arrived at a more compact system description, which in different ways capture some aspects of the essence of the original model dynamics. Using model reduction tools, the number of degrees of freedom are reduced, while still remaining within the same model formalism as the original model. By reformulating the model, the original model is approximated to another hopefully more transparent modeling formalism, with a structure that better captures the key qualities of the original system. Examples of model reduction [17] include lumping of variables [10, 11], separation of timescales [14] (or, for example, the classical Michaelis-Menten equation describing enzyme kinetics), sensitivity analysis based methods, and methods based on identifiability analysis [13]. Examples of transformation of modeling formalism include boolean approximations [4, 12], hybrid stochastic approximations [18], or the simplified model used throughout this study, a delayed piecewise linear approximation [15] of an ordinary differential model.
The cell cycle is one of the most extensively studied and essential biological regulatory circuits. The differential computational models pioneered and developed by Novak and Tyson [2, 19] describing cell cycle regulation of fission yeast, incorporates and accounts for a large body of experimental data. During the cell cycle, the cell grows, DNA replicates, and the cell divides into two daughter cells. To maintain cell size over several generations, it is essential that there is a controlled balance between cell growth and cell division. The model used in the present study [19] describes the key components of that regulatory mechanism, consisting of proteins and protein complexes that drive the cell cycle in fission yeast. It is detailed in the Methods (equations (3-16)) and is referred to throughout as the NT-model. The NT-model is thus our point of departure for the model simplification process and sensitivity analyses. For a detailed description of the NT-model see [19]. For a more general discussion of cell cycle dynamics see [2].
To examine the dynamical properties of the original NT-model, we performed a sensitivity analysis. Two different types of output responses to parameter perturbations were considered. One was qualitative and described whether the system was still working, i.e. whether there were oscillations (cell divisions) corresponding to parameters that here we call "essential". The other response pattern was quantitative and measured changes in cell size (cell growth), which described "modulatory" effects of parameter perturbations. This is primarily a methodological distinction, i.e. response effects related to parameter perturbations. In principle, parameters could be both essential and modulatory, depending on the perturbations in question. In practice, when conducting sensitivity analyses, the key question is which parameters affect the system output, how, and under what conditions. Here, we extend the scope of the sensitivity analysis to evaluate the correspondence between perturbation effects in the original model and analytical results of the simplified model, focusing on two types of possible perturbation effects - essential and modulatory changes - anticipating that the simplified model should be able to account for both of these situations. The analysis includes three key steps: (1) perturbations of the parameters of the original NT-model; (2) a corresponding recalibration of the parameters of the simplified model; and (3) systematic comparison of the perturbation effects on the orginal and simplified models (Figure 1). The perturbation effects of the original model are investigated by numerical simulations whereas the corresponding effects of the simplified model are retrieved by evaluation of analytical expressions.
Sensitivity analysis refers to a broad range of mathematical and statistical analysis methods that serve to evaluate the relative merits of a set of parameters in controlling system dynamics. The aim is to relate variation (uncertainty) in the system output to variation (uncertainty) in the system input. The analysis method chosen depends on the goal. Campolongo et al [20] distinguish between three types of analysis: (1) factor screening; (2) local sensitivity analysis; and (3) global analysis. The first type refers to an analysis targeting the effects of parameter variation one at a time, i.e. variation in system output is related to parameters one at a time. It is a more or less qualitative assessment of the relevant factors in system dynamics. By contrast, local and global sensitivity analyses imply detailed quantitative assessments of relative effects of the isolated factors of concern. Local analysis makes use of differential calculus, and stability analysis of equilibrium points. Global sensitivity analysis targets response effects in the whole parameter space, as well as interaction effects between parameters. It relies to a larger extent on statistical methods. Our purpose in this study was to clarify the relevance of parameters in cell cycle dynamics. As a starting point, we performed a screening study to investigate the relative merits of single parameters in cell cycle regulation.
Results
Our analysis is based on a set, , of parameter perturbations of the original NT-model, consisting of 470 single parameter changes. This set was used to analyse the relationship between parameter values and model output (cell size and cycle time) through two different tracks, see Figure 1. During the first track, the sensitivity analysis, numerical simulations of the NT-model were conducted for all perturbations in , and scores of sensitivity were constructed for each model parameter. During track two, all perturbations of were translated to corresponding perturbations of the hybrid model and, instead of simulations, mathematical conditions were evaluated in order to predict the sensitivity to the perturbations, and thereby construct scores of sensitivity. Two measures of sensitivity were used in both tracks, targeting essential and modulatory effects of perturbations, respectively. Finally, the sensitivity scores from the two tracks were compared in order to i) validate the predictions from the hybrid model, and ii) investigate the role of essential and modulatory parameters and perturbations. The results are divided into the following sections: (I) sensitivity analysis of original model; (II) analytical predictions from the hybrid model; and (III) comparison between the analytical predictions and the sensitivity analysis. Finally, (IV) the hybrid model is used for more complicated predictions.
I Sensitivity analysis of the original model identifying essential and modulatory perturbation effects
The parameters of the original model, k 1,..., μ = p^{ org } (equation (16)), were perturbed one at a time, starting from a default set of parameter values corresponding to the wild-type cell (equation (16)). A perturbation consists of a change of the parameter value by a factor ps_{ j } (the relative Perturbation Size), where ps_{ j } ∈ ps = (10^{-1}, 10^{-0.8}, 10^{-0.6}, 10^{-0.4}, 10^{-0.2}, 10^{0}, 10^{0.2}, 10^{0.4}, 10^{0.6}, 10^{0.8}, 10^{1}). Each of the 47 parameters, , i ∈ 1,..., 47, was thus perturbed 11 times given the new value , j ∈ 1,..., 11, where is the default value of parameter . For each perturbed simulation, the output trajectory was examined for i) whether there were consistent oscillations or not, ii) cycle time t_{ CT }and iii) final (before cell division) cell mass M_{ end }.
We next defined measures of sensitivity summarizing how sensitive the system was to perturbations of the single parameter . Two measures were defined, one based on the cycle time of the oscillations, denoted , and the other based on final cell mass, . Since the cycle time was, in almost all cases, either constant t_{ CT } = 138.6 or non-existing (quantized cycle times or no oscillations), we decided to consider only whether we had proper cell cycle oscillations or not in the definition of . Proper oscillations were defined as trajectories that still oscillated after 4000 min, and had a narrow distribution of cycle times (to remove quantized behaviour). We therefore ended up with one discrete-qualitative measure and one continuous-quantitative . In the calculation of , only simulations with proper oscillations were considered. We denoted perturbations with a large effect on S^{ osc } essential, and with a large effect on modulatory.
The sensitivity analysis was carried out using two different ranges of parameter perturbations, narrow range perturbations close to the default parameter value, and wide range perturbations both close, but also further away. For narrow range perturbations, and were calculated using only the two perturbations closest to the default value, i.e. ps_{ narrow } = (10^{-0.2}, 10^{0}, 10^{0.2}), while for wide range, all perturbation sizes were used, ps_{ wide } = ps.
Essential parameters
Essential perturbation effects - comparison between the hybrid model predictions and the original model results
Essential Parameter Effects | |||||
---|---|---|---|---|---|
Narrow Range | Wide Range | ||||
Parameter | Original | Hybrid | Parameter | Original | Hybrid |
0 | 0 | k _{10} | 6 | 1 | |
0 | 0 | k _{9} | 5 | 1 | |
0 | 0 | k _{6} | 4 | 4 | |
0 | 0 | 4 | 4 | ||
J _{i 25} | 0 | 0 | k _{14} | 3 | 0 * |
J _{a 25} | 0 | 0 | k _{13} | 3 | 0 * |
V _{i 25} | 0 | 0 | k _{8} | 3 | 3 |
V _{a 25} | 0 | 0 | k _{7} | 3 | 4 |
J _{ iwee } | 0 | 0 | J _{5} | 3 | 4 |
J _{ awee } | 0 | 0 | 2 | 0 | |
V _{ iwee } | 0 | 0 | 1 | 1 | |
V _{ awee } | 0 | 0 | J _{7} | 1 | 0 |
J _{16} | 0 | 0 * | 0 | 0 | |
J _{15} | 0 | 0 * | 0 | 0 | |
0 | 0 * | 0 | 0 | ||
0 | 0 * | J _{i 25} | 0 | 0 | |
k _{15} | 0 | 0 * | J _{a 25} | 0 | 0 |
k _{14} | 0 | 0 * | V _{i 25} | 0 | 0 |
k _{13} | 0 | 0 * | V _{a 25} | 0 | 0 |
k _{ diss } | 0 | 0 * | J _{ iwee } | 0 | 0 |
0 | 0 * | J _{ awee } | 0 | 0 | |
0 | 0 * | V _{ iwee } | 0 | 0 | |
k _{12} | 0 | 0 * | V _{ awee } | 0 | 0 |
k _{11} | 0 | 0 * | J _{16} | 0 | 0 * |
J _{10} | 0 | 0 | J _{15} | 0 | 0 * |
J _{9} | 0 | 0 | 0 | 0 * | |
k _{10} | 0 | 0 | 0 | 0 * | |
k _{9} | 0 | 0 | k _{15} | 0 | 0 * |
J _{8} | 0 | 0 | k _{ diss } | 0 | 0 * |
J _{7} | 0 | 0 | 0 | 0 * | |
k _{8} | 0 | 0 | 0 | 0 * | |
k _{7} | 0 | 0 | k _{12} | 0 | 0 * |
J _{5} | 0 | 0 | k _{11} | 0 | 0 * |
k _{6} | 0 | 0 | J _{10} | 0 | 0 |
0 | 0 | J _{9} | 0 | 0 | |
0 | 0 | J _{8} | 0 | 0 | |
J _{4} | 0 | 0 | 0 | 0 | |
k _{4} | 0 | 0 | J _{4} | 0 | 0 |
0 | 0 | k _{4} | 0 | 0 | |
J _{3} | 0 | 0 | 0 | 0 | |
0 | 0 | J _{3} | 0 | 0 | |
0 | 0 | 0 | 0 | ||
0 | 0 | 0 | 0 | ||
0 | 0 | 0 | 0 | ||
0 | 0 | 0 | 0 | ||
k _{1} | 0 | 0 | k _{1} | 0 | 0 |
Modulatory parameters
Modulatory perturbation effects - comparison between the hybrid model predictions and the original model results
Modulatory Parameter Effects | |||||
---|---|---|---|---|---|
Narrow Range | Wide Range | ||||
Parameter | Original | Hybrid | Parameter | Original | Hybrid |
k _{1} | 0.637 | 0.593 | k _{1} | 3.98 | 3.84 |
0.533 | 0.507 | 2.35 | 2.17 | ||
V _{i 25} | 0.523 | 0.25 | 1.77 | 2.35 | |
V _{a 25} | 0.521 | 0.249 | V _{i 25} | 1.27 | 0.764 |
0.361 | 0.502 | V _{a 25} | 1.27 | 0.762 | |
0.225 | 0.31 | k _{15} | 0.966 | 0. * | |
J _{i 25} | 0.222 | 0.29 | 0.965 | 0. * | |
0.168 | 0.106 | 0.948 | 0.0102 | ||
0.0367 | 0.0165 | 0.859 | 0.259 | ||
V _{ iwee } | 0.0356 | 0.237 | k _{13} | 0.83 | 0. * |
k _{13} | 0.0352 | 0. * | k _{14} | 0.823 | 0. * |
k _{14} | 0.0345 | 0. * | 0.801 | 1.08 | |
V _{ awee } | 0.0343 | 0.234 | J _{i 25} | 0.786 | 1.01 |
k _{11} | 0.0307 | 0. * | 0.555 | 0.408 | |
J _{4} | 0.0259 | 0.0148 | V _{ iwee } | 0.304 | 0.65 |
0.0237 | 0.0148 | V _{ awee } | 0.303 | 0.649 | |
0.0212 | 0. * | k _{9} | 0.207 | 0.396 | |
k _{9} | 0.0212 | 0.0191 | k _{11} | 0.202 | 0. * |
0.02 | 1.64 × 10^{-7} | J _{4} | 0.157 | 0.0941 | |
k _{7} | 0.0181 | 0.000916 | 0.149 | 0.094 | |
0.0142 | 0.000213 | 0.0817 | 0. * | ||
0.0142 | 0.000237 | 0.065 | 0.00137 | ||
0.0135 | 0.00256 | k _{7} | 0.0491 | 0.00615 | |
k _{6} | 0.0118 | 0.000217 | 0.0421 | 0.000484 | |
0.00821 | 0. * | J _{ awee } | 0.0392 | 0.097 | |
J _{ awee } | 0.00655 | 0.017 | J _{a 25} | 0.0366 | 0.0435 |
k _{4} | 0.00624 | 0.0134 | k _{4} | 0.0354 | 0.0632 |
J _{a 25} | 0.00574 | 0.00586 | 0.029 | 0. * | |
k _{8} | 0.00574 | 0.000655 | 0.0238 | 0. * | |
k _{10} | 0.00548 | 0.0189 | k _{10} | 0.022 | 0.396 |
J _{5} | 0.00291 | 0.000132 | k _{8} | 0.0127 | 0.00918 |
k _{15} | 0.00282 | 0. * | 0.0102 | 3.04 × 10^{-7} | |
0.00232 | 0. * | J _{5} | 0.00917 | 0.0000567 | |
0.000926 | 0. * | k _{6} | 0.00828 | 0.000456 | |
0.000595 | 0.00173 | 0.00567 | 0.0331 | ||
0.000429 | 5.63 × 10^{-9} | J _{15} | 0.00219 | 0. * | |
J _{15} | 0.000331 | 0. * | 0.00189 | 3.15 × 10^{-8} | |
k _{12} | 0.000263 | 0. * | k _{12} | 0.00144 | 0. * |
J _{ iwee } | 0.000166 | 0.00304 | J _{3} | 0.000874 | 0.000998 |
J _{10} | 0.000166 | 0.0007 | J _{9} | 0.000724 | 0.00169 |
J _{8} | 0.000166 | 0.000233 | J _{ iwee } | 0.000723 | 0.00895 |
J _{3} | 0.000166 | 0.00016 | J _{10} | 0.000694 | 0.00437 |
J _{16} | 0. | 0. * | J _{8} | 0.000306 | 0.00149 |
k _{ diss } | 0. | 0. * | J _{16} | 0.000271 | 0. * |
J _{9} | 0. | 0.00027 | k _{ diss } | 0.000207 | 0. * |
J _{7} | 0. | 1.72 × 10^{-7} | J _{7} | 0.0000497 | 1.1 × 10^{-6} |
With wide range perturbations (Table 2 original model), the top scoring parameters also include and k_{15}, which are related to TF (a transcription factor) activity, and related to the activity of the protein Ste9. This is most likely a result of the G1/S checkpoint not functioning properly in the presence of these large parameter perturbations further away from the default parameter set, and being active at a point where it should not. In the coming analysis, we denote parameters with as modulatory and with as non-modulatory. This, a bit arbitrarily chosen, limit corresponds to a relative average change of cell size of 5%.
IIA Analytical predictions from the hybrid model on essential and modulatory perturbation effects
In [15] we introduced a scheme by which we could simplify the NT-model. Dynamical switching modules were identified and replaced by time lagged step-functions, resulting in the hybrid DPL-model. This model approximation process is summarized in the Methods. The hybrid model (equations (17-19)) consists of four linear systems, defined by the matrices A _{ kl }, k, l ∈ {1, 2}, where the linear system at each time-point t is determined by, so called, switching rules (equation (18)). Figure 2 displays a numerical simulation of the DPL-system, indicating the different linear systems. A switch between systems is initiated when the output y(t) = [MPF](t), corresponding to the concentration of the protein complex MPF, crosses a threshold θ_{25/wee}or θ_{ slp/ste }. Note that when θ_{25/wee}is passed, the switch is immediate, whereas, when θ_{ slp/ste }is passed, the switch is delayed with τ minutes. The switching rule can therefore be considered as equipped with a memory, and we have denoted the model delayed piecewise linear. An analysis of the DPL-model [15] showed that each separate linear system, defined by the system matrix A _{ kl }, was stable, having an asymptotically stable fixed point ( , with real negative eigenvalues). Combining the four linear systems with the switching rules revealed, however, that some of the fixed points could never be reached without passing a switching threshold (and thereby a change of linear systems). Thus, these fixed points were no longer stable. The relationship between the location of the fixed points and the switching thresholds, therefore, determines the dynamical behaviour of the system. This relationship between fixed point and switching thresholds corresponds to analytical expressions including the parameters of the hybrid model, and allows us to present two types of important parameter/output relationships. The first type refers to necessary constraints (NC), which must be satisfied in order for the model to generate proper oscillations. Therefore NC identifies essential parameter changes. The second type is a function describing the size of the cell, or Size Function (SF). SF describes how cell size is influenced by hybrid model parameters, and therefore SF identifies modulatory parameter changes and their effects on cell size. When using these expressions in order to find out whether a parameter change is modulatory, essential or both, we always have the default parameter set as the point of departure for the change. With another set of default parameter values, the result could of course change. Note that NC is a qualitative test - either a parameter change violates the conditions (and the change is considered to be essential), or it does not; whereas SF is quantitative, giving numerical predictions on cell size.
Analytical results identifying essential parameter changes - necessary constraints NC
Two constraints, that are necessary in order to have cell cycle oscillations, were retrieved in the analysis of the hybrid model [15], and are further extended in the Methods section (equations (24,25)). These correspond to inequalities, including hybrid model parameters, that define regions of parameter values for which there cannot be proper oscillations (note that the necessary conditions only state that if not satisfied, there will not be oscillations, but not the opposite, i.e. they do not assure oscillations, which agrees with the definition of "essential" in the sensitivity analysis, i.e. anhilation of oscillation due to parameter perturbations). A change of hybrid parameter τ...μ = p^{ hyb } (equation (20)), that violates these constraints is considered as essential.
Analytical results identifying modulatory parameter changes - the size function SF
The second type of parameter/output relationship, the function SF (equations (21,22), Methods), describes how the final size of the cell (i.e. just before cell division), is influenced by hybrid model parameters. SF corresponds to the formation of a concentration threshold. When the concentration of the protein complex MPF passes this threshold (under normal circumstances), cell division is initiated. In cell cycle physiology, it corresponds to the G2/M checkpoint [2].
IIB Translating the analytical results from the hybrid model to the original model
To be able to compare the hybrid and original models (i.e. compare their respective parameter characterizations) we needed to translate the analytical result of the hybrid model to the original model. For this translation, we first defined a mapping from original model parameters to hybrid model parameters. We then used this mapping to backtrack the analytical results of the hybrid model to the original model parameters.
Mapping original model parameters onto hybrid model parameters
There is no simple algebraic relationship that could be used to define mappings between original and hybrid model parameters parameters, , i = 1...n_{ org } and , m = 1...n_{ hyb } , i.e. there is no analytical function for all m that can be used to relate the original and hybrid models. Instead, we constructed mappings numerically for different specific values of p^{ org } , namely those values corresponding to the perturbations used in the sensitivity analysis. First, each original parameter was perturbed n_{ pert } = 11 times according to the relative perturbation sizes, ps, described earlier, i.e. the parameter was given the value , j = 1...n_{ pert } (note here that single index denotes a parameter, whereas a double index denotes a specific parameter value). Next, the effect from the perturbation was translated to the hybrid parameters p^{ hyb } . The translation was done using a recalibration process, adapting the hybrid model to the original perturbed one through re-performing the hybrid approximation process and measuring the effect of the perturbation on for all m = 1...n_{ hyb } (Methods). This means that we, for each perturbation of the original model , j = 1...n_{ pert } , i = 1,..., n_{ org } , obtained a new set of hybrid mobel parameter values, denoted by .
Backtracking the analytical result from the hybrid model to the original model
After recalibration of the hybrid model, we translated the analytical result in terms of modulatory and essential effects, by backtracking the analytical conditions (NC and SF) to the original model. This means that for each perturbation of an original model parameter , we constructed a new hybrid model and examined the analytical expressions determining essential and modulatory effects. From this, we finally characterized the original parameter in terms of essential and/or modulatory. To sum up, the essential and modulatory effects of the original parameter perturbations are measured in two main ways: with respect to the original model (track 1), and with respect to analytical constraints and implications after recalibration of the hybrid model (track 2).
Bactracking the essential conditions
To backtrack the necessary constraints (NC, corresponding to essential parameter changes), to original parameter characterizations, we counted the total number (out of n_{ pert } ) of perturbations of which have a mapped effect such that the necessary constraints (equations (24,25)) are violated. From this we calculated a sensitivity score similar to the one from the sensitivity analysis, but based on the hybrid model, and therefore denoted . Thus, , where if perturbation j of parameter (mapped to the hybrid model) does not violate constraints (24,25) and otherwise, j ∈ 1,..., n_{ pert } .
Backtracking the modulatory conditions
To backtrack the function determining cell size (SF, corresponding to modulatory parameter changes) to original parameter characterizations, we calculated the average mapped effect that the n perturbations of (which do not violate the necessary constraints (24,25)) should have on final cell size, as calculated by SF. By this we get a calculated sensitivity score similar to the one from the sensitivity analysis, but based on the hybrid model and therefore denoted . Thus, , where is the, from SF (equations (21,22)), calculated final cell size after the j'th perturbation of parameter (mapped to the hybrid model), where j' corresponds to those j for which , and is the by SF calculated final cell size using the default parameter set (20).
There are two hybrid parameters that are not considered in the correspondence analysis. The first one is μ, which is related to the growth of the cell, i.e. the increase in cell mass M. Since the hybrid model is based on the assumption that the increase in M is very slow compared to other variables, perturbations of μ are not considered. The other parameter is τ, which has not been included in the analysis for practical reasons (Methods). Not all original parameter perturbations have a mapping to a hybrid parameter (data not shown). This is partly a reflection of the fact that parts of the original system are inactive during the experimental context that the hybrid model seeks to reproduce, and has therefore not been included in the hybrid model.
III Correspondence analysis of hybrid and original parameter characterizations
Once we had conducted the sensitivity analysis of the original model (I), and retrieved the corresponding predictions from the hybrid model (II), we were now in a position to compare the results (Figure 1). The parameters were divided into essential or non-essential (Table 1) and modulatory or non-modulatory (Table 2), with respect to the model in question: original or hybrid. We considered both narrow range and wide range perturbations. A parameter was characterized as modulatory if (or ) and as essential if (or ), and the parameter characterizations of the original and hybrid models were compared.
Sensitivity and specificity of hybrid model predictions
Prediction | sensitivity | specificity |
---|---|---|
modulatory-narrow range | 100% | 95% |
modulatory-wide range | 60% | 96% |
essential-narrow range | - | 100% |
essential-wide range | 67% | 100% |
In the hybrid model, modulatory parameters are defined by the size function SF, which corresponds, as earlier described, to the G2/M checkpoint of the cell cycle. The proteins behind this checkpoint are those that are most important to regulate the size of the cell in this context. If we did not know this and were to identify the mechanism regulating cell size, from this model comparison, we can see that we would be better of by the use of narrow perturbations rather than wide ones. If wide range perturbations are used, other parameters, which are not involved in this checkpoint, are also incorrectly identified, and therefore confusing the picture.
We can conclude that the correspondence between models is better near the default values of the model parameters (narrow range), and becomes worse further from this point. At the same time, it seems impossible to identify essential parameters without using perturbations further away from the default parameter setting. Finally, if we use narrow range perturbations to identify modulatory parameters and wide range perturbations to identify essential parameters, then we note that most parameters are either essential or modulatory for this set of perturbations.
Mis-predictions
There are two main types of mis-predictions at wide range perturbations. First, parts of the original model, for which there is no counterpart in the hybrid model (parameters indicated with * in Table 1 and Table 2), and that are not active at narrow range perturbations, have in fact an essential or modulatory effect at wide range perturbations. This indicates that the hybrid model performs quite well for predictions locally around the point in parameter space where the hybrid model was constructed. Further away from this point, the predictions are still quite good for those parts of the original system that were included in the hybrid model. The hybrid model cannot, however, capture what will happen to parts that were not included. Second, there are some mis-predictions that are most likely due to the approximation and discrepancy between the hybrid and original models. One such prediction was the parameters k_{9} and k_{10}, which are found to be more essential in the original model than in the hybrid model. k_{9} and k_{10} are important actors to obtain quantized output behaviour. Such behaviour cannot be mimicked by the hybrid model; rather, the hybrid model predicts that the system will have proper oscillations for some of the perturbations, which gives quantized oscillations. Also, the fact that we have not included the hybrid model parameter τ in the analysis might affect predictions.
Finally, we have a problem with mappings to the hybrid model parameter θ_{25/wee}. This parameter is different compared to the other hybrid parameters since it corresponds to the switching threshold of two step functions, s_{25} and s_{ wee } (Methods), when their respective switching thresholds are at the same level i.e. θ_{25/wee}= θ_{25} = θ_{ wee } when θ_{25} = θ_{ wee } . To be able to map a perturbation of the original model to θ_{25/wee}∈ p^{ hyb } that keeps this relationship (θ_{25} = θ_{ wee } ), and thus the structure of the hybrid model, at least two original parameters have to be perturbed simultaneously. However, the single parameter perturbations used in this study, only correspond to a change in either θ_{25} or θ_{ wee } . To obtain a numerical value for each mapping despite this, we assume that θ_{25/wee}= min(θ_{25},θ_{ wee } ), when the threshold is passed from below, and that θ_{25/wee}= max(θ_{25}, θ_{ wee } ), when the threshold is passed from above. Presuming that as soon as either θ_{25} or θ_{ wee } is passed, the other threshold will also soon be passed.
IV Combinatorial predictions from the hybrid model
After investigating the correspondance between the original and hybrid model concerning single parameter perturbations of the original model, we used the hybrid model to identify combinations of parameters aimed at a specific critical effect.
Essential combinations of parameters
The hybrid parameter θ_{25/wee}was shown to have the largest effect (Table 4), using these perturbations. Simulataneously, none of the original parameters mapped to θ_{25} or θ_{ wee } were found to be essential (data not shown). This emphasizes that there are important (essential) features in the original model that are robust towards single parameter perturbations. We described earlier that θ_{25/wee}∈ p^{ hyb } can only be mapped to from combinations of original parameters if the structure of the hybrid model is to be retained (i.e. θ_{25} = θ_{ wee } ). We therefore expect a critical effect on the original system by changing such a combination. A pairwise mapping to θ_{25/wee}∈ p^{ hyb } can be achieved by simultaneous changes in V_{i 25}∈ p^{ org } and V_{ awee } ∈ p^{ org } or V_{a 25}∈ p^{ org } and V_{ iwee }∈ p^{ org } . By adding the parameter s ∈ p^{ org } such that V_{i 25}→ sV_{i 25}, V_{ awee } → sV_{ awee } , and the parameter t ∈ p^{ org } such that V_{a 25}→ tV_{a 25}and V_{ iwee }→ tV_{ iwee } , then a perturbation of s ∈ p^{ org } or t ∈ p^{ org } in the original model corresponds to changing θ_{25/wee}∈ p^{ hyb } . Using the same perturbation scheme as before (wide range) we obtained the following sensitivity scores for the combined perturbations and , which can be compared to the score of S^{ osc } = 0 for V_{i 25}, V_{ awee } , V_{a 25}and V_{ iwee }, respectively. This shows that by combining perturbations of two, by themselves, robust parameters, we can obtain a critical effect.
Modulatory combinations of parameters
where all constants are hybrid model parameters. Mathematically, this corresponds to a bifurcation point at which the system moves from a stable fixed point to a limit cycle. Under circumstances corresponding to a normal cell, this expression can be used to approximate final cell mass before cell division.
Using this expression, we can compute quantitative predictions of cell size, e.g. if k_{1} is reduced by 50%, the cell size is expected to be twice as large. This was tested in the original model by setting k_{1} = 0.015 and running a numerical simulation. Final cell mass increased from 2.0 to 4.0. If, together with halving k_{1} we also double θ_{25/wee} and h_{ wee } , cell size should increase by about 2 × 2 × 2 = 8 times. Testing this in the original model by setting k_{1} = 0.015, V_{i 25}= V_{ awee } = 0.5 (corresponding to doubling the hybrid model parameter θ_{25/wee} ), (doubling h_{ wee } ), final cell size increased from 2.0 to 14.4. We can therefore predict multiplicative effects by using the hybrid model.
Summary of results
In this case study of the cell cycle, by comparing the sensitivity analysis of the original model with analytical predictions from the hybrid model, we found that i) predictions for essential and modulatory parameters from the hybrid model were satisfactory for small perturbations, but more discrepancies were seen for larger perturbations; ii) small perturbations were of more use when trying to identify important modulatory mechanisms compared to larger ones whereas, iii) larger perturbations were needed in order to identify essential parameters. Furthermore, iv) most parameters were either essential or modulatory using this set of perturbations. Finally, v) the most sensitive feature of the system (the threshold corresponding to θ_{25/wee}) was robust towards single parameter perturbations.
Discussion and Conclusions
Any biological process is essentially an interaction between different elements in time and space. Given experimental data and prior knowledge we may be able to formulate such interactions in mathematical terms. As a rule, such models suffer from uncertainty in the structure, both from a lack of representation of essential elements and interactions, as well as from erroneous model representation of the phenomena under investigation [21, 22]. Moreover, parameters are uncertain in the sense that several different combinations are consistent with a given set of data. When dealing with a specific model, such as the NT-model for the cell-cycle [19], we are in a unique position because a large body of experimental work has already been encoded in mathematical models by the pioneering work of Novak and Tyson [23]. Furthermore, this specific NT-model has been analyzed using both bifurcation [19] and mathematical piecewise linear approximation techniques [15]. Bifurcation analysis indicates the dynamical mechanisms behind the transitions within the cell-cycle, while the mathematical simplification analysis provides proof of the existence of a limit cycle, yields explicit analytical expressions describing how model parameters affect cell size, and also provides conditions necessary for oscillations. However, it is generally very difficult to succeed with a thorough mathematical analysis of a given computational model, and this becomes even harder for larger models. Numerical bifurcation analysis is also a technique that is most efficient for smaller models. For larger models, we are left performing numerical simulations to explore the sensitivity of the model, and thereby increase our understanding of what parts of the model may be important for different aspects of the system's behaviour [24]. For these reasons, we have here interpolated between the results obtained from a sensitivity analysis of the original model, and the outcome from a previous model simplification [15], with the rational being that such an analysis may be instructive for assessing what we can and cannot discover using a sensitivity analysis. Such understanding may prove useful for those cases where an explicit mathematical simplification is not feasible.
Our analysis of the simplified model resulted in the formulation of two types of mathematical constraints - one identifying parameter perturbations with a qualitative effect on model output (NC), and the other describing perturbations with a quantitative effect (SF). From the corresponding sensitivity analysis of the original NT-model, we could also here identify perturbations that had a qualitative effect (removing oscillations), or quantitative effect (changing cell size). Using the same set of parameter perturbations and sensitivity measures in both the hybrid and original models, we could compare our analytical predictions to the sensitivity analysis. Based on our definition on modulatory and essential parameters, both methods characterized the parameters more or less equally for narrow range perturbations; for wide range perturbations there was a discrepancy.
This combined analysis demonstrated that the parameters were, in most cases, characterized as either essential or modulatory using this set of perturbations, where essential parameters were only identified using wide perturbations. This means that there are parameters that do not have any effect at small perturbations, not even modulatory, but which do have a critical (essential) effect at large perturbations. They are therefore very important to the system even though this would not be obvious from a local sensitivity analysis. Modulatory parameters, as defined by the size function (SF) of the hybrid model, were best identified by small perturbations. When larger perturbations were applied, parameters from other parts of the system also showed up, confusing the picture. SF describes the parameter relationship of the G2/M checkpoint - the proteins behind this checkpoint are those that are most important in cell size regulation. If we did not know this and were to try to identify the mechanism regulating cell size, we would be better to use small perturbations. Essential parameters, on the other hand, could not be found using small perturbations alone; rather larger perturbations had to be used. This is not surprising considering that the biological system has to be robust in the default parameter setting, and the corresponding point in parameter space must therefore be located far away from the region where there are, for example, no oscillations. Using the hybrid model for more complex predictions and comparing these to the sensitivity analysis, we found that the threshold corresponding to the hybrid parameter θ_{25/wee}, which has an essential effect (and modulatory) in the hybrid model, were robust (using S^{ osc } ) towards single parameter perturbations; however it had an essential effect with a targeted combined perturbation. We also demonstrated the possibility to find combinations of parameters with a multiplicative effect on cell size.
In the case of single parameter perturbations, it is possible to numerically simulate the behaviour of a large model to all possible parameter perturbations. With perturbations of more than one parameter at a time, the combinatorial explosion has the effect that all parameter combinations cannot be tested. A simplified model therefore has the potential to inform us about which parameter combinations that are important to test. Our study pin-points the difference between parameters that have a quantitative, modulatory impact on model output, where the perturbation size is correlated with the change in model output, and parameters that are essential to the system in a qualitative manner.
Our investigation has several similarities to a recent study of Radulescu et al [14] where they suggest the use of a hierarchy of reduced models to describe complex biological networks. Radulescu also map critical parameters identified from the reduced model, back to the original model. However, there are also marked differences. In Radulescu et al, model reductions are by and large faithful in both structure and components to the original model, letting kinetics guide structural decomposition and parameter elimination, keeping the most important components and not introducing new ones. In the process, critical monomials and thereby critical parameters are identified. This is not the approach taken here. There are no simple and direct analytical relationships between the original and simplified models. Instead we look at different perturbations in the original model and recalibrate the simplified model from those in order to backtrack findings in the simplified model. By using a new model formalism, in this case, a piecewise linear with delay, we can point to other important features of the original model. In this case that the system seems to switch between different linear systems and that the location of the fixed points and the thresholds are most important for the dynamics. We also find that a very sensitive feature of the original model, the threshold described earlier, is robust towards single parameter perturbations.
A simplified model is an approximation of a more complicated system, and there will be instances when it does not work. Its predictions must therefore be confirmed in the full model context. However, a discrepancy between the simplified and original model is of interest in itself, since it can point out an approximation that has failed. Since approximations are a way to learn about the essentials of a system, this gives information on neglected essential features. To combine model simplification with sensitivity analysis is a way to investigate the structure of complex biological networks. We believe that this approach is applicable to systems other than the cell cycle, even though the precise set up used in this study may not be suitable. Sensitivity analysis has to be carefully performed, since it is almost impossible to investigate the full parameter space for any model with more than a few parameters. Here we performed a first step screening study, modifying one parameter at a time. The next logical step would be to consider a more global sensitivity analysis by investigating combinations of parameters, which was beyond the scope of this study. In this comparison between the hybrid model and the original model, we have considered all simulations with a quantized behaviour as not working properly, and therefore disqualified them from the analysis. This is partly because a normal cell does not display this type of behaviour, but also because the hybrid model does not mimic the behaviour of the original model in a few of these quantized instances.
Methods
The original NT-model
The hybrid, DPL-model
This default parameter set was retrieved by calibrating the hybrid model to the unperturbed original NT-model, as described below. It differs slightly from the default parameter set of [15].
Parameter perturbations
Each simulation was run in 4000 minutes using the numerical integration program XPP and integration method QualRk. We defined one cell division cycle as the trajectory between two local minima in the mass M variable. The length of each such cycle, the cycle time, t_{ CT } , was recorded. If the trajectory still oscillated after 4000 minutes and the distribution of t_{ CT } was narrow (within 1 min from average t_{ CT } ), the perturbation was defined as having proper oscillations.
Size Function
In [19] and [15] the dynamics of the cell cycle are described in terms of bifurcation analysis, presuming that the size of the cell (cell mass M) changes much slower than the other variables and therefore can be viewed as a constant parameter in the analysis of the dynamics. From a bifurcation diagram (e.g. Figure six in [15]) using M as bifurcation parameter and [MPF] as bifurcation variable, can be found that, during normal circumstances, critical for the size of the cell is the point of bifurcation when the system moves from a stable fixed point to a limit cycle. In [15] we gave an analytical expression for when this bifurcation takes place.
Presuming that for small perturbations this relationship will remain, we use equation (22) to predict the effect of perturbations in the hybrid and (after translation) the original model.
Necessary conditions
where we use the notation of equations (17-19), is the slow eigenvector of A_{12}, going through the fixed point and M_{ end } is given from equation (22). Note that M_{ end } was calculated from M_{ C } , i.e. the point where Cx_{11} passes θ_{25/wee}as described above.
which are used as necessary conditions throughout this study.
Summary of the model simplification process
- 1.
The steady-state input/output behaviour of subsets of variables of the original model were graphed.
- 2.
Variables with minor effect on the dynamical behaviour were removed.
- 3.
Remaining variables were lumped into switching modules SM_{ s } s ∈ 1,..., 3, such that one variable was functioning as input to the module, a combination of variables as output, and the steady-state input/output behaviour were in the form of a monotonic function with a sigmoid kind of shape. The input/output function is denoted, , where is a subset of original model parameters; those parameters that exist in the ODE equations defining SM_{ s } .
- 4.
The sigmoid functions were approximated by step functions , where , and θ_{ s } corresponds to the switching threshold of the step function, h_{ s } (high) to the maximum level and l_{ s } (low) to the minimum level (θ_{ s } , h_{ s } , and l_{ s } are in this study determined from three points of as described in the calibration below).
- 5.
The switching modules SM_{ s } were each approximated by a step function and a time delay τ_{ s } [15]. The time delay was often τ_{ s } = 0.
Note that we here only consider the small DPL-model in [15].
Mapping (calibration) procedure
In order to construct mappings between p^{ org } and p^{ hyb } , all parameters of p^{ org } were perturbed n_{ pert } = 11 times according to ps (defined in Results), and for each perturbation a calibration of the hybrid model was performed. The hybrid model corresponding to the j th, j ∈ 1,...,11 perturbation of parameter is denoted .
The calibration of the step functions was done from three points of the sigmoid functions , s ∈ 1,...,3, corresponding to the threshold, the high, and low level of the step-functions. The threshold level was defined as the point (denoted θ^{ SM } ) where the derivative of the curve was the largest (smallest), and the high and low level were measured at two points (denoted h^{ SM } and l^{ SM } ) of the curve corresponding to or , depending on whether the sigmoid was increasing or decreasing; corresponds to θ^{ SM } using the default parameter set. The hybrid model calibration procedure for mapping between original parameter perturbations and hybrid parameters can be described in pseudo-code as
for each , i ∈ 1,..., n_{ org }do
for each perturbation size ps_{ j } ∈ ps, j ∈ 1,..., n_{ pert }do
for each switching module SM_{ s } s ∈ 1,..., 3 do
use and , as values for parameters θ_{ s } , h_{ s } , ;
end for
end for
end for.
The hybrid parameter τ were not included in the comparison between the two models of practical reasons; since this parameter can not easily be mapped via as the other ones. From this process we retrieved n_{ org } × n_{ pert } = 47 × 11 sets of hybrid model parameters , j ∈ 1,..., n_{ pert } .
To translate the notation above to the hybrid model parameters of equation (20) let s ∈ 1,..., 3 = {25, wee, slp/ste}, except for θ_{25} and θ_{ wee } . During the model simplification process we noted that (when using default parameters) and this was used as a further simplification step by merging θ_{25} and θ_{ wee } into one parameter θ_{25/wee}. Therefore, only perturbations of the original model which keep the property are consistent with the hybrid model. These are all pairwise perturbations.
Declarations
Acknowledgements
We thank Drs David Gomez-Cabrero and Hanna Hardin for critical reading and discussions. This work was financed by the Swedish Foundation for Strategic Research (OE, TA) and the Swedish Research Council (JT).
Authors’ Affiliations
References
- Barabasi AL, Oltvai ZN: Network biology: understanding the cell's functional organization. Nat Rev Genet. 2004, 5: 101-113. 10.1038/nrg1272View ArticlePubMedGoogle Scholar
- Tyson JJ, Chen AK, Novak B: Network dynamics and cell physiology. Nat Rev Mol Cell Biol. 2001, 2: 908-916. 10.1038/35103078View ArticlePubMedGoogle Scholar
- Eldar A, Dorfman R, Weiss D, Ashe H, Shilo B, Barkai N: Robustness of the BMP morphogen gradient in Drosophila embryonic patterning. Nature. 2002, 419: 304-308. 10.1038/nature01061View ArticlePubMedGoogle Scholar
- Albert R, Othmer HG: The topology of the regulatory interactions predicts the expression pattern of the Drosophila segment polarity genes. J Theor Biol. 2003, 223: 1-18. 10.1016/S0022-5193(03)00035-3View ArticlePubMedGoogle Scholar
- Bornholdt S: Less is more in Modeling Large Genetic Networks. Science. 2005, 310: 449-451. 10.1126/science.1119959View ArticlePubMedGoogle Scholar
- Tyson JJ, Chen JC, Novak B: Sniffers, buffers, toggles and blinkers: dynamics of regulatory and signaling pathways in the cell. Curr Opin Cell Biol. 2003, 15: 221-231. 10.1016/S0955-0674(03)00017-6View ArticlePubMedGoogle Scholar
- Kuepfer L, Peter M, Sauer U, Stelling J: Ensemble modeling for analysis of cell signaling dynamics. Nat Biotechnol. 2007, 25: 1001-1006. 10.1038/nbt1330View ArticlePubMedGoogle Scholar
- Prinz AA, Bucher D, Marder E: Similar network activity from disparate circuit parameters. Nat Neurosci. 2004, 7: 1345-1352. 10.1038/nn1352View ArticlePubMedGoogle Scholar
- Gutenkunst RN, Waterfall JJ, Casey FP, Brown KS, Myers CR, Sethna JP: Universally Sloppy Parameter Sensitivities in Systems Biology Models. PLoS Comput Biol. 2007, 3: 1871-1877. 10.1371/journal.pcbi.0030189View ArticlePubMedGoogle Scholar
- Saez-Rodriguez J, Kremling A, Conzelmann H, Bettenbrock K, Gilles ED: Modular analysis of signal transduction networks. IEEE Contr Syst Mag. 2004, 24: 35-52. 10.1109/MCS.2004.1316652.View ArticleGoogle Scholar
- Sorensen ME, DeWeerth SP: An algorithmic method for reducing conductance-based neuron models. Biol Cybern. 2006, 95: 185-192. 10.1007/s00422-006-0076-6View ArticlePubMedGoogle Scholar
- Davidich M, Bornholdt S: The transition from differential equations to Boolean networks: a case study in simplifying a regulatory network model. J Theor Biol. 2008, 255: 269-277. 10.1016/j.jtbi.2008.07.020View ArticlePubMedGoogle Scholar
- Schmidt H, Madsen MF, Danø S, Cedersund G: Complexity reduction of biochemical rate expressions. Bioinformatics. 2008, 24 (6): 848-854. 10.1093/bioinformatics/btn035View ArticlePubMedGoogle Scholar
- Radulescu O, Gorban AN, Zinovyev A, Lilienbaum A: Robust simplifications of multiscale biochemical networks. BMC Syst Biol. 2008, 2 (86):Google Scholar
- Eriksson O, Zhou Y Band B, Björkegren J, Tegnér J: Deconstructing the core dynamics from a complex time-lagged regulatory biological circuit. IET Syst Biol. 2009, 3: 113-129. 10.1049/iet-syb.2007.0028View ArticlePubMedGoogle Scholar
- Feret J, Danos V, Krivine J, Harmer R, Fontana W: Internal coarse-graining of molecular systems. Proc Natl Acad Sci USA. 2009, 106: 6453-8. 10.1073/pnas.0809908106PubMed CentralView ArticlePubMedGoogle Scholar
- Okino MS, Mavrovouniotis ML: Simplification of Mathematical Models of Chemical Reaction Systems. Chem Rev. 1998, 98: 391-408. 10.1021/cr950223lView ArticlePubMedGoogle Scholar
- Crudu A, Debussche A, Radulescu O: Hybrid stochastic simplifications for multiscale gene networks. BMC Syst Biol. 2009, 3 (89): :-Google Scholar
- Novak B, Pataki Z, Ciliberto A, Tyson JJ: Mathematical model of the cell division cycle of fission yeast. Chaos. 2001, 11: 277-286. 10.1063/1.1345725View ArticlePubMedGoogle Scholar
- Campolongo F, Saltelli A, Sorensen T, Tarantolo S: Hitchhiker's Guide to Sensitivity Analysis. Sensitivity Analysis. Edited by: Saltelli A, Chan K, Scott EM. 2000, Chichester, UK: John Wiley & Sons LtdGoogle Scholar
- Tegnér J, Compte A, Auffray C, An G, Cedersund G, Clermont G, Gutkin B, Oltvai Z, Stephan K, Thomas R, Villoslada P: Computational disease modeling - fact or fiction?. BMC Syst Biol. 2009, 3 (56):Google Scholar
- Clermont G, Auffray C, Moreau Y, Rocke D, Dalevi D, Dubhashi D, Marshall D, Raasch P, Dehne F, Provero P, Tegner J, Aronow B, Langston M, M B: Bridging the gap between systems biology and medicine. Genome Med. 2009, 1 (9): 88- 10.1186/gm88PubMed CentralView ArticlePubMedGoogle Scholar
- Tyson JJ, Csikasz-Nagy A, Novak B: The dynamics of cell cycle regulation. BioEssays. 2002, 24: 1095-1109. 10.1002/bies.10191View ArticlePubMedGoogle Scholar
- Hlavacek W: How to deal with large models?. Mol Syst Biol. 2009, 5 (240):Google Scholar
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.