 Software
 Open Access
 Published:
CBFA: phenotype prediction integrating metabolic models with constraints derived from experimental data
BMC Systems Biology volume 8, Article number: 123 (2014)
Abstract
Background
Flux analysis methods lie at the core of Metabolic Engineering (ME), providing methods for phenotype simulation that allow the determination of flux distributions under different conditions. Although many constraintbased modeling software tools have been developed and published, none provides a free userfriendly application that makes available the full portfolio of flux analysis methods.
Results
This work presents Constraintbased Flux Analysis (CBFA), an opensource software application for flux analysis in metabolic models that implements several methods for phenotype prediction, allowing users to define constraints associated with measured fluxes and/or flux ratios, together with environmental conditions (e.g. media) and reaction/gene knockouts. CBFA identifies the set of applicable methods based on the constraints defined from user inputs, encompassing algebraic and constraintbased simulation methods. The integration of CBFA within the OptFlux framework for ME enables the utilization of different model formats and standards and the integration with complementary methods for phenotype simulation and visualization of results.
Conclusions
A generalpurpose and flexible application is proposed that is independent of the origin of the constraints defined for a given simulation. The aim is to provide a simple to use software tool focused on the application of several flux prediction methods.
Background
Over the past 20–30 years, there has been a substantial increase in the production of materials through the use of cell factories. Metabolic engineering (ME) [1] deals with the analysis and design of metabolic systems towards particular goals, such as increasing the production of useful compounds. The field has received increased attention, due to the growing adoption of industrial biotechnology for producing pharmaceuticals, food ingredients and fine chemicals, among others [2]. The emergence of highthroughput tools for genome sequencing [3], gene expression, protein and metabolome analysis [4], has enabled a better understanding of cell metabolism and, consequently, empowered rational strain development approaches. However, a deeper understanding of the physiological state of cells can be obtained from metabolic flux profiles [5] and, therefore, the measurement of metabolic fluxes and the understanding of their control within metabolic systems are at the core of ME.
Current biochemical knowledge and the information collected from the annotation of genome sequencing projects allowed the development of genomescale metabolic models (GSMMs), supporting the simulation of the metabolic phenotypes. Although the process is only semiautomatic, a large number of GSMMs have been reconstructed for different organisms [6].
Through the use of a metabolic model, taking into account stoichiometry, reaction reversibility, and quasisteadystate assumptions, linear constraints over the values of intracellular fluxes can be established. Environmental conditions (e.g. media) are added in the form of constraints over uptake fluxes, while reaction deletions can be defined setting the respective fluxes to zero. However, in the context of genomescale models, the majority of these systems are underdetermined [7], meaning that the number of constraints is not enough to algebraically solve the system for unknown fluxes. Thus, simulation approaches based on linear/quadratic programming (LP/QP) optimization methods are used to calculate flux values. This is the case with the wellknown Flux Balance Analysis (FBA) method [8], where an optimization problem is formulated to optimize a defined objective function, typically the maximization of growth rate as defined by an artificial biomass flux [9].
However, the configuration of a proper objective function is not straightforward [10]. Moreover, LP based approaches can have multiple solutions, i.e. different flux distributions that satisfy all the constraints and have the same optimal value for the objective function [11]. Additional limitations arise from the reliance on balances for cofactors, such as NADH and NADPH [12] and from the presence of reversible reactions and futile cycles [13]. Some of these problems can be attenuated by experimental measurements or previous knowledge of certain metabolic fluxes, but these are rarely able to turn the system into a determined one, to allow the unique determination of all fluxes in the model.
Metabolic flux analysis (MFA) methods are, thus, based on the use of metabolic models and experimental data (fluxomics). When available, data of measured fluxes (both exchange and internal fluxes) and known flux ratios [14],[15] can be added to the metabolic model in the form of additional constraints, reducing the solution space, eventually leading to a determined (or overdetermined) system. These data are usually obtained from the measurements of exchange fluxes, such as the rates of formation and consumption of compounds or from more sophisticated procedures that include feeding of labelled substrates [5].
Several software applications have been put forward providing methods for flux quantification based on experimental data [16]. These typically work over data from ^{13}C labelled substrate experiments [17][20].
Numerous tools for generalpurpose constraintbased analysis have been developed in the last few years. In Additional file 1: Table S1 in supplementary material, the main features of these tools are summarized. All these tools offer the possibility to perform FBA, however few provide flexible and easytouse tools for defining additional constraints as those coming from measured or otherwise known fluxes or flux ratios. For example, in this GSMMs’ era, it is important not only to provide users with the capacity to compute flux distributions from experimental data for determined systems, but also to evaluate the impact of the measurements for systems that, although remaining underdetermined, see their solution space reduced. Moreover, it is important to be able to combine different types of experimental data or other available knowledge, such as flux measurements or ratios for intracellular fluxes with exchange fluxes or to easily perform these calculations for both wildtype and mutant strains.
Few tools exploit the various derivations of FBA that enable the application of tailored approaches to specific problems. Although COBRA [19] is the tool that contains the most relevant features, it requires the commercial software Matlab. CellNetAnalyser [21], the successor of FluxAnalyzer [16], provides key features for MFA, such as the classification of models according to determinacy and redundancy. However, it is also built over Matlab and is itself a commercial tool, although it allows an academic license. A few software tools, like MetaFluxNet and SBRT [22], execute matrix operations to calculate flux distributions without the application of optimization methods. Also, importing metabolic flux ratios to constrain MFA methods is a feature not supported by existing tools, although SBRT and COBRA accept the introduction of linear combinations of reactions to constrain the systems.
In this work, CBFA – Constraintbased Flux Analysis, a generalpurpose and flexible application is proposed that is independent of the type of data available and their processing. CBFA is able to work with regular constraintbased models, without information about atom transitions between precursors and products in the model reactions (as required by other tools). Independence from carbon atom mapping facilitates user interaction, since this kind of information is not easily compiled for genomescale models, though efforts to accomplish such task have been reported [23],[24]. In CBFA, input data are not fluxomics measurements generated by analytical techniques, such as Mass Spectrometry (MS) or Nuclear Magnetic Resonance (NMR), rather, input data are known or measured metabolic flux ratios, which can be easily used to reflect the portions of flux that generate a certain metabolite originating from different pathways [14],[15] and that have already been used in the context of genomescale models [25]. Other data, such as known or measured exchange or internal fluxes can also be directly inputted. The aim is to provide a simple to use software tool focused on the straightforward application of several stateoftheart flux analysis methods and the simulation of the behaviour of wild type or mutant strains. The development of tools to generate these ratios from fluxomics data (GCMS) is addressed in a complementary software, whose publication is currently under preparation.
The main features of this tool are the following:

Opensource – it allows all users to use the tool freely and invites contribution of other researchers by providing the source code to the community;

Userfriendly – facilitates its use by users with no/little background in modelling/programming;

Simple – needs only a few steps to perform flux analysis methods and obtain flux distributions, simplifying the use of experimental information in the simulations;

Modular – as it is incorporated in the Optflux software [26], it follows a plugin based architecture, facilitating the addition of specific features;

Compatible with standards –compatibility with the Systems Biology Markup Language (SBML) [27], the Mathematical Markup Language (MathML) [28] and several layout standards for visualization (SBGNML, XGMML, SBML Layout) rendering interoperability with other tools easier.
In its current version, CBFA accommodates several tools and algorithms that have been developed for the manipulation of metabolic models:

methods for phenotype simulation, such as FBA and its variant Parsimonious Enzyme Usage FBA (pFBA) [29];

a method to minimize the error between calculated and measured fluxes, through a quadratic programming formulation;

methods for system characterization:

○ through the calculation of tight bounds of fluxes in the model;

○ through the analysis of the null space of the stoichiometric matrix.


a method for robustness analysis of a configured objective function;

methods to calculate fluxes when the nonmeasured fluxes can be calculated from the input constraints, and exploiting redundancies when additional data are available;

a suitable model visualization tool to facilitate the interpretation of results.
Additional file 1: Table S1 highlights some of the features of this tool, when compared to some of the applications mentioned above. It also lists, in a different tab, the novel features of this tool in the framework of the OptFlux application.
The main concepts used in the development and its main functionalities are presented in the next sections.
Implementation
The tool is fully implemented in the Java language, and thus available for all platforms. The only nonJava parts consist of the linear/quadratic programming solvers, where several interfaces are implemented such as the GNU Linear Programming Kit[30] and IBM ILOG CPLEX.
In order to explore several features for model handling and visualization, and to enable the interaction with different tools, CBFA was built as a new plugin for Optflux. Optflux is an opensource and modular ME software platform with a plugin based architecture. Optflux and CBFA follow the modelviewcontroller (MVC) design pattern, incorporating three welldefined concepts: operations, datatypes and datatype views. This design pattern allows the combination and reuse of different units of work and facilitates the continuous development of new features.
CBFA is a userfriendly software, with a graphical user interface (GUI) layer to allow users to call and visualize the software features and results, where the presentation layer is well separated from the business and application layers. Thus, all its features can be used by other software platforms through a welldefined programming interface without the dependency on the GUI input interactions.
Results and discussion
An overview of the overall workflow implemented in CBFA is provided in Figure 1, where the different inputs, the generated constraints, the system types, the supported methods and their outputs are shown. In this section, the main steps and alternatives offered by the pipeline to execute the flux analysis methods are briefly explained. The detailed description of the implemented methods and complete mathematical formulations are provided as supplementary material (Additional file 2), also available on the project’s website. Additionally, a Beginner’s tutorial with illustrations for all the steps needed to perform the software tasks is available to help firsttime users (Additional file 3 and online documentation on the site).
System configuration
Starting with a metabolic model that can be loaded in different formats (e.g. SBML or CSV files), the user can configure inputs to flux analysis methods, including environmental and genetic conditions (gene/reaction deletions), measured or otherwise known fluxes and/or flux ratios given as expressions in the form:
where κ_{i}, κ_{j} ∈ R and τ ∈ R\{0} are userdefined real numbers and v_{i} and v_{j} are fluxes. This allows the definition of constraints of different origins, e.g. coming from calculations over experimental data, for example from ^{13}C experiments, or deriving from existing knowledge about a biological system. Note that the specific calculations to reach flux ratios from experimental data in different formats are not included in this software.
Given this information, degrees of freedom of the system are calculated from the properties of the original stoichiometric matrix of the model, the defined knockouts, the experimental values defined for some fluxes, and the provided flux ratios. The accurate number of degrees of freedom is obtained by the difference between the number of unknown variables (fluxes) of the system and the number of linearly independent equations of the system. However, since the purpose of this work is mainly to perform flux analysis tasks upon GSMMs, the typical configurations will lead to large numbers of degrees of freedom. It is not relevant to calculate the exact number of degrees of freedom if this is too high. Therefore, the costly algebraic operations that are performed towards the exact calculation of the degrees of freedom are not always made, rather an approximation is firstly calculated, given by the difference between the unknowns and the equations of the system, without taking into account the linear dependency between them. If this approximation gets close to a threshold, the proper number of degrees of freedom is calculated and presented to the user.
To reduce the degrees of freedom, the user can add new flux measurements, gene/reaction deletions or equality flux ratios. If the type of system changes, the set of available methods is updated accordingly.
Available methods for Flux analysis
Two approaches are used to determine flux distributions: algebraic or optimization based methods (linear or quadratic programming). Depending on the degrees of freedom of the system, different methods can be applied (Figure 1). If the system is either determined or overdetermined it can be algebraically solved and, therefore, fluxes can be calculated. In the latter case, the additional information can be used to calculate fluxes by regular or weighted least squares fitting.
However, with genomescale models, systems are more commonly underdetermined, even when experimental information is added. Thus, optimization methods are used to obtain flux values, by defining an objective function, while respecting the defined constraints. FBA and pFBA can be selected, maximizing a given objective function (e.g. the biomass flux). An alternative is to use quadratic programming to minimize the difference between the values of a subset of measured fluxes and their calculated values. The resulting formulation is mathematically similar to the minimization of metabolic adjustment (MOMA) [31].
The null space approach can be used to calculate a unique solution to the non measured fluxes, when the system is either determined or overdetermined. It can also be used when the system is underdetermined, but in this case returning the admissible flux space represented by the generating base vectors. These vectors can be used to generate valid flux distributions.
In order to characterize the system, other approaches can also be used like the computation of tight bounds of the system under certain conditions. Here, the lower and the upper admissible bounds for all fluxes, which have neither been measured nor have associated knockouts, are obtained through the application of an optimizationbased method that maximizes/minimizes the flux of interest under the same set of constraints. This method is an approach based on Flux Variability Analysis (FVA) [11] where no constraints are defined for the objective function.
Different solutions leading to the same optimal objective value can be achieved by the methods based on LP problems. To characterize the alternative solutions that satisfy the imposed set of constraints and have the same objective value, FVA is again used, this time to calculate the admissible ranges of flux values under optimality [11].
To analyze the changes of the objective function in response to variations of a specific flux, a method for robustness analysis is also available. For a selected set of fluxes, their values in a prior optimization for the given objective are retrieved. After that, each flux is varied in different percentages of its prior value and the effects of these changes on the objective function can be studied [32].
Furthermore, all the aforementioned methods can be used to investigate how a set of knockouts is reflected on the flux distribution. This is accomplished by the configuration of knockouts as additional constraints to the system. These knockouts can be inserted as reactions or as genes by using the genereaction rules of the model (when these are available).
The results and configurations used by the methods described above can be exported to a PDF file, including the model details, the method that was used, all the constraints that were selected, and the result of the simulation in terms of objective function and net conversions. Moreover, it reports the bounds that were actually used in the simulations when constraints overlap with each other.
User interaction
The aim of CBFA is to provide the community with a tool to perform flux analysis tasks that is simple, intuitive and with high usability. The interaction with this software is based on three main concepts:

Datatypes: Encapsulate the different types of objects of the application, holding relevant data such as models, flux measurements, flux ratios, simulation results, etc. The application can keep several instances (objects) of the same datatype.

Views: To check the contents of a datatype, panel views are defined that present to the user the most relevant information that it encompasses. Since a single datatype might contain different types of information, it can have multiple views shown as different panels (tabs).

Operations: All the functionalities are invoked in the form of operations, which are units with welldefined sets of inputs and outputs (datatypes). Whenever an operation is called, a dialog is presented to the user to define the set of input objects. The most common scenario is to create a new instance of a certain datatype (e.g. the result of a simulation) or to modify existing instances (e.g. add flux measurements to an existing set).
These concepts are used to build the application layer and to construct graphical user interfaces, which intermediate between the user and all the core methods implemented. In Figure 2, the architecture of the application is illustrated showing the main datatypes, views and operations, as well as the relationship with the core classes utilized in the implementation. A complementary view is given by Figure 3, which provides snapshots of some views and interfaces of the operations invoked when performing simulations and constraints configuration, showing a typical workflow when working with CBFA.
A metabolic model is loaded defining a working project, which is the root datatype in the hierarchical organization of the clipboard (lefthand side panel). The visualization of a selected object is presented on the right side. The structure and content of the visualization depends on the specific datatype, and on the object’s specific information. The hierarchical tree of the clipboard keeps all the results of the operations that are performed during the software utilization, namely datatypes for environmental conditions, flux measurements, flux ratios, and simulation results. Results are clustered in the clipboard according to the method used.
All the operations are performed on the scope of a certain project that keeps a single instance of a metabolic model. The other datatypes, such as flux measurements, can have multiple instances, under the same project, and the user can perform different tasks using alternative instances. Moreover, in the same running application, it is possible to have more than one project in the clipboard.
Operations are made available either on the top menu bar of the application or in a popup menu that is triggered when a right click event occurs upon a clipboard item. In the last case, only the operations that have the selected type as one of its input argument types are shown.
Case studies
To illustrate some of the main features of CBFA, two case studies were considered. First, the yeast Saccharomyces cerevisiae was investigated through the use of a simplified metabolic model for its growth [33], while in the second one, a genomescale model of the microorganism Escherichia coli[34] was used to perform flux analysis tasks based on optimization methods. In the first case, the aim was to show how the implemented methods for (over)determined systems could be exploited. The model encompasses 45 reactions and 49 metabolites. The study was performed to analyse the phenotype response obtained by performing a deletion of the fumarate hydratase 1 reaction, through a knockout of the fum1 gene [35], under aerobic conditions with glucose as the carbon source.
Without adding constraints to the model, the system has 9 degrees of freedom. After setting the fum1 knockout, and adding the uptake and excretion rates given in [35], the configured system remained underdetermined. So, some flux ratios, such as the fraction of phosphoenolpyruvate originating from oxaloacetate and the fraction of oxaloacetate deriving from pyruvate [35], were added. In this way, the system was made determined. Thus, it was possible to calculate the intracellular fluxes (see the full results in the Additional file 4) through algebraic methods.
In a second case study, to demonstrate how optimizationbased methods can be used to investigate flux distributions in GSMMs, an analysis was made using the iJR904 model for Escherichia coli MG1655 [34]. This model contains 932 reactions, (including a biomass equation), 618 internal metabolites and 143 external metabolites (with respective drain fluxes) are present in the model. The original model has 332 degrees of freedom, and, therefore, the system was underdetermined and appropriate for the utilization of optimizationbased methods, such as the ones based on linear programming approaches. In this case, the high number of degrees of freedom precludes the use of methods for determined or overdetermined systems, as the amount of extra constraints to be added is very large.
To configure the experimental inputs to the analysis, the work from [36] was considered. Here, to study the influence of the inputs, the metabolic flux distribution was examined by incrementally adding constraints to the system. Analyses were made under aerobic conditions with glucose as the carbon source. The first step was to perform an FBAbased method. By analysing the flux distribution, the absolute values for the reactions R_ADK1 and R_ADK3 were considered to be too high. R_ADK1 was also occurring on the reverse direction, making a cycle with R_ADK3. After performing an FVA analysis, the two reactions had as lower and upper bounds the two extremes of their domains. Therefore, a pFBAbased method was performed, to minimize the total sum of the flux absolute values and remove this futile cycle. It was possible to confirm that the values for these two reactions changed in comparison to the FBA simulation, such that R_ADK1 changed its direction and R_ADK3 was set to zero. For the simulations performed in this first step, the only constraint that has been added was an environmental condition to set the glucose uptake rate to 11 mmol/gDW/hr. The objective was to maximize growth and the obtained value for the biomass flux was 0.97 hr^{−1}.
In order to approximate the distribution according to [36], a second step was done, adding external flux measurements for O_{2}, CO_{2} and acetate, which resulted in a decrease of the biomass flux to 0.81 hr^{−1}. However, there were still some differences in the internal fluxes with respect to the published values. Therefore, some flux ratios were added and changes were detected in the internal flux distributions, such as the flux of phosphoenolpyruvate carboxylase and ATP:oxaloacetate carboxylyase. The detailed results (flux distributions) are given in Additional file 5, together with some notes on the conditions.
As an illustration of the previous results, in Figure 4, the distribution of the central carbon metabolism fluxes in both case studies is illustrated by overlaying fluxes over the network topology. This figure also serves as an example of the visualization capabilities of the tool.
Conclusions
Although several software tools and applications have been developed and published to perform constrainedbased flux analysis, currently none provides the full set of methods to perform flux analysis in a userfriendly way, while many are commercial or based on commercial systems such as Matlab. Thus, the software application described in this work is complementary to existing ones, since it provides a free, simple and quick way to perform flux analysis, with an extensive portfolio of methods, without all the complexity and specificity associated with experimental data processing. Therefore, it is possible to emulate the use of experimental data, by simply setting values (or intervals) for flux measurements, or to establish flux ratios that may reflect the over/ under expression of certain genes, or to translate the flux activity of a given pathway, for instance, obtained from ^{13}C labelled substrate experiments.
Availability and requirements
The software is made available, together with other resources, in the home page given below.
More details:
Software name: CBFA plugin for Optflux
Project home page: http://www.optflux.org
Methods details and application tutorial: http://www.optflux.org/cbfa
Operating system(s): Platform independent
Programming languages: Java
Other requirements: Java JRE 1.7.x (for Mac OS users the installation of JDK 1.7 is recommended), GLPK
License: GNUGPL, version 3
Additional files
Abbreviations
 CBFA:

Constraintbased Flux analysis
 FBA:

Flux balance analysis
 FVA:

Flux variability analysis
 GLPK:

GNU Linear Programming Kit
 GSMMs:

Genomescale metabolic models
 GUI:

Graphical user interface
 LP:

Linear programming
 ME:

Metabolic engineering
 MathML:

Mathematical markup language
 MFA:

Metabolic Flux analysis
 MOMA:

Minimization of metabolic adjustment
 MVC:

Model view controller
 pFBA:

Parsimonious enzyme usage Flux balance analysis
 QP:

Quadratic programming
 SBML:

Systems biology markup language
References
 1.
Nielsen J: Metabolic engineering: techniques for analysis of targets for genetic manipulations. Biotechnol Bioeng. 1998, 58: 125132. 10.1002/(SICI)10970290(19980420)58:2/3<125::AIDBIT3>3.0.CO;2N.
 2.
Future Prospects for Industrial Biotechnology. 2011, OECD Publishing, Paris, France
 3.
Wiechert W: 13C metabolic flux analysis. Metab Eng. 2001, 3: 195206. 10.1006/mben.2001.0187.
 4.
Alm EJ, Huang KH, Price MN, Koche RP, Keller K, Dubchak IL, Arkin AP: The MicrobesOnline Web site for comparative genomics. Genome Res. 2005, 15: 10151022. 10.1101/gr.3844805.
 5.
Sauer U: Metabolic networks in motion: 13Cbased flux analysis. Mol Syst Biol. 2006, 2: 6210.1038/msb4100109.
 6.
Henry CS, DeJongh M, Best AA, Frybarger PM, Linsay B, Stevens RL: Highthroughput generation, optimization and analysis of genomescale metabolic models. Nat Biotechnol. 2010, 28: 977982. 10.1038/nbt.1672.
 7.
Varma A, Palsson BO: Metabolic flux balancing: basic concepts, scientific and practical use. Nat Biotechnol. 1994, 12: 994998. 10.1038/nbt1094994.
 8.
Papoutsakis ET: Equations and calculations for fermentations of butyric acid bacteria. Biotechnol Bioeng. 1984, 26: 174187. 10.1002/bit.260260210.
 9.
Edwards JS, Ibarra RU, Palsson BO: In silico predictions of Escherichia coli metabolic capabilities are consistent with experimental data. Nat Biotechnol. 2001, 19: 125130. 10.1038/84379.
 10.
Schuetz R, Kuepfer L, Sauer U: Systematic evaluation of objective functions for predicting intracellular fluxes in Escherichia coli. Mol Syst Biol. 2007, 3: 11910.1038/msb4100162.
 11.
Mahadevan R, Schilling CH: The effects of alternate optimal solutions in constraintbased genomescale metabolic models. Metab Eng. 2003, 5: 26410.1016/j.ymben.2003.09.002.
 12.
Christensen B, Nielsen J: Isotopomer analysis using GCMS. Metab Eng. 1999, 1: 282290. 10.1006/mben.1999.0117.
 13.
Wiechert W, de Graaf AA: Bidirectional reaction steps in metabolic networks: I. Modeling and simulation of carbon isotope labeling experiments. Biotechnol Bioeng. 1997, 55: 101117. 10.1002/(SICI)10970290(19970705)55:1<101::AIDBIT12>3.0.CO;2P.
 14.
Sauer U, Lasko DR, Fiaux J, Hochuli M, Glaser RW, Szyperski T, Wüthrich K, Bailey JE: Metabolic flux ratio analysis of genetic and environmental modulations of Escherichia coli central carbon metabolism. J Bacteriol. 1999, 181: 66796688.
 15.
Fischer E, Sauer U: Metabolic flux profiling of Escherichia coli mutants in central carbon metabolism using GCMS. Eur J Biochem. 2003, 270: 880891. 10.1046/j.14321033.2003.03448.x.
 16.
Klamt S, Stelling J, Ginkel M, Gilles ED: FluxAnalyzer: exploring structure, pathways, and flux distributions in metabolic networks on interactive flux maps. Bioinformatics. 2003, 19: 261269. 10.1093/bioinformatics/19.2.261.
 17.
Zamboni N, Fischer E, Sauer U: FiatFlux–a software for metabolic flux analysis from 13Cglucose experiments. BMC Bioinformatics. 2005, 6: 20910.1186/147121056209.
 18.
Quek LE, Wittmann C, Nielsen LK, Krömer JO: OpenFLUX: efficient modelling software for 13Cbased metabolic flux analysis. Microb Cell Fact. 2009, 8: 2510.1186/14752859825.
 19.
Schellenberger J, Que R, Fleming RMT, Thiele I, Orth JD, Feist AM, Zielinski DC, Bordbar A, Lewis NE, Rahmanian S, Kang J, Hyduke D, Palsson BO: Quantitative prediction of cellular metabolism with constraintbased models: the COBRA Toolbox v2. 0. Nat Protoc. 2011, 6: 12901307. 10.1038/nprot.2011.308.
 20.
Weitzel M, Nöh K, Dalman T, Niedenführ S, Stute B, Wiechert W: 13CFLUX2—highperformance software suite for 13Cmetabolic flux analysis. Bioinformatics. 2013, 29: 143145. 10.1093/bioinformatics/bts646.
 21.
Klamt S, SaezRodriguez J, Gilles ED: Structural and functional analysis of cellular networks with Cell NetAnalyzer. BMC Syst Biol. 2007, 1: 210.1186/1752050912.
 22.
Wright J, Wagner A: The systems biology research tool: evolvable opensource software. BMC Syst Biol. 2008, 2: 5510.1186/17520509255.
 23.
Ravikirthi P, Suthers PF, Maranas CD: Construction of an E. Coli genomescale atom mapping model for MFA calculations. Biotechnol Bioeng. 2011, 108: 13721382. 10.1002/bit.23070.
 24.
Pitkänen E, Jouhten P, Hou J, Syed MF, Blomberg P, Kludas J, Oja M, Holm L, Penttilä M, Rousu J, Arvas M: Comparative genomescale reconstruction of gapless metabolic networks for present and ancestral species. PLoS Comput Biol. 2014, 10: e100346510.1371/journal.pcbi.1003465.
 25.
McAnulty MJ, Yen JY, Freedman BG, Senger RS: Genomescale modeling using flux ratio constraints to enable metabolic engineering of clostridial metabolism in silico. BMC Syst Biol. 2012, 6: 4210.1186/17520509642.
 26.
Rocha I, Maia P, Evangelista P, Vilaça P, Soares S, Pinto J, Nielsen J, Patil K, Ferreira E, Rocha M: OptFlux: an opensource software platform for in silico metabolic engineering. BMC Syst Biol. 2010, 4: 4510.1186/17520509445.
 27.
Hucka M, Finney A, Sauro HM, Bolouri H, Doyle JC, Kitano H, Arkin AP, Bornstein BJ, Bray D, CornishBowden A, Cuellar AA, Dronov S, Gilles ED, Ginkel M, Gor V, Goryanin II, Hedley WJ, Hodgman TC, Hofmeyr JH, Hunter PJ, Juty NS, Kasberger JL, Kremling A, Kummer U, Le Novere N, Loew LM, Lucio D, Mendes P, Minch E, Mjolsness ED: The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics. 2003, 19: 524531. 10.1093/bioinformatics/btg015.
 28.
Ausbrooks R, Buswell S, Carlisle D, Chavchanidze G, Dalmas S, Devitt S, Diaz A, Dooley S, Hunter R, Ion P, Kohlhase M, Lazrek A, Libbrecht P, Miller N, Miner R, Rowley C, Sargent M, Smith B, Soiffer N, Sutor R, Watt S: Mathematical Markup Language (MathML) Version 3.0.W3C recommendations, 2010:115121,
 29.
Lewis NE, Hixson KK, Conrad TM, Lerman JA, Charusanti P, Polpitiya AD, Adkins JN, Schramm G, Purvine SO, LopezFerrer D, Weitz KK, Eils R, König R, Smith RD, Palsson BO: Omic data from evolved E. coli are consistent with computed optimal growth from genomescale models. Mol Syst Biol. 2010, 6: 39010.1038/msb.2010.47.
 30.
Makhorin A: GNU Linear Programming Kit (GLPK)., [http://www.gnu.org/software/glpk/]
 31.
Segre D, Vitkup D, Church GM: Analysis of optimality in natural and perturbed metabolic networks. Proc Natl Acad Sci. 2002, 99: 1511215117. 10.1073/pnas.232349399.
 32.
Edwards JS, Palsson BO: Robustness analysis of the escherichiacoli metabolic network. Biotechnol Prog. 2000, 16: 927939. 10.1021/bp0000712.
 33.
Förster J, Gombert AK, Nielsen J: A functional genomics approach using metabolomics and in silico pathway analysis. Biotechnol Bioeng. 2002, 79: 703712. 10.1002/bit.10378.
 34.
Reed JL, Vo TD, Schilling CH, Palsson BO: An expanded genomescale model of Escherichia coli K12 (iJR904 GSM/GPR). Genome Biol. 2003, 4: R5410.1186/gb200349r54.
 35.
Blank LM, Kuepfer L, Sauer U: Largescale 13Cflux analysis reveals mechanistic principles of metabolic network robustness to null mutations in yeast. Genome Biol. 2005, 6: R4910.1186/gb200566r49.
 36.
Fischer E, Zamboni N, Sauer U: Highthroughput metabolic flux analysis based on gas chromatography–mass spectrometry derived 13C constraints. Anal Biochem. 2004, 325: 308316. 10.1016/j.ab.2003.10.036.
Acknowledgments
The work is partially funded by ERDF  European Regional Development Fund through the COMPETE Programme (operational programme for competitiveness) and by National Funds through the FCT (Portuguese Foundation for Science and Technology) within project ref. COMPETE FCOMP01 0124FEDER015079. RC’s work is funded by a Ph.D. grant from the Portuguese FCT (ref. SFRH/BD/66201/2009).
The authors would also like to thank the FCT Strategic Project PEstOE/EQB/LA0023/2013 and the Projects “BioInd  Biotechnology and Bioengineering for improved Industrial and AgroFood processes”, REF. NORTE070124FEDER000028 and “PEM – Metabolic Engineering Platform”, project number 23060, both cofunded by the Programa Operacional Regional do Norte (ON.2 – O Novo Norte), QREN, FEDER.
Author information
Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
RC, PE, PM, MP, IR and MR were involved in the conception of the algorithms. RC, MR and PV were involved in the design and implementation of the software. RC, IR and MR proposed and prepared the case study. RC, IR, JFT and MR helped to draft the manuscript. All authors read, reviewed and approved the final manuscript.
Electronic supplementary material
12918_2014_123_MOESM1_ESM.xlsx
Additional file 1: Table S1.: Feature comparison of several software tools for constraintbased methods, including a list of CBFA novel features under the framework of OptFlux. (XLSX 49 KB)
12918_2014_123_MOESM2_ESM.pdf
Additional file 2:Document with a full description and mathematical formulation of the implemented Flux Analysis methods.(PDF 603 KB)
Results for the simulations carried out in the first case study: reduced model of
Additional file 4: S. cerevisiae .(XLSX 20 KB)
Results for the simulations carried out in the first case study: GSMM of
Additional file 5: E. coli .(XLSX 124 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Carreira, R., Evangelista, P., Maia, P. et al. CBFA: phenotype prediction integrating metabolic models with constraints derived from experimental data. BMC Syst Biol 8, 123 (2014). https://doi.org/10.1186/s1291801401231
Received:
Accepted:
Published:
Keywords
 Constraintbased modeling
 Metabolic Flux analysis
 Metabolic engineering
 Opensource software