FAME, the Flux Analysis and Modeling Environment

Boele, Joost; Olivier, Brett G; Teusink, Bas

doi:10.1186/1752-0509-6-8

Software
Open access
Published: 30 January 2012

FAME, the Flux Analysis and Modeling Environment

Joost Boele^1,2,
Brett G Olivier^1,2,3 &
Bas Teusink^1,2

BMC Systems Biology volume 6, Article number: 8 (2012) Cite this article

11k Accesses
52 Citations
8 Altmetric
Metrics details

Abstract

Background

The creation and modification of genome-scale metabolic models is a task that requires specialized software tools. While these are available, subsequently running or visualizing a model often relies on disjoint code, which adds additional actions to the analysis routine and, in our experience, renders these applications suboptimal for routine use by (systems) biologists.

Results

The Flux Analysis and Modeling Environment (FAME) is the first web-based modeling tool that combines the tasks of creating, editing, running, and analyzing/visualizing stoichiometric models into a single program. Analysis results can be automatically superimposed on familiar KEGG-like maps. FAME is written in PHP and uses the Python-based PySCeS-CBM for its linear solving capabilities. It comes with a comprehensive manual and a quick-start tutorial, and can be accessed online at http://f-a-m-e.org/.

Conclusions

With FAME, we present the community with an open source, user-friendly, web-based "one stop shop" for stoichiometric modeling. We expect the application will be of substantial use to investigators and educators alike.

Background

In the post-genome era, genome-scale stoichiometric models have gained popularity, as the absence of the need to experimentally determine model parameters one enzyme at a time made it possible to build bigger metabolic models than ever before [1]. However, genome-scale stoichiometric models quickly get so large that efficiently editing them requires advanced tools. It is not even a trivial matter to run a model and visualize the results, as this requires at least a linear solver and some program to interpret the solver's output.

Although tools that facilitate common tasks are available (e.g. Model SEED for automated model generation [2], the COBRA Toolbox for model solving and command-line manipulation [3]), and some cater to more than one need (e.g. OptFlux [4] or the Java-based CellNetAnalyzer [5], which both feature model editing and a form of visualization), none have proven to be the panacea that bridges these gaps. Broad adoption of tools is often impeded by complicated installation procedures, required proprietary software (e.g. Matlab), not scaling up to genome-scale proportions, or results visualization that requires extensive user input in order to produce intelligible results.

We identified and experienced the need for an open-source, user-friendly and portable (web-based) software environment for most routine questions a (systems) biologist would want to ask a genome-scale metabolic model. Based on our own extensive experience in developing and using such models, we have developed FAME: the Flux Analysis and Modeling Environment, a "one stop shop" that addresses these issues.

Comparison with existing tools

In an analysis of available applications, the programs that approach FAME's functionality the closest are the aforementioned COBRA Toolbox, OptFlux, and CellNetAnalyzer. We will discuss these tools here, but also refer to Table 1, where we have summarized a more complete assessment of the alternatives.

Table 1 Comparison between FAME and existing software

Full size table

The COBRA Toolbox [3] is one of the most widely used toolkits for (stoichiometric) systems biology modeling. It has a very complete editing and analysis feature set, and features results visualization on user-supplied network maps. Although the toolbox itself is open-source, it is dependent on Matlab, which may deter impecunious users. Moreover, to perform any routine tasks or data analysis, users must first learn to use Matlab.

OptFlux [4] and CellNetAnalyzer [5] are tools that integrate some or all of FAME's key functionalities, particularly model editing and visualization. However, neither tool has a web interface, and CellNetAnalyzer is based on Matlab, which makes it suffer from similar limitations as the COBRA Toolbox. In both tools, as well as in the COBRA Toolbox, visualization is dependent on user input of the network topology in a tool-specific format, such as a CellDesigner [6] map or COBRA Toolbox-specific "map file". FAME offers supervised visualization in a web interface, and this can be considered an enhancement of existing functionality in three ways: first, users need not supply a custom-made map file to visualize results; second, FAME scours models for meta-information that might aid in the visualization of run results (e.g. EC-numbers); and third, FAME uses this information to generate maps that are interactive, with elements that can be clicked to access additional information. It is the first application to open up this feature set in an installation-free manner, and to harness the functionality of the web for this kind of analysis.

Implementation

Besides the externally visible HTML and CSS that convey its markup, the parts of the FAME web application that do the work are implemented in PHP5 and JavaScript. PHP was chosen because it is a fast, browser independent language that integrates well with other programs. FAME uses the Python-based PySCeS-CBM (Python Simulator of Cellular Systems-Constraint Based Modelling toolkit) [7] as an interface to a linear solver. Model information, which is encoded in XML, is communicated to PySCeS using SOAP, the Simple Object Access Protocol. Run results are retrieved over the same protocol, and this setup of FAME as a SOAP client opens the door for it to gather data from a variety of resources in future releases (e.g. the Kyoto Encyclopedia of Genes and Genomes (KEGG, by Kanehisa and Goto [8])). Moreover, as both FAME and PySCeS have a modular architecture, future expansions of the analysis capabilities of PySCeS-CBM will conveniently translate to cognate expansions of FAME's functionality. When visualizing, FAME generates pathway maps as SVG images (Scalable Vector Graphics), using an algorithm designed specifically for FAME. The SVG format was selected because of its open nature (it is an XML-based format), cross-platform compatibility and scalability. In addition, SVG images natively support the inclusion of hyperlinks, which adds a layer of interactivity to run results. FAME, PySCeS(-CBM), and Mariner, the SOAP interface to PySCeS, are all open source.

Results and Discussion

A web-based application, FAME aims to address the needs of modelers on three key points of focus: model creation, result generation, and interpretation (i.e. visualization, sensitivity analysis, metabolite connectivity, etc.) (Figure 1). Traditionally, transitions between these tasks often impeded work flow and could themselves become a source of errors. For example, running a model after editing it, or visualizing the results after running it, would require the user to save a file, launch another program, and then load the file into the new program. In FAME, these labor-intensive transitions are eliminated by teaming up with Mariner, a SOAP interface of PySCeS-CBM [7]. Throughout this section, we will illustrate FAME's functionality based on an example use case (Figure 2).

Creating and editing models

FAME allows users to either upload their own preexisting model (Figure 2A), or to build a new model based on the information in KEGG. When building from scratch, it is possible to select a subset of pathways from KEGG, foregoing the inclusion of unnecessary reactions that may be present in existing genome-scale reconstructions. To allow for fast model construction, FAME uses a cached copy of the required information from KEGG for the assembly of new models. In addition, the KEGG IDs in such models can be used to find more information from KEGG if the need should arise. FAME's visualization module makes use of these IDs when mapping run results onto KEGG maps. Importing non-KEGG, non-FAME models may inactivate some of these capabilities, although measures have been taken to use metadata that is available in imported models, for example the models from the BiGG database [9]. As an alternative to building from scratch, any stoichiometric model can be loaded into FAME, provided it is encoded in the Systems Biology Mark-up Language (SBML, [10]), the de facto standard for representing such models. A proposed SBML Level 3 package "Flux Balance Constraints" allows the definition of constraint based models [11]. Models that lack information about constraints, however, will also load in FAME, and will be automatically converted to constraint based models as necessary. FAME is intentionally very flexible with respect to the integrity of the input SBML, accepting even a bare minimum of information about a model's stoichiometry.

Once a model is loaded, FAME offers all tools a seasoned constraint-based modeler would need to study physiology. This includes easy editing of the flux bounds on all internal and exchange reactions, editing existing reactions, adding/deleting objectives, recognizing dead-end metabolites (orphans), recognizing synonymous reactions, and assigning reactions to different or new compartments (Figure 2B). Adding exchange reactions is supported, as are performing operations in batch and setting constraints on a per-reaction basis. The current version of a model can be exported as SBML at any time.

Result generation

Analysis commands are forwarded to PySCeS-CBM, which handles the mathematical operations and returns the results to FAME. Operations supported by PySCeS include Flux Balance Analysis (FBA) and Flux Variability Analysis (FVA). Given the more compute-intensive nature of the latter, if a subset of pathways is selected, only reactions in those pathways will be included in the variability analysis. For instance, whereas an FBA of the S. cerevisiae model [12] from BiGG (1266 reactions) takes under ten seconds (including visualization), the equivalent FVA takes roughly ten times as long. In addition, FAME can minimize the sum of absolute fluxes of an FBA solution, which leads to results that are more biologically relevant, as it can reduce complex loops in FBA solutions to their underlying net fluxes. FAME can also perform analyses on metabolites, rather than reactions or fluxes: per metabolite, it can list right hand side sensitivities, shadow prices, and it also features the option of checking whether a specific metabolite can be produced by the model. The latter can also be performed for all metabolites in the model at once. Once generated, results can be visualized, but they are also always presented as a human-readable table (which includes reduced costs for each reaction) and as a machine-readable, tab-separated format file that can be imported in e.g. Excel (Figure 2D).

The included Gene Association Workbench allows users to intuitively take advantage of gene association information present in the model metadata, e.g. by simulating (multiple) knockout mutants. Results are presented in the same interactive manner as the other analyses. If on any occasion the model system is over-determined, FAME will relay the solver's message that the solution status is 'infeasible' and additionally issues a warning to the user. Under-determined systems will run and produce a result; upon interpreting run results, users may run further analyses such as FVA to assess the properties of the solution space.

Visualization

The visualization module generates images in SVG format, based on the analysis results returned by PySCeS. The advantages of using SVG are manifold, some of the more notable being image scalability and ease of editing using third party software. Depending on the web browser used, users may need to download a (free) plug-in to view the images.

For each selected pathway an interactive KEGG-like image is drawn (Figure 2C), on which the run results are superimposed. To the biologist, this readily recognizable representation is an improvement over unsupervised visualization algorithms (e.g. in [13]), and while this approach to data visualization was already applied some years ago [14], to our knowledge, FAME is the first web-based application that both generates data and automatically visualizes results.

Many elements in the results images are clickable (another advantage of the SVG format), to make more information available more conveniently. For instance, clicking a metabolite will display an overview of reactions producing or consuming it, along with the KEGG information page for the metabolite, while clicking a reaction will display that reaction's KEGG information page.

Conclusions

With FAME, we present the community with an easy to use web-based "one stop shop" for the manipulation and execution of stoichiometric models. It enables biologists to create or import models, edit them, run them at the click of a button, and visualize the results from the browser window. We expect that its install-free integration of execution and visualization will appeal to investigators and educators alike. Future releases of FAME will feature integration with web-based annotation services and further analysis options. Finally, the novel SOAP interface to PySCeS-CBM will facilitate the creation of user-friendly interfaces based on PySCeS that will uncover powerful modeling functions that may otherwise remain hidden behind the ever-enigmatic command line cursor.

Availability and Requirements

FAME is intended and offered as a web service, but can also be installed locally, as source code will be made available upon request. FAME can be accessed online at http://f-a-m-e.org/, where a full user manual and guided tutorial are available. PySCeS-CBM and Mariner are also open source, and can be downloaded from http://pysces.sourceforge.net/cbm. FAME and PySCeS/Mariner are covered by their own respective BSD-style licenses, which can be found on the respective web pages and, in short, entail that they are open-source and free to use for both academic and non-academic users.

References

Oberhardt M, Palsson B, Papin J: Applications of genome-scale metabolic reconstructions. Mol Syst Biol. 2009, 5: 320-
Article Google Scholar
Henry M, DeJongh CS, Best A, Frybarger P, Linsay B, Stevens R: High-throughput generation, optimization and analysis of genome-scale metabolic models. Nat Biotechnol. 2010, 28: 977-982. 10.1038/nbt.1672.
Article CAS Google Scholar
Schellenberger J, Que R, Fleming RM, Thiele I, Orth JD, Feist AM, Zielinski DC, Bordbar A, Lewis NE, Rahmanian S, Kang J, Hyduke DR, Palsson B: Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0. Nat Protoc. 2011, 6: 1290-1307. 10.1038/nprot.2011.308.
Article CAS Google Scholar
Rocha I, Maia P, Evangelista P, Vilaça P, Soares S, Pinto J, Nielsen J, Patil K, Ferreira E, Rocha M: OptFlux: an open-source software platform for in silico metabolic engineering. BMC Syst Biol. 2010, 4: 45-10.1186/1752-0509-4-45.
Article Google Scholar
Klamt S, Saez-Rodriguez J, Gilles E: Structural and functional analysis of cellular networks with CellNetAnalyzer. BMC Syst Biol. 2007, 1: 2-10.1186/1752-0509-1-2.
Article Google Scholar
Funahashi A, Tanimura N, Morohashi M, Kitano H: CellDesigner: a process diagram editor for gene-regulatory and biochemical networks. BIOSILICO. 2003, 1: 159-162. 10.1016/S1478-5382(03)02370-9.
Article Google Scholar
Olivier B, Rohwer J, Hofmeyr J: Modelling cellular systems with PySCeS. Bioinformatics. 2005, 21: 560-561. 10.1093/bioinformatics/bti046.
Article CAS Google Scholar
Kanehisa M, Goto S: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000, 28: 27-30. 10.1093/nar/28.1.27.
Article CAS Google Scholar
Schellenberger J, Park J, Conrad T, Palsson B: BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions. BMC Bioinformatics. 2010, 11: 213-10.1186/1471-2105-11-213.
Article Google Scholar
Hucka M, Finney A, Sauro H, Bolouri H, Doyle J, Kitano H, Arkin A, Bornstein B, Bray D, Cornish-Bowden A, Cuellar A, Dronov S, Gilles E, Ginkel M, Gor V, Goryanin I, Hedley W, Hodgman T, Hofmeyr J, Hunter P, Juty N, Kasberger J, Kremling A, Kummer U, Le Novère N, Loew L, Lucio D, Mendes P, Minch E, Mjolsness E, Nakayama Y, Nelson M, Nielsen P, Sakurada T, Schaff J, Shapiro B, Shimizu T, Spence H, Stelling J, Takahashi K, Tomita M, Wagner J, Wang J, Forum S: The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics. 2003, 19: 524-531. 10.1093/bioinformatics/btg015.
Article CAS Google Scholar
Main Page - SBML.org. [http://sbml.org/]
Duarte NC, Herrgard MJ, Palsson B: Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. Genome Res. 2004, 14: 1298-1309. 10.1101/gr.2250904.
Article CAS Google Scholar
Schwarz R, Liang C, Kaleta C, Kühnel M, Hoffmann E, Kuznetsov S, Hecker M, Griffiths G, Schuster S, Dandekar T: Integrated network reconstruction, visualization and analysis using YANAsquare. BMC Bioinformatics. 2007, 8: 313-10.1186/1471-2105-8-313.
Article Google Scholar
Kono N, Arakawa K, Tomita M: MEGU: pathway mapping web-service based on KEGG and SVG. In Silico Biol. 2006, 6: 621-625.
CAS Google Scholar
Cvijovic M, Olivares-Hernández R, Agren R, Dahr N, Vongsangnak W, Nookaew I, Patil K, Nielsen J: BioMet Toolbox: genome-wide analysis of metabolism. Nucleic Acids Res. 2010, 38: W144-149. 10.1093/nar/gkq404.
Article CAS Google Scholar
Shannon P, Markiel A, Ozier O, Baliga N, Wang J, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13: 2498-2504. 10.1101/gr.1239303.
Article CAS Google Scholar
Segre D, Vitkup D, Church GM: Analysis of optimality in natural and perturbed metabolic networks. Proc Natl Acad Sci USA. 2002, 99: 15112-15117. 10.1073/pnas.232349399.
Article CAS Google Scholar

Download references

Acknowledgements

The authors thank Filipe Santos and Anisha Goel for helping improve FAME by test-driving it within their daily modeling practice. BGO is supported by NWO Computational Life Science Grant 635-100-021. BT and JB acknowledge the Centre for Integrative Bioinformatics VU (IBIVU) and the Amsterdam Institute for Molecules, Medicines and Systems (AIMMS) for support.

Author information

Authors and Affiliations

Systems Bioinformatics/AIMMS, VU University Amsterdam, De Boelelaan 1085, 1081HV, Amsterdam, The Netherlands
Joost Boele, Brett G Olivier & Bas Teusink
Netherlands Institute for Systems Biology (NISB), Amsterdam, The Netherlands
Joost Boele, Brett G Olivier & Bas Teusink
Life Sciences, Centrum voor Wiskunde en Informatica (CWI), Science Park 123, 1098XG, Amsterdam, The Netherlands
Brett G Olivier

Authors

Joost Boele
View author publications
You can also search for this author in PubMed Google Scholar
Brett G Olivier
View author publications
You can also search for this author in PubMed Google Scholar
Bas Teusink
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bas Teusink.

Additional information

Authors' contributions

JB created FAME. BGO created Mariner and PySCeS. JB, BGO and BT wrote the paper. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Boele, J., Olivier, B.G. & Teusink, B. FAME, the Flux Analysis and Modeling Environment. BMC Syst Biol 6, 8 (2012). https://doi.org/10.1186/1752-0509-6-8

Download citation

Received: 12 August 2011
Accepted: 30 January 2012
Published: 30 January 2012
DOI: https://doi.org/10.1186/1752-0509-6-8

FAME, the Flux Analysis and Modeling Environment