FAME, the Flux Analysis and Modeling Environment
BMC Systems Biologyvolume 6, Article number: 8 (2012)
The creation and modification of genome-scale metabolic models is a task that requires specialized software tools. While these are available, subsequently running or visualizing a model often relies on disjoint code, which adds additional actions to the analysis routine and, in our experience, renders these applications suboptimal for routine use by (systems) biologists.
The Flux Analysis and Modeling Environment (FAME) is the first web-based modeling tool that combines the tasks of creating, editing, running, and analyzing/visualizing stoichiometric models into a single program. Analysis results can be automatically superimposed on familiar KEGG-like maps. FAME is written in PHP and uses the Python-based PySCeS-CBM for its linear solving capabilities. It comes with a comprehensive manual and a quick-start tutorial, and can be accessed online at http://f-a-m-e.org/.
With FAME, we present the community with an open source, user-friendly, web-based "one stop shop" for stoichiometric modeling. We expect the application will be of substantial use to investigators and educators alike.
In the post-genome era, genome-scale stoichiometric models have gained popularity, as the absence of the need to experimentally determine model parameters one enzyme at a time made it possible to build bigger metabolic models than ever before . However, genome-scale stoichiometric models quickly get so large that efficiently editing them requires advanced tools. It is not even a trivial matter to run a model and visualize the results, as this requires at least a linear solver and some program to interpret the solver's output.
Although tools that facilitate common tasks are available (e.g. Model SEED for automated model generation , the COBRA Toolbox for model solving and command-line manipulation ), and some cater to more than one need (e.g. OptFlux  or the Java-based CellNetAnalyzer , which both feature model editing and a form of visualization), none have proven to be the panacea that bridges these gaps. Broad adoption of tools is often impeded by complicated installation procedures, required proprietary software (e.g. Matlab), not scaling up to genome-scale proportions, or results visualization that requires extensive user input in order to produce intelligible results.
We identified and experienced the need for an open-source, user-friendly and portable (web-based) software environment for most routine questions a (systems) biologist would want to ask a genome-scale metabolic model. Based on our own extensive experience in developing and using such models, we have developed FAME: the Flux Analysis and Modeling Environment, a "one stop shop" that addresses these issues.
Comparison with existing tools
In an analysis of available applications, the programs that approach FAME's functionality the closest are the aforementioned COBRA Toolbox, OptFlux, and CellNetAnalyzer. We will discuss these tools here, but also refer to Table 1, where we have summarized a more complete assessment of the alternatives.
The COBRA Toolbox  is one of the most widely used toolkits for (stoichiometric) systems biology modeling. It has a very complete editing and analysis feature set, and features results visualization on user-supplied network maps. Although the toolbox itself is open-source, it is dependent on Matlab, which may deter impecunious users. Moreover, to perform any routine tasks or data analysis, users must first learn to use Matlab.
OptFlux  and CellNetAnalyzer  are tools that integrate some or all of FAME's key functionalities, particularly model editing and visualization. However, neither tool has a web interface, and CellNetAnalyzer is based on Matlab, which makes it suffer from similar limitations as the COBRA Toolbox. In both tools, as well as in the COBRA Toolbox, visualization is dependent on user input of the network topology in a tool-specific format, such as a CellDesigner  map or COBRA Toolbox-specific "map file". FAME offers supervised visualization in a web interface, and this can be considered an enhancement of existing functionality in three ways: first, users need not supply a custom-made map file to visualize results; second, FAME scours models for meta-information that might aid in the visualization of run results (e.g. EC-numbers); and third, FAME uses this information to generate maps that are interactive, with elements that can be clicked to access additional information. It is the first application to open up this feature set in an installation-free manner, and to harness the functionality of the web for this kind of analysis.
Results and Discussion
A web-based application, FAME aims to address the needs of modelers on three key points of focus: model creation, result generation, and interpretation (i.e. visualization, sensitivity analysis, metabolite connectivity, etc.) (Figure 1). Traditionally, transitions between these tasks often impeded work flow and could themselves become a source of errors. For example, running a model after editing it, or visualizing the results after running it, would require the user to save a file, launch another program, and then load the file into the new program. In FAME, these labor-intensive transitions are eliminated by teaming up with Mariner, a SOAP interface of PySCeS-CBM . Throughout this section, we will illustrate FAME's functionality based on an example use case (Figure 2).
Creating and editing models
FAME allows users to either upload their own preexisting model (Figure 2A), or to build a new model based on the information in KEGG. When building from scratch, it is possible to select a subset of pathways from KEGG, foregoing the inclusion of unnecessary reactions that may be present in existing genome-scale reconstructions. To allow for fast model construction, FAME uses a cached copy of the required information from KEGG for the assembly of new models. In addition, the KEGG IDs in such models can be used to find more information from KEGG if the need should arise. FAME's visualization module makes use of these IDs when mapping run results onto KEGG maps. Importing non-KEGG, non-FAME models may inactivate some of these capabilities, although measures have been taken to use metadata that is available in imported models, for example the models from the BiGG database . As an alternative to building from scratch, any stoichiometric model can be loaded into FAME, provided it is encoded in the Systems Biology Mark-up Language (SBML, ), the de facto standard for representing such models. A proposed SBML Level 3 package "Flux Balance Constraints" allows the definition of constraint based models . Models that lack information about constraints, however, will also load in FAME, and will be automatically converted to constraint based models as necessary. FAME is intentionally very flexible with respect to the integrity of the input SBML, accepting even a bare minimum of information about a model's stoichiometry.
Once a model is loaded, FAME offers all tools a seasoned constraint-based modeler would need to study physiology. This includes easy editing of the flux bounds on all internal and exchange reactions, editing existing reactions, adding/deleting objectives, recognizing dead-end metabolites (orphans), recognizing synonymous reactions, and assigning reactions to different or new compartments (Figure 2B). Adding exchange reactions is supported, as are performing operations in batch and setting constraints on a per-reaction basis. The current version of a model can be exported as SBML at any time.
Analysis commands are forwarded to PySCeS-CBM, which handles the mathematical operations and returns the results to FAME. Operations supported by PySCeS include Flux Balance Analysis (FBA) and Flux Variability Analysis (FVA). Given the more compute-intensive nature of the latter, if a subset of pathways is selected, only reactions in those pathways will be included in the variability analysis. For instance, whereas an FBA of the S. cerevisiae model  from BiGG (1266 reactions) takes under ten seconds (including visualization), the equivalent FVA takes roughly ten times as long. In addition, FAME can minimize the sum of absolute fluxes of an FBA solution, which leads to results that are more biologically relevant, as it can reduce complex loops in FBA solutions to their underlying net fluxes. FAME can also perform analyses on metabolites, rather than reactions or fluxes: per metabolite, it can list right hand side sensitivities, shadow prices, and it also features the option of checking whether a specific metabolite can be produced by the model. The latter can also be performed for all metabolites in the model at once. Once generated, results can be visualized, but they are also always presented as a human-readable table (which includes reduced costs for each reaction) and as a machine-readable, tab-separated format file that can be imported in e.g. Excel (Figure 2D).
The included Gene Association Workbench allows users to intuitively take advantage of gene association information present in the model metadata, e.g. by simulating (multiple) knockout mutants. Results are presented in the same interactive manner as the other analyses. If on any occasion the model system is over-determined, FAME will relay the solver's message that the solution status is 'infeasible' and additionally issues a warning to the user. Under-determined systems will run and produce a result; upon interpreting run results, users may run further analyses such as FVA to assess the properties of the solution space.
The visualization module generates images in SVG format, based on the analysis results returned by PySCeS. The advantages of using SVG are manifold, some of the more notable being image scalability and ease of editing using third party software. Depending on the web browser used, users may need to download a (free) plug-in to view the images.
For each selected pathway an interactive KEGG-like image is drawn (Figure 2C), on which the run results are superimposed. To the biologist, this readily recognizable representation is an improvement over unsupervised visualization algorithms (e.g. in ), and while this approach to data visualization was already applied some years ago , to our knowledge, FAME is the first web-based application that both generates data and automatically visualizes results.
Many elements in the results images are clickable (another advantage of the SVG format), to make more information available more conveniently. For instance, clicking a metabolite will display an overview of reactions producing or consuming it, along with the KEGG information page for the metabolite, while clicking a reaction will display that reaction's KEGG information page.
With FAME, we present the community with an easy to use web-based "one stop shop" for the manipulation and execution of stoichiometric models. It enables biologists to create or import models, edit them, run them at the click of a button, and visualize the results from the browser window. We expect that its install-free integration of execution and visualization will appeal to investigators and educators alike. Future releases of FAME will feature integration with web-based annotation services and further analysis options. Finally, the novel SOAP interface to PySCeS-CBM will facilitate the creation of user-friendly interfaces based on PySCeS that will uncover powerful modeling functions that may otherwise remain hidden behind the ever-enigmatic command line cursor.
Availability and Requirements
FAME is intended and offered as a web service, but can also be installed locally, as source code will be made available upon request. FAME can be accessed online at http://f-a-m-e.org/, where a full user manual and guided tutorial are available. PySCeS-CBM and Mariner are also open source, and can be downloaded from http://pysces.sourceforge.net/cbm. FAME and PySCeS/Mariner are covered by their own respective BSD-style licenses, which can be found on the respective web pages and, in short, entail that they are open-source and free to use for both academic and non-academic users.
Oberhardt M, Palsson B, Papin J: Applications of genome-scale metabolic reconstructions. Mol Syst Biol. 2009, 5: 320-
Henry M, DeJongh CS, Best A, Frybarger P, Linsay B, Stevens R: High-throughput generation, optimization and analysis of genome-scale metabolic models. Nat Biotechnol. 2010, 28: 977-982. 10.1038/nbt.1672.
Schellenberger J, Que R, Fleming RM, Thiele I, Orth JD, Feist AM, Zielinski DC, Bordbar A, Lewis NE, Rahmanian S, Kang J, Hyduke DR, Palsson B: Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0. Nat Protoc. 2011, 6: 1290-1307. 10.1038/nprot.2011.308.
Rocha I, Maia P, Evangelista P, Vilaça P, Soares S, Pinto J, Nielsen J, Patil K, Ferreira E, Rocha M: OptFlux: an open-source software platform for in silico metabolic engineering. BMC Syst Biol. 2010, 4: 45-10.1186/1752-0509-4-45.
Klamt S, Saez-Rodriguez J, Gilles E: Structural and functional analysis of cellular networks with CellNetAnalyzer. BMC Syst Biol. 2007, 1: 2-10.1186/1752-0509-1-2.
Funahashi A, Tanimura N, Morohashi M, Kitano H: CellDesigner: a process diagram editor for gene-regulatory and biochemical networks. BIOSILICO. 2003, 1: 159-162. 10.1016/S1478-5382(03)02370-9.
Olivier B, Rohwer J, Hofmeyr J: Modelling cellular systems with PySCeS. Bioinformatics. 2005, 21: 560-561. 10.1093/bioinformatics/bti046.
Kanehisa M, Goto S: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000, 28: 27-30. 10.1093/nar/28.1.27.
Schellenberger J, Park J, Conrad T, Palsson B: BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions. BMC Bioinformatics. 2010, 11: 213-10.1186/1471-2105-11-213.
Hucka M, Finney A, Sauro H, Bolouri H, Doyle J, Kitano H, Arkin A, Bornstein B, Bray D, Cornish-Bowden A, Cuellar A, Dronov S, Gilles E, Ginkel M, Gor V, Goryanin I, Hedley W, Hodgman T, Hofmeyr J, Hunter P, Juty N, Kasberger J, Kremling A, Kummer U, Le Novère N, Loew L, Lucio D, Mendes P, Minch E, Mjolsness E, Nakayama Y, Nelson M, Nielsen P, Sakurada T, Schaff J, Shapiro B, Shimizu T, Spence H, Stelling J, Takahashi K, Tomita M, Wagner J, Wang J, Forum S: The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics. 2003, 19: 524-531. 10.1093/bioinformatics/btg015.
Main Page - SBML.org. [http://sbml.org/]
Duarte NC, Herrgard MJ, Palsson B: Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. Genome Res. 2004, 14: 1298-1309. 10.1101/gr.2250904.
Schwarz R, Liang C, Kaleta C, Kühnel M, Hoffmann E, Kuznetsov S, Hecker M, Griffiths G, Schuster S, Dandekar T: Integrated network reconstruction, visualization and analysis using YANAsquare. BMC Bioinformatics. 2007, 8: 313-10.1186/1471-2105-8-313.
Kono N, Arakawa K, Tomita M: MEGU: pathway mapping web-service based on KEGG and SVG. In Silico Biol. 2006, 6: 621-625.
Cvijovic M, Olivares-Hernández R, Agren R, Dahr N, Vongsangnak W, Nookaew I, Patil K, Nielsen J: BioMet Toolbox: genome-wide analysis of metabolism. Nucleic Acids Res. 2010, 38: W144-149. 10.1093/nar/gkq404.
Shannon P, Markiel A, Ozier O, Baliga N, Wang J, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13: 2498-2504. 10.1101/gr.1239303.
Segre D, Vitkup D, Church GM: Analysis of optimality in natural and perturbed metabolic networks. Proc Natl Acad Sci USA. 2002, 99: 15112-15117. 10.1073/pnas.232349399.
The authors thank Filipe Santos and Anisha Goel for helping improve FAME by test-driving it within their daily modeling practice. BGO is supported by NWO Computational Life Science Grant 635-100-021. BT and JB acknowledge the Centre for Integrative Bioinformatics VU (IBIVU) and the Amsterdam Institute for Molecules, Medicines and Systems (AIMMS) for support.
JB created FAME. BGO created Mariner and PySCeS. JB, BGO and BT wrote the paper. All authors read and approved the final manuscript.