- Methodology article
- Open Access
Systems biology by the rules: hybrid intelligent systems for pathway modeling and discovery
© Bosl; licensee BioMed Central Ltd. 2007
- Received: 21 August 2006
- Accepted: 15 February 2007
- Published: 15 February 2007
Expert knowledge in journal articles is an important source of data for reconstructing biological pathways and creating new hypotheses. An important need for medical research is to integrate this data with high throughput sources to build useful models that span several scales. Researchers traditionally use mental models of pathways to integrate information and development new hypotheses. Unfortunately, the amount of information is often overwhelming and these are inadequate for predicting the dynamic response of complex pathways. Hierarchical computational models that allow exploration of semi-quantitative dynamics are useful systems biology tools for theoreticians, experimentalists and clinicians and may provide a means for cross-communication.
A novel approach for biological pathway modeling based on hybrid intelligent systems or soft computing technologies is presented here. Intelligent hybrid systems, which refers to several related computing methods such as fuzzy logic, neural nets, genetic algorithms, and statistical analysis, has become ubiquitous in engineering applications for complex control system modeling and design. Biological pathways may be considered to be complex control systems, which medicine tries to manipulate to achieve desired results. Thus, hybrid intelligent systems may provide a useful tool for modeling biological system dynamics and computational exploration of new drug targets. A new modeling approach based on these methods is presented in the context of hedgehog regulation of the cell cycle in granule cells. Code and input files can be found at the Bionet website: www.chip.ord/~wbosl/Software/Bionet.
This paper presents the algorithmic methods needed for modeling complicated biochemical dynamics using rule-based models to represent expert knowledge in the context of cell cycle regulation and tumor growth. A notable feature of this modeling approach is that it allows biologists to build complex models from their knowledge base without the need to translate that knowledge into mathematical form. Dynamics on several levels, from molecular pathways to tissue growth, are seamlessly integrated. A number of common network motifs are examined and used to build a model of hedgehog regulation of the cell cycle in cerebellar neurons, which is believed to play a key role in the etiology of medulloblastoma, a devastating childhood brain cancer.
- Fuzzy Logic
- Default Rule
- Hedgehog Pathway
- Fuzzy Logic Model
A key issue in functional genomics is to understand how genetic or epigenetic perturbations to cellular machinery result in disease. Related to this is the question of how to perturb the system so that it functions in a desirable way. The first issue is the concern of hypothesis generation and the second is drug target discovery. Much of the work in these areas is now done by human experts who mentally integrate experimental data reported in the literature and perhaps statistical summaries of data from a variety of sources. Unfortunately, the number of papers published each year on any specific disease pathway continues to grow rapidly, making it difficult for specialists to incorporate all new results into their mental model of a pathway. Furthermore, the amount of data becoming available through new measurement technologies is far greater than the human brain can adequately synthesize. A principal goal of systems biology is to devise computer algorithms to help with the task of integrating expert knowledge, largely gleaned through published results, with high throughput data sources to create new hypotheses and putative drug targets. A particularly vexing problem is that the amount of data available about any pathway is usually incomplete. The human mind is fortunately very good at reasoning with incomplete knowledge, which enables researchers to create new hypotheses even when only incomplete information is available. Systems biology may benefit from emulating human reasoning: less detailed models that can incorporate many sources of incomplete information may yield more useful output .
Many engineering system designs today use descriptive models rather than mathematical models derived from mathematical formulas, such as differential equations. This is often achieved using a modeling approach based on fuzzy logic. Fuzzy logic is a computational method for formulating and transferring human expert knowledge to computational models. It provides a flexible tool for modeling the relationship between input and output information and is distinguished by its robustness with respect to noise and variations in system parameters [2–4]. This characteristic seems to mirror the robustness of biological systems and their remarkable ability to achieve precise functional control from imprecise components . Fuzzy logic and several related, complementary software methodologies are sometimes lumped under the term 'soft computing' or hybrid intelligent systems. These represent a combination of emerging, complementary problem-solving technologies that include fuzzy logic, probabilistic reasoning, Bayesian networks, neural networks, and evolutionary or genetic algorithms. Models based on fuzzy logic are sometimes called rule-based models; these terms will be used interchangeably here.
Fuzzy logic in medicine
Fuzzy logic has become ubiquitous in modern control systems engineering, including medical applications [6–9]. It has been particularly useful in applications where appropriate mathematical models cannot be derived due to the complexity of the problem. Examples include controlling the pump rate in artificial hearts and monitoring the hemodynamic state of patients during surgery , prediction of gait events using the electromyographic (EMG) activity of lower extremity muscles in children with cerebral palsy , pacemaker control , anesthesia and ventilator control , control of robotic prostheses , and numerous other medical  and engineering applications . Perhaps one of the most immediately attractive features of fuzzy logic modeling for biology is that it provides a straightforward method for formulating and transferring human expert knowledge to quantitative computer models. Since much of the data about biological systems is derived from logically-designed, hypothesis-driven experiments and contained in linguistic form in journals, fuzzy logic provides a way for biologists to incorporate data that might otherwise be difficult to incorporate into computer models. Often this data is noisy and imprecise. Fuzzy logic models are naturally robust with respect to noise and variation in system parameters, but allow computation of logical consequences of complex system dynamics with imprecise variables.
At the same time, fuzzy logic models are capable of representing extremely complex systems to high degrees of accuracy when precise data is available. The standard additive model (SAM), a common formulation of a fuzzy logic system, is a universal approximator. That is, a SAM can approximate any nonlinear function as precisely as desired, give appropriate data . Statistical and non-gradient based optimization methods such as genetic algorithms can be used to refine fuzzy logic models that are initially constructed by biologists manually or from pathway databases. Estimation of distribution algorithms, a type of evolutionary algorithm that replaces mutation and crossover operations with probability distributions , provide a potential way to use statistical high throughput data, such protein interaction data, to optimize manually constructed models or to provide putative hypotheses to fill in unknown network components. Thus, the methodology presented here, which is primarily focused on using expert knowledge to build computational models, can use the aforementioned soft computing and statistical methods to incorporate other data sources to optimize parameters, making it an ideal data integration tool. Since many textbooks and papers are available to describe fuzzy logic modeling details [6, 9, 16, 17], the discussion here will focus on a novel implementation of a fuzzy logic system embedded in a network structure and its usefulness for spanning research in basic biology and clinical research.
Fuzzy logic and qualitative methods for systems biology
Several related methods have been developed for fuzzy modeling of gene interaction networks from microarray data. The models are similar, but different methods are used to incorporate data and parameterize the models. Woolf  made fuzzy logic inference tables from triplets of genes to determine co-expression relationships. Sokhansanj, et al.  used an exhaustive search of gene interaction among twelve yeast cell cycle genes to fit the fuzzy model parameters. The model was able to predict transcriptional correlation among the twelve yeast cell cycle genes after tuning the model on a different data set for the same system. Linden, et al.  used a fuzzy logic model in a similar way to reverse engineer a gene interaction network from microarray data using genetic programming to discover and optimize the gene interaction rules. The goal of each of these models was to qualitatively represent coexpression patterns among genes.
Our goal is a broader modeling paradigm than the previous fuzzy logic applications. More along the lines of our goal are efforts to use qualitative differential equations (QDEs) to model biochemical dynamics in cells. Thus far this work has been primarily used for metabolic pathways. A modeling system that requires time series of metabolite concentrations as input data is described by King, et al. . They used a QDE system to model ten reactions involved in glycolosis. Two distinct modeling tasks are identified: simulation of a biochemical system when model parameters are known and system discovery from time series data. For system discovery, machine learning methods from the soft computing paradigm were used to parameterize the QDEs. Time series of concentrations for proteins of known structure were used as input data to build the model, which required considerable computational time on a Beowulf cluster. One of the hindrances to adoption of this modeling approach is its computational representation that has evolved from the Lisp programming language.
Heuristic reasoning for hierarchical complex systems
The methodology presented in this paper is distinguished from previous qualitative modeling approaches in several ways. The initial goal was to develop a model that would allow experimental biologists to use the kind of qualitative data found in typical journal articles to describe the interaction of genes, proteins, and other cellular components to create computer models of large numbers of interacting parts. This arose from a practical need in our research to keep track of myriad components in pathway models that were built from data extracted from dozens of journal articles. Biologists already do this kind of mental modeling every time they make a new hypothesis; a tool was needed to aid in this reasoning. Secondly, with new sources of data becoming available, it was important to design a methodology that could be expanded in the future to integrate new data sources to refine models.
Finally, biological processes span many scales. A kind of heuristic modeling is common in the literature, where molecular interactions are analyzed and used to create new hypotheses about cellular events, tissue processes or disease progression. For example, specific gene mutations accelerate tumor growth in specific tissues. This is a semi-quantitative relationship between two very different scales. Fuzzy network modeling can be used as a tool for aiding human reasoning when many interacting variables participate in complex interaction networks on several scales. Though the interactions can sometimes only be described approximately, the logic of the interactions is rigorous.
The modeling approach presented here was developed in an effort to create a computational tool that would emulate and extend the ability of human experts to create reasonable models from incomplete or ambiguous data. The goal was to allow these models to be constructed directly from the information contained in journal articles that typically report experimental results about molecular pathways. This approach adopts fuzzy logic for biochemical reaction modeling and all cellular processes and embeds this in a network structure. The network structure is similar to functional Petri nets [22–24], which are well-suited to representing molecular pathways and gene interactions, as well as physiological system interactions.
Pathway models can be constructed manually by biologists and manipulated to study the dynamics of alternative pathways. However, the power of this method is that it provides a framework for using various soft computing technologies to integrate diverse data sources to improve and refine models. Rule-based or fuzzy logic models [7, 10, 16, 18, 19] are appropriate for manipulation by genetic or other evolutionary algorithms, which may be useful for drug target discovery. This process will be discussed in future papers that expand the basic model presented here. Details about methods for integration of high-throughput data with expert knowledge will also be reserved for future publications. Because the soft computing paradigm has been widely adopted for many engineering tasks, it is hoped that the framework presented here can be adopted and rapidly expanded by many researchers with expertise in these methods. Input files and code for all examples presented are available at the Bionet website .
The most common childhood brain cancer is medulloblastoma (MB), a devastating disease that can result in permanent serious brain damage even when successfully treated [26–28]. The etiology of MB is believed to involve aberrant activation of Sonic hedgehog (Shh) signalling [27–31]. Potential therapeutic targets are therefore regulators of the hedgehog pathway [32–34]. Although much remains to be learned about the transcriptional hierarchy involved in hedgehog activation, enough experimental data is available to construct pathway models that are quite complex. Hedgehog regulation of medulloblastomas will be used as an example to illustrate our modeling approach.
Basic fuzzy network modeling concepts
We define a dynamic, interacting system by specifying the all of the variables that participate in the system and the processes (reactions) in which each node participates. The participating variables, also called nodes as in graph theory or Petri nets, can represent protein concentrations, gene expression levels, tumor size, temperature or any other variable of interest, including discrete variables.
Reactions or processes define dynamic changes to node values and must involve one or more nodes as reactants. Though the language used leans toward biochemical reactions, a useful feature of this rule-based model is the ability to connect dynamics on multiple scales. Nodes can represent any quantity for which rules of change can be defined. A reaction is defined by a reaction rate constant and a list of participating nodes together with their role in the reaction. The rate constant is a scaling factor that multiplies the reaction rate that is determined by changing reactant values. Reactions can also be defined as discrete events. Reaction rate has no meaning in this case; the event happens when the reactants satisfy the rule for firing and nothing happens when those conditions are not satisfied. This is useful for modeling processes such as chromosome separation or cytokinesis. Discrete events may also be a useful model for representing complex processes like metastasis as a single event that follows a long series of cellular reactions and sets into motion other processes.
For each reaction shown, the reactants have default rules based on their role as substrate, product, activator or inhibitor. These default rules can be overridden by giving new rules explicitly, as will be described in the methods section or may be optimized using automatic methods to integrate other data sources. The latter will not be discussed in detail here. A diagram of a system as in figure 1 is used to construct a computer model by first listing all of the participating nodes. Each reaction is then listed, with a reaction rate. Substrates are variables that are used up in a reaction; products are produced. Activators and inhibitors participate in a reaction, but their value doesn't change. The default rules for each of these roles are as follows:
When a substrate is absent, the reaction rate is zero. As the concentration of substrate increases, the reaction rate increases proportionately. Thus, if the substrate level is low, its contribution to the reaction rate is low. The reaction rate, multiplied by the rate scaling factor or rate constant, determines how fast the substrate is used up in that reaction. By default, substrates have a stoichiometry of -1. This can be changed manually as needed.
A product is produced in a reaction. It does not affect the reaction rate, except that when the product concentration reaches its highest allowable amount the reaction rate goes to zero. No more can be produced. The default stoichiometry is +1.
Activators have the same default rules as a substrate: as the concentration increases, the reaction rate contribution increases. Activator concentration is not changed by a reaction in which it participates as an activator, it only influences the rate. Activators and inhibitors have a stoichiometry of 0, indicating that the reaction does not change their concentration.
An inhibitor causes a reaction rate to slow as its concentration increases. When absent, an inhibitor has no effect on a reaction rate, which is then wholly determined by the concentrations of substrates and activators. As the amount of inhibitor increases, the reaction rate slows. The concentration at which an activator or inhibitor strongly affects a reaction rate can be set when known. By default, the effect is proportionate to the enzyme concentration.
In each reaction, the contributions of all participating reactants are averaged using one of several averaging schemes. We found that either a harmonic average or minimum rule works best. Both of these have the property that if the rule for any reactant makes the rate zero, the rate will be zero. Thus, if a reaction includes two substrates and an activator where the concentration of one substrate is zero, but the activator and the other substrate concentration are high, the output rule is zero: the reaction rate should be zero because one of the substrates is absent.
In rule-based simulations the meaning of "low" and "high" are determined by the dynamic range or universe of discourse for each variable. In the hedgehog model, most variables have a concentration range of [0.0, 1.0] for simplicity in illustrating the dynamics of particular motifs. However, the dynamic range of Shh is defined to be [0.001, 0.1] in this model. This was set to illustrate that variables can be assigned values in any desired range. Rules are defined based on the meaning of "low" and "high" within the context of the appropriate range for a given variable. Thus, a concentration of 0.1 is high for Shh in this model, while 0.1 is low for Gli1. In addition, the meaning of "low" and "high" for a given variable can be different in different reactions. Thus, in reactions where Gli1 is a product, such as a transcription reaction, low, medium and high on a scale of 0 to 1 might be represented by 0.4, 0.6 and 0.8. In a catalytic reaction where Gli1 is a potent non-stoichiometric activator, a concentration of 0.001 might be "high" in that it causes the reaction to achieve its maximum rate. This information can be put into the model rules if available. If not, model simulations can still be run using whatever information is available.
Positive and negative feedback
Feedback motifs are common in biological systems [39–43]. Negative feedback loops are essential for maintaining equilibrium and resisting change while positive feedback structures amplify signals to enable switch-like behaviour . Gli1 is a transcription factor that promotes transcription of Ptc1 [28, 30, 33, 44]. Figure 2 highlights the Gli1 feedback loop in the hedgehog pathway. NmycP and the transforming event required for tumor growth are constitutively "on" in figure 2 simulations in order to isolate the effects of Gli1 feedback on Ptc1 and tumor growth. When Shh signalling is active, Ptc1 is repressed, causing increased Gli1 expression. When the Gli1 feedback loop is active, as in figure 2.a, the system is self-regulating and tends toward an equilibrium level of Ptc1, Smo and Gli1, as increased Ptc1 expression tends to upregulate Gli1 even in the presence of Shh . When the feedback loop is broken, as in figure 2.b, Shh signalling causes rapid suppression of Ptc1 to low levels and growth of Smo and Gli1 to high levels. Disconnecting the feedback loop is accomplished in the model by simply removing Gli1 as a reactant from the production reaction for Ptc1 (rule 3 in figure 1).
The feedforward system in figure 3 offers a potential explanation for the occurrence of medulloblastoma in haploinsufficient Ptc+/- mice. Approximately 14–20% of heterozygous Ptc+/- mice develop medulloblastoma tumors, whereas these tumors are rare in Ptc+/+ mice [33, 34]. Lower production of Ptc1 in heterozygous mice may make the probability that a perturbation to the hedgehog pathway will result in decreased Ptc1 expression for a long enough time to allow tumor growth to initiate. Transient Shh upregulation, as seen in the simulation, is not sufficient to allow sufficient Gli1 increase if Ptc1 production is high.
Single Input Motif (SIM)
Chemical reaction kinetics
Although analytical equations are known for basic reaction kinetics and approximations for enzyme-catalyzed reactions, it is desirable to use simple rules to describe such reactions when the goal is to integrate these with other processes for which analytical equations are not known. A number of software packages are available for modeling reaction dynamics with differential equations and can be found, for example, on the systems biology workbench site . Generic numerical software that might also be suitable can be found in a number of places, including the netlib numerical software library . Rule-based representations of chemical reactions are relatively simple to write and easily modified, which is useful when exploring the system effects of different possible reactions through simulation.
Because of the universal approximation properties of fuzzy logic models, they are able to represent any nonlinear process as accurately as desired, given sufficient data. It may be easier to make very precise chemical reaction models with differential equations, but robust system models that integrate many levels, from approximate cellular chemistry to clinical data, may be more useful for some medical research and more easily constructed with linguistic rules by biologists and clinicians without numerical modeling experience.
Rules to determine reaction rates from substrate concentrations can be determined in a variety of ways. The mathematical details for fuzzy logic rules most appropriate for modeling chemical kinetics remain to be derived. For modeling biological systems, simplicity with reasonable accuracy is a high priority. A second order reaction and an enzyme catalyzed reaction, both involving activation of Smo by Ptc1, are implemented with fuzzy rules and examined here.
An essential part of hedgehog pathway activation involves release of Ptc1 repression of Smo. Smo activates downstream transcription factors of the Gli family, which in turn upregulate cell cycle components [31, 33, 38, 49]. One model suggests that Smo is in a balance between active and inactive states. Smo is activated when it combines stoichiometrically with a small molecule (sm) that changes its conformation. The small molecule must be actively transported across the cell membrane by Ptc1, which acts catalytically.
The model in figure 5 is simulated with the following rules, with only three essential reactions. For each reaction, take the harmonic mean of substrate concentrations (zero/v. low/low/med/high/v. high); then reaction rate is (zero/v. low/low/med/high/v. high). The three reactions are: Reaction 1: Smo_inactive and sm are substrates and active Smo is the product. The rate coefficient is 1.0e-3 s-1. Reaction 2: the reverse reaction, has one substrate, activated Smo-sm complex, and two products, with the rate coefficient 1.0e-3 s-1. Reaction 3: The sm is transported across the membrane at a rate much faster than it is used by Smo and is continually supplied at a constant rate that is catalyzed by Ptc1.
It is important to emphasize that the goal here is not just to show that fuzzy rules can be found to match differential equation models of chemical kinetics. Rather, a model based on intuitively reasonable rules that describe the reaction qualitatively produce reaction dynamics that are very close to dynamics derived from chemical kinetics using rigorous mathematical derivations.
Oscillators, switches and discrete events in the cell cycle
Cell cycle machinery plays a central role in cell proliferation, apoptosis and cell fate determination. Failure to exit the cell cycle at the correct time may cause abnormal development and aberrant re-entry into the cell cycle by terminally differentiated cells may be a primary cause of many cancers. Understanding regulation of the cell cycle is therefore a central issue in the application of molecular biology to medicine. For that reason, a number of differential equation models of the cell cycle in yeast have been constructed for research purposes [51–54]. Regulation of the mammalian cell cycle is more complex than that of yeast, though many homologous genes and proteins and common network motifs are involved.
A model of the mammalian cell cycle was constructed with several important components. Chromosome separation and cell division were included as discrete events that occur when certain conditions are met. The cell cycle is an oscillatory system that involves not only continuous processes, but a series of discrete events . Discrete events are an important mechanism in cells for controlling the precise timing of key events and insuring irreversibility with imprecise components [56, 57]. Thus, even though many cell cycle parameters are poorly constrained, the overall behaviour of the cell cycle system is well characterized and heuristic modeling approaches are suitable for studying the control dynamics or testing possible ideas for new drug targets .
Tumor growth is linked to this cell cycle via cell division. Tumor size in this case is a measure of the number of cell divisions that have occurred. If the cell cycle arrests, tumor size shrinks slowly through a continuous apoptotic reaction. Rules for this cell cycle model are as follows:
1. cyclinD1 production is continuous
2. cyclinD1 decay is proportional to cyclinD1 and APC levels and is rapid
3. cyclinB production is proportional to cyclinD1 level
4. cyclinB decay is zero when APC is less than high, is low when APC is high, and is very high when APC is very high.
5. Cdc14 production is proportional to cyclinB levels
6. Cdc14 decay is continuous and proportional to Cdc14 concentration
7. APC production is proportional to Cdc14 level, but is zero when Cdc14 is very low and low.
8. APC decay is directly proportional to cyclinB, cell mass and entry into G1SG2 and determined by the harmonic average of these.
9. Chromosome separation event occurs when cyclinD1 is medium or higher and G1SG2 is in the on state (G1SG2 is a discrete event, either on or off). G1SG2 turns off when chromosome separation occurs.
10. Cell division occurs when cyclinB is low or less and decreasing at any rate; chromosome separation event must also have occurred. G1SG2 turns on when cell division occurs and chromosome separation turns off.
11. Cell division turns off when cell mass is very low or less and cell division is currently on.
12. Cell mass growth is constant. When cell division occurs (a single event), cell mass is halved. Cell mass has a range of values between one and two.
Modeling and drug target discovery
Computational models are potentially very useful for exploring the effects of manipulating complex pathways through drugs or other means. Rule based models of two features that make them promising tools for drug target discovery. First, it is easy for experts to use their domain knowledge to build models. Rules can be changed easily without mathematical manipulations to carry out thought experiments with many hundreds or thousands of components. Secondly, algorithms for automatically manipulating fuzzy rules have been developed for a variety of engineering applications and include genetic algorithms and neural nets. It may be possible for experts to build first-stage models of pathways and then apply computational methods to optimize the models using other data sources, or to search for optimal drug targets.
To illustrate the use of the model for carrying out computational experiments a rule-based model was constructed using elements of the previously presented pathways. Our goal in this example is to explore the effects of a drug, HhAntag, that reportedly interferes with Gli1 activation of cyclinD1, thereby stopping progression of the cell cycle . HhAntag was introduced as a new component to the model in figure 1. HhAntag inhibits both Gli1 and, to a lesser extent, Gli2 expression downstream of Smo.
It should be emphasized here that the hedgehog pathway is not well understood and the model presented here is in preliminary stages. Nevertheless, it illustrates important modeling ideas. The next step is to replace the tumor growth module with a cell cycle model. First, we develop and test a preliminary cell cycle model using rules. This will then be connected to the hedgehog pathway.
Once started, the cycle will continue as long as the Shh pathway remains active. Mitotic degradation of Nmyc permits neuronal precursor cell cycle exit in the absence of Shh signalling or in the case of an intrinsic program-directed shift toward differentiation. Thus neural fate specification and cancer activation are regulated by the same machinery.
In figure 10.b simulation time courses show the effect of decreasing insulin growth factor (IGF) on the cell cycle. In this figure, IGF is decreased beginning on day 6, allowing GSK-3β to increase. Nmyc-P is then phosphorylated and degraded. Note that Nmyc-P initially decreases, and then recovers for one cell cycle through complicated feedbacks. Incorporating as many of these complicated interactions into a model is essential for identifying potential drug targets, as cells are remarkably robust and have built-in fault tolerance systems that may not be evident from examination of static network or interaction diagrams [56, 62, 63]. The effects of drug dose will be the same for this model as in figure 9, as the primary action of the drug on Gli2 has not been changed. However, adding details to the cell cycle now allows more detailed investigation of the interaction between the drug, its targets, and cell cycle components.
An important goal of computational modeling of biochemical networks is to identify potential drug targets in silico. Computational tools that can enhance the ability of researchers to reason through the complex dynamics of pathways of interest may be useful for researchers trying to keep track of dozens or hundreds of interacting network components, while automatic methods for manipulating pathway models to search for drug targets have the potential to revolutionize the drug development process .
Integrating the rapidly growing quantities of high throughput data into meaningful pathway information is a bottleneck to computational drug screening and discovery . Although informatics tools are currently being used to identify potential pathway interactions and putative targets, target identification necessarily requires detailed understanding of the structure of the pathway and the role of specific genes and proteins. This information is generally available in journal articles as linguistic pathway descriptions or interactions derived from hypothesis-driven experiments .
A major advantage of the modeling approach presented here is that it enables the incorporation of biological expertise into the modeling process. At the same time, it does not prevent integration of multiple high throughput data sources into the model. New evolutionary algorithms based on statistical estimation, called estimation of distribution algorithms [15, 67], enable Bayesian probability distributions to be used as input data to optimize system parameters. Such statistical data is commonly available from bioinformatics analyses of microarray expression data ([68–70]) or proteomics data [71, 72]. The framework presented by the fuzzy logic model presented here thus allows representation of expert knowledge about a pathway, and integration of other sources of numerical data with that data. Further research into efficient algorithms for integration of data from many sources is an important area for further research.
Drug discovery is a difficult challenge for computational and systems biology. Finding a "clever way to throw away the details may be the most important part of model building" . Intelligent hybrid systems are an important method for modeling dynamic networks and complex biological processes with linguistic or qualitative-logical data, while allowing for more accurate models as more data becomes available.
The initial motivation of this modeling approach was to shift the burden of modeling to biological description and biological data. We deliberately sought to develop a modeling paradigm that imitates the logical reasoning that biologists use when analyzing a complex pathway diagram to understand its function and develop new hypotheses to explain observations.
Another motivation for this work, which will be developed and explicated in future publications, was to find a quantitative representation of biological systems that was simple enough to be manipulated by genetic algorithms as a possible approach to computational drug target discovery and for integration of multiple data sources. Such a program will likely be similar to current engineering approaches to fuzzy system discovery [3, 4, 9, 10] and will be a subject for future research.
Fuzzy logic (FL) provides a simple way to arrive at a definite conclusion based upon vague, ambiguous, imprecise, noisy, or missing input information. FL's approach to control problems mimics human reasoning, only much faster. A FL biochemical model requires some numerical parameters in order to operate, such as initial values and rate coefficients, but exact values of these numbers are usually not critical. Since dynamic rules are defined in terms of fuzzy quantities, the logic of the dynamics can be prescribed without precise numerical values.
Bionet implementation of fuzzy network models
A potential drawback of fuzzy logic models is the rapid growth of the rule table when many inputs are involved. Biological networks typically involve hundreds or thousands of interacting variables, so this is a serious concern. If the number of fuzzy sets used for each variable is F, then the number of rules required to define rules for all possible inputs is FN, where N is the number of variables. At least four fuzzy sets are needed; even a few hundred variables causes the number of rules needed to specify a dynamic model to become huge.
Fortunately, a several simplifications are possible when modeling biological networks that greatly reduce the size of the rule base, even when hundreds or thousands of variables are involved. Most importantly, the number of variables involved in any reaction or process is relatively small. Typically only a few species are involved in a reaction; the vast majority of proteins or genes do not interact appreciably. Thus, a biological system can be modelled as many networked reactions or processes.
In general, the number of rules needed for each reaction is equal to the product of the number of fuzzy sets for each component involved in the reaction. If each reactant has six fuzzy sets or states and there are four reactants in a process, then there are 64 = 1296 rules that must be specified. Computations are simplified further by noting that any real number belongs to at most two fuzzy sets. Thus the number of rules that must be evaluated at each time step is 24 = 16 when there are four reactants. Some heuristics can be used to automatically compute the rules. In general, reactions are limited by the minimum rate for any one of the reactants. If a substrate for a reaction has concentration zero, for example, that will limit the rate of reaction regardless of how much of any of the other reactants is present. Similarly, if a strong inhibitor is present at its highest concentration and it shuts down the reaction completely, then it does not matter what the other concentrations are. With similar heuristics, it is sufficient to specify the effect of each component individually on the reaction. Rules for combining reactants are then determined automatically by applying appropriate averaging heuristics. Example simulations below illustrate the input information required.
All simulations in this paper used six fuzzy sets, corresponding to the linguistic variables zero, very low, low, medium, high and highest. The centroid locations for each of these sets, a scale of 0.0 to 1.0, are 0.0, 0.2, 0.4, 0.6, 0.8, and 1.0. When the universe of discourse is other than 0 to 1, centroids are the endpoints and fractional increments of 0.2 define the other centroids. We found that six fuzzy sets were adequate for all simulation requirements presented here. The discretization error in numerical differential equations can be reduced by using a finer grid or by using a variable grid that puts finer grid points where greater accuracy is needed. The standard additive model can achieve greater flexibility for approximating functions in a similar manner, either by increasing the number of fuzzy sets or by moving the centroid locations. The latter was illustrated earlier with the example of catalytic activation of Smo by Ptc1. The centroids for very low, low and medium were defined by 0.0012, 0.0022 and 0.0032, respectively, while leaving the other centroids unchanged. In Bionet, default centroids can be redefined by simply listing the centroid values after the rules for a variable in a reaction. Note that the meaning of a linguistic variable such as "very low" is reaction dependent. When a protein is a stoichiometric substrate in a reaction, the default centroids might be appropriate; while for the same variable acting as a catalyst the meaning of very low might be quite different.
Input information for a Bionet model consists of a list of variables, also called nodes, along with initial values. Each reaction between variables is a fuzzy rule-based inference. A reaction consists of one or more variables and a reaction rate constant. The rate constant scales the reaction rate that is determined as an output from the fuzzy inference and is 1.0 by default. Logical pathway models can be built to test the logic of interactions even if little is known about rate constants. If all reactions are defined to have only two output states, highest and zero, or on and off, the entire network is equivalent to a dynamic Boolean network [74, 75].
Variables play different roles in different reactions. A variable may be a product in a synthesis reaction. In other reactions it may be an inhibitor or activator – or even both depending on its concentration. For simplicity, default rules are defined for four basic roles: substrate, product, activator and inhibitor. Stoichiometry determines how quickly the reactant is produced or used up; activators and inhibitors have stoichiometry of zero in a reaction.
The essence of model dynamics is the fuzzy inference that maps fuzzy input variables to fuzzy output variables. Conversion of precise real values to fuzzy or linguistic values is accomplished by fuzzification. This is illustrated in figure 11. The domain or universe of discourse for a variable can be defined by any real numbers. The minimum and maximum values are constants for the whole simulation. The default range is 0 to 1, since in many cases we are interested in network dynamics with normalized values. However, specifying a different range is trivial, requiring only that the range be specified when the node and its initial value are declared. Most importantly, reaction dynamics are defined in terms of fuzzy sets, not the real values that are attached to those sets by the centroid values. In figure 11.a the domain is divided into six fuzzy sets of equal size. This is the default, though it can also be changed by listing the centroid values with the node declaration. Although the endpoints are fixed, the placement of intermediate centroids may be context dependent and differ with each reaction. This reflects the fact that the meaning of linguistic terms such as "low" may depend on whether a particular molecule is a product or activating enzyme, for example. An example of fuzzification of the real value 0.42 is: 0.42 = 0.9 (0.4) + 0.1 (0.6) = 0.9 low + 0.1 medium.
For wider dynamic range, a logarithmic scale has also been implemented. As illustrated in figure 11.b, triangular membership functions are linear on the log of the values. For example, if the range is [10-3, 102], then 0.0316 = sqrt(0.001) is fuzzified by linearly combining logarithms of the fuzzy set values. Log(0.0316) = -1.5 = 0.5 log(10-2) + 0.5 log(101) = 0.5 very low + 0.5 low. Logarithmic scales are useful when concentrations vary over a large range and changes in biochemical activity require concentration changes over many orders of magnitude. The rate at which the actual (real-valued) concentration changes in each reaction is still determined by the reaction rate. We emphasize again that the reaction rules are still defined in terms of the effect of fuzzy values (low, medium, and so on) on the reaction rate. The fuzzy set definitions, whether linear or logarithmic, provide meaning to the linguistic fuzzy sets in the context of particular reactions.
Node Shh 0.1 log 0.001 0.1
Node Ptc1 0.5
The keyword 'Node' identifies this as a node definition. Initial values 0.0 by default if not given explicitly. Similarly, the domain or universe of discourse is [0,1] if not given. In the above, Shh has an initial value of 0.1 and a domain of [0.001, 0.1] on a logarithmic scale. This was done primarily for illustration, since in this simulation Shh was set to a constant value and maintained. The initial value for Ptc1 is 0.5; all others have initial value 0.0. The meaning of "low" or "high" for each variable is context dependent and is defined for each reaction that a variable participates in.
The model has 8 reactions, which are essentially fuzzy inference engines that map the fuzzy input variables to a fuzzy output rate. The network of reactions is illustrated in figure 12. Reactions rates are computed and fire on each time step. The rate is scaled by the rate constant to give a reaction rate at this time step. Each reaction consists of a reaction name, a rate coefficient, and a list of participating reactants. The reactants can be used up (substrates, stoichiometry < 0), produced (products, stoichiometry > 0) or participate without changing concentration (activator or inhibitor, stoichiometry = 0). The reactions for Smo production and decay, Reactions 2 and 7, are shown here:
Reaction Smo_production 0.002
inh Ptc1 5 2 1 1 0 0
Reaction Smo_decay 0.002
The keyword 'Reaction' defines a reaction definition. The reaction name follows the keyword. The number after the reaction name is the reaction rate constant. If not given the default value is 1.0. The second reaction shown here, Smo_decay, has a single reactant, Smo, which is a substrate. The 'sub' keyword implies the following default values:
Reaction Smo_decay 0.002
sub Smo 0 1 2 3 4 5 stoi -1 centroids 0.0 0.2 0.4 0.6 0.8 1.0
The first six numbers that follow 'sub Smo' are the rules that define how the reactant Smo contributes to the rate of this reaction. The six integers are the output rate for each fuzzy level of Smo. These are a shorthand for the linguistic rules: "when Smo is zero, the rate is 0; when Smo is very low, the rate is 1 or very low; when Smo is low, the rate is 2 or low; when Smo is medium the rate is 3 or medium; when Smo is high, the rate is 4 or high; when Smo is highest, the rate is 5 or highest. If more information is available to give more accurate rules or centroids, they can be written here. More likely, these values will be set by automatic optimization methods when high throughput data is used to refine models that are initially built manually by experts. We have implemented a simple genetic algorithm that manipulates the rules and centroids. Incorporation of procedures for specific data sources remain for future research, but the model is designed for these enhancements.
The Smo_production reaction fires on every time step. The default rules for a product, Smo in this case, are 5 5 5 5 5 0. This means that the output rate is 5 or "highest" when the Smo concentration is zero through high (0 to 4). If Smo reaches the highest concentration, 5, it cannot go higher because that would exceed its defined limit. The rate goes to zero at that point. The inhibitor, Ptc1, has rules 5 2 1 1 0 0, which means that when Ptc1 concentration is high or highest, the rate is zero. Similarly, when Ptc1 is zero, the rate is 5 (unaffected), when Ptc1 is very low, the rate tends to low (2), when Ptc1 is low or medium, the rate is very low (1).
The actual reaction rate is computed as follows: the real concentration of each participating reactant is fuzzified. The rate is a multi-dimensional function of the number of reactants. Following the usual Standard Additive Model, the rate is found by adding fractional membership of each variable to find the total contribution:
If on a particular time step Smo is 0.55 = 0.25 low and 0.75 medium, while Ptc1 is 0.4 high and 0.6 highest, then equation (1) becomes
0.25 * 0.4 * HA(5,1) + 0.75 * 0.4 * HA(5,0) + 0.25 * 0.6 * HA(0,1) + 0.75 * 0.6 * HA(0,0)
= 0.1 * 1.7 + 0.3 * 0 + 0.15 * 0 + 0.45 * 0
All rates are computed on a 0 to 1 scale, and then scaled by the rate coefficient. The final rate for this reaction step is thus 3.4e-4. The value of Smo on the next time step will be 0.55034.
The list of variables and the definition of all reactions is the primary input to define a Bionet model. Start and end times for the simulation and the number of time steps are also needed as input. At each time step, all reactions take place and variables are updated. Bionet simulations are similar to explicit time-stepping schemes in numerical approximations to differential equations. Bionet implements the Standard Additive Model or SAM for each fuzzy reaction . This implementation guarantees the universal approximator property of the model.
Time limits and time step size must be specified as for any dynamic model. This method uses explicit time stepping, so the step size is controlled by the fastest reaction. The condition rate*Δt < Δxmax must hold for the fastest reaction, where xmax is either the difference between the highest and next-to-highest centroid for any variable that is a product substrate in that reaction, or xmax is the difference between the next-to-lowest and lowest centroid for any substrate in that reaction. For all simulations in this paper, the time step was 1 minute. Most reaction rates were on the order of 10-3 minutes, but the cyclin decay rates were faster (1.0 min-1 for cyclin D).
Reaction order is asynchronous to prevent any bias in the simulation. The order of reactions is randomly chosen at every time step. Furthermore, if desired, a probability of firing may be assigned to a reaction. In this case, reactions will occur with some probability on each of the time steps. Models that include stochastic reactions can represent stochastic processes in cells, such as stochastic gene expression  or a probability of tumor formation. Whether or not this is an appropriate or useful model for stochastic biological processes remains to be determined and likely will depend on the particular phenomenon of interest. The flexibility of this approach allows for a variety of modifications to be tried in future research.
Software availability and requirements
Simulation software, source code and input files containing rules for all examples discussed in this paper are available for download on the Bionet web site.
Project name: Bionet
Project home page: http://www.chip.org/~wbosl/Software/Bionet/bionet.html
Operating systems: Platform independent; GUI version for MS-Windows only.
Programming language: java (Java 2 runtime environment)
License: Use of this software is governed by the Gnu General Public License .
This work was supported by NIH grants K01HG003418-02 and GM059964-06S1. The author is especially thankful for laboratory support and encouragement from Isaac Kohane, director of the Children's Hospital Boston Informatics program at Harvard-MIT Division of Health Sciences. Special thanks to collaborator and friend Rong Li at the Stowers Institute for Medical Research for initially motivating this work and for many helpful discussions of cell cycle machinery, modeling, and systems biology.
- Bornholdt S: Systems biology. Less is more in modeling large genetic networks. Science. 2005, 310 (5747): 449-451.PubMedView ArticleGoogle Scholar
- Davidson TJ, Harel S, Arboleda VA, Prunell GF, Shelanski ML, Greene LA, Troy CM: Highly efficient small interfering RNA delivery to primary mammalian neurons induces MicroRNA-like effects before mRNA degradation. J Neurosci. 2004, 24 (45): 10040-10046.PubMedView ArticleGoogle Scholar
- Jang JSR, Sun CT, Mizutani E: Neuro-fuzzy and soft computing : a computational approach to learning and machine intelligence. MATLAB curriculum series. 1997, xxvi, 614-Upper Saddle River, NJ , Prentice HallGoogle Scholar
- Kosko B: Neural networks and fuzzy systems : a dynamical systems approach to machine intelligence. 1992, xxvii, 449-Englewood Ciffs, NJ , Prentice HallGoogle Scholar
- Gerhart J, Kirschner M: Cells, embryos, and evolution : toward a cellular and developmental understanding of phenotypic variation and evolutionary adaptability. 1997, xiii, 642 ,  of plates-Malden, Mass. , Blackwell ScienceGoogle Scholar
- Tsoukalas LH, Uhrig RE: Fuzzy and neural approaches in engineering. Adaptive and learning systems for signal processing, communications, and control. 1997, xix, 587-New York , WileyGoogle Scholar
- Ross TJ: Fuzzy logic with engineering applications. 2004, XXI, 628 s.-Chichester , Wiley, 2ndGoogle Scholar
- Nikravesh M, Zadeh LA, Korotkikh V: Fuzzy Partial Differential Equations and Relational Equations: Reservoir Characterization and Modeling. Studies in Fuzziness & Soft Computing. 2004, Berlin , Springer-VerlagGoogle Scholar
- Negoita MG, Reusch B: Real world applications of computational intelligence. 2005, xvi, 295-Berlin ; New York , SpringerGoogle Scholar
- Rau G, Becker K, Kaufmann R, Zimmermann HJ: Fuzzy logic and control: principal approach and potential applications in medicine. Artif Organs. 1995, 19 (1): 105-112.PubMedView ArticleGoogle Scholar
- Lauer RT, Smith BT, Betz RR: Application of a neuro-fuzzy network for gait event detection using electromyography in the child with cerebral palsy. IEEE Trans Biomed Eng. 2005, 52 (9): 1532-1540.PubMedView ArticleGoogle Scholar
- Shin JW, Yoon JH, Yoon YR: Rate-adaptive pacemaker controlled by motion and respiratory rate using neuro-fuzzy algorithm. Med Biol Eng Comput. 2001, 39 (6): 694-699.PubMedView ArticleGoogle Scholar
- Kwok HF, Linkens DA, Mahfouf M, Mills GH: Rule-base derivation for intensive care ventilator control using ANFIS. Artif Intell Med. 2003, 29 (3): 185-201.PubMedView ArticleGoogle Scholar
- Teodorescu HL, Kandel A, Hall LO: Report of research activities in fuzzy AI and medicine at USF CSE. Artif Intell Med. 2001, 21 (1-3): 177-183.PubMedView ArticleGoogle Scholar
- Larranaga P, Lozano JA: Estimation of distribution algorithms : a new tool for evolutionary computation. Genetic algorithms and evolutionary computation ; 2. 2002, xxxi, 382-Boston , Kluwer Academic PublishersGoogle Scholar
- Kecman V: Learning and soft computing : support vector machines, neural networks, and fuzzy logic models. Complex adaptive systems. 2001, xxxii, 541-Cambridge, Mass. , MIT PressGoogle Scholar
- Wang L: Support vector machines : theory and applications. Studies in fuzziness and soft computing, v 177. 2005, x, 431-Berlin ; New York , SpringerGoogle Scholar
- Woolf PJ, Wang Y: A fuzzy logic approach to analyzing gene expression data. Physiol Genomics. 2000, 3 (1): 9-15.PubMedGoogle Scholar
- Sokhansanj BA, Fitch JP, Quong JN, Quong AA: Linear fuzzy gene network models obtained from microarray data by exhaustive search. BMC Bioinformatics. 2004, 5: 108-PubMed CentralPubMedView ArticleGoogle Scholar
- Linden R, Bhaya A: Evolving fuzzy rules to model gene expression. Biosystems. 2006Google Scholar
- King RD, Garrett SM, Coghill GM: On the use of qualitative reasoning to simulate and identify metabolic pathways. Bioinformatics. 2005, 21 (9): 2017-2026.PubMedView ArticleGoogle Scholar
- Sackmann A, Heiner M, Koch I: Application of Petri net based analysis techniques to signal transduction pathways. BMC Bioinformatics. 2006, 7 (1): 482-PubMed CentralPubMedView ArticleGoogle Scholar
- Doi A, Fujita S, Matsuno H, Nagasaki M, Miyano S: Constructing biological pathway models with hybrid functional Petri nets. In Silico Biol. 2004, 4 (3): 271-291.PubMedGoogle Scholar
- Hardy S, Robillard PN: Modeling and simulation of molecular biology systems using petri nets: modeling goals of various approaches. J Bioinform Comput Biol. 2004, 2 (4): 595-613.PubMedView ArticleGoogle Scholar
- Bionet. http://www.chip.org/~wbosl/Software/Bionet/bionet.html
- Pomeroy SL, Tamayo P, Gaasenbeek M, Sturla LM, Angelo M, McLaughlin ME, Kim JY, Goumnerova LC, Black PM, Lau C, Allen JC, Zagzag D, Olson JM, Curran T, Wetmore C, Biegel JA, Poggio T, Mukherjee S, Rifkin R, Califano A, Stolovitzky G, Louis DN, Mesirov JP, Lander ES, Golub TR: Prediction of central nervous system embryonal tumour outcome based on gene expression. Nature. 2002, 415 (6870): 436-442.PubMedView ArticleGoogle Scholar
- Rubin JB, Rowitch DH: Medulloblastoma: a problem of developmental biology. Cancer Cell. 2002, 2 (1): 7-8.PubMedView ArticleGoogle Scholar
- Wechsler-Reya R, Scott MP: The developmental biology of brain tumors. Annu Rev Neurosci. 2001, 24: 385-428.PubMedView ArticleGoogle Scholar
- Cohen MM: The hedgehog signaling network. Am J Med Genet A. 2003, 123 (1): 5-28.View ArticleGoogle Scholar
- Kenney AM, Rowitch DH: Sonic hedgehog promotes G(1) cyclin expression and sustained cell cycle progression in mammalian neuronal precursors. Mol Cell Biol. 2000, 20 (23): 9055-9067.PubMed CentralPubMedView ArticleGoogle Scholar
- Kim JY, Nelson AL, Algon SA, Graves O, Sturla LM, Goumnerova LC, Rowitch DH, Segal RA, Pomeroy SL: Medulloblastoma tumorigenesis diverges from cerebellar granule cell differentiation in patched heterozygous mice. Dev Biol. 2003, 263 (1): 50-66.PubMedView ArticleGoogle Scholar
- Berman DM, Karhadkar SS, Hallahan AR, Pritchard JI, Eberhart CG, Watkins DN, Chen JK, Cooper MK, Taipale J, Olson JM, Beachy PA: Medulloblastoma growth inhibition by hedgehog pathway blockade. Science. 2002, 297 (5586): 1559-1561.PubMedView ArticleGoogle Scholar
- Kimura H, Stephen D, Joyner A, Curran T: Gli1 is important for medulloblastoma formation in Ptc1+/- mice. Oncogene. 2005, 24 (25): 4026-4036.PubMedView ArticleGoogle Scholar
- Romer JT, Kimura H, Magdaleno S, Sasai K, Fuller C, Baines H, Connelly M, Stewart CF, Gould S, Rubin LL, Curran T: Suppression of the Shh pathway using a small molecule inhibitor eliminates medulloblastoma in Ptc1(+/-)p53(-/-) mice. Cancer Cell. 2004, 6 (3): 229-240.PubMedView ArticleGoogle Scholar
- Corrales JD, Rocco GL, Blaess S, Guo Q, Joyner AL: Spatial pattern of sonic hedgehog signaling through Gli genes during cerebellum development. Development. 2004, 131 (22): 5581-5590.PubMedView ArticleGoogle Scholar
- Chen JK, Taipale J, Cooper MK, Beachy PA: Inhibition of Hedgehog signaling by direct binding of cyclopamine to Smoothened. Genes Dev. 2002, 16 (21): 2743-2748.PubMed CentralPubMedView ArticleGoogle Scholar
- Chen JK, Taipale J, Young KE, Maiti T, Beachy PA: Small molecule modulation of Smoothened activity. Proc Natl Acad Sci U S A. 2002, 99 (22): 14071-14076.PubMed CentralPubMedView ArticleGoogle Scholar
- Sotelo C: Cellular and genetic regulation of the development of the cerebellar system. Prog Neurobiol. 2004, 72 (5): 295-339.PubMedView ArticleGoogle Scholar
- Brandman O, Ferrell JE, Li R, Meyer T: Interlinked fast and slow positive feedback loops drive reliable cell decisions. Science. 2005, 310 (5747): 496-498.PubMed CentralPubMedView ArticleGoogle Scholar
- Cinquin O, Demongeot J: Roles of positive and negative feedback in biological systems. C R Biol. 2002, 325 (11): 1085-1095.PubMedView ArticleGoogle Scholar
- Milo R, Itzkovitz S, Kashtan N, Levitt R, Shen-Orr S, Ayzenshtat I, Sheffer M, Alon U: Superfamilies of evolved and designed networks. Science. 2004, 303 (5663): 1538-1542.PubMedView ArticleGoogle Scholar
- Yeger-Lotem E, Sattath S, Kashtan N, Itzkovitz S, Milo R, Pinter RY, Alon U, Margalit H: Network motifs in integrated cellular networks of transcription-regulation and protein-protein interaction. Proc Natl Acad Sci U S A. 2004, 101 (16): 5934-5939.PubMed CentralPubMedView ArticleGoogle Scholar
- Kern JK: The possible role of the cerebellum in autism/PDD: disruption of a multisensory feedback loop. Med Hypotheses. 2002, 59 (3): 255-260.PubMedView ArticleGoogle Scholar
- Sjostrom SK, Finn G, Hahn WC, Rowitch DH, Kenney AM: The Cdk1 complex plays a prime role in regulating N-myc phosphorylation and turnover in neural precursors. Dev Cell. 2005, 9 (3): 327-338.PubMedView ArticleGoogle Scholar
- Shen-Orr SS, Milo R, Mangan S, Alon U: Network motifs in the transcriptional regulation network of Escherichia coli. Nat Genet. 2002, 31 (1): 64-68.PubMedView ArticleGoogle Scholar
- Milo R, Shen-Orr S, Itzkovitz S, Kashtan N, Chklovskii D, Alon U: Network motifs: simple building blocks of complex networks. Science. 2002, 298 (5594): 824-827.PubMedView ArticleGoogle Scholar
- Systems Biology Workbench. http://sbw.kgi.edu
- Netlib Numerical Software Library. http://www.netlib.org
- Rowitch DH, B SJ, Lee SM, Flax JD, Snyder EY, McMahon AP: Sonic hedgehog regulates proliferation and inhibits differentiation of CNS precursor cells. J Neurosci. 1999, 19 (20): 8954-8965.PubMedGoogle Scholar
- Taipale J, Cooper MK, Maiti T, Beachy PA: Patched acts catalytically to suppress the activity of Smoothened. Nature. 2002, 418 (6900): 892-897.PubMedView ArticleGoogle Scholar
- Zwolak JW, Tyson JJ, Watson LT: Parameter estimation for a mathematical model of the cell cycle in frog eggs. J Comput Biol. 2005, 12 (1): 48-63.PubMedView ArticleGoogle Scholar
- Sveiczer A, Tyson JJ, Novak B: Modelling the fission yeast cell cycle. Brief Funct Genomic Proteomic. 2004, 2 (4): 298-307.PubMedView ArticleGoogle Scholar
- Ciliberto A, Novak B, Tyson JJ: Mathematical model of the morphogenesis checkpoint in budding yeast. J Cell Biol. 2003, 163 (6): 1243-1254.PubMed CentralPubMedView ArticleGoogle Scholar
- Tyson JJ, Novak B: Regulation of the eukaryotic cell cycle: molecular antagonism, hysteresis, and irreversible transitions. J Theor Biol. 2001, 210 (2): 249-263.PubMedView ArticleGoogle Scholar
- Murray AW: Recycling the cell cycle: cyclins revisited. Cell. 2004, 116 (2): 221-234.PubMedView ArticleGoogle Scholar
- Bosl WJ, Li R: Mitotic-exit control as an evolved complex system. Cell. 2005, 121 (3): 325-333.PubMedView ArticleGoogle Scholar
- Georgi AB, Stukenberg PT, Kirschner MW: Timing of events in mitosis. Curr Biol. 2002, 12 (2): 105-114.PubMedView ArticleGoogle Scholar
- Ingolia NT, Murray AW: The ups and downs of modeling the cell cycle. Curr Biol. 2004, 14 (18): R771-7.PubMedView ArticleGoogle Scholar
- Fall CP: Computational cell biology. Interdisciplinary applied mathematics ; v 20. 2002, xx, 468-New York , SpringerGoogle Scholar
- Alberts B: Molecular biology of the cell. 2002, xxxiv, 1463, -New York , Garland Science, 4thGoogle Scholar
- Knoepfler PS, Kenney AM: Neural precursor cycling at sonic speed: N-Myc pedals, GSK-3 brakes. Cell Cycle. 2006, 5 (1): 47-52.PubMedView ArticleGoogle Scholar
- Hornberg JJ, Bruggeman FJ, Westerhoff HV, Lankelma J: Cancer: a Systems Biology disease. Biosystems. 2006, 83 (2-3): 81-90.PubMedView ArticleGoogle Scholar
- Bar-Yam Y: Unifying Themes in Complex Systems. New England Complex Systems Institute Series on Complexity. 2000, Boulder, CO , Westview PressGoogle Scholar
- Nielsen UB, Schoeberl B: Using computational modeling to drive the development of targeted therapeutics. IDrugs. 2005, 8 (10): 822-826.PubMedGoogle Scholar
- Fischer HP: Towards quantitative biology: Integration of biological information to elucidate disease pathways and to guide drug discovery. Biotechnol Annu Rev. 2005, 11: 1-68.PubMedView ArticleGoogle Scholar
- Jiang Z, Zhou Y: Using bioinformatics for drug target identification from the genome. Am J Pharmacogenomics. 2005, 5 (6): 387-396.PubMedView ArticleGoogle Scholar
- Lozano JA, Larrañaga P, Inza I, Bengoetxea E: Towards a New Evolutionary Computation : Advances in the Estimation of Distribution Algorithms. Studies in Fuzziness and Soft Computing. 2006, 192: Berlin , SPRINGERGoogle Scholar
- Pe'er D: Bayesian network analysis of signaling networks: a primer. Sci STKE. 2005, 2005 (281): pl4-PubMedGoogle Scholar
- Ramoni MF, Sebastiani P, Kohane IS: Cluster analysis of gene expression dynamics. Proc Natl Acad Sci U S A. 2002, 99 (14): 9121-9126.PubMed CentralPubMedView ArticleGoogle Scholar
- Sebastiani P, Yu YH, Ramoni MF: Bayesian machine learning and its potential applications to the genomic study of oral oncology. Adv Dent Res. 2003, 17: 104-108.PubMedView ArticleGoogle Scholar
- Nariai N, Kim S, Imoto S, Miyano S: Using protein-protein interactions for refining gene networks estimated from microarray data by Bayesian networks. Pac Symp Biocomput. 2004, 336-347.Google Scholar
- Jansen R, Yu H, Greenbaum D, Kluger Y, Krogan NJ, Chung S, Emili A, Snyder M, Greenblatt JF, Gerstein M: A Bayesian networks approach for predicting protein-protein interactions from genomic data. Science. 2003, 302 (5644): 449-453.PubMedView ArticleGoogle Scholar
- Girault C, Valk R: Petri nets for systems engineering : a guide to modeling, verification, and applications. 2003, XVI, 607 s.-Berlin , SpringerView ArticleGoogle Scholar
- Soule C: Mathematical approaches to differentiation and gene regulation. C R Biol. 2006, 329 (1): 13-20.PubMedView ArticleGoogle Scholar
- Lagomarsino MC, Jona P, Bassetti B: Logic backbone of a transcription network. Phys Rev Lett. 2005, 95 (15): 158701-View ArticleGoogle Scholar
- Raser JM, O'Shea EK: Noise in gene expression: origins, consequences, and control. Science. 2005, 309 (5743): 2010-2013.PubMed CentralPubMedView ArticleGoogle Scholar
- GNU General Public License.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.