- Methodology article
- Open Access
- Published:

# Gene perturbation and intervention in context-sensitive stochastic Boolean networks

*BMC Systems Biology*
**volume 8**, Article number: 60 (2014)

## Abstract

### Background

In a gene regulatory network (GRN), gene expressions are affected by noise, and stochastic fluctuations exist in the interactions among genes. These stochastic interactions are context dependent, thus it becomes important to consider noise in a context-sensitive manner in a network model. As a logical model, context-sensitive probabilistic Boolean networks (CSPBNs) account for molecular and genetic noise in the temporal context of gene functions. In a CSPBN with *n* genes and *k* contexts, however, a computational complexity of *O*(*nk*
^{2}2^{2n}) (or *O*(*nk* 2^{n})) is required for an accurate (or approximate) computation of the state transition matrix (STM) of the size (2^{n} ∙ *k*) × (2^{n} ∙ *k*) (or 2^{n} × 2^{n}). The evaluation of a steady state distribution (SSD) is more challenging. Recently, stochastic Boolean networks (SBNs) have been proposed as an efficient implementation of an instantaneous PBN.

### Results

The notion of stochastic Boolean networks (SBNs) is extended for the general model of PBNs, i.e., CSPBNs. This yields a novel structure of context-sensitive SBNs (CSSBNs) for modeling the stochasticity in a GRN. A CSSBN enables an efficient simulation of a CSPBN with a complexity of *O*(*nLk* 2^{n}) for computing the state transition matrix, where *L* is a factor related to the required sequence length in CSSBN for achieving a desired accuracy. A time-frame expanded CSSBN can further efficiently simulate the stationary behavior of a CSPBN and allow for a tunable tradeoff between accuracy and efficiency. The CSSBN approach is more efficient than an analytical method and more accurate than an approximate analysis.

### Conclusions

Context-sensitive stochastic Boolean networks (CSSBNs) are proposed as an efficient approach to modeling the effects of gene perturbation and intervention in gene regulatory networks. A CSSBN analysis provides biologically meaningful insights into the oscillatory dynamics of the p53-Mdm2 network in a context-switching environment. It is shown that random gene perturbation has a greater effect on the final distribution of the steady state of a network compared to context switching activities. The CSSBN approach can further predict the steady state distribution of a glioma network under gene intervention. Ultimately, this will help drug discovery and develop effective drug intervention strategies.

## Background

Diverse biological functions are regulated through the interactions among genes, proteins and other molecules in a cell. Gene expressions are however affected by the intrinsic and extrinsic noise in a gene network [1]. A major source of the noise is the stochastic fluctuations in gene regulatory interactions [2]. The genetic interactions are also context dependent, that is, certain regulatory functions are active in some cellular states, but inactive in others [3]. This indicates the necessity to consider noise in a context-sensitive manner in the study of gene regulatory networks (GRNs).

Various methods have been proposed to model GRNs; these include logical models [4], continuous models using differential equations [5, 6] and stochastic models at the single-molecule level [7, 8]. As a classic logical model, Boolean networks (BNs) have been widely used to qualitatively model the interactions among genes [4, 9–11]. Probabilistic Boolean networks (PBNs) have been proposed to consider noise in a BN model [12–14]. In a PBN, the next state of a gene is determined by its current state and a Boolean function. If the Boolean function is randomly selected, a PBN is referred to as an instantaneous PBN [12]. As a general model, a context-sensitive PBN (CSPBN) considers the feature of context dependence in a biological network [15]. In a CSPBN, a context is a combination of Boolean functions and each function determines the next state of a gene. A context remains unchanged until a switching occurs. This switching of contexts, possibly caused by external stimuli, is considered to occur randomly in a network.

The study of PBNs has focused on the analysis of steady state distributions (SSDs) under gene perturbation and intervention. A Markov Chain Monte Carlo method is used in [16] to analyze the long run behavior of a PBN; however, this method generally requires a large number of simulations to reach a steady state, due to the slow convergence typically encountered in a Monte Carlo method [17]. An analysis is performed in [18] for finding the SSD of a PBN through the computation of the state transition matrix (STM). However, the application of an analytical approach is generally limited to small networks due to the exponential increase in the size of an STM with gene numbers. The analysis of CSPBNs presents even a greater challenge due to its significantly increased computational complexity. Analytical expressions have been derived for analyzing CSPBNs [19]; however, this method is only applicable to the steady state analysis of a network with small perturbation and switching probabilities. The method in [20] ignores some BNs with very small probabilities for reducing the size of the STM and thus provides a more efficient but approximate solution for computing the SSD of a CSPBN. In [15, 21, 22], gene intervention is investigated for avoiding undesirable states associated with certain diseases (such as cancer). Due to external stimuli, the STM is changed by external control variables, so desirable states can be obtained with larger probabilities in the SSD. In a context-sensitive network with *n* genes and *k* contexts, however, a (2^{n} ∙ *k*) × (2^{n} ∙ *k*) [23] (or 2^{n} × 2^{n}[20]) matrix is required for an accurate (or approximate) analysis of the SSD; this results in a computational complexity of *O*(*nk*
^{2}2^{2n}) (or *O*(*nk* 2^{n})) for an accurate (or approximate) computation of the STM. Hence, the application of current gene network analysis is limited to those of less than a dozen of genes.

Recently, stochastic Boolean networks (SBNs) have been proposed for an efficient computation of the STM and SSD of an instantaneous PBN [24]. The SBN approach can recover biologically-proven regulatory behaviors, such as the oscillatory dynamics of a simplified p53-Mdm2 network [25] and the dynamic attractors in a T cell immune response network [26]. To further exploit the simplicity of logical models, stochastic multiple-valued networks (SMNs) have been utilized to investigate the dynamics of gene networks with multiple-valued gene states [27]. Asynchronous SBNs have also been developed to investigate the asynchronous state-update behavior of genes [28]. In this paper, the notion of SBNs is extended for the general model of PBNs, i.e., the CSPBNs. This presents a novel structure of context-sensitive SBNs (CSSBNs) for modeling the stochasticity in a context-dependent GRN. In particular, gene perturbation and intervention are considered in CSSBNs. Through an efficient simulation of a CSSBN, the computational complexity in the evaluation of a CSPBN is reduced from *O*(*nk*
^{2}2^{2n}) to *O*(*nLk* 2^{n}) for computing the STM, where *L* is a factor related to the required sequence length in CSSBN for achieving a desired accuracy. The use of non-Bernoulli sequences of random permutations of fixed number of 1’s and 0’s further increases computational efficiency and allows for a tunable tradeoff between accuracy and efficiency. A time-frame expanded CSSBN can further simulate the stationary behavior of a CSPBN and produce results that are more accurate than an approximate analysis, thus making the CSSBN useful in modeling complex context-sensitive GRNs. The CSSBN models are applied to the study of a simplified p53-Mdm2 network and the results using an SSD control policy are reported on the effect of external gene intervention in a glioma network [18, 20].

## Methods

### Context-Sensitive Probabilistic Boolean Networks (CSPBNs)

For a network of *n* genes, a probabilistic Boolean network (PBN) is defined by *G*(*V, F*), where *V* = {*x*
_{1}, *x*
_{2}, …, *x*
_{
n
}}, a set of binary-valued nodes, *F* = (*F*
_{1}, *F*
_{2}, …, *F*
_{
n
}), a list of sets of Boolean functions: {\mathit{F}}_{\mathit{i}}=\left\{{\mathit{f}}_{1}^{\left(\mathit{i}\right)},{\mathit{f}}_{2}^{\left(\mathit{i}\right)},\dots ,{\mathit{f}}_{\mathit{l}\left(\mathit{i}\right)}^{\left(\mathit{i}\right)}\right\} and *l*(*i*) is the number of possible functions for gene *i, i* = 1, 2, …, *n*[12–14]. A node *x*
_{
i
} represents the state of gene *i*; *x*
_{
i
} = 1 (or 0) indicates that gene *i* is (or not) expressed. The set *F*
_{
i
} contains the rules that determine the next state of gene *i*. Each {\mathit{f}}_{\mathit{j}\left(\mathit{i}\right)}^{\left(\mathit{i}\right)}:{\left\{0,1\right\}}^{\mathit{n}}\to \left\{0,1\right\} for 1 ≤ *j*(*i*) ≤ *l*(*i*) is a mapping or predictor function determining the state of gene *i*. In a context-sensitive PBN (CSPBN) with *k* contexts, a network function is defined as {\mathit{f}}_{\mathit{j}}=\left({\mathit{f}}_{\mathit{j}}^{\left(1\right)},{\mathit{f}}_{\mathit{j}}^{\left(2\right)},\dots ,{\mathit{f}}_{\mathit{j}}^{\left(\mathit{n}\right)}\right), where {\mathit{f}}_{\mathit{j}}^{\left(\mathit{i}\right)}:{\left\{0,1\right\}}^{\mathit{n}}\to \left\{0,1\right\} is the predictor function for gene *i, i* = 1, 2, …, *n,* in context *j* (*j* = 1, 2, …, *k*) [15, 29].

Due to the stochasticity in genetic networks, the next state of gene *i* is determined by all the *l*(*i*) functions in *F*
_{
i
}, i.e., by {\mathit{f}}_{1}^{\left(\mathit{i}\right)},{\mathit{f}}_{2}^{\left(\mathit{i}\right)},\dots ,{\mathit{f}}_{\mathit{l}\left(\mathit{i}\right)}^{\left(\mathit{i}\right)}, with probabilities {\mathit{c}}_{1}^{\left(\mathit{i}\right)},{\mathit{c}}_{2}^{\left(\mathit{i}\right)},\dots ,{\mathit{c}}_{\mathit{l}\left(\mathit{i}\right)}^{\left(\mathit{i}\right)}. For an instantaneous PBN of *n* genes, there are a total number of {\displaystyle \prod _{\mathit{i}=1}^{\mathit{n}}}\mathit{l}\left(\mathit{i}\right) possible Boolean networks (BNs), each of which is a possible realization of the genetic network. Each BN can be considered as the network function for a context [20]. In the *j* th context, assume the network function is given by {\mathit{f}}_{\mathit{j}}=\left({\mathit{f}}_{\mathit{j}\left(1\right)}^{\left(1\right)},{\mathit{f}}_{\mathit{j}\left(2\right)}^{\left(2\right)},\cdots ,{\mathit{f}}_{\mathit{j}\left(\mathit{n}\right)}^{\left(\mathit{n}\right)}\right)\phantom{\rule{0.12em}{0ex}}, where each {\mathit{f}}_{\mathit{j}\left(\mathit{i}\right)}^{\left(\mathit{i}\right)}:{\left\{0,1\right\}}^{\mathit{n}}\to \left\{0,1\right\} for 1 ≤ *j*(*i*) ≤ *l*(*i*) is a mapping or predictor function determining the state of gene *i*. The probability that the *j* th context is selected, is obtained as {\mathit{C}}_{\mathit{j}}={\displaystyle \prod _{\mathit{i}=1}^{\mathit{n}}}{\mathit{c}}_{\mathit{j}\left(\mathit{i}\right)}^{\left(\mathit{i}\right)} for *j* = 1, 2, …, *k*, where *k* is the number of contexts in a context-sensitive network [20]. The state of a gene is updated in the selected context; thus the next state depends on both the present state and the selected context.

In a context-sensitive network, a context may remain for certain time until a random event occurs. A context switching usually occurs with probability *q*. For *q* = 1, the CSPBN becomes an instantaneous PBN. For *q* < 1, if a new context is to be selected, it is randomly chosen from the set of network functions: {*f*
_{1}, *f*
_{2}, …, *f*
_{
k
}}, with a set of context selection probabilities: {*C*
_{1}, *C*
_{2}, …, *C*
_{
k
}} [29]. If noise is considered in a CSPBN, it is often referred to as a perturbation, by which a gene flips its state with a probability *p* (*p* ≠ 0). It has been shown that a PBN with perturbation is an ergodic Markov chain in that all the states are connected in the PBN [13]. The transition probability for any two states is determined by the values of *p* and *q* pairs. Following [19], one of four mutually exclusive events occurs at time *t* in a CSPBN with perturbation:

∅_{1}: The predictor functions in the currently selected context are applied to update the gene expressions and this context remains for the next transition.

∅_{2}: The predictor functions in the currently selected context are applied to update the states of the genes and then a new context is selected for the next transition.

∅_{3}: A random perturbation occurs and the currently selected context remains for the next transition.

∅_{4}: A random perturbation occurs and a new context is selected for the next transition.

The effect of a switching order, i.e., whether a network switches its context before or after its state transition, is considered in [23], whereas in this paper, we focus on the transition rules, i.e., the network function is applied first and then the context switches.

A gene activity profile (GAP) is defined as a vector for describing the state of a network at time *t*, ** x**(

*t*) = (

*x*

_{1}(

*t*),

*x*

_{2}(

*t*), …,

*x*

_{ n }(

*t*)), where

*x*

_{ i }(

*t*) ∈ {0, 1} for

*i*= 1, 2, …,

*n*. The state of a CSPBN can be represented as a combination of a context and a GAP, i.e.,

**= (**

*S**context j*,

*GAP i*). By this definition, the number of states in a CSPBN increases from 2

^{n}to 2

^{n}

*k*for a network with

*n*genes and

*k*contexts. A GAP can also be represented by its decimal index, i.e., \mathit{d}={\displaystyle \sum _{\mathit{j}=1}^{\mathit{n}}}{2}^{\mathit{n}-\mathit{j}}{\mathit{x}}_{\mathit{j}}\left(\mathit{t}\right)+1. A state of a CSPBN is then given by

**= (**

*S*

*f*_{ i },

*d*) for

*i*∈ {1, 2, …,

*k*} and

*d*∈ {1, 2, …, 2

^{n}}, where

*d*is the decimal index of a GAP. For convenience, a state (

*f*_{ i },

*d*) is referred to as (

*i*,

*d*) in the following analysis.

In a CSPBN with a perturbation probability *p* and a switching probability *q*, the transition probability for any two states ** a** and

**is given by [23]:**

*b*where

and *h* is the Hamming distance between two GAPs with decimal indices *x*
_{1} and *x*
_{2}, *x*
_{1}, *x*
_{2} ∈ {1, 2, …, 2^{n}}. This distance indicates the number of genes with different expressions in the two GAPs *x*
_{1} and *x*
_{2}.

The state transition matrix (STM) of a CSPBN without perturbation is of the size 2^{n} ∙ *k* × 2^{n} ∙ *k*, given by [23]:

and the STM of a CSPBN with perturbation is given by:

where *P*_{
i
} and {\tilde{\mathit{P}}}_{\mathit{i}} denote the STMs for the Boolean network *i, i* ∈ {1, 2, …, *k*}, without and with perturbation respectively.

### Context-Sensitive Stochastic Boolean Networks (CSSBNs)

#### A CSSBN without perturbation

Because of the similarities between logic circuits and biological networks, digital circuits have been used to simulate genetic networks [30], as well as to determine the node vulnerability in cellular networks [31]. Stochastic logic has been demonstrated in several biological applications [32, 33]. Stochastic computation implements probabilistic analysis using Boolean logic by encoding a real number or probability into a random binary bit sequence [34]. A probability is usually represented by a proportional number of 1’s in a stochastic sequence. The complement of a probability can be computed by an inverter and the multiplication of probabilities can be implemented by an AND gate for independent inputs. A multiplexer computes a weighted sum of its input probabilities, with the weights given by the selection inputs. Figure 1 shows an inverter (NOT), an AND gate, a buffer, an OR gate, an XOR gate and a multiplexer [24, 35]. Due to stochastic fluctuations, the computational results by stochastic logic are not deterministic but probabilistic. However, this stochastic fluctuation can be reduced through the use of non-Bernoulli sequences of random permutations of fixed numbers of 1’s and 0’s as initial inputs, thus producing more accurate results than using Bernoulli sequences [35]. It has been shown that a probabilistic distribution of genes’ states can be accurately determined in a stochastic Boolean network (SBN) with a reasonably long stochastic sequence length [24].

In a CSPBN, the selection of a context is determined by the context switching probability. This probability indicates the likelihood to maintain the current context *i* or to select a new context from the *k* contexts (including the currently selected context *i*). Based on the present state and the selected context, a gene’s state is updated. If no perturbation occurs in an *n*-gene CSPBN with a switching probability of *q*, the transition probability from state (*s*, *y*) to (*r*, *x*) is given by [23]:

where

*s* and *r* denote the *s* th and *r* th contexts, *y* and *x* represent two gene activity profiles (GAPs, in decimal indices), and *C*
_{
s
} and *C*
_{
r
} indicate the probability of selecting the *s* th and *r* th contexts respectively.

An SBN structure is proposed in [24] to implement an instantaneous PBN. For a CSPBN, a context-sensitive SBN (CSSBN) is constructed to consider the switching of contexts, as shown in Figure 2. In this CSSBN model, the probabilistic switching is implemented using a multiplexer in stochastic computation and the switching probability *q* is encoded as a random binary sequence *Q* that serves as the control sequence of a 2-to-1 multiplexer (MUX). If the *j* th bit in the sequence *Q* is 1, a new context will be selected for the next transition. Otherwise, the current context will remain. The selection probability of the new context is determined by the original and current context selection probabilities. As shown in the lower section of Figure 2, this process is implemented by 2-to-1 multiplexers with the original and current context selection sequences as inputs and *Q* as the control sequence. The selection probability of the new context is then obtained as encoded in the output stochastic sequences of the multiplexers. If *q =* 0, the CSSBN functions as a fixed Boolean Network (BN). If *q* = 1, the CSSBN is simplified to an instantaneous SBN.

If a switch does occur, the context selection process is implemented by another multiplexer for choosing one from the *k* contexts, according to predefined selection probabilities. As a network function is a combination of each gene’s predictor function, a combination of the *m* control sequences of *S*
_{1} ~ *S*
_{
m
} is used to encode the predefined selection probabilities; this in turn determines the selection probability of each new context. For a CSPBN with *n* genes, the CSSBN needs to be run for each of the 2^{n} input states and sequences need to be generated for the *m* control signals of the multiplexer.

For a switching probability of *q* in the proposed CSSBN, the currently selected context *i* remains at time *t* with a probability of *q* and switches to one of the *k* contexts with a probability of 1 - *q*. If the network transitions from a GAP *y* to *x* in the *s* th context (i.e., *f*
_{
s,y,x
} = 1), then context *s* will remain with probability 1 - *q* + *qC*
_{
s
} at the next time step. Otherwise, the network moves into a new context *r* with probability *qC*
_{
r
}. From this analysis, it can be seen that the CSSBN in Figure 2 computes the transition probability from state (*s*, *y*) to (*r*, *x*) as given by (7). This indicates that the proposed CSSBN model accurately implements the function of a CSPBN.

### A CSSBN with perturbation

In a PBN with random perturbation, a gene may change its state with a probability *p* during a state transition. Following [12], the effect of perturbation is considered to flip a gene’s state. Assume in an *n*-gene CSPBN, the current GAP at time *t* is given by ** x** = (

*x*

_{1},

*x*

_{2}, …,

*x*

_{ n }) and

**is the perturbation vector, the GAP at**

*γ**t*+ 1,

**is given by:**

*x′,*where ⊕ is the addition modulo 2 and *f*_{
j
}(⋅) is the network function for the *j* th context at time *t.* To account for the effect of perturbation, a CSSBN with perturbation is constructed, as shown in Figure 3. In this CSSBN, XOR gates are used to implement the addition modulo 2 of the perturbation vector and the present state, while an *n*-input OR gate is used to compute the probability that a perturbation occurs. The output of the OR gate is then used as the control sequence of a bus (or multiple-bit) multiplexer to decide the selection of sequences with or without perturbation. If the output sequence of the OR gate contains all 0’s, which means that there is no perturbation, then the next state is given by the predictor functions in the currently selected context in the original CSSBN without perturbation; otherwise, the next state is determined by the perturbation effect. A stochastic analysis of the function of the CSSBN with perturbation shows that the next state of the network is given by:

which is equivalent to (9). This indicates that a CSPBN with perturbation can be accurately implemented by a CSSBN with perturbation.

### Intervention in a CSSBN

In contrast to perturbation, gene intervention refers to the process of deliberately changing the states of some genes to guide a network into a desired state [12]. External stimuli are applied to a network to avoid undesirable states that might be associated with certain diseases. For an effective intervention, control policies are developed for different intervention strategies; as several genes may affect the state of the target gene, a single gene that is most influential on the network state is usually identified as the control gene, due to its simplistic biological implications [15]. The STM is then changed by an external intervention [15, 21, 22].

In a CSPBN, the context information is usually hidden and thus difficult to obtain in practice. However, gene expressions can readily be observed, so a GAP within all possible contexts is considered to be undesirable if the GAP is an undesirable state of the network [36]. Hence, gene intervention refers to changing the state of a network as represented by a GAP.

For an *n*-gene network, a vector ** s** = (

*s*

_{1},

*s*

_{2},

*s*

_{3}, …,

*s*

_{ n }) with

*s*

_{ i }∈ {0, 1} for any

*i*∈ {1, 2, …,

*n*}, is defined as the control gene vector, that is, if

*s*

_{ i }= 1, then gene

*i*is selected as a control gene [37]. An intervention vector is defined as \mathit{u}=\left({\mathit{u}}_{1},{\mathit{u}}_{2},{\mathit{u}}_{3},\dots ,{\mathit{u}}_{{2}^{\mathit{n}}}\right),

*u*

_{ j }∈ {0, 1} for any

*j*∈ {1, 2, …, 2

^{n}}, where

*u*

_{ j }= 1 (or 0) indicates a flipping (or remaining) of the state of a control gene at GAP

*j.*If gene

*i*is selected to be a control gene, for example, then

*s*

_{ i }= 1. The expression level of the control gene

*i*is then determined by its current state and the status of

**. If**

*u**u*

_{ j }= 1, i.e.,

*s*

_{ i }

*u*

_{ j }= 1, the state of control gene

*i*is flipped by the external intervention when GAP

*j*emerges; otherwise, the state of control gene

*i*remains unchanged and thus the current state is preserved in GAP

*j*[38, 39]. An intervention vector can be obtained by various methods; in this paper, a control policy using the steady state distribution (SSD) [40] is used to obtain

**.**

*u*The state of a network under intervention, i.e., a GAP, is determined by the control signal, the state prior to the intervention and the intervention vector. Let {\widehat{\mathit{x}}}_{\mathit{t}} and *x*_{
t
} be the GAPs before and after intervention at time *t*; if the *g* th gene is the control gene (i.e., *s*
_{
g
} = 1), the state of the network under intervention is given by [29]:

where *s*_{
g
} is a vector of *n* bits with 0 at every bit except the *g* th bit. 1(∙) is an indicator function: 1\left(\mathit{u}\left({\widehat{\mathit{x}}}_{\mathit{t}}\right)=1\right)=1 if \mathit{u}\left({\widehat{\mathit{x}}}_{\mathit{t}}\right)=1 and 1\left(\mathit{u}\left({\widehat{\mathit{x}}}_{\mathit{t}}\right)=1\right)=0 otherwise. When \mathit{u}\left({\widehat{\mathit{x}}}_{\mathit{t}}\right)=0, the GAP {\widehat{\mathit{x}}}_{\mathit{t}} is maintained; otherwise, the GAP {\widehat{\mathit{x}}}_{\mathit{t}} is an undesirable state, for which the control bit is to be toggled, thereby improving the occurring probability of a desirable state.

The CSSBN model under intervention is given in Figure 4. If gene *i* is selected as the control gene, i.e., *s*
_{
i
} = 1, the output of the AND gate is determined by the state of ** u**. If state

*j*is an undesirable state, i.e.,

*u*

_{ j }= 1, the state of the control gene

*i*is toggled. This is implemented by an XOR gate: if

*s*

_{ i }

*u*

_{ j }= 1, the state of gene

*i,*given by the output of an XOR gate, is flipped at the state, or GAP,

*j*; otherwise, the state is not changed by the intervention. From this analysis, the state of a network after intervention is given by:

which is equivalent to (11). This indicates that the CSSBN model in Figure 4 accurately implements the process of external gene intervention.

### State transition matrix (STM) and steady state analysis

As discussed previously, the state of a CSPBN can be represented as ** S** = (

*context*,

*x*) by considering the selected context and GAP information. Using a CSSBN, the STM can be obtained through the statistics encoded in the output sequences. Given an input state, each simulation of a CSSBN produces the transition probabilities from this input to all output states, i.e., the row in the STM for this input. For a CSPBN with

*n*genes and

*k*contexts, the CSSBN needs to be run for each of the 2

^{n}

*k*input states and an

*O*(

*n*) number of sequences need to be generated for the control signals of the multiplexers. As in [24], a factor,

*L*, is used to account for the computational overhead required by using a longer stochastic sequence. Therefore, a complexity of

*O*(

*nLk*2

^{n}) results for computing the STM of a context-sensitive network. In a CSSBN, the required minimum sequence length is typically on a polynomial order of the numbers of genes and contexts, as shown in the simulation results in Table 1. Since the number of possible BNs,

*k*, generally increases exponentially with the number of predictor functions for each gene, the complexity of using a CSSBN to compute the STM, i.e.,

*O*(

*nLk*2

^{n}), is smaller than

*O*(

*nk*

^{2}2

^{2n}) of an accurate analysis for a CSPBN [23]. As indicated by the shorter average run time in Table 1, this difference becomes significant for a network with a large number of gene states. In Table 1, the simulation results are also provided for CSSBNs with a single context, i.e.,

*k*= 1. In this case, the CSSBN degenerates into a deterministic BN, for which an analytical approach is more efficient, due to the use of random binary bit sequences in the stochastic approach. However, the CSSBN approach is more efficient in computing the STM for a gene network with a large number of states. See Additional file 1: The pseudocode for computing the STM of a CSSBN.

In general, a (2^{n} ∙ *k*) × (2^{n} ∙ *k*) STM is required for an accurate CSPBN analysis [23] while a 2^{n} × 2^{n} STM is needed by an approximate method [20]. As the number of genes increases in a network, a matrix-based analysis becomes infeasible due to a significant increase in the required computational resources. The analysis of the SSD is even more challenging for a CSPBN. For a CSSBN, however, the STM can be accurately and efficiently computed. Furthermore, the SSD can be evaluated through an iterative simulation in the temporal domain (or the so-called time-frame expansion technique [24]). By this technique, an iterative structure of the CSSBN is used to simulate the temporal evolution of a GRN. The required number of iterations is determined by the number of state transitions before reaching a steady state.

Given an initial state distribution *x*^{(0)}, let *x*^{(m)} and *x*^{(m + 1)} be the state distributions after *m* and *m* + 1 transitions respectively. Assume that the STM of the network is given by ** A**, then the state transitions from

*x*^{(m)}to

*x*^{(m + 1)}are described by:

If ||*x*^{(m + 1)} - *x*^{(m)}||_{∞} is used to compute the maximum absolute value of the summed difference of each row in *x*^{(m)} and *x*^{(m + 1)}, the condition for reaching a steady state is given by [18]:

where ϵ indicates a threshold for determining whether the steady state has been reached or not. If (14) is met, a network is considered to have reached a steady state, i.e., *x*^{(∞)} = *x*^{(m)}; thus *x*^{(m)} is considered as the stationary distribution. Alternatively, a reasonable number of network transitions (e.g. a few hundreds) can be run to obtain the SSD in practice. As shown in the Results and Discussion section, a time-frame extended simulation of CSSBNs is more efficient than an analytical approach while producing more accurate results than an approximate analysis. See Additional file 2: The pseudocode for computing the SSD of a CSSBN.

In a time-frame expanded CSSBN, random binary bit streams are generated for predictor function selection, gene perturbation and context switching probabilities. As in [24], non-Bernoulli sequences of random permutations of fixed numbers of 1’s and 0’s are used for encoding initial probabilities. These sequences then propagate through the CSSBNs and the statistics encoded in the output sequences are used to obtain the SSD. Compared to the use of Bernoulli sequences, as shown in [35], the use of the non-Bernoulli sequences produces more accurate results for a given sequence length or requires a shorter sequence length for a desired evaluation accuracy. As an efficient alternative to an STM-based analysis, the stochastic simulation of CSSBNs provides flexibility in achieving a tunable accuracy-efficiency tradeoff by using stochastic sequences of different lengths. Hence, the proposed CSSBN approach is potentially useful in the analysis of large GRNs.

## Results and discussion

### Simulation with a p53-Mdm2 Network

As a tumour suppressor gene, p53 plays an important role in preventing the development and progression of tumour cells [41, 42]. In a p53 network, signaling pathways are triggered by external stimuli. For example, DNA damages activate pathways that involve the genes p53 and Mdm2. It has been shown that the expression level of the p53 protein is reversely related with that of the Mdm2 gene, which leads to an oscillatory behavior of the p53-Mdm2 network [25, 43]. Various Boolean models have been developed to simulate the dynamics of a p53 network [44–46]. In this section, the two-gene probabilistic Boolean network (PBN) model developed in [24] for the p53-Mdm2 network is used to illustrate the applicability of context-sensitive stochastic Boolean networks (CSSBNs). This (instantaneous) PBN of the two genes p53 and Mdm2 consists of *V* = {*x*
_{1}, *x*
_{2}} and the predictor function sets {\mathit{F}}_{1}=\left\{{\mathit{f}}_{1}^{\left(1\right)},{\mathit{f}}_{2}^{\left(1\right)},{\mathit{f}}_{3}^{\left(1\right)},{\mathit{f}}_{4}^{\left(1\right)}\right\} and {\mathit{F}}_{2}=\left\{{\mathit{f}}_{1}^{\left(2\right)},{\mathit{f}}_{2}^{\left(2\right)},{\mathit{f}}_{3}^{\left(2\right)},{\mathit{f}}_{4}^{\left(2\right)}\right\}. The truth table for the state transitions of this PBN is given as Table 2[24].

An internal entry indicates whether a logical 1 or 0 would result at the next state of a gene from a selected Boolean function. \phantom{\rule{0.12em}{0ex}}{\mathit{c}}_{\mathit{j}}^{\left(\mathit{i}\right)} indicates the transition probability by a function {\mathit{f}}_{\mathit{j}}^{\left(\mathit{i}\right)}[24].

In [24], a stochastic Boolean network (SBN) of the p53-Mdm2 network is proposed. Based on the general CSSBN model in Figure 2, a CSSBN for the p53-Mdm2 network is shown in Figure 5. In this model, a network function (or a Boolean network (BN)) is selected at time *t* - 1 for context *i* and at time *t*, the network remains at context *i* with a probability of *q* or switches to one of the *k* contexts (*k* = 16 for this network) with a probability of 1 - *q*. This context switching behavior is modeled by a 2-to-1 multiplexer (MUX) for each gene and the switching probability *q* is encoded in the control bit sequence *Q*. At the same time, the selection probabilities of the new context are determined by the 2-to-1 multiplexers with the original and current context selection sequences as inputs and *Q* as the control sequence, as shown in the upper section of Figure 5.

The context-sensitive p53-Mdm2 network has 2 genes and 16 contexts. If both context and gene activity profile (GAP) are considered, there are 2^{2} × 16 = 64 states for the p53-Mdm2 CSSBN. Given the transition probability for each predictor function (in Table 2), the probability for selecting each context can readily be computed for independent functions, as shown in Table 3.With random perturbation to the genes, a CSSBN with perturbation is constructed as shown in Figure 6. For this two-gene network, a two-input multiplexer is used for each gene to probabilistically select a perturbed state or the original CSSBN state without perturbation.

The CSSBN is then used to obtain the state transition matrix (STM) for the p53-Mdm2 network. See Additional file 3: The Matlab program that describes the structure of the CSSBN in Figure 6 and computes its STM for the p53-Mdm2 network with perturbation.

The norms || ⋅ ||_{1}, || ⋅ ||_{2}, and || ⋅ ||_{∞} are used to measure the differences of the STMs obtained for the CSSBN and CSPBN. || ⋅ ||_{1} and || ⋅ ||_{∞} indicate the maximum absolute values of the summed differences of the columns and rows respectively, while || ⋅ ||_{2} measures the average difference of all entries. Assume *A*_{
CSSBN
} and *A*_{
CSPBN
} are the obtained STMs for the CSSBN and CSPBN respectively. Let *Δ* ** A** =

*A*_{ CSSBN }-

*A*_{ CSPBN }; the norms of the differences of the computed matrices (||

*Δ*

**||**

*A*_{1}, ||

*Δ*

**||**

*A*_{2}and ||

*Δ*

**||**

*A*_{∞}) are then shown in Table 4 for different values of the switching probability

*q*and perturbation rate

*p*. The average run time is also shown for using the CSSBN.

As revealed in Table 4, the difference in the STMs computed using the CSSBN and an analytical CSPBN approach is significantly reduced by increasing the sequence length *L*. However, the inaccuracies, due to the inherent stochastic fluctuations in stochastic computation, are generally small and thus negligible. Hence, the proposed CSSBN model can be used to accurately and efficiently compute the STM of a CSPBN.

A steady state analysis is further performed on the proposed CSSBN. For the p53-Mdm2 network, there are four states or GAPs. The probability of each GAP is given by the sum of the probabilities for all contexts. The simulation results for the four GAPs with respect to different *p* and *q* values are shown in Figure 7 (using a sequence length of 500k bits).As can be seen in Figure 7, while the SSD is determined by both the perturbation and switching probabilities, the perturbation rate has a greater effect on the final distribution of the steady state compared to the switching probability. The SSD changes drastically with the increase of the perturbation rate, whereas the effect of context switching is rather limited for a given perturbation rate.

As *p*, *q* ∈ [0, 1], several *p* and *q* pairs are chosen for further analysis. The difference in the SSDs obtained by using the time-frame expanded CSSBN technique, the approximate [20] and accurate analysis [23] are shown in Table 5.

As revealed in Table 5, the CSSBN approach can compute the SSD more accurately than the approximate analysis. In fact, the difference in the results between the CSSBN and the accurate analysis is negligible when reasonably long stochastic sequences are used. With the STM obtained for a CSSBN, an SSD is evaluated and the results are very accurate compared to those obtained using the accurate approach, as shown in Figure 8. A further analysis shows that the relative error is limited to less than approximately 0.2% for the CSSBN approach.The SSDs obtained using different methods, as well as the corresponding values of norms, are shown in Figure 9. It can be seen that the SSD can be accurately evaluated by the CSSBN model compared to the accurate analytical approach.

It has been shown that a PBN with perturbation evolves as an ergodic Markov chain [12]. For a larger perturbation probability *p,* there is an increased randomness in the network activities, thus the steady states of the network are more evenly distributed [23]. This can be seen in the simulation results in Figure 9. Figures 7 and 9 further show that context switching has little impact on the SSD of the network with perturbation. This is due to the fact that the context switching activity only affects the selection probability of a context, but not the predictor functions. Also shown in Figure 9 is that an oscillation of the states of p53 and Mdm2 exists, indicated by the higher probabilities of the network states (or GAPs) 01 and 10 compared with the other states. However, this two-state oscillation is less evident when the perturbation rate is significantly higher. On the other hand, a modest perturbation to the network only has a minor effect on the oscillatory behavior, confirming the stability and robustness of the p53-Mdm2 network in a dynamic environment [47].

### Experiments on a glioma network

A network in [16] obtained from a human glioma gene expression data set [48] is further used to illustrate the efficiency of the CSSBN model and the time-frame expansion technique. Based on this data set, a PBN was inferred by a method using the coefficients of determination (CODs) and an SSD analysis was performed in [18]. An approximate method was developed in [20] as an extension to estimate the SSD of a CSPBN with perturbation. The gene TOP2A was not considered in either study as it is an input gene with an in-degree of zero. In our study, the network setting is considered the same as in [18, 20] with the gene TOP2A removed; this leads to a total of 2^{14} GAPs. Figure 10 shows a detailed structure of the considered glioma network with double (or single)-headed arrows indicating the bi (or uni)-directional relationships of gene pairs. The selection probabilities for the predictor functions are shown in Table 6[20] and the Boolean functions for each gene are obtained through an analysis of the data in [18].

In Table 6, each column for *F*
_{
i
}, *i* ∈ {1, 2, …, 14}, contains the selection probabilities for the Boolean functions of gene *i*, with the value in the *j* th row as the probability for {\mathit{f}}_{\mathit{j}}^{\left(\mathit{i}\right)}, thus the probabilities sum to 1 in each column. Table 6 also shows that six genes have only a single predictor function; only the state of gene 6 is determined by three functions, while the states of the other genes are determined by two predictor functions.

### Steady state analysis

For the 14-gene glioma network, there are a total of 384 contexts, as can be determined from Table 6. It requires an STM with 2^{14} × 384 = 6291456 columns and rows for an accurate analysis. This makes it infeasible to estimate the steady states of a CSPBN using a matrix-based analysis. The approximate analysis in [20] would require the computation of an STM of the size 2^{14} × 2^{14}. Thus, it is difficult in general to use an analytical approach to evaluate a large network, due to the demanding computational resources required. However, a CSSBN model can be constructed for the glioma network; this CSSBN is based on the constituent SBN, as shown in Figure 11. With the CSSBN, the SSD can be estimated using the aforementioned time-frame expansion technique with a greatly reduced complexity. The obtained SSD is then compared with that obtained from the approximate analysis in [20]. In this paper, a network is considered to have reached a steady state if the discrepancy between two adjacent iterations is smaller than a required threshold ϵ (by (14)) or the number of simulation iterations has reached a maximum constant value. See Additional file 4: The Matlab program that evaluates the steady state distribution using the time-frame expanded CSSBN technique for the glioma network with context switching and perturbation.

The state or GAP of the glioma network can be represented by a binary vector as (*x*
_{1}
*, x*
_{2}
*,* …*, x*
_{14}), *x*
_{
i
} ∈ {0, 1} for *i* ∈ {1, 2, …, 14}, or its decimal index. For example, the state (01001100011011) can be represented as state 4892. The SSDs of the context-sensitive glioma network with all 16384 states, obtained using the CSSBN approach and the approximate analysis [20], are shown in Figure 12.

The norms of the differences of SSDs, obtained using the CSSBN with different sequence lengths and the approximate method in [20], are shown in Table 7.

As shown in Table 7, the CSSBN time-frame expansion technique efficiently evaluates the SSD of the glioma network and produces results comparable to those obtained by the approximate analysis [20]. Evaluation accuracy is further improved by using longer stochastic sequences with yet a shorter runtime compared to the approximate method. Since it is difficult, if not impossible, to compute the STM or SSD of a large GRN by using an accurate or approximate analysis, the CSSBN time-frame expansion approach provides an alternative means to accurately and efficiently estimate the SSD of a large network with a tunable accuracy by using stochastic sequences of different lengths.

### Intervention analysis

In an *n-* gene network, if gene *X*
_{
i
} is the target gene, the states of all genes can be divided into a set of desirable states, D, and a set of undesirable states, U, by the expression level of the target gene [40]. As can be seen in Figure 10, gene *X*
_{14}, the (BCL2A1); BFL1 protein; GRS protein is one of the most influential genes that interacts closely with others, thus it is chosen as the target gene. The desirable and undesirable states are then separated by the expression level of *X*
_{14}. Assume that the inactivation of *X*
_{14} is preferred; the cumulative probabilities of the desirable and undesirable states are given by {\displaystyle \sum {{{}_{\mathit{x}}}_{{}_{14}}}_{=0}{\mathit{\pi}}_{\mathit{x}}} and {\displaystyle \sum {{{}_{\mathit{x}}}_{{}_{14}}}_{=1}{\mathit{\pi}}_{\mathsf{y}}}, respectively, where *π*
_{
x
} and *π*
_{
y
} are elements in the desirable and undesirable SSDs respectively.

As discussed previously, a GRN can be intervened by applying external stimuli to minimize the likelihood of being in an undesirable state. Various methods have been proposed for deriving control vectors for an effective intervention [29, 40]; in this study the SSD algorithm proposed in [40] is used to determine an intervention vector. For simplicity, 16 most significant contexts that account for a total selection probability of 57.27% are chosen from the total 384 contexts of the glioma network for an intervention analysis. For these 16 contexts, the expression levels of gene *X*
_{5} and *X*
_{7} have no effect on the states of other genes, so these two genes are removed; this yields a simplified 12-gene glioma network. The cumulative distributions of the desirable steady states of the context-sensitive 12-gene glioma network are shown in Table 8 for using a different gene as the control gene.

When a different gene is selected as the control gene, the effect of intervention varies with respect to improving the probabilities of desirable states in the SSD. For the glioma network, as revealed in Table 8, gene X_{1}, Tie-2, is the most effective for maximizing the percentage of the desirable states. The cumulative distribution of the desirable states is increased from 45.97% to 70.91% by an intervention using Tie-2 as the control gene. Also revealed in the table is that an intervention via gene *X*
_{4}, (HSP40); (HDJ1; DNAJ1), has nearly no impact on the distribution of the desirable states, as the percentage of the desirable states has not been changed much by the intervention (i.e., 45.90% vs. 45.97%). A modest improvement in the desirable state distribution is obtained by an intervention with another gene as the control gene. For any target gene, this process can be applied to find the most significant gene that maximizes the probabilities of desirable states.

The effects of intervention can also be seen in Figure 13, where three different scenarios are considered: (a) no intervention, (b) an indirect intervention via gene *X*
_{1}, i.e., Tie-2, and (c) a direct intervention via the target gene *X*
_{14}, the (BCL2A1); BFL1 protein; GRS protein. The cumulative probabilities of the desirable states are 45.97% for no intervention, 70.91% and 99.99% for the two intervention strategies. As revealed in these results, a direct intervention of the target gene is perhaps optimal for avoiding the undesirable states and almost certainly taking the network into a desirable state. However, when an intervention on the target gene is not possible, an intervention through Tie-2 is the most effective means to maximize the probability of the down-regulation of the target gene *X*
_{14}.

## Conclusion

Context-sensitive stochastic Boolean networks (CSSBNs) are proposed as an efficient approach to modeling the effects of gene perturbation and intervention in gene regulatory networks (GRNs). In a CSSBN, the state transition matrix can be accurately and efficiently computed with a complexity of *O*(*nLk* 2^{n}), where *n* is the number of genes in a context-sensitive probabilistic Boolean network (CSPBN), *k* is the number of contexts and *L* is a factor determined by the stochastic sequence length. This result is an improvement compared to the previous result of *O*(*nk*
^{2}2^{2n}) for an accurate analysis. The use of non-Bernoulli stochastic sequences further increases computational efficiency and allows for a tunable tradeoff between accuracy and efficiency. A steady state analysis using a time-frame expansion technique has shown a significant speedup and produced more accurate results than an approximate analysis in the computation of the steady state distribution.

CSSBNs are constructed for the analysis of gene perturbation in a p53-Mdm2 network and gene intervention in a glioma network. Biologically meaningful insights are gained into the oscillatory dynamics of the p53-Mdm2 network in a context-switching environment with random gene perturbation. It has been shown that the steady state distribution (SSD) changes drastically with the increase of the perturbation rate, whereas the effect of context switching is rather limited for a given perturbation rate. Therefore, random gene perturbation may have a greater effect on the final distribution of the steady state compared to context switching activities. By predicting the SSD, the CSSBN approach can further evaluate the effectiveness of external gene intervention. A case study of the glioma network shows that the CSSBNs are useful in modeling the effects of gene perturbation and intervention in a complex context-sensitive GRN. This will eventually help drug discovery for an effective drug intervention therapy.

## References

Elowitz MB, Levine AJ, Siggia ED, Swain PS: Stochastic gene expression in a single cell. Sci. 2002, 297: 1183-1186. 10.1126/science.1070919.

Pedraza JM, van Oudenaarden A: Noise propagation in genetic networks. Sci. 2005, 307: 1965-1969. 10.1126/science.1109090.

Dunlop M, Cox RIII, Levine J, Murray R, Elowitz M: Regulatory activity revealed by dynamic correlations in gene expression noise. Nat Genet. 2008, 40: 1493-1498. 10.1038/ng.281.

Kauffman SA: Metabolic stability and epigenesis in randomly constructed genetic nets. Theor Biol. 1969, 22: 437-467. 10.1016/0022-5193(69)90015-0.

Klipp E: Systems Biology in Practice: Concepts, Implementation and Application. 2005, Weinheim, Germany: Wiley-VCH

Qian L, Wang H, Dougherty E: Inference of noisy nonlinear differential equation models for gene regulatory networks using genetic programming and Kalman filtering. IEEE Trans Signal Process. 2008, 56 (7): 3327-3339.

Gillespie DT: A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J Comput Phys. 1976, 22: 403-10.1016/0021-9991(76)90041-3.

Gillespie DT: Exact stochastic simulation of coupled chemical reactions. J Phys Chem. 1977, 81: 2340-2361. 10.1021/j100540a008.

Davidich M, Bornholt S: Boolean network model predicts cell cycle sequence of Fission yeast. PLoS One. 2008, 3 (2): e1672-e2008. 10.1371/journal.pone.0001672.

Giacomantonio CE, Goodhill GJ: A Boolean model of the gene regulatory network underlying Mammalian cortical area development. PLoS Comput Biol. 2010, 6 (9): e1000936-10.1371/journal.pcbi.1000936.

Huang S: Gene expression profiling, genetic networks, and cellular states: an integrating concept for tumorigenesis and drug discovery. J Mol Med. 1999, 77: 469-480. 10.1007/s001099900023.

Shmulevich I, Dougherty ER, Zhang W: From Boolean to probabilistic Boolean networks as models of genetic regulatory networks. Proc IEEE. 2002, 90: 1778-1792. 10.1109/JPROC.2002.804686.

Dougherty ERSI, Zhang W: Gene perturbation and intervention in probabilistic Boolean networks. Bioinform. 2002, 18 (10): 1319-1331. 10.1093/bioinformatics/18.10.1319.

Shmulevich I, Dougherty E: Probabilistic Boolean Networks: The Modeling and Control of Gene Regulatory Networks. 2009, Philadelphia: SIAM

Pal R, Datta A, Bittner ML, Dougherty ER: Intervention in context-sensitive probabilistic Boolean networks. Bioinform. 2005, 21: 1211-1218. 10.1093/bioinformatics/bti131.

Shmulevich I, Gluhovsky I, Hashimoto RF, Dougherty ER, Zhang W: Steady-state analysis of genetic regulatory networks modeled by probabilistic Boolean networks. Comp Funct Genomics. 2003, 4: 601-608. 10.1002/cfg.342.

Rosenthal JS: Minorization conditions and convergence rates for Markov chain Monte Carlo. J Am Stat Assoc. 1995, 90: 558-566. 10.1080/01621459.1995.10476548.

Zhang S, Ching W, Ng M, Akutsu T: Simulation study in probabilistic Boolean network models for genetic regulatory networks. Int J Data Min Bioinform. 2007, 1 (3): 217-240. 10.1504/IJDMB.2007.011610.

Brun M, Dougherty ER, Shmulevich I: Steady-state probabilities for attractors in probabilistic Boolean networks. Signal Process. 2005, 85: 1993-2013. 10.1016/j.sigpro.2005.02.016.

Ching W, Zhang S, Ng M, Akutsu T: An approximation method for solving the steady-state probability distribution of probabilistic Boolean networks. Bioinform. 2007, 23: 1511-1518. 10.1093/bioinformatics/btm142.

Datta A, Choudhary A, Bittner ML, Dougherty ER: External control in markovian genetic regulatory networks. Mach Learn. 2003, 52: 169-191. 10.1023/A:1023909812213.

Datta A, Choudhary A, Bittner ML, Dougherty ER: External control in markovian genetic reegulatory networks: the imperfect information case. Bioinform. 2004, 20: 924-993. 10.1093/bioinformatics/bth008.

Pal R: Context-sensitive probabilistic Boolean networks: steady-state properties, reduction, and steady-state approximation. IEEE Trans Signal Process. 2010, 58 (2): 879-890.

Liang J, Han J: Stochastic Boolean networks: an efficient approach to modeling gene regulatory networks. BMC Syst Biol. 2012, 6: 113-10.1186/1752-0509-6-113.

Lahav G, Rosenfeld N, Sigal A, Geva-Zatorsky N, Levine AJ, Elowitz MB, Alon U: Dynamics of the p53-Mdm2 feedback loop in individual cells. Nat Genet. 2004, 36: 147-150. 10.1038/ng1293.

Martin S, Zhang Z, Martino A, Faulon J-L: Boolean dynamics of genetic regulatory networks infer red from microarray time series data. Bioinform. 2007, 23 (7): 866-874. 10.1093/bioinformatics/btm021.

Zhu P, Han J: Stochastic multiple-valued gene networks. IEEE Transac Biomed Circ Syst. 2014, 8 (1): 42-53.

Zhu P, Han J: Asynchronous stochastic Boolean networks as gene network models. J Comput Biol. 2014, in press

Faryabi B, Vahedi G, Chamberland JF, Datta A, Dougherty ER: Intervention in context-sensitive probabilistic Boolean networks revisited. EURASIP J Bioinform Syst Biol. 2009, 2009: 5-

McAdams HH, Shapiro L: Circuit simulation of genetic networks. Sci. 1995, 269 (5224): 650-10.1126/science.7624793.

Abdi A, Tahoori MB, Emamian ES: Fault diagnosis engineering of digital circuits can identify vulnerable molecules in complex cellular pathways. Sci Signal. 2008, 1 (42): ra10-

Adar R, Benenson Y, Linshiz G, Rosner A, Tishby N, Shapiro E: Stochastic computing with biomolecular automata. PNAS. 2004, 101 (27): 9960-9965. 10.1073/pnas.0400731101.

Benenson Y, Gil B, Ben-Dor U, Adar R, Shapiro E: An autonomous molecular computer for logical control of gene expression. Nat. 2004, 429: 423-429. 10.1038/nature02551.

Gaines BR: Stochastic computing systems. Adv Inf Syst Sci. 1969, 2: 37-172.

Han J, Chen H, Liang J, Zhu P, Yang Z, Lombardi F: A Stochastic Computational Approach for Accurate and Efficient Reliability Evaluation. IEEE Transactions on Computers. 2013, in press. Advance access in IEEE Xplore

Pal R, Datta A, Bittner ML, Dougherty ER: Optimal infinite horizon control for probabilistic Boolean networks. IEEE Trans Signal Process. 2006, 54: 2375-2387.

Faryabi B, Datta A, Dougherty ER: On approximate stochastic control in genetic regulatory networks. IET Syst Biol. 2007, 1 (6): 361-368. 10.1049/iet-syb:20070015.

Layek R, Datta A, Pal R, Dougherty ER: Adaptive intervention in probabilistic Boolean networks. Bioinform. 2009, 25 (16): 2042-2048. 10.1093/bioinformatics/btp349. Doi:

Qian X, Dougherty ER: Effect of function perturbation on the steady-state distribution of genetic regulatory networks: optimal structural intervention. IEEE Trans Signal Process. 2008, 56: 4966-4976.

Qian X, Ivanov I, Ghaffari N, Dougherty ER: Intervention in gene regulatory networks via greedy control policies based on long-run behavior. BMC Syst Biol. 2009, 3: 16-10.1186/1752-0509-3-16.

Weinberg RA: The Biology of Cancer. 2006, New York: Garland Science, 1

Vogelstein B, Lane D, Levine AJ: Surfing the p53 network. Nat. 2000, 408: 307-310. 10.1038/35042675.

Batchelor E, Loewer A, Lahav G: The ups and downs of p53: understanding protein dynamics in single cells. Nat Rev Cancer. 2009, 9 (5): 371-377. 10.1038/nrc2604.

Ciliberto A, Novak B, Tyson JJ: Steady states and oscillations in the p53-Mdm2 network. Cell Cycle. 2005, 4: 486-493.

Abou-Jaoude W, Ouattara D, Kaufman M: From structure to dynamics: frequency tuning in the p53-mdm2 network: I: logical approach. J Theor Biol. 2009, 258 (4): 561-577. 10.1016/j.jtbi.2009.02.005.

Murrugarra D, Veliz-Cuba A, Aguilar B, Arat S, Laubenbacher R: Modeling stochasticity and variability in gene regulatory networks. EURASIP J Bioinform Syst Biol. 2012, 1: 5-

Fuller GN, Rhee CH, Hess KR, Caskey LS, Wang R, Bruner JM, Yung WK, Zhang W: Reactivation of insulin-like growth factor binding protein 2 expression in glioblastoma multiforme: a revelation by parallel gene expression profiling. Cancer Res. 1999, 59: 4228-4232.

Ge H, Qian M: Boolean network approach to negative feedback loops of the p53 pathways: synchronized dynamics and stochastic limit cycles. J Comput Biol. 2009, 16: 119-132. 10.1089/cmb.2007.0181. doi:10.1089/cmb.2007.0181

## Acknowledgements

This work was supported in part by the Natural Sciences and Engineering Research Council (NSERC) of Canada in a Discovery Grant and a Startup fund of the University of Alberta.

## Author information

### Authors and Affiliations

### Corresponding author

## Additional information

### Competing interests

The authors declare that they have no competing interests.

### Authors’ contributions

PZ carried out the GRN studies, performed the statistical analysis and drafted the manuscript. JL developed the initial models and participated in discussion. JH participated in the GRN studies, revised the manuscript and supervised the research. All authors read and approved the final manuscript.

## Electronic supplementary material

### 12918_2013_1332_MOESM1_ESM.pdf

Additional file 1:The pseudocode for computing the state transition matrix (STM) of a context-sensitive stochastic Boolean network (CSSBN).(PDF 126 KB)

### 12918_2013_1332_MOESM2_ESM.pdf

Additional file 2:The pseudocode for computing the steady-state distribution (SSD) of a context-sensitive stochastic Boolean network (CSSBN).(PDF 206 KB)

### 12918_2013_1332_MOESM3_ESM.zip

Additional file 3:**The Matlab program that describes the structure of the context-sensitive stochastic Boolean network (CSSBN) in Figure** 6
**and computes its state transition matrix for the p53-Mdm2 network with perturbation.**(ZIP 2 KB)

### 12918_2013_1332_MOESM4_ESM.zip

Additional file 4:The Matlab program that evaluates the steady state distribution using the time-frame expanded context-sensitive stochastic Boolean network (CSSBN) technique for the glioma network with context switching and perturbation.(ZIP 4 KB)

## Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

## Rights and permissions

**Open Access**
This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License (
https://creativecommons.org/licenses/by/2.0
), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

## About this article

### Cite this article

Zhu, P., Liang, J. & Han, J. Gene perturbation and intervention in context-sensitive stochastic Boolean networks.
*BMC Syst Biol* **8**, 60 (2014). https://doi.org/10.1186/1752-0509-8-60

Received:

Accepted:

Published:

DOI: https://doi.org/10.1186/1752-0509-8-60

### Keywords

- Gene regulatory networks
- Boolean networks
- Stochastic Boolean networks
- Context dependent
- Gene perturbation
- Intervention
- Context switch
- Steady state distribution
- p53 network
- glioma network