- Research article
- Open Access
- Published:

# Information theoretic approach to complex biological network reconstruction: application to cytokine release in RAW 264.7 macrophages

*BMC Systems Biology***volume 8**, Article number: 77 (2014)

## Abstract

### Background

High-throughput methods for biological measurements generate vast amounts of quantitative data, which necessitate the development of advanced approaches to data analysis to help understand the underlying mechanisms and networks. Reconstruction of biological networks from measured data of different components is a significant challenge in systems biology.

### Results

We use an information theoretic approach to reconstruct phosphoprotein-cytokine networks in RAW 264.7 macrophage cells. Cytokines are secreted upon activation of a wide range of regulatory signals transduced by the phosphoprotein network. Identifying these components can help identify regulatory modules responsible for the inflammatory phenotype. The information theoretic approach is based on estimation of mutual information of interactions by using kernel density estimators. Mutual information provides a measure of statistical dependencies between interacting components. Using the topology of the network derived, we develop a data-driven parsimonious input–output model of the phosphoprotein-cytokine network.

### Conclusions

We demonstrate the applicability of our information theoretic approach to reconstruction of biological networks. For the phosphoprotein-cytokine network, this approach not only captures most of the known signaling components involved in cytokine release but also predicts new signaling components involved in the release of cytokines. The results of this study are important for gaining a clear understanding of macrophage activation during the inflammation process.

## Background

Cellular functions and biological processes are regulated by complex biochemical reactions within and between the cells [1, 2]. Bimolecular techniques can be used to measure concentrations of various molecular components, such as proteins and metabolites, allowing a partial reconstruction of the networks involving these components. A goal of systems biology is to reconstruct these underlying networks and to infer associated biological phenomena from large scale measurements [3]. More specifically, reconstruction of biological networks yields a framework for understanding the relationship between molecular measurements and higher-level phenotypes [4, 5].

Analyses of diverse read-outs from cells allow one to map an input onto responses associated with a given phenotype, i.e., to reconstruct the underlying biological network that results in the phenotype. Current computational approaches for network reconstruction include principal component regression (PCR) [6], partial least squares (PLS) regression [7], linear matrix inequalities (LMI) [8], and Bayesian Networks (BNs) [9]. These approaches are briefly described below.

PCR is a regression procedure that uses a principal component analysis to estimate regression coefficients [6]. Usually, principal components with the highest variance are selected in three steps. First, a principal component analysis is performed on the data matrix of explanatory variables. Second, a least-squares regression is applied between the selected components (latent variables) and the output/response variables. Finally, the model’s parameters are calculated for the selected explanatory variables by combining the two steps [10]. In contrast to PCR, PLS regression captures the maximum variance in the output variables while capturing sufficient variance in the input variables [7, 11]. PLS makes a linear model by projecting the input and output variables onto a new space [12, 13]. LMI converts nonlinear convex optimization problems into linear optimization problems [14]. The basic idea of the LMI is to approximate a given input/output modeling problem posed as a quadratic optimization problem with a linear objective and so-called LMI constraint [8]. Approaches such as PCR and PLS essentially work based on a linear model template. Bayesian networks are graphical models that describe causal or pseudo-causal interactions between variables [9, 15]. Nodes of a BN represent random variables in the Bayesian sense and edges represent conditional dependencies among the random variables [16]. BNs have a number of drawbacks related to the so-called representation problem: they require one to choose between discrete or continuous variables and parametric or non-parametric forms of the conditional probability distribution, and to decompose the joint probability distribution into conditional probability distributions among the relevant variables [17]. Information-theoretic approaches provide a non-parametric alternative to Bayesian networks. They construct parsimonious models of biological networks by establishing statistical dependencies of interactions based on their uncertainty reductions [18–20]. Unlike PCR/PLS, this approach does not make any assumptions about the linearity of the system and the functional form of the statistical distribution of the variables [21, 22]. We describe our information-theoretic approach to the reconstruction of biological networks in the next section. Next, this method is used to develop a parsimonious model of phosphoprotein-cytokine network in RAW 264.7 macrophages. In the following sections, we compare the regulatory components captured by our approach with those identified by previous approaches and the knowledge available in scientific literature.

### Shannon’s information theory

Building upon Hartley’s conceptual framework [23], which relates the information of a random variable with its probability, Shannon [24] defined “entropy” of a random variable in terms of its probability distribution. For a random variable *X* given a random sample {*x*
_{1}, …, *x*
_{
n
}} with probabilities *P*(*x*
_{
i
}), entropy H is defined as

Shannon’s information theory defines “mutual information” as the amount of information about a random variable *X* that can be obtained by observing another random variable *Y*. This definition implies that the information that *Y* provides about *X* reduces uncertainty about *X* due to the knowledge of *Y*. Intuitively, mutual information infers the information that *Y* and *X* share by measuring how much knowing one of the variables can reduce the uncertainty about the other [25]. Then, the mutual information of *Y* relative to *X*, or *X* relative to *Y*, is given by

Mutual information provides a metric for measuring statistical dependencies of interactions. It has several advantages over other methods [18–20]: It does not make any assumption about the functional form of the statistical distribution of variables [22]; and, information theoretic approaches are not dependent on the linearity assumption of the model for the ease of computation [21].

### Threshold selection on mutual information

A parsimonious model of a complex system has to capture a necessary and sufficient model of the entire system, while minimizing the number of interacting components, from the measured data for the system. The ultimate goal of data-driven network-reconstruction methods is to find such a necessary and sufficient model. Information theoretic approaches analyze the statistical dependencies of interacting components by measuring the mutual information coefficients of interactions. A mutual information network of a complex system is obtained by computing the mutual information matrix (MIM) and selecting the threshold of mutual information (TMI). MIM is a square matrix, whose elements *MIM*
_{
ij
} = *I (X*
_{
i
}
*, Y*
_{
j
}
*)* are the mutual information between the variables *X*
_{
i
} and *Y*
_{
j
}. TMI defines the threshold of statistical dependencies of interactions. Choosing an appropriate TMI is a non-trivial problem. A straightforward but computationally demanding approach is to perform permutations of measurements several times and to recalculate a distribution of the mutual information for each permutation. Then permuted distributions are averaged and the largest mutual information in the averaged permuted distribution represents the threshold [26]. Some of the algorithms for network reconstruction and threshold selection in biological networks are discussed below.

The Relevance Network (RelNet) constructs a network in which a pair of random variables *X*
_{
i
} and *Y*
_{
j
} is linked by an edge if the mutual information *I*(*X*
_{
i
},*Y*
_{
j
}) is larger than a given threshold [27]. The Context Likelihood of Relatedness (CLR) algorithm derives a score from the empirical distribution of the mutual information for each pair of random variables *X*
_{
i
} and *Y*
_{
j
}[28]. CLR estimates a score

where *μ*
_{
i
} and *σ*
_{
i
} are the mean and standard deviation of the distribution of the mutual information of *X*
_{
i
} and all other variables *Y*
_{
j
} (*j = 1,…,n*). The Minimum Redundancy Network (MRNet) relies on the conditional mutual information to make inference. MRNet is applied to determine regulatory targets and pathways. If two random variables *X* and *Y* have a large mutual information but are conditionally independent given a third random variable *Z*, MRNet considers no statistical dependency between them [29]. ARACNE (Algorithm for the Reconstruction of Accurate Cellular NEtworks) assigns to each pair of nodes a weight equal to their mutual information and removes the weakest edges by applying a proper threshold [30]. ARACNE applies Kernel Density Estimation (KDE) approaches to measure mutual information between nodes and selects the bandwidth of kernels by minimizing the Kullback–Leibler distances between kernel density distributions of variables before and after removing the *i*
^{th} observation. It also applies an information-theoretic property called the Data Processing Inequality (DPI) to remove statistically weak connections. DPI states that, if *X*
_{
i
} interacts with *X*
_{
j
} through a random variable *X*
_{
k
} then *I*(*X*
_{
i
}, *X*
_{
j
}) < *min*{*I*(*X*
_{
i
}, *X*
_{
k
}), *I*(*X*
_{
j
}, *X*
_{
k
})}.

We employ an information-theoretic approach both to reconstruct complex biological networks and to establish a parsimonious model of the entire system. Our strategy is to determine mutual information of interactions using kernel density estimators based on an unbiased cross-validation [31–33] estimation of kernel bandwidths and to analyze statistical dependencies of nodes by selecting a threshold obtained by applying the large deviation theory [18] employed by ARACNE [30].

## Methods

### Information theoretic approach for biological network reconstruction

As mentioned before, MI measures the information that *X* and *Y* share by measuring how much knowing one of these variables will reduce the uncertainty of the other and reflects the statistical dependencies of two variables. Hence, higher MI between an input and an output indicates a larger reduction in uncertainty and suggests a stronger input–output connection. Small (statistically zero) MI between two random variables indicates that variables are independent.

Measuring mutual information with a kernel density estimator (KDE)—a non-parametric method for estimating probability densities of variables—is more advantageous than histogram-based methods in terms of a better mean square error rate of convergence of the estimate to the underlying density [32]. We note than ARACNE also uses KDE [30] to estimate MI. A disadvantage of KDE is the need to specify an optimal kernel bandwidth [33]. Once the optimal kernel bandwidth is obtained and the MI coefficients of the network are measured using KDE, the next step is to select a proper threshold to determine the boundary of statistically significant connections and the weak connections to be removed; similar concept of statistical significance has been used by Pradervand et al. [34] in a PCR-based approach to network reconstruction. Following these three steps, information theoretic model of the network is obtained. It provides a parsimonious network in which the number of false connections are reduced considerably. Our method of MI-based network reconstruction is inspired by (and borrows several components from) the ARACNE framework [30]. However, we employ a different methodology for the selection of optimal kernel bandwidth as described below.

The following subsections present a description of the above-mentioned steps to create a data-driven model of complex networks. These steps are applied to decipher, in a lumped manner, regulatory mechanisms involved in the release of seven cytokines by activation of 22 signaling proteins in RAW 264.7 macrophage. The Alliance for Cellular Signaling (AfCS) has generated a systematic profiling of signaling responses and cytokine release in RAW 264.7 macrophage [35, 36]. This dataset consists of data from stimulation of macrophages by both Toll and non-Toll receptor ligands. The objective is to create an input–output model, in which signaling responses (22 inputs) are used to predict cytokine release (7 outputs).

### Non-parametric estimations of mutual information

Kernel Density Estimation (KDE) is a non-parametric method to determine the Probability Density Function (PDF) of a random variable. Given a random sample {*x*
_{1}, …, *x*
_{
n
}} for a univariate random variable *X* with an unknown density *f*, a kernel density estimator (KDE) estimates the shape of this function as [32]:

where, *h* is the kernel bandwidth. A bivariate kernel density function of two random variables *X* and *Y* given two random samples {*x*
_{1}, …, *x*
_{
n
}} and {*y*
_{1}, …, *y*
_{
n
}} is defined as:

The mutual information of *X* and *Y* is computed as [37]:

where, *n* is the sample size, and *h* is the kernel width.

### Selection of optimal kernel bandwidth

The use of KDEs to evaluate the MI coefficients requires the optimal selection of the kernel bandwidth *h*. The main criterion used to determine the optimal kernel width is the minimization of the expected risk function, defined as the mean integrated squared error (MISE) between the computed and true (unknown) distributions [32, 33],

where, *f*
_{
h
}
*(x)* is the kernel density estimate of *x* for a bandwidth of *h*. MISE cannot be used directly since it involves the unknown density function *f(x)*. To address this issue, several algorithms have been developed to get an estimate of the optimal bandwidth. One of the most commonly used algorithms employs a cross-validation type approach. Based on this approach, if *f*
_{
h
}
*(x)* is the kernel density estimation at *x* for a bandwidth of *h* using all of the data to fit the KDE, then a cross-validated estimate of the bandwidth is the value for *h* that minimizes [31–33]:

where, *f*
_{
(−i),h
}
*(x*
_{
i
}
*)* is the kernel density estimator using the bandwidth *h* at *x*
_{
i
} obtained after removing *i*
^{th} observation. For two vectors *X* and *Y*, the cross-validation method determines the optimal kernel width for each pair of randomly selected set of *n* pairs of variables and the mean of optimal kernel widths for these *n* pairs is used as an approximated kernel width for the entire dataset [38].

### Network reconstruction and threshold selection

Once the optimal kernel width has been selected and the MI matrix has been computed, the next step is to find an appropriate threshold of MI, *I*
_{
0
}. Based on large deviation theory used by ARACNE algorithm [30], the probability that an empirical value of mutual information *I* is greater than *I*
_{
0
}
*,* provided that its true value is $\overline{\mathit{I}}=0$, is

where, *c* is a constant. Taking the natural logarithm of both sides yields

where, *b* is proportional to the sample size *N*. Therefore, ln*P* is a linear function of *I*
_{
0
} with the slope *b*. Using these results, for any dataset with sample size *N* and a desired p-value, the corresponding threshold can be obtained where *a* and *b* are fitted from the data. This threshold is used to remove statistically weak edges. Since each cytokine is explicitly an output we do not employ any further analysis such as DPI [18] to identify and remove indirect connections.

Using the network thus obtained, a predictive model can be developed as described in Appendix A.

### Application to phosphoprotein-cytokine signaling network

We employ this information theoretic approach to reconstruct the phosphoprotein-cytokine network in RAW 264.7 macrophages. To achieve this goal, the first step is the creation of the MI matrix (MIM) of interactions for each Toll and non-Toll data set separately and then finding a proper threshold for each network.

Macrophages play key roles in both innate and adaptive immunity, regulating the immune responses and the development of acute and chronic inflammations by producing a wide array of powerful chemical substances and regulatory factors such as cytokines [39]. Cytokines are a group of proteins and act as mediators between cells. Cytokines locate and interact with the target immune cells by binding to their receptor [40, 41]. The release of immune-regulatory cytokines is regulated by a complex signaling network [34, 42]. Multiple stimuli generate different signals and these signals generate different cytokine responses. Clear delineation of these signaling pathways is a prerequisite for understanding the causes of cytokine release.

#### Description of the data

In order to determine the signaling components involved in the cytokine release, we used the AfCS data on the phosphoproteins and cytokines under Toll and non-Toll conditions in RAW 264.7 macrophages. The phosphoprotein/cytokine data set consists of 22 phosphoproteins (inputs) and 7 cytokines (outputs). The cytokines (outputs) include: Tumor Necrosis Factor alpha (TNFα), Interleukin-1α (IL-1α), Interleukin-6 (IL-6), Interleukin-10 (IL-10), Granulocyte Macrophage Colony Stimulating Factor (GM-CSF), Regulated on Activation, Normal T Expressed and Secreted (RANTES) and Macrophage Inflammatory Protein- 1alpha (MIP-1α). The phosphoproteins (inputs) include: Signal Transducers and Activator of Transcription (STAT) 1α (STAT1α), STAT1β, STAT3, STAT5, Ribosomal Protein S6 (Rps6), Ribosomal S6 kinase (RSK), Glycogen Synthase Kinase (GSK) 3A (GSK3A), GSK3B, Extracellular-signal Regulated Kinases (ERK) 1 (ERK1), ERK2, cyclic Adenosine Monophosphate (cAMP), c-Jun N-terminal Kinases (JNK) long (JNK lg), JNK short (JNK sh), AKT, p40 Phagocyte Oxidase (p40Phox), Ezrin [Ezr]/Radixin [Rdx](Ezr/Rdx), Membrane-organizing Extension Spike Protein (Moesin or MSN), P38, Sma and Mad related proteins 2 (SMAD2), Nuclear Factor Kappa-light-chain-enhancer of activated B cells p65 (NF-κB p65), Protein Kinase C Delta (PKCD) and Protein kinase C μ2 (PKCμ2).

Both the input data and output data are time-averaged since the time-scales of the input data are in minutes (measurements taken at 1, 3 and 10 minutes) whereas that of the output (cytokines) data are in hours (measurements taken at 2, 3 and 4 hrs). Phosphoproteins were measured using western blots (AfCS protocols #PP00000177 and #PP00000181) and cytokines were measured using multiplex suspension arrays (AfCS protocols #PP00000209 and #PP00000223 [36]). More details about the experiments can be found on the AfCS website [36] and the procedure for pre-processing the data is explained by Pradervand et al. [34]. In short, phosphoprotein data corresponded to *log*(fold-change with respect to basal level) and cytokine data corresponded to *log*(treatment – control + 1).

The dataset used consists of Toll and non-Toll data. RAW cells were stimulated with a panel of 22 ligands, in single and double ligand combinations. The Toll data sets refers to the collection of data in which one of the ligands activates Toll-like receptors (TLRs) and results in major inflammation [34]. These TLR-ligands include lipopolysaccharide (LPS), Resiquimod (R-848), PAM2CSK4 (PAM 2) and PAM3CSK4 (PAM 3). The non-Toll data refers to the collection in which the ligands do not activate one or more of the TLRs. Due to the substantial extent of response when TLRs are activated by TLR-ligands (relative to other ligands), the important effect of other ligands gets masked if one of the ligands is a TLR-ligand. Hence, in order to identify the specific connections in the networks mediating information flow during stimulation by other ligands, the data was separated in Toll and non-Toll sets. After removal of rows with missing values across all inputs and outputs, Toll and non-Toll data each consisted of 78 rows or observations (78 × 22 input data matrix and 78 × 7 output data matrix), which were used to estimate MI in each case.

A reduced model of each set was obtained by applying the principles of information theory described above. Combining these two models, we obtained the network model based on the entire data set. The resulting network provides a parsimonious phosphoprotein-cytokine model, in which the number of signaling components involved in cytokine release is minimized considerably. This model not only successfully captures most of the known signaling components involved in cytokine release, but also predicts new signaling components involved in release of cytokines.

Finally, while not the main objective of this work, we also developed predictive models (albeit linear models since log-transformation removed some of the nonlinearity) using the significant inputs (MI > threshold). The procedure to develop the linear models is presented in Appendix A. We used the Toll dataset for developing the linear models. With the intent to validate the predictive linear models, the data set was partitioned in training and test sets. Since different sets of input variables are significant for different outputs, after eliminating the rows with missing values, the effective number of observations for each output was different, which ranged 89–115 for the training set and 33–39 for the test set; about 3:1 ratio for the number of training vs. test samples.

## Results

The proper kernel bandwidth has been estimated by applying the above-mentioned cross-validation approach (equation (7)). For Toll data set, the selected bandwidth (*h*) is 0.14 and for non-Toll data set, *h* is 0.17. Figure 1 shows the probability density functions of seven released cytokines, as inferred by the KDE in equation (3) computed through the MATLAB function *ksdensity*[43] using the estimated value of *h*. All of the estimated densities are highly non-Gaussian. In this figure, x-axis shows the measured values of cytokines after being normalized and the y-axis demonstrates their densities by applying KDE.

Using these kernel density estimators, we used equation (3) to compute the MI coefficients of all protein-cytokine connections for the Toll and non-Toll datasets. Figure 2 shows these coefficients as a bar-graph, with the corresponding thresholds shown by the dashed lines for a p-value = 0.005 (*I*
_{0} = 0.19 for Toll data and *I*
_{0} = 0.17 for non-Toll data). The MI coefficients below these thresholds are considered to be statistically insignificant and discarded without any significant impact. It can be inferred from Figure 2 (both panels) that increase (decrease) in the desired p-value (and hence decrease (increase) in the MI threshold) will result in inclusion (exclusion) of some connections. For example, for non-Toll data, a small increase in MI threshold will make STAT1α and STAT5 insignificant for RANTES; connections to other cytokines will be unaffected. For the Toll data, since the MI values for the pairs cAMP – TNFα and AKT – MIP1α are close to the threshold, a small increase in the threshold will render these connections insignificant. Conversely, a small decrease in MI threshold will make the cAMP – MIP1α connection significant and hence be included in the network. Similar observations were made with the PCR approach as well [34]. Overall, with changing threshold, the network topology changes in a robust manner where just one or two edges appear or disappear.

Figure 3 shows the reconstructed networks obtained from the non-Toll (left panel and orange nodes) and Toll (right panel and pink nodes) data for 22 signaling phosphoproteins and seven cytokines. These two networks are combined to yield the network of the entire system, which is shown in Figure 4. Blue nodes in Figure 4 show phosphoproteins involved in both datasets. This network captures most of the known signaling components involved in cytokine release and confirms the potentially important novel signaling components that have been suggested recently by other methods, such as PCR [34]. Our approach also identifies new signaling components involved in the release of cytokines, including Ribosomal S6 kinase on TNFα.

Since phosphoproteins may also have regulatory impacts on other phosphoproteins, the above mentioned process is applied again to capture all the significant phosphoprotein-phosphoprotein and phosphoprotein-cytokine connections in one network. The mutual information matrix of all interactions is built again and the proper kernel bandwidth and threshold is selected (*h* = 0.14 and *I*
_{0} = 0.20 for Toll data and *h* = 0.17 and *I*
_{0} = 0.17 for non-Toll data). Figure 5 shows the reconstructed networks obtained from the non-Toll (left panel and orange nodes) and Toll (right panel and pink nodes) data and Figure 6 is the final network obtained by combining the two networks in Figure 5 containing significant phosphoprotein-phosphoprotein and phosphoprotein-cytokine connections in the entire system.To demonstrate the robustness of our results, this network is built again by capturing the networks of each cytokine individually and combining the seven reconstructed networks. Figure 7 shows the networks obtained from node-by-node analysis for TNFα (left panel) and IL-6 (right panel). In comparison with the network of Figure 6, such a network doesn’t capture the regulatory effect of PKCμ2 on G-CSF for Toll-data and cAMP on IL-6 and AKT on TNFα from non-Toll data. As the lower panel in Figure 2 shows, the mutual information of these interactions are very close to the selected threshold. All other connections present in Figure 6 are also included in such a network.

The scatter-plot in Figure 8 illustrates the predictive power of the linear models made from the reconstructed network from the Toll data (Figure 3, right panel) for training (dots) and test (open circles) datasets on cytokine release (see Methods). Most of the training and test data points are inside within two root-mean-squared errors of the training data (Appendix A). To provide a measure of the predictive quality of these linear models, we also computed the coefficient of determination *R*
^{2} for each cytokine as described in Appendix A. The *R*
^{2} values range from 0.32 to 0.62. TNFα and MIP-1α yield the best fit (*R*
^{2} > 0.6) and IL-6 and RANTES yield the lowest coefficients of determination. Although the linear model derived based on the significant components identified through the information theoretic approach is in a good agreement with the predictive models obtained with other methods, such as PCR [34] and PLS [44], the low coefficient of determination in these models, even with log-transformation of the data, indicates the non-linear nature of the phosphoprotein-cytokine signaling networks.

## Discussion

The information theoretic approach accurately identifies the main signaling phosphoproteins involved in cytokine release (Figures 3 and 4). We analyzed both Toll and non-Toll ligand response datasets. Non-Toll data is required to identify the regulatory effects of STAT1α, STAT1β, STAT3, STAT5 and cAMP (Figure 3, left panel) and Toll-data provides information about PKCμ2, JNK lg, JNK sh and NF-κB P65 and ERK2 (Figure 3, right panel). ERK1, AKT, P38 and RSK are identified as significant in both datasets. We provide a comparison of the regulatory components necessary for cytokine release identified by the information theoretic approach and other computational methods such as PCR with statistical significance testing [34] and biochemical knowledge available in literature. The results of this comparison are summarized in Table 1.

Activated macrophages secrete cytokines [86]. Various pathways transmit the signals that initiate cytokine production [87, 88]. Cytokines are classified based on their functions or their sources [86, 89]. They can be grouped into anti-inflammatory and pro-inflammatory cytokines based on their functional role in inflammatory responses. Pro-inflammatory cytokines such as TNFα, IL-1α and GM-CSF induce both acute and chronic inflammatory responses. Anti-inflammatory cytokines, such as IL-10 limit the magnitude of inflammation and chemokines, such as MIP and RANTES are involved in chemotaxis of leukocytes.

### Pro-inflammatory cytokines

Granulocyte/macrophage Colony Stimulating Factor (G-CSF) regulates the production of neutrophil G granulocytes and stimulates the function of mature neutrophils [90]. We identify the phosphoproteins PKCμ2 [49], NF-κB p65 [44], JNK lg/sh [46], P38, RSK [53] and ERK1/2 [51] as the main regulators for the production and release of G-CSF. Tumor Necrosis Factor alpha (TNFα) is involved in normal host defense in both mediating inflammatory and immune responses [91]. Our study captures the largest network of regulatory components for TNFα which consists of twelve signaling phosphoproteins: RSK, AKT, RPS6, PKCμ2, GSK3A, cAMP, ERK1/2, JNK sh/lg, NF-κB p65 and P38. Some studies suggest the regulatory impact of STAT1α and STAT1β on TNFα [92]. Both our network and the network from PCR minimal model [34] missed these connections. Interleukin-1alpha (IL-1α) is produced by activated macrophages and is responsible for inflammation [93]. The information theoretic approach identifies cAMP, JNK lg/sh, ERK1/2, P38 and NF-κB p65 as the main regulators of production/release of IL-1α.

As Table 1 shows, this study identifies most of the signaling components of pro-inflammatory cytokines captured by other computational methods and strongly confirms the regulatory effect of P38 which has been proposed by the PCR minimal model [34]. Unlike the PCR minimal model [34], our approach successfully captures the regulatory effects of ERK1 and ERK2 on GCS-F [51] and TNFα [66]. It confirms the regulatory effect of GSK3A on TNFα [59] which have been suggested by studies. NF-κB, ERK, JNK (targets c-Jun [63]) and Sp1 (trans-activating transcription factor 1) are the transcriptional activators of TNFα [58, 94]. In this light, our results show good agreement with other studies by capturing all signaling components identified by the PCR minimal model, in addition to predicting the known regulatory effects of ERK1/2, GSK3A (regulated by c-Jun which is affected by JNK) [44, 59, 94]. The information theoretic approach also identifies RSK, a substrate of ERK [95], as a potentially novel regulatory component involved in the release of TNFα.

P38 (from Toll data) has the strongest and ERK1 (from non-Toll data) has the weakest regulatory impact on TNFα. As Figure 8 shows, TNFα yields the best linear fit in terms of the coefficient of determination (*R*
^{2} = 0.62), which is in good agreement with other models obtained by PCR [34, 96] and PLS [44] methods. NF-κB p65 represents the highest statistical dependency while PKCμ2 has the lowest mutual information coefficient among the captured regulatory network components of GCS-F. JNK lg (from Toll data) shows the highest regulatory effect on IL-1α.

### Anti-inflammatory cytokines

Interleukin-10 (IL-10) is an anti-inflammatory cytokine that has important roles in immune regulation and inflammation [97]. Our approach shows the regulatory effects of PKCμ2 [65], P38 [56], RSK [67], ERK1/2 [61], NF-κB p65 [64] and JNK sh/lg, on IL-10. Macrophage Inflammatory Protein-1α (MIP-1α) belongs to the group of CC chemokines that regulate several inflammatory responses including trafficking and activation of leukocytes, as well as the fever response [98]. We capture the regulatory effects of cAMP [72], AKT [79], RSK, ERK1/2 [74], P38 [57], JNK sh/lg [76] and NF-κB p65 [70] on MIP-1α. One study suggests the regulatory effects of STAT1α/β and STAT3 on MIP-1α [68]. The PCR minimal model [34] only identifies STAT1α as a significant component of MIP-1α. Regulated on Activation, Normal T Expressed and Secreted (RANTES), is a CC chemokine and has a key role in recruiting leukocytes into inflammatory sites [99]. The information theoretic approach suggests that STAT3, STAT5, STAT1α, NF-κB p65, PKCμ2, P38 JNK lg/sh, ERK1/2 and RSK regulate RANTES and unlike the PCR minimal model [34], it is in good agreement with the cytokine literature.

As indicated in Table 1, the network identified by our study includes most of the known signaling components of anti-inflammatory cytokines described in the literature and unlike the PCR minimal model [34], captures the regulatory effects of NF-κB p65 and ERK1/2 on MIP-1α. Some studies suggest that the TLR ligand pathways that activate IL-10 are P38 dependent and NF-κB signaling pathway has no contribution on the activation of IL-10 [100, 101]. However, our study and the PCR model [34] identify the regulatory effects of JNK lg/sh which are activated through NF-κB p65.

The information theoretic approach and PCR [34] models both yield low coefficient of determination for cytokines (*R*
^{2} < 0.8) possibly due to their regulations by unmeasured pathways and/or a nonlinear relationship between the levels of cytokines and the phosphoproteins. In comparison to the PCR approach, information theoretic approach shows a better agreement with known regulatory components in the literature. The high variability of data (low coefficient of determination) might explain this by considering the fact that when noise or variability is high, the threshold used in the PCR approach is high so that it identifies a relatively lesser number of components as being significant. The non-linear nature of the biological processes might be an explanation for the failure of PCR to identify the regulatory effects of ERK1/2, cAMP and RSK on cytokines.

JNK lg (from Toll data) has the strongest effect and AKT (from non-Toll data) has the weakest effect on MIP-1α. Our network shows the highest mutual information (from non-Toll data) for NF-κB and IL-10. PKCμ2 has the weakest regulatory effect on IL-10. JNK lg has the strongest regulatory effect on RANTES and STAT3 shows the lowest statistical dependencies to it.

### Interleukin-6

Interleukin-6 (IL-6) is secreted by macrophages and T cells and acts as both a pro-inflammatory and anti-inflammatory cytokine [102]. Our model identifies the regulatory effects of phosphoproteins RSK, PKCμ2, ERK1/2, JNK sh/lg, P38, NF-κB and cAMP. The regulatory roles of cAMP [54] and P38 [48] which could not be captured by the PCR minimal model [34], are identified by the information theoretic approach. JNK lg (from Toll data) yields the strongest regulatory effect and cAMP (from non-Toll data) yields the weakest regulatory effect on IL-6.

Overall, our network model and quantitative predictions are in good agreement with other studies available in literature and captures most of the known regulatory components involved in cytokine release. Our model confirms the regulatory effect of P38 on G-CSF that has been suggested by the PCR minimal model several years ago [34] and captures one potentially novel regulatory effect of RSK on TNFα. The advantages of the information theoretic method has been demonstrated by comparing the accuracy of its parsimonious model to the models obtained by other computational methods such as PCR minimal models in predicting the regulatory components for cytokines with high variability and low coefficient of determination.

## Conclusions

Identifying the regulatory components for cytokines is critical for understanding the mechanisms that control their production and release in immune cells. In recent years, several computational methods have been applied to develop signaling networks, involved in cytokine release, which have led to an improved understanding of cytokine release in macrophages. In this work, we developed a parsimonious input–output model of regulatory phosphoprotein-cytokine network based on an information theoretic approach. Our model demonstrated the applicability of this approach to the data-driven reconstruction of biological networks. The data, which consisted of a systematic profiling of signaling responses in RAW 264.7 macrophage cells upon treatment with Toll- and non-Toll receptor ligands, was provided by the Alliance for Cellular Signaling (AfCS). Information theoretic approach as a non-parametric method identified the regulatory components (phosphoproteins) on which specific cytokines showed significant statistical dependence (measured in terms of mutual information). The reconstructed network also was able to capture the regulatory network of phosphoprotein interactions. We calculated mutual information of interactions by using kernel density estimator (KDE) and discarded weak connections using proper thresholds. Using such a parsimonious list of significant inputs, a predictive model was also developed for each of the cytokines which predicted a separate test data well. Most of the significant connections are validated against the known literature. Some novel connections, such as Ribosomal S6 kinase for Tumor Necrosis Factor alpha are also identified by the mutual information approach, which were not detected by the PCR approach. These novel regulatory components serve as testable hypotheses.

### Availability of supporting data

The data sets supporting the results of this article are available at the UCSD Signaling Gateway web site [http://www.signaling-gateway.org/data/Data.html].

## Appendices

### Appendix A - development of a predictive model

To develop a predictive model using the reconstructed network, we build the following linear model between the significant inputs (*X*) and a chosen output (*Y*):

where, *ϵ* represents white noise. Generally, one deals with one output at a time because the set of significant inputs differs for different outputs. *X* is mean-centered and normalized by the standard deviation and *Y* is mean-centered. The coefficient matrix, *b*, is estimated by least square method [103] using “training dataset”:

Once $\widehat{\mathit{b}}$ is estimated, the model can be tested on a “test dataset”. The test dataset generally has the same probability distribution as training dataset. Thus, given the input test data *X*
_{test} (normalized by using the mean and standard deviation parameters obtained for the training set), the output test data *Y*
_{test} (offset by the mean of *Y*) is predicted as:

Two metrics used to measure the accuracy of the prediction are Root Mean Square Error (RSME) and coefficient of determination (*R*
^{2}) calculated as [104]:

where, *n* is the number of data points. ${\overline{\mathit{Y}}}_{\mathit{test}}$ is the mean value of the *n* data points for the chosen output. *R*
^{2} is a good quantitative metric indicating the quality of prediction by the linear model.

## References

- 1.
Spirin V, Mirny LA: Protein complexes and functional modules in molecular networks. Proc Natl Acad Sci U S A. 2003, 100: 12123-12128.

- 2.
Li W, Liu Y, Huang HC, Peng Y, Lin Y, Ng WK, Ong KL: Dynamical systems for discovering protein complexes and functional modules from biological networks. IEEE/ACM Trans Comput Biol Bioinform. 2007, 4: 233-250.

- 3.
Cho KH, Choo SM, Jung SH, Kim JR, Choi HS, Kim J: Reverse engineering of gene regulatory networks. IET Syst Biol. 2007, 1: 149-163.

- 4.
Albert R: Network inference, analysis, and modeling in systems biology. Plant Cell. 2007, 19: 3327-3338.

- 5.
Hyduke DR, Palsson BO: Towards genome-scale signalling network reconstructions. Nat Rev Genet. 2010, 11: 297-307.

- 6.
Jolliffe IT: Principal Component Analysis. 1986, New York: Springer-Verlag

- 7.
Kramer R: Chemometric Techniques for Quantitative Analysis. 1998, New York: Marcel Dekker

- 8.
El Ghaoui L, Niculescu S-I: Advances in Linear Matrix Inequality Methods in Control. 1999, Philadelphia, PA: Society for Industrial and Applied Mathematics

- 9.
Neapolitan RE: Learning Bayesian Networks. 2004, Upper Saddle River, NJ: Prentice Hall

- 10.
Reiss PT, Ogden RT: Functional principal component regression and functional partial least squares. J Am Stat Assoc. 2007, 102: 984-996.

- 11.
Toscas PJ, Shaw FD, Beilken SL: Partial least squares (PLS) regression for the analysis of instrument measurements and sensory meat quality data. Meat Sci. 1999, 52: 173-178.

- 12.
Wentzell PD, Montoto LV: Comparison of principal components regression and partial least squares regression through generic simulations of complex mixtures. Chemom Intell Lab Syst. 2003, 65: 257-279.

- 13.
Esposito Vinzi V: SpringerLink (Online Service): Handbook of Partial Least Squares Concepts, Methods and Applications. 2010, Berlin: Springer

- 14.
Cosentino C, Curatola W, Montefusco F, Bansal M, di Bernardo D, Amato F: Linear matrix inequalities approach to reconstruction of biological networks. IET Syst Biol. 2007, 1: 164-173.

- 15.
Geiger D, Verma T, Pearl J: Identifying Independence in Bayesian Networks. Networks. 1990, 20: 507-534.

- 16.
Heckerman D, Geiger D, Chickering DM: Learning Bayesian networks - the combination of knowledge and statistical-data. Mach Learn. 1995, 20: 197-243.

- 17.
Perrin BE, Ralaivola L, Mazurie A, Bottani S, Mallet J, D’Alche-Buc F: Gene networks inference using dynamic Bayesian networks. Bioinformatics. 2003, 19 Suppl 2: ii138-ii148.

- 18.
Cover TM, Thomas JA: Elements of Information Theory. 2006, Hoboken, N.J.: Wiley-Interscience, 2

- 19.
Kraskov A, Stogbauer H, Grassberger P: Estimating mutual information. Phys Rev E Stat Nonlin Soft Matter Phys. 2004, 69: 066138-

- 20.
Hnizdo V, Darian E, Fedorowicz A, Demchuk E, Li S, Singh H: Nearest-neighbor nonparametric method for estimating the configurational entropy of complex molecules. J Comput Chem. 2007, 28: 655-668.

- 21.
Steinfath M, Groth D, Lisec J, Selbig J: Metabolite profile analysis: from raw data to regression and classification. Physiol Plant. 2008, 132: 150-161.

- 22.
Numata J, Ebenhoh O, Knapp EW: Measuring correlations in metabolomic networks with mutual information. Genome Informatics Int Conf Genome Informatics. 2008, 20: 112-122.

- 23.
Hartley RH: Transmission of information. Bell Syst Tech J. 1928, 7: 535-563.

- 24.
Shannon CE: A mathematical theory of communication. Bell Syst Tech J. 1948, 27: 379-423.

- 25.
Kojadinovic I: On the use of mutual information in data analysis: an overview. 11th International Symposium Applied Stochastic Models Data Analysis. 2005, France: Brest, 738-747.

- 26.
Butte AJ, Kohane IS: Mutual information relevance networks: functional genomics clustering using pairwise entropy measurements. Pac Symp Biocomput. 2000, 5: 418-429. URL: http://psb.stanford.edu/psb-online/proceedings/psb00/butte.pdf [access date: June 14, 2014]

- 27.
Butte AJ, Tamayo P, Slonim D, Golub TR, Kohane IS: Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks. Proc Natl Acad Sci U S A. 2000, 97: 12182-12186.

- 28.
Faith JJ, Hayete B, Thaden JT, Mogno I, Wierzbowski J, Cottarel G, Kasif S, Collins JJ, Gardner TS: Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles. PLoS Biol. 2007, 5: e8-

- 29.
Peng H, Long F, Ding C: Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell. 2005, 27: 1226-1238.

- 30.
Margolin AA, Nemenman I, Basso K, Wiggins C, Stolovitzky G, Dalla Favera R, Califano A: ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinformatics. 2006, 7 Suppl 1: S7-

- 31.
Silverman BW: Density Estimation for Statistics and Data Analysis. 1986, London, New York: Chapman and Hall

- 32.
Raykar VC, Duraiswami R: Fast optimal bandwidth selection for kernel density estimation. sixth SIAM International Conference on Data Mining. Edited by: Ghosh J, Lambert D, Skillicorn D, Srivastava J. 2006, Bethesda: Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 524-528.

- 33.
Mugdadi AR, Ahmad IA: A bandwidth selection for kernel density estimation of functions of random variables. Comput Stat Data Anal. 2004, 47: 49-62.

- 34.
Pradervand S, Maurya MR, Subramaniam S: Identification of signaling components required for the prediction of cytokine release in RAW 264.7 macrophages. Genome Biol. 2006, 7: R11-

- 35.
Gilman AG, Simon MI, Bourne HR, Harris BA, Long R, Ross EM, Stull JT, Taussig R, Bourne HR, Arkin AP, Cobb MH, Cyster JG, Devreotes PN, Ferrell JE, Fruman D, Gold M, Weiss A, Stull JT, Berridge MJ, Cantley LC, Catterall WA, Coughlin SR, Olson EN, Smith TF, Brugge JS, Botstein D, Dixon JE, Hunter T, Lefkowitz RJ, Pawson AJ: Overview of the alliance for cellular signaling. Nature. 2002, 420: 703-706.

- 36.
The Alliance for Cellular Signalling (AfCS). [http://signaling-gateway.org]

- 37.
Moon YI, Rajagopalan B, Lall U: Estimation of mutual information using kernel density estimators. Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics. 1995, 52: 2318-2321.

- 38.
Turlach BA: Bandwidth selection in kernel density estimation: A review. Technical Report. 1993, Place de l'Université 1, 1348, Belgium: Univ. Catholique de Louvain

- 39.
Mosser DM, Edwards JP: Exploring the full spectrum of macrophage activation. Nat Rev Immunol. 2008, 8: 958-969.

- 40.
Saito S: Cytokine cross-talk between mother and the embryo/placenta. J Reprod Immunol. 2001, 52: 15-33.

- 41.
Feghali CA, Wright TM: Cytokines in acute and chronic inflammation. Front Biosci. 1997, 2: d12-d26.

- 42.
Stanley AC, Lacy P: Pathways for cytokine secretion. Physiology. 2010, 25: 218-229.

- 43.
MATLAB and Statistics Toolbox Release 2012b. [http://www.mathworks.com/help/stats/ksdensity.html]

- 44.
Wu Y, Johnson GL, Gomez SM: Data-driven modeling of cellular stimulation, signaling and output response in RAW 264.7 cells. J Mol Signal. 2008, 3: 11-

- 45.
Reeves MB, Compton T: Inhibition of inflammatory interleukin-6 activity via extracellular signal-regulated kinase-mitogen-activated protein kinase signaling antagonizes human cytomegalovirus reactivation from dendritic cells. J Virol. 2011, 85: 12750-12758.

- 46.
Cho YH, Lee CH, Kim SG: Potentiation of lipopolysaccharide-inducible cyclooxygenase 2 expression by C2-ceramide via c-Jun N-terminal kinase-mediated activation of CCAAT/enhancer binding protein beta in macrophages. Mol Pharmacol. 2003, 63: 512-523.

- 47.
Yin T, Yang YC: Mitogen-activated protein kinases and ribosomal S6 protein kinases are involved in signaling pathways shared by interleukin-11, interleukin-6, leukemia inhibitory factor, and oncostatin M in mouse 3 T3-L1 cells. J Biol Chem. 1994, 269: 3731-3738.

- 48.
Ahmed ST, Mayer A, Ji JD, Ivashkiv LB: Inhibition of IL-6 signaling by a p38-dependent pathway occurs in the absence of new protein synthesis. J Leukoc Biol. 2002, 72: 154-162.

- 49.
Kothari SS, Abrahamsen MS, Cole T, Hammond WP: Expression of granulocyte colony stimulating factor (G-CSF) and granulocyte/macrophage colony stimulating factor (GM-CSF) mRNA upon stimulation with phorbol ester. Blood Cells Mol Dis. 1995, 21: 192-200.

- 50.
Sanders JL, Stern PH: Protein kinase C involvement in interleukin-6 production by parathyroid hormone and tumor necrosis factor-alpha in UMR-106 osteoblastic cells. J Bone Miner Res. 2000, 15: 885-893.

- 51.
Suzuki K, Hino M, Hato F, Tatsumi N, Kitagawa S: Cytokine-specific activation of distinct mitogen-activated protein kinase subtype cascades in human neutrophils stimulated by granulocyte colony-stimulating factor, granulocyte-macrophage colony-stimulating factor, and tumor necrosis factor-alpha. Blood. 1999, 93: 341-349.

- 52.
Faggioli L, Costanzo C, Donadelli M, Palmieri M: Activation of the interleukin-6 promoter by a dominant negative mutant of c-Jun. Bba Mol Cell Res. 2004, 1692: 17-24.

- 53.
Joseph DE, Paul CC, Baumann MA, Gomez-Cambronero J: S6 kinase p90rsk in granulocyte-macrophage colony-stimulating factor-stimulated proliferative and mature hematopoietic cells. J Biol Chem. 1996, 271: 13088-13093.

- 54.
Dendorfer U, Oettgen P, Libermann TA: Multiple regulatory elements in the interleukin-6 gene mediate induction by prostaglandins, cyclic AMP, and lipopolysaccharide. Mol Cell Biol. 1994, 14: 4443-4454.

- 55.
Ozes ON, Akca H, Mayo LD, Gustin JA, Maehama T, Dixon JE, Donner DB: A phosphatidylinositol 3-kinase/Akt/mTOR pathway mediates and PTEN antagonizes tumor necrosis factor inhibition of insulin signaling through insulin receptor substrate-1. Proc Natl Acad Sci U S A. 2001, 98: 4640-4645.

- 56.
Dobreva ZG, Miteva LD, Stanilova SA: The inhibition of JNK and p38 MAPKs downregulates IL-10 and differentially affects c-Jun gene expression in human monocytes. Immunopharmacol Immunotoxicol. 2009, 31: 195-201.

- 57.
Dean JLE, Sarsfield SJ, Tsounakou E, Saklatvala J: p38 mitogen-activated protein kinase stabilizes mRNAs that contain cyclooxygenase-2 and tumor necrosis factor AU-rich elements by inhibiting deadenylation. J Biol Chem. 2003, 278: 39470-39476.

- 58.
Kuprash DV, Udalova IA, Turetskaya RL, Kwiatkowski D, Rice NR, Nedospasov SA: Similarities and differences between human and murine TNF promoters in their response to lipopolysaccharide. J Immunol. 1999, 162: 4045-4052.

- 59.
Eto M, Kouroedov A, Cosentino F, Luscher TF: Glycogen synthase kinase-3 mediates endothelial cell activation by tumor necrosis factor-alpha. Circulation. 2005, 112: 1316-1322.

- 60.
Johannes FJ, Horn J, Link G, Haas E, Siemienski K, Wajant H, Pfizenmaier K: Protein kinase Cmu downregulation of tumor-necrosis-factor-induced apoptosis correlates with enhanced expression of nuclear-factor-kappaB-dependent protective genes. Eur J Biochem. 1998, 257: 47-54.

- 61.
Qian C, Jiang X, An H, Yu Y, Guo Z, Liu S, Xu H, Cao X: TLR agonists promote ERK-mediated preferential IL-10 production of regulatory dendritic cells (diffDCs), leading to NK-cell activation. Blood. 2006, 108: 2307-2315.

- 62.
Ollivier V, Parry GC, Cobb RR, de Prost D, Mackman N: Elevated cyclic AMP inhibits NF-kappaB-mediated transcription in human monocytic cells and endothelial cells. J Biol Chem. 1996, 271: 20828-20835.

- 63.
Diaz B, Lopez-Berestein G: A distinct element involved in lipopolysaccharide activation of the tumor necrosis factor-alpha promoter in monocytes. J Interferon Cytokine Res. 2000, 20: 741-748.

- 64.
Wen AY, Sakamoto KM, Miller LS: The role of the transcription factor CREB in immune function. J Immunol. 2010, 185: 6413-6419.

- 65.
Tremblay P, Houde M, Arbour N, Rochefort D, Masure S, Mandeville R, Opdenakker G, Oth D: Differential effects of PKC inhibitors on gelatinase B and interleukin 6 production in the mouse macrophage. Cytokine. 1995, 7: 130-136.

- 66.
Means TK, Pavlovich RP, Roca D, Vermeulen MW, Fenton MJ: Activation of TNF-alpha transcription utilizes distinct MAP kinase pathways in different macrophage populations. J Leukoc Biol. 2000, 67: 885-893.

- 67.
Naqvi S, Macdonald A, McCoy CE, Darragh J, Reith AD, Arthur JS: Characterization of the cellular action of the MSK inhibitor SB-747651A. Biochem J. 2012, 441: 347-357.

- 68.
Guzzo C, Mat NFC, Gee K: Interleukin-27 induces a STAT1/3- and NF-kappa B-dependent proinflammatory cytokine profile in human monocytes. (vol 285, pg 24404, 2010). J Biol Chem. 2012, 287: 8661-8671.

- 69.
Kovacic JC, Gupta R, Lee AC, Ma M, Fang F, Tolbert CN, Walts AD, Beltran LE, San H, Chen G, Hilaire CS, Boehm M: Stat3-dependent acute Rantes production in vascular smooth muscle cells modulates inflammation following arterial injury in mice. J Clin Invest. 2010, 120: 303-314.

- 70.
Brueckmann M, Hoffmann U, Dvortsak E, Lang S, Kaden JJ, Borggrefe M, Haase KK: Drotrecogin alfa (activated) inhibits NF-kappa B activation and MIP-1-alpha release from isolated mononuclear cells of patients with severe sepsis. Inflamm Res. 2004, 53: 528-533.

- 71.
Kelly J, Spolski R, Imada K, Bollenbacher J, Lee S, Leonard WJ: A role for Stat5 in CD8+ T cell homeostasis. J Immunol. 2003, 170: 210-217.

- 72.
Amella CA, Sherry B, Shepp DH, Schmidtmayerova H: Macrophage inflammatory protein 1alpha inhibits postentry steps of human immunodeficiency virus type 1 infection via suppression of intracellular cyclic AMP. J Virol. 2005, 79: 5625-5631.

- 73.
Ohmori Y, Schreiber RD, Hamilton TA: Synergy between interferon-gamma and tumor necrosis factor-alpha in transcriptional activation is mediated by cooperation between signal transducer and activator of transcription 1 and nuclear factor kappaB. J Biol Chem. 1997, 272: 14899-14907.

- 74.
Ottonello L, Montecucco F, Bertolotto M, Arduino N, Mancini M, Corcione A, Pistoia V, Dallegri F: CCL3 (MIP-1alpha) induces in vitro migration of GM-CSF-primed human neutrophils via CCR5-dependent activation of ERK 1/2. Cell Signal. 2005, 17: 355-363.

- 75.
Hiura TS, Kempiak SJ, Nel AE: Activation of the human RANTES gene promoter in a macrophage cell line by lipopolysaccharide is dependent on stress-activated protein kinases and the IkappaB kinase cascade: implications for exacerbation of allergic inflammation by environmental pollutants. Clin Immunol. 1999, 90: 287-301.

- 76.
Leyva-Illades D, Cherla RP, Lee MS, Tesh VL: Regulation of cytokine and Chemokine expression by the ribotoxic stress response elicited by Shiga toxin type 1 in human macrophage-like THP-1 cells. Infect Immun. 2012, 80: 2109-2120.

- 77.
Dai X, Sayama K, Tohyama M, Shirakata Y, Yang L, Hirakawa S, Tokumaru S, Hashimoto K: The NF-kappaB, p38 MAPK and STAT1 pathways differentially regulate the dsRNA-mediated innate immune responses of epidermal keratinocytes. Int Immunol. 2008, 20: 901-909.

- 78.
Jordan NJ, Watson ML, Yoshimura T, Westwick J: Differential effects of protein kinase C inhibitors on chemokine production in human synovial fibroblasts. Br J Pharmacol. 1996, 117: 1245-1253.

- 79.
Lentzsch S, Gries M, Janz M, Bargou R, Dorken B, Mapara MY: Macrophage inflammatory protein 1-alpha (MIP-1 alpha) triggers migration and signaling cascades mediating survival and proliferation in multiple myeloma (MM) cells. Blood. 2003, 101: 3568-3573.

- 80.
Zhang Y, Zhai Q, Luo Y, Dorf ME: RANTES-mediated chemokine transcription in astrocytes involves activation and translocation of p90 ribosomal S6 protein kinase (RSK). J Biol Chem. 2002, 277: 19042-19048.

- 81.
Hobbs RM, Watt FM: Regulation of interleukin-1alpha expression by integrins and epidermal growth factor receptor in keratinocytes from a mouse model of inflammatory skin disease. J Biol Chem. 2003, 278: 19798-19807.

- 82.
Bird TA, Schule HD, Delaney P, de Roos P, Sleath P, Dower SK, Virca GD: The interleukin-1-stimulated protein kinase that phosphorylates heat shock protein hsp27 is activated by MAP kinase. FEBS Lett. 1994, 338: 31-36.

- 83.
Hashimoto S, Matsumoto K, Gon Y, Maruoka S, Kujime K, Hayashi S, Takeshita I, Horie T: p38 MAP kinase regulates TNF alpha-, IL-1 alpha- and PAF-induced RANTES and GM-CSF production by human bronchial epithelial cells. Clin Exp Allergy. 2000, 30: 48-55.

- 84.
Bailly S, Fay M, Israel N, Gougerot-Pocidalo MA: The transcription factor AP-1 binds to the human interleukin 1 alpha promoter. Eur Cytokine Netw. 1996, 7: 125-128.

- 85.
Mori N, Prager D: Transactivation of the interleukin-1 alpha promoter by human T-cell leukemia virus. Leuk Lymphoma. 1997, 26: 421-433.

- 86.
Ma J, Chen T, Mandelin J, Ceponis A, Miller NE, Hukkanen M, Ma GF, Konttinen YT: Regulation of macrophage activation. Cell Mol Life Sci. 2003, 60: 2334-2346.

- 87.
Shaw G, Kamen R: A conserved AU sequence from the 3′ untranslated region of GM-CSF mRNA mediates selective mRNA degradation. Cell. 1986, 46: 659-667.

- 88.
Kontoyiannis D, Pasparakis M, Pizarro TT, Cominelli F, Kollias G: Impaired on/off regulation of TNF biosynthesis in mice lacking TNF AU-rich elements: implications for joint and gut-associated immunopathologies. Immunity. 1999, 10: 387-398.

- 89.
Oppenheim JJ, Murphy WJ, Chertox O, Schirrmacher V, Wang JM: Prospects for cytokine and chemokine biotherapy. Clin Cancer Res. 1997, 3: 2682-2686.

- 90.
Ogawa T, Kusumoto M, Kuroki S, Nagata S, Yamanaka N, Kawano R, Yoshida J, Shinohara M, Matsuo K: Adjuvant GM-CSF cytokine gene therapy for breast cancer. Gan to kagaku ryoho Cancer Chemother. 2001, 28: 1512-1514.

- 91.
Beutler B, Cerami A: The biology of cachectin/TNF–a primary mediator of the host response. Annu Rev Immunol. 1989, 7: 625-655.

- 92.
Wang Y, Wu TR, Cai S, Welte T, Chin YE: Stat1 as a component of tumor necrosis factor alpha receptor 1-TRADD signaling complex to inhibit NF-kappaB activation. Mol Cell Biol. 2000, 20: 4505-4512.

- 93.
Dinarello CA: The biological properties of interleukin-1. Eur Cytokine Netw. 1994, 5: 517-531.http://www.ncbi.nlm.nih.gov/pubmed/7727685,

- 94.
Tsai EY, Falvo JV, Tsytsykova AV, Barczak AK, Reimold AM, Glimcher LH, Fenton MJ, Gordon DC, Dunn IF, Goldfeld AE: A lipopolysaccharide-specific enhancer complex involving Ets, Elk-1, Sp1, and CREB binding protein and p300 is recruited to the tumor necrosis factor alpha promoter in vivo. Mol Cell Biol. 2000, 20: 6084-6094.

- 95.
Frodin M, Gammeltoft S: Role and regulation of 90 kDa ribosomal S6 kinase (RSK) in signal transduction. Mol Cell Endocrinol. 1999, 151: 65-77.

- 96.
Asadi B, Maurya MR, Tartakovsky DM, Subramaniam S: Comparison of statistical and optimisation-based methods for data-driven network reconstruction of biochemical systems. IET Syst Biol. 2012, 6: 155-163.

- 97.
Rensink AA, Gellekink H, Otte-Holler I, ten Donkelaar HJ, de Waal RM, Verbeek MM, Kremer B: Expression of the cytokine leukemia inhibitory factor and pro-apoptotic insulin-like growth factor binding protein-3 in Alzheimer’s disease. Acta Neuropathol. 2002, 104: 525-533.

- 98.
Davatelis G, Tekamp-Olson P, Wolpe SD, Hermsen K, Luedke C, Gallegos C, Coit D, Merryweather J, Cerami A: Cloning and characterization of a cDNA for murine macrophage inflammatory protein (MIP), a novel monokine with inflammatory and chemokinetic properties. J Exp Med. 1988, 167: 1939-1944.

- 99.
Rossi D, Zlotnik A: The biology of chemokines and their receptors. Annu Rev Immunol. 2000, 18: 217-242.

- 100.
Bondeson J, Browne KA, Brennan FM, Foxwell BM, Feldmann M: Selective regulation of cytokine induction by adenoviral gene transfer of IkappaBalpha into human macrophages: lipopolysaccharide-induced, but not zymosan-induced, proinflammatory cytokines are inhibited, but IL-10 is nuclear factor-kappaB independent. J Immunol. 1999, 162: 2939-2945.

- 101.
Ma W, Lim W, Gee K, Aucoin S, Nandan D, Kozlowski M, Diaz-Mitoma F, Kumar A: The p38 mitogen-activated kinase pathway regulates the human interleukin-10 promoter via the activation of Sp1 transcription factor in lipopolysaccharide-stimulated human macrophages. J Biol Chem. 2001, 276: 13664-13674.

- 102.
Scheller J, Chalaris A, Schmidt-Arras D, Rose-John S: The pro- and anti-inflammatory properties of the cytokine interleukin-6. Biochim Biophys Acta. 2011, 1813: 878-888.

- 103.
Bretscher O: Linear Algebra With Applications. 2013, Boston: Pearson Education, 5

- 104.
DeGroot MH, Schervish MJ: Probability and statistics. 2012, Boston: Addison-Wesley, 4

## Acknowledgments

This research was supported by the National Science Foundation (NSF) collaborative grants DBI-0835541 and STC-0939370, and National Institutes of Health (NIH) collaborative grants U54GM69338, R01HL106579 and R01HL108735 to SS.

## Author information

## Additional information

### Competing interests

The authors declare that they have no competing interests.

### Authors’ contributions

Research design and supervision: SS, DMT, MRM. Algorithm: FF, MRM, SS, DMT. Computer program: FF, MRM. Writing and revision: FF, MRM, SS, DMT. FF and MRM contributed equally to this work. All authors read and approved the final manuscript.

Farzaneh Farhangmehr, Mano Ram Maurya contributed equally to this work.

## Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

## Rights and permissions

## About this article

#### Received

#### Accepted

#### Published

#### DOI

### Keywords

- Bioinformatics
- Data mining
- Network inference
- Data-driven network reconstruction
- Information theory
- Mutual information
- Probabilistic algorithm
- Statistical methods