Volume 1 Supplement 1

BioSysBio 2007: Systems Biology, Bioinformatics, Synthetic Biology

Open Access

Gene regulatory network of human adipocyte differentiation

  • Judit Kumuthini1Email author,
  • Conrad Bessant1,
  • Paul Wilson2 and
  • Daniel Crowther2
BMC Systems Biology20071(Suppl 1):P57

DOI: 10.1186/1752-0509-1-S1-P57

Published: 8 May 2007

Background

In this article we demonstrate novel pre-processing methods to reduce data dimensionality of human adipocyte differentiation microarray data. Genetic networks of the insulin receptor family, ppar family, fox family, cebp family mef2, fabp, add1 and klf, and probes with highly significant change in gene expression level were learned separately using a Bayesian frame work. The extracted networks were validation of genetic network against many publicly available and as well as in house interaction and literature databases available at GSK.

Methods

Multidimensional, hMAD microarray data provided by GSK was used to generate additional artificial experiments using a novel technique and the differentially expressed probes were filtered. Through Gaussian clustering 45 clusters plus the outliers extracted were used to learn the genetic network using taboo search algorithm using BayesiaLab®.

Results

See Figure 1 and Figure 2.

Conclusion

The choices of pre-processing methods and dimensionality reduction techniques applied in this work have a major impact on Bayesian network extraction. The Bayesian networks extracted were validated against a proprietary Network warehouse database at GSK. Many novel genetic interactions were identified. We suggest that these pre-processing methods can be widely used genetic network extraction, even in the absence of many experiments but replicates. Thus improves the prediction of significant changes in gene expression for microarray experiments and reduce both false positives and negatives.
https://static-content.springer.com/image/art%3A10.1186%2F1752-0509-1-S1-P57/MediaObjects/12918_2007_Article_130_Fig1_HTML.jpg
Figure 1

GRN of genes of interest, colour coded according to the biological family they belong to. The respective Affymetrics probe ID annotation from Tiger database is associated with each node as indicated by the sign.

https://static-content.springer.com/image/art%3A10.1186%2F1752-0509-1-S1-P57/MediaObjects/12918_2007_Article_130_Fig2_HTML.jpg
Figure 2

Pearson correlation coefficient of optimised 45 Clusters TN represented by the wavy circular node. The thickness of the arc represents the information contributed to the global structure. The strength of the arc is proportional to the strength of the probabilistic relation. The association of marginal probability to the modalities (outcome) of nodes is computed R, the Pearson's linear correlation coefficient between two nodes linked by an arc. The thickness of an arc is directly proportional to the absolute value of R. The blue indicated positive correlation and red a negative correlation. The exact value of the correlation for each arc is temporary displayed in the comment of the arc as indicated by the sign. The association of values to nodes' modalities and R, the Pearson's linear correlation coefficient between two nodes linked by an arc. If the modalities don't have associated values, default values are defined in order to compute R (from 0 to n-1 for a node with n modalities). The thickness of an arc is directly proportional to the absolute value of R, its colour represents the sign of R (blue if positive and red if not). The exact value of the correlation for each arc is temporary displayed in the comment of the arc.

Authors’ Affiliations

(1)
Cranfield University
(2)
GSK, Bioinformatics Research and Development

Copyright

© Kumuthini et al; licensee BioMed Central Ltd. 2007

This article is published under license to BioMed Central Ltd.

Advertisement