Volume 5 Supplement 2
Combinatorial motif analysis of regulatory gene expression in Mafb deficient macrophages
© Morita et al; licensee BioMed Central Ltd. 2011
Published: 14 December 2011
Deficiency of the transcription factor MafB, which is normally expressed in macrophages, can underlie cellular dysfunction associated with a range of autoimmune diseases and arteriosclerosis. MafB has important roles in cell differentiation and regulation of target gene expression; however, the mechanisms of this regulation and the identities of other transcription factors with which MafB interacts remain uncertain. Bioinformatics methods provide a valuable approach for elucidating the nature of these interactions with transcriptional regulatory elements from a large number of DNA sequences. In particular, identification of patterns of co-occurrence of regulatory cis-elements (motifs) offers a robust approach.
Here, the directional relationships among several functional motifs were evaluated using the Log-linear Graphical Model (LGM) after extraction and search for evolutionarily conserved motifs. This analysis highlighted GATA-1 motifs and 5’AT-rich half Maf recognition elements (MAREs) in promoter regions of 18 genes that were down-regulated in Mafb deficient macrophages. GATA-1 motifs and MafB motifs could regulate expression of these genes in both a negative and positive manner, respectively. The validity of this conclusion was tested with data from a luciferase assay that used a C1qa promoter construct carrying both the GATA-1 motifs and MAREs. GATA-1 was found to inhibit the activity of the C1qa promoter with the GATA-1 motifs and MafB motifs.
These observations suggest that both the GATA-1 motifs and MafB motifs are important for lineage specific expression of C1qa. In addition, these findings show that analysis of combinations of evolutionarily conserved motifs can be successfully used to identify patterns of gene regulation.
In recent years, genomic analyses have identified many short DNA sequences that function as transcriptional regulatory elements and also show evolutionary conservation. These signature sequences are usually referred to as ”motifs”. There is considerable interest in these motifs because variations in gene expression play crucial roles in many biological functions and are also of importance in disease etiology. Much of the information on the roles of these motifs has been obtained using microarray or qRT-PCR analyses to investigate the dynamics of gene expression. These technologies also provide insights into variation in cell fate decision, such as (re)differentiation or (dys)function. The genomic elements that control variations in gene expression, and hence cell fate, are those associated with transcription factors. As a consequence, considerable efforts are being made to detect and characterize these motifs.
Three procedures are widely employed to identify motifs: sequence alignment, motif extraction and motif search. Sequence alignments make it possible to identify biologically meaningful regions . In order to expedite investigation of long and complex mammalian genomes, it was necessary to develop computer science methods that permitted analyses to be performed in real-time with high-sensitivity and high-precision. One such approach is the Smith-Waterman algorithm , which permits sequence ambiguity, but also provides high accuracy and flexibility, although not with high processing speed. Subsequent development of this algorithm led to the Smith-Waterman-Gotoh (SWG) method, which overcame these problems in efficiency . The SWG method is now the standard algorithm for optimal local alignment, which is a representative method in sequence analysis. The PRRN algorithm, which provides one of the highest-precision approaches, uses a doubly nested randomized iterative (DNR) method for efficient production of multiple alignments [5–7]. Once the multiple alignments have been produced, it is then necessary to undertake motif extraction from conserved common patterns in the set of consensus sequences. Two main methods can be used for this step, namely, the numeration method and the probabilistic method, based on the Weeder  and MEME  algorithms, respectively. Motif research is also performed to find already-known motifs, for example, using the MAST , TFSEARCH  and TFBIND  algorithms.
Many approaches have been employed to investigate motif interaction, such as regression methods . However, very few studies have focused on regulatory interactions in a large-scale combination of motifs. One such study made use of the Log-linear Graphical Model (LGM)  for statistical tests and estimations of the causal relationships among motifs . The LGM is a multivariate analysis and probabilistic model presented as a graph model with probabilistic conditional independency. However, volume of many motifs may be beyond the scaling limits of real-time calculation. It is important to remember, therefore, that current methods require selection of motifs. Additionally, it is important that conclusions derived from computer analyses of interactions among motifs are confirmed by practical methodologies.
With regard to methods for confirmation of interactions, the use of knockout (KO) mice or induced overexpression of transcription factors can provide evidence with a high confidence level. However, these methods generally focus on only one transcription factor at a time. Interactions involving two or more motifs are difficult to identify due to complications arising from motif ambiguity. In our laboratory, we have generated Mafb KO mice . MafB is a transcription factor and is known to regulate genes that are expressed in macrophages, a type of leukocyte, in almost all species. MafB is a DNA binding protein with an acidic domain, a basic region and a leucine zipper structure (b-zip structure); it can form a homodimer or a heterodimer with a b-zip structure protein. The protein binds to the Maf recognition element [17, 18] and 5’AT-rich half-MARE  (MAREs) in regulatory promoter regions. However, despite many investigations, little is known about how MafB achieves regulation of the expression of target genes, or of the cooperation of MafB with other transcription factors. This uncertainty could be surmounted in a timely and cost-effective manner by use of motif detection protocols to sift through large amounts of DNA sequence data.
In this study, we showed bioinformatics methods to identify regulatory motifs and transcription factors which interact with MafB. Through use of the LGM after multiple alignments for finding evolutionarily conserved motifs from a large number of DNA sequences, the relationships of several functional motifs were investigated. In an attempt to understand the motif information, we focused on our analysis of MafB, which is bound to the MARE in the promoter region of genes that are down-regulated in Mafb deficient macrophages. The relationships among several motifs were elucidated using data from prior biochemical experiments and from a new study. The observations also suggest that combinations of evolutionarily conserved motifs can be used to predict gene regulation. These techniques for investigating combinations of regulatory motifs in evolutionarily conserved regions should help to accelerate development of applications in medical sciences and lead to elucidation of causes of diseases.
Three approaches for the input sequences
Three approaches were employed for input sequences to discover motifs among multiple sequences . Multiple genes, single species: this is based on the supposition that regulatory motifs are conserved among co-regulated genes within a species, and that the level of gene expression is constant under the chosen experimental conditions. Different transcription factors might have an indirect influence on the same function. Single gene, multiple species: the rate of mutation of a regulatory motif is presumed to be slowed by selective pressure. In a single gene group, therefore, universally conserved regions contain regulatory motifs among cross-species. Conserved regions among closely related species may contain less-functional motifs as noise. On the other hand, alignment can be problematic due to changes in function during evolution across species with large evolutionary distances. Multiple genes, multiple species: alignment of orthologous sequences is used to identify conserved regions; these regions are then analyzed as above in Multiple genes and single species. Since potential scores for motif predictions can be improved by alignment of multiple genomes , this approach here was adopted to detect regulatory motifs.
The major steps of the analysis
Step1-1; Sequence data collection and arrangement
Gene expression data from a previous study were used to identify the promoter sequences of MafB target genes. As Mafb KO mice die shortly after birth, embryonic tissues were screened. Microarray (GEO, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSM511101) and qRT-PCR data were obtained from a set of three independently cultured macrophages from Mafb KO mouse fetal liver at embryonic day 14.5 . The DNA microarray analysis was outsourced to JGS Co., Ltd. The National Center for Biotechnology Information HomoloGene database (NCBI, http://www.ncbi.nlm.nih.gov/homologene) was then used to find orthologues of the identified down-regulated and non-regulated genes in 6 mammalian model species. Promoter sequences -2000 bp upstream and +300 bp downstream from each of the transcription start sites were extracted from NCBI and DBTSS  core nucleotide databases, using the annotated mRNA Reference Sequence in FASTA format.
Step1-2; Identification of consensus sequences by multiple alignments
In order to avoid aligning sequences with an extremely low level of similarity, pairwise alignments were firstly made with the SWG program using data from the mouse and other species with default parameters. Species with a score value of more than 200 and the consensus length of 500bp to 1000bp were selected. Multiple sequence alignments were then carried out for these selected species using SWG, following a progressive method. The PRRN program was then used to refine the alignments to improve the precision of those with lower sequence similarities. Non-conserved regions were masked for possible loss of functions by substitution using the letter ”N” in the nucleotide sequences, the mouse sequences were then collected.
Step1-3; Motif extraction for finding MafB binding genes
MafB binding genes in down-regulated genes were first screened using nine short sequences previously confirmed by biochemical experiments . Four categories of consensus motifs, termed here ”MafB motifs”, were generated by the MEME program; nucleotide lengths of N=8, N=9, N=10, and N=11 were used since it is known that an AT-rich sequence is located about three nucleotides upstream of the 6bp MARE. The parameters were set to allow sites on + or - DNA strands; revcomp and distribution of motifs; zoops. Next, each mouse consensus sequence was searched for MafB motifs by the MAST program of the MEME suite. Thus, genes with MafB motifs conserved at least in man and mouse were selected; these genes were named the ”MafB binding gene set”.
Step2-1; Motif extraction from the MafB binding gene set
At Step 1-3, MafB motif extraction was performed using nine short sequences previously confirmed by biochemical experiments. Motif extraction was then performed again using MEME to identify other consensus motifs among mouse sequences of the MafB binding gene set with the MafB motifs. Parameters were set to allow sites on + or - DNA strands; revcomp and distribution of motifs; zoops. To aid efficient capture by MEME, nucleotide motif lengths were assigned as N=6,8,10,12,14,16, or 18, because motifs are generally 6~20bp. Longer-length motifs may be palindromes. The ten most significant motifs for each of the 7 length variants were extracted.
Step2-2; Search for consensus motifs for transcription factor binding sites
Motif search programs were performed for the 70 motifs extracted at Step2-1 and the 4 MafB motifs at Step1-3 with the TFSEARCH and TFBIND algorithms, which are libraries of preexisting motifs for transcription factors. The options for the TFSEARCH were set as matrix; vertebrate and the threshold value; more than 65 because of using only highly conserved regions. These conditions showed great promise as similar results were obtained with TFBIND. For different tops obtained by each algorithm, the rule that the two tops were taken when the top of two transcription factors were the same with TFSEARCH and TFBIND, and that only the first top was required from each when the top was different with each algorithm was applied.
Step2-3; Identifying functional motif candidates
Among motifs for transcription factors, candidate regulatory motifs, the so-called ”functional motifs” were sought. An Over-Representation Index (ORI) score  to identify over-represented motifs in a group was calculated as , where Patt p is the number of a kind of motif present in the down-regulated gene promoters, Patt np is that in the non-regulated gene promoters, N p is the number of promoters with a motif in the down-regulated genes and N promoter is the total number of promoters in the down-regulated genes. A high ORI score indicates that the motif is present evenly among all promoters of down-regulated genes, while a low score indicates that the motif occurs many times in a part of promoters of down-regulated genes. The number of each motif presence in a promoter was then counted with using Weeder-motif locator program, with options as Minimum match percentage; 90 percent, Search the motif in both strands; check, Maximum number of substitutions; substitutions: 1 ; N=6,8,: 2 ; N=10,12, : 3 ; N=14,16, : 4 ; N=18. The motif detection algorithm is different to MEME, allowing ambiguous motifs to be obtained in addition to those identified by MEME. Peak ORI scores were labeled ”functional motif candidates”; these sequences contained a MafB motif(Step1-3).
Step3-1; Directional relationships of functional motifs
Using the results from the Weeder-motif locator, the patterns of the top motifs by ORI in each promoter were assigned as ”2” when present and ”1” when not present. The patterns were input into the L-GM program  to search for directional interactions among the functional motifs. A model was evaluated objectively with deviance and p-value for Reduced Model(RM t ) by a backward elimination method from Full Model(FM) and it was also presented as an independent graph model with edges and lines. Several combinations of directorial motifs were shown by the final model, and they were checked in all genes.
Step3-2; Modeling for the co-occurrence of functional motifs and validation of the hypothesis
A hypothesis and regulatory modeling were derived from the results of the analysis of directional relationships between the MafB motifs and other motifs. The MafB motifs were compared with the results of biochemical experiments to determine whether MafB could actually bind to them to regulate transcription. Validation of the hypothesis was also tested by the results of a luciferase assay using the C1qa gene promoter.
Collection and multiple alignment of promoters of MafB target genes
Fifty-two down-regulated genes in macrophages from MafB deficient mice
Lbp, Slc43a3, Rtp4, Gdf15, Isg15, Lgals3bp, Mafb, Psd3, Chst7, Col18a1, C1qa, Bambi, Slc9a3r1, Glt25d1, Ifit3, Tmem66, Ifi44, Usp18,
Clu, Gad1, Ndrg4, Cnrip1, Folr2, C1qb, Trib3, Daf2, Htr2b, Irf7, Bst2, Leprotl1, Adamts1, Cxcl10, Igfbp1, Ifit1, Rbp4, Rab15, Cd55,
Gas6, Ifit2, Defb29, Setd2, Iigp1, Gatad2a, Ccl12, Krt18, 1810011O10Rik, Fgb, Phgdh, Hpgd, Ambp, Cd5l(Api6), Emr1(F4/80)
Four patterns of predicted MafB motifs
MafB binding sequence
Eighteen predicted MafB binding genes
Adamts1, Ambp, Bambi, C1qa, C1qb, Cd5l(Api6), Chst7, Clu, Cxcl10, Emr1(F4/80), Gad1, Gas6, Igfbp1, Krt18, Lbp, Mafb, Slc43a3, Slc9a3r1
[NCBI Accession Number (respectively): NM_009621, NM_007443, NM_026505, NM_007572, NM_009777, NM_009690, NM_021715, NM_013492, NM_021274, NM_010130, NM_008077, NM_019521, NM_008341, NM_010664, NM_008489, NM_010658, NM_021398, NM_012030 ]
Identification of 10 functional motif candidates
Ten functional motif candidates
Motif (ORI ≥ 61, Underline; MARE)
(Partially same strand of 14-2)
Only group ”A” is extracted at Step1.
(Complementary strand of 14-2, 16-2)
(Complementary strand of 12-7)
(Partially same strand of 12-2)
Co-occurrence of functional motifs
Occurrence patterns of functional motif candidates in all 229 genes
The presence of the 10 functional motif candidates were examined in 229 gene promoters (the 18 down-regulated and the 211 non-regulated genes) to investigate the co-occurrence of functional motifs. A large gene dataset is valuable for this type of investigation as it enables to compare the patterns of occurrence of motifs in both down-regulated and non-regulated genes. The occurrence of a motif was labeled as ”2”, its absence as ”1”. Most motifs fell into the former category in the down-regulated genes, but into the latter in the non-regulated genes. Several motifs showed a similar pattern in all 229 genes when they were (reverse) complementary with the same transcription factor, as (IDs;10-3, 12-7), (IDs;12-2, 16-4) and (IDs;12-2, 14-2, 16-2). The motif pattern data were combined by similarity, except MafB motif [10 AP4/AML ATCTGCTGAC] from Step1-3.
LGM for 10 combinations of functional motif candidates
Co-occurrence of MafB motif and GATA-1 motif
Combinations of functional motif candidates in each gene
(P < 0.001) Linked motif
Hypothesis and Modeling
Results of biochemical experiments and hypothesis validation
Evaluation of the MafB motif using the results of biochemical experiments
The results from biochemical experiments were used to confirm the binding of MafB to the MafB motif and its regulation of transcription. Two of the mouse genes present in the set of 18 down-regulated genes were subjected to a luciferase assay. Mutations of MARE in C1qa and Cd5l (Api6, AIM) promoters considerably impaired promoter activity in RAW264.7 macrophage cells. Cotransfection of the luciferase reporters driven by the gene promoters along with MafB expression vectors dramatically decreased luciferase gene expression.
In this study, information from an LGM analysis on the directional relationships of several functional motifs is presented. The results suggest that both GATA-1 motifs and MafB motifs in the promoter region are important for lineage specific expression of genes that are down-regulated in Mafb deficient macrophages. It was also possible to validate the conclusions from the LGM analysis using the results of biochemical experiments. Overall, the findings indicate that it is possible to predict combinations of evolutionarily conserved motifs for gene regulation from large-scale DNA sequences. These directional motifs were obtained using objective application of a final model at a p-value of < 0.001 (Figure 2-a) and an independent graph by LGM (Figure 2-b). The promoter regions of 15 genes down-regulated in Mafb deficient macrophages contained GATA-1 motifs and 5'AT-rich half Maf recognition elements (MAREs) (Table 5). This suggests that both the GATA-1 motif and the MafB motif have a role in the negative and positive regulation of expression of these genes, respectively. The GATA-1 motif had high score in an ORI, possibly indicating overrepresentation of the motif in the down-regulated genes (Table 4). To investigate this possibility a luciferase assay was performed using a C1qa promoter that had both the GATA-1 motif and MARE. GATA-1 inhibited the activity of this promoter (Figure 4).
The 4 nucleotides in GATA-1 motif are often the same as the 6 nucleotides of MAREs. This suggests that MafB binding may be prevented by GATA-1, or that both of motifs with these proteins may bind each other for transcriptional or translational repression. GATA-1 and MafB may influence the level of mRNA expression in MafB target genes and thereby cause the generation of abnormal protein levels. As a consequence, defective MafB regulation may underlie abnormal functions in macrophages. GATA-1 has been shown to suppress monocyte differentiation in bone marrow, and to inhibit macrophage differentiation and apoptosis [26, 27]. These motifs therefore regulate gene expression patterns to support the direction of macrophage differentiation. In hematopoietic stem cells, GATA-1 activates HDAC(histone deacetylase) and down-regulates the GM-CSF (Granulocyte Macrophage Colony-Stimulating Factor) gene to promote differentiation of erythrocytes . However, since GATA-2 and MZF-1 down-regulate HDAC, they are thought to be related to hematopoietic differentiation. In erythroid progenitor cells, GATA-2 acts with MZF1 to suppress macrophage differentiation, while the loss of GATA-1 results in up-regulation of GM-CSFR and differentiation into macrophages. GATA-1 and MZF1 are therefore considered to be related to macrophage differentiation, and the cooccurrence of MZF1 and GATA-1 motifs are likely to influence macrophage differentiation.
The GATA-1 is generally required for the differentiation and proliferation of erythrocytes, and the MafB is known to be essential for maintaining macrophage function. The co-occurrence of GATA-1 and MafB motifs initially seems to be contradictory with respect to macrophages. However, it has been reported that in macrophages MafB is a repressor of Ets-1, an inhibitor of erythrocyte differentiation in the chicken . Another study showed that ectopic expression of GATA-1 in myeloid cells increases erythrocyte differentiation . When Ets-1 is expressed in both erythroid and myeloid cells, MafB expression is limited to the latter . Thus, MafB is an essential factor for differentiation or functional maintenance of myeloid cells. However, the decrease of MafB expression is mediated by Ets-1, and the expression of GATA-1 reactivates erythrocyte differentiation. This effect was presumed to be a result of the loss of MafB that stimulated GATA-1 to change the direction of differentiation. These observations also support the hypothesis that MafB and GATA-1 motifs co-occur in macrophages. MafB deficiency may enable cells to re-differentiate erythrocytes from macrophages. It will be necessary to determine whether the MafB and GATA-1 motifs in macrophages are involved in erythrocyte differentiation. Deficiency of GATA-1 allows erythroid cells to re-differentiate into macrophages [30–33]. If the identified GATA-1 motif of macrophages is also present in erythroid cells, then it may be capable of inducing the re-differentiation of macrophages to erythrocytes. Furthermore, ectopic expression of C/EBPα, GATA-1, or GATA-2 strongly enhances the macrophage differentiation potential of Pax5 KO pro-B cells under lymphoid culture conditions, whereas GATA-1 expression induces erythroblast development [34, 35](MACPAK: Simulatable Macrophage Pathway Knowledgebase database, http://macpak.csml.org/click/index.php?q=d:18472258). The transdifferentiation of pro-B cells to macrophages or erythrocytes may be determined by their MafB and GATA motifs. Thus, through investigation of co-occurring motifs it may be possible to identify the switches for cell differentiation.
Identification of combinations of transcriptional regulatory elements from large amounts of DNA sequences can benefit greatly from proposed bioinformatics methods as the multiple alignments and the LGM. In particular, the pattern of co-occurrence of motifs is proposed to provide a strong means for directing potential interaction of transcription factors not known from biochemical studies. The hypothesis postulates that GATA-1 motifs and MafB motifs negatively and positively regulate expression of 18 genes that are down-regulated in Mafb deficient macrophages, respectively. This result was verified experimentally. By combining the bioinformatics analysis and the experimental approach, it was suggested that both the GATA-1 motifs and the MafB motifs are important for expression of C1qa. Through the use of the bioinformatics methods, the regulatory motifs related to MafB have been identified and their interactions were suggested. The findings here also suggest that combinations of evolutionarily conserved motifs are capable of predicting gene regulation. It is to be expected that future studies will elaborate on and develop these findings. For example, further research on the control of gene expression may discover rules for the co-occurrence of particular motifs in the genomes. Eventually, these new techniques for identifying regulatory elements should help to accelerate development of applications in medical sciences and lead to elucidation of the causes of different diseases.
MM developed the bioinformatics methods, implemented the algorithms, carried out the data analysis, and drafted and wrote the manuscript. MH had made substantial contributions to acquisition of microarray data, and was involved in critical study of important intellectual content. MH designed, MN and MM carried out the experiments. ST conducted the supervision of the study and manuscript preparation, and gave final approval of the version to be published. All authors read and approved the final manuscript.
List of abbreviations used
acute myeloid leukemia 1 gene
transcription factor ap4
- C1qa :
complement component 1, q subcomponent, a chain
database of transcriptional start sites
doubly-nested randomized iterative
granulocyte/macrophage-colony stimulating factor receptor
log-linear graphical model
maf recognition element
motif-based sequence analysis tools
myeloid zinc finger gene 1
national center for biotechnology information
zinc fingerprotein with interaction domain
multiple sequence alignment program
The authors thank staff of University of Tsukuba for experimental helps and comments. Also, the authors thank Drs. Osamu Gotoh, Tetsushi Yada and Natsuhiro Ichinose of the Graduate School of Informatics, Kyoto University, Dr. Kenta Nakai and the staff of the Human Genome Center, Institute of Medical Science, the University of Tokyo for their valuable advice. The authors would also like to thank staff at the Computational Biology Research Center, Advanced Industrial Science and Technology, Japan for helpful suggestions.
This article has been published as part of BMC Systems Biology Volume 5 Supplement 2, 2011: 22nd International Conference on Genome Informatics: Systems Biology. The full contents of the supplement are available online at http://www.biomedcentral.com/1752-0509/5?issue=S2.
- Moretti S, Armougom F, Wallace I, Higgins D, Jongeneel C, NotredameM C: The M-Coffee web server: a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Nucleic Acids Res. 2007, 35: W645-648. 10.1093/nar/gkm333.PubMed CentralView ArticlePubMedGoogle Scholar
- Wallace I, O’Sullivan O, Higgins D: Evaluation of iterative alignment algorithms for multiple alignment. Bioinformatics. 2005, 21: 1408-1414. 10.1093/bioinformatics/bti159.View ArticlePubMedGoogle Scholar
- Smith T, Waterman M: Identification of common molecular subsequences. J.Mol.Biol. 1981, 147: 195-197. 10.1016/0022-2836(81)90087-5.View ArticlePubMedGoogle Scholar
- Gotoh O: An improved algorithm for matching biological sequences. Journal of Molecular Biology. 1982, 162: 705-708. 10.1016/0022-2836(82)90398-9. [Http://www.genome.ist.i.kyoto-u.ac.jp/~aln_user/prrn/index.html] 10.1016/0022-2836(82)90398-9View ArticlePubMedGoogle Scholar
- Gotoh O: A weighting system and algorithm for aligning many phylogenetically related sequences. CABIOS. 1995, 11: 543-551. [http://www.genome.ist.i.kyoto-u.ac.jp/~aln_user/archive/CABIOS95.pdf]PubMedGoogle Scholar
- Gotoh O: Significant improvement in accuracy of multiple protein sequence alignments by iterative refinement as assessed by reference to structural alignments. J. Mol. Biol. 1996, 264: 823-838. 10.1006/jmbi.1996.0679.View ArticlePubMedGoogle Scholar
- Gotoh O: Multiple sequence alignment: algorithms and applications. Adv. Biophys. 1999, 36: 159-206. [http://www.genome.ist.i.kyoto-u.ac.jp/~aln_user/archive/ADBP99.pdf]View ArticlePubMedGoogle Scholar
- Pavesi G, Mereghetti P, Zambelli F, Stefani M, Mauri G, Pesole G: MoD Tools: regulatory motif discovery in nucleotide sequences from co-regulated or homologous genes. Nucleic Acids Res. 2006, 34 (Web Server issue): W566-W57. [Http://188.8.131.52/modtools/]PubMed CentralView ArticlePubMedGoogle Scholar
- Bailey TL, Elkan C: Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol. 1994, 2: 28-36. http://www.sdsc.edu/~tbailey/papers/ismb94.pdf, [http://meme.sdsc.edu/meme/intro.html]PubMedGoogle Scholar
- Bailey TL, Gribskov M: Combining evidence using p-values: application to sequence homology searches. Bioinformatics. 1998, 14: 48-54. 10.1093/bioinformatics/14.1.48. [http://meme.sdsc.edu/meme/intro.html] 10.1093/bioinformatics/14.1.48View ArticlePubMedGoogle Scholar
- Heinemeyer T, Wingender E, Reuter I, Hermjakob H, Kel AE, Kel OV, Ignatieva EV, Ananko EA, Podkolodnaya OA, Kolpakov FA, Podkolodny NL, Kolchanov NA: Databases on transcriptional regulation: TRANSFAC, TRRD and COMPEL. Nucleic Acids Res. 1998, 26: 362-367. 10.1093/nar/26.1.362. Yutaka Akiyama: ”TFSEARCH: Searching Transcription Factor Binding Sites”, http://mbs.cbrc.jp/research/db/TFSEARCH.html 10.1093/nar/26.1.362PubMed CentralView ArticlePubMedGoogle Scholar
- Tsunoda T, Takagi T: Estimating Transcription Factor Bindability on DNA. Bioinformatics. 1999, 15 (7/8): 622-630.http://www.hgc.jp/japanese/software.html, [http://tfbind.hgc.jp/]http://tfbind.hgc.jp/View ArticlePubMedGoogle Scholar
- Das D, Pellegrini M, Gray JW: A Primer on Regression Methods for Decoding cis-Regulatory Logic. PLoS Computational Biology. 2009, 5: e1000269-10.1371/journal.pcbi.1000269.PubMed CentralView ArticlePubMedGoogle Scholar
- JSQC: Graphical Modeling. Graphical Modeling. 1999, NikkagirenGoogle Scholar
- Park S, Ichinose N, Yada T: Probabilistic Graphical Modeling for Large-scale Combinatorial Regulation of Transcription Factors. Proc. of the workshop on Knowledge, Language, and Learning in Bioinformatics (KLLBI). 2008, 72-86. [http://www.genome.ist.i.kyoto-u.ac.jp/~park/pdf/park08_kllbi.pdf]Google Scholar
- Moriguchi T, Hamada M, Morito N, Terunuma T, Hasegawa K, Zhang C, Yokomizo T, Esaki R, Kuroda E, Yoh K, Kudo T, Nagata M, Engel J, Yamamoto M, Takahashi S: MafB is essential for renal development and F4/80 expression in macrophage. Mol Cell Biol. 2006, 26: 5715-5727. 10.1128/MCB.00001-06. [http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE20419] 10.1128/MCB.00001-06PubMed CentralView ArticlePubMedGoogle Scholar
- Blank V, Nancy CA: The Maf transcription factors: regulators of differentiation. Trends in Biochemical Sciences. 1997, 22: 437-441. 10.1016/S0968-0004(97)01105-5.View ArticlePubMedGoogle Scholar
- Kataoka K, Fujiwara KT, Noda M, Nishizawa M: MafB, a new Maf family transcription activator that can associate with Maf and Fos but not with Jun. Mol Cell Biol. 1994, 14: 7581-7591.PubMed CentralView ArticlePubMedGoogle Scholar
- Yoshida T, Ohkumo T, Ishibashi S, Yasuda K: The 5’-AT-rich half-site of Maf recognition element: a functional target for bZIP transcription factor Maf. Nucleic Acids Research. 2005, 33 (11): 3465-3478. 10.1093/nar/gki653.PubMed CentralView ArticlePubMedGoogle Scholar
- Wang T, Stormo G: Combining phylogenetic data with co-regulated genes to identify regulatory motifs. Bioinformatics. 2003, 19: 2369-2380. 10.1093/bioinformatics/btg329.View ArticlePubMedGoogle Scholar
- Kolbe D, Taylor J, Elnitski L, Eswara P, Li J, Miller W, Hardison R, Chiaromonte F: Regulatory Potential Scores From Genome-Wide Three-Way Alignments of Human, Mouse, and Rat. Genome Research. 2004, 14: 700-707. 10.1101/gr.1976004.PubMed CentralView ArticlePubMedGoogle Scholar
- Blanchi B, Kelly L, Viemari J, Lafon I, Burnet H, Bevengut M, Tillmanns S, Daniel L, Graf T, Hilaire G, Sieweke M: MafB deficiency causes defective respiratory thythmogenesis and fatal central apnea at birth. Nat Neurosci. 2003, 6: 1091-1100. 10.1038/nn1129.View ArticlePubMedGoogle Scholar
- Suzuki Y, Yamashita R, Nakai K, Sugano S: DBTSS:DataBase of human Transcriptional Start Sites and full-length cDNAs. Nucleic Acids Res. 2002, 30: 328-331. 10.1093/nar/30.1.328.PubMed CentralView ArticlePubMedGoogle Scholar
- Bajic VB, Choudhary V, Hock CK: Content analysis of the core promoter region of human genes. In Silico Biol. 2004, 4: 109-125. [http://www.bioinfo.de/isb/2003040011/]PubMedGoogle Scholar
- Motohisa H: G-GM & L-GM Systems for Graphical Modelling. Bulletin of the Computational Statistics of Japan. 2002, 15: 63-74. [http://sciencelinks.jp/j-east/article/200407/000020040704A0154465.php]Google Scholar
- Kulessa H, Frampton J, Graf T: GATA-1 reprograms avian myelomonocytic cell lines into eosinophils, thromboblasts, and erythroblasts. Genes Dev. 1995, 9: 1250-1262. 10.1101/gad.9.10.1250.View ArticlePubMedGoogle Scholar
- Tanaka H, Matsumura I, Nakajima K, Daino H, Sonoyama J, Yoshida H, Oritani K, Machii T, Yamamoto M, Hirano T, Kanakura Y: GATA-1 blocks IL-6-induced macrophage differentiation and apoptosis through the sustained expression of cyclin D1 and Bcl-2 in a murine myeloid cell line M1. Blood. 2000, 95: 1264-1273.PubMedGoogle Scholar
- Wada T, Kikuchi J, Nishimura N, Shimizu R, Kitamura T, Furukawa Y: Expression levels of histone deacetylases determine the cell fate of hematopoietic progenitores. J Biol Chem. 2009, 284: 30673-30683. 10.1074/jbc.M109.042242.PubMed CentralView ArticlePubMedGoogle Scholar
- Sieweke M, Tekotte H, Frampton J, Graf T: MafB is an interaction partner and repressor of Ets-1 that inhibits erythroid differentiation. Cell. 1996, 85: 49-60. 10.1016/S0092-8674(00)81081-8.View ArticlePubMedGoogle Scholar
- Kitajima K, Tanaka M, Zheng J, Yen H, Sato A, Sugiyama D, Umehara H, Sakai E, Nakano T: Redirecting differentiation of hematopoietic progenitors by a transcriptionfactor, GATA-2. Blood. 2006, 107: 1857-1863. 10.1182/blood-2005-06-2527.View ArticlePubMedGoogle Scholar
- Kitajima K, Zheng J, Yen H, Sugiyama D, Nakano T: Multipotential differentiation ability of GATA-1-null erythroid-committed cells. Genes Dev. 2006, 20: 654-659. 10.1101/gad.1378206.PubMed CentralView ArticlePubMedGoogle Scholar
- Zheng J, Kitajima K, Sakai E, Kimura T, Minegishi N, Yamamoto M, Nakano T: Differential effects of GATA-1 on proliferation and differentiation of erythroid lineage cells. Blood. 2006, 107: 520-527. 10.1182/blood-2005-04-1385.View ArticlePubMedGoogle Scholar
- Sugiyama D, Tanaka M, Kitajima K, Zheng J, Yen H, Murotani T, Yamatodani A, Nakano T: Differential context-dependent effects of friend of GATA-1 (FOG-1) on mast-cell development and differentiation. Blood. 2008, 111: 1924-1932. 10.1182/blood-2007-08-104489.View ArticlePubMedGoogle Scholar
- Cobaleda C, Busslinger M: Developmental plasticity of lymphocytes. Curr Opin Immunol. 2008, 20 (2): 139-148. 10.1016/j.coi.2008.03.017.View ArticlePubMedGoogle Scholar
- Heavey B, Charalambous C, Cobaleda C, Busslinger M: Myeloid lineage switch of Pax5 mutant but not wildtype B cell progenitors by C/EBPalpha and GATA factors. EMBO J. 2003, 22: 3887-3897. 10.1093/emboj/cdg380.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.