Evolution of context dependent regulation by expansion of feast/famine regulatory proteins
© Plaisier et al.; licensee BioMed Central Ltd. 2014
Received: 2 July 2014
Accepted: 16 October 2014
Published: 14 November 2014
Expansion of transcription factors is believed to have played a crucial role in evolution of all organisms by enabling them to deal with dynamic environments and colonize new environments. We investigated how the expansion of the Feast/Famine Regulatory Protein (FFRP) or Lrp-like proteins into an eight-member family in Halobacterium salinarum NRC-1 has aided in niche-adaptation of this archaeon to a complex and dynamically changing hypersaline environment.
We mapped genome-wide binding locations for all eight FFRPs, investigated their preference for binding different effector molecules, and identified the contexts in which they act by analyzing transcriptional responses across 35 growth conditions that mimic different environmental and nutritional conditions this organism is likely to encounter in the wild. Integrative analysis of these data constructed an FFRP regulatory network with conditionally active states that reveal how interrelated variations in DNA-binding domains, effector-molecule preferences, and binding sites in target gene promoters have tuned the functions of each FFRP to the environments in which they act. We demonstrate how conditional regulation of similar genes by two FFRPs, AsnC (an activator) and VNG1237C (a repressor), have striking environment-specific fitness consequences for oxidative stress management and growth, respectively.
This study provides a systems perspective into the evolutionary process by which gene duplication within a transcription factor family contributes to environment-specific adaptation of an organism.
KeywordsTranscription factor Expansion Systems biology
Expansion of transcription factor (TF) families via gene duplication enables an organism to adapt to new environments by providing a means to rewire its gene regulatory network . The process of rewiring is accomplished through natural selection of random mutations that maneuver each TF homolog into a distinct niche. Mutations that alter the set of target genes regulated by a TF can lead to functionally different effects. This process of functional divergence has two primary outcomes: 1) neo-functionalization where a TF gains a new function not present in the ancestral TF, and 2) sub-functionalization where the homologous TFs divide the functions of the ancestral TF ,. Mutations that change the context where TF homologs are expressed can also be very important as they can relocate an advantageous function to a new context . This complementary process of contextual divergence also has two primary outcomes: 1) neo-contextualization can bring an advantageous function to a new context, and 2) sub-contextualization where TF homologs split up the contexts of the ancestral TF ,. Thus, mutations causing functional and contextual divergence of duplicated TFs allow organisms to explore a large space of new environmental or nutritional niches. Interestingly, homologs that are co-expressed in a particular context tend to have divergent DNA recognition motifs (i.e., they act in similar contexts but regulate different genes) and homologs expressed in different contexts often retain similar DNA recognition motifs (i.e., they regulate the same genes albeit in different environments) . Thus, through TF duplication followed by functional and contextual divergence an organism can rewire its gene regulatory network to deal with new nutritional and environmental challenges.
Feast/famine regulatory proteins (FFRPs)  or Lrp-like proteins  of the Lrp/AsnC family (PF01037, AsnC_trans_reg) represent one of the oldest and largest families of prokaryotic transcriptional regulators. This ancient family of TFs is found both in archaea and bacteria suggesting that their common ancestor had at least one FFRP-like protein . It is striking that on average each sequenced archaeal genome encodes 5 (α4) FFRPs, which suggests that expansions in the FFRP family had already occurred in a common ancestor (Additional file 1: Table S1). For instance, in the archaeal family of halobacteriaceae FFRP expansions have led to an average of 10 (α2) FFRP homologs per sequenced genome. Thus, it is safe to assume that the FFRP gene family has evolved through numerous expansions prior to and after evolution of the archaeal lineage  and that these expansions provide one possible means for organisms to adapt to changes in nutritional and/or environmental conditions ,.
Our research focuses on the genome of H. salinarum NRC-1 from the halobacteriaceae family, which encodes eight full-length FFRP homologs as well as an additional putative FFRP homolog that is missing a DNA binding domain (Additional file 2: Figure S1) . Structurally FFRP proteins are comprised of a helix-turn-helix (HTH) DNA binding domain connected through a flexible linker to a ‘regulation of amino acid metabolism’ (RAM) domain that typically binds amino acids to modulate regulatory activity ,-. RAM domains in some FFRPs have strong specificity for a single amino acid ,,, some are activated by two or more amino acids ,,, and others have evolved specificity to non-amino acid effector molecules ,,. The presence of a TrkA-C domain  in Trh2 and a TRASH domain , in VNG1179C suggest these FFRPs may be involved in the sensing and regulation of genes in response to changes in K+/NAD+and metals (e.g. Cu(II) ), respectively (Additional file 2: Figure S2). However, the contexts in which the eight FFRPs act and the specific genes they regulate are largely unknown. This information is essential to understand how the eight FFRP family members in H. salinarum NRC-1 have functionally and contextually diverged.
Here we have characterized the functional and contextual divergence of expanded FFRP family members in H. salinarum NRC-1 to understand how TF homologs evolve to occupy different niches. The key features defining an FFRP’s niche are the repertoire of target genes that it regulates, the contexts in which it is expressed, and the effector molecules that modify its activity. We experimentally mapped genome-wide binding locations for all eight FFRPs, analyzed their expression across 466 gene expression microarrays from 35 different growth conditions which mimic environmental and nutritional contexts H. salinarum NRC-1 is likely to experience in the wild, and inferred their effector-molecule preferences. This integrated analysis provided evidence for both functional and contextual divergence in the evolution of distinct conditionally active regulatory networks for five of the eight FFRPs. We have performed follow-up experiments that validate conditional regulation by two FFRPs, and demonstrate a context dependent fitness benefit for the regulation. Our results demonstrate that the eight FFRPs in H. salinarum NRC-1 have evolved to occupy distinct niches through variations in one or all of the three known determinants of their functions: which genes they regulate, when they are expressed, and what effector-molecules they bind. Importantly, these results illustrate how interrelated variations in these three properties tune function of each FFRP to the environmental context in which it acts.
Results and discussion
Evolution of Homologous FFRPs in H. salinarum NRC-1
Pairwise functional divergence of FFRP family members
Type-I functional divergence (θijαSE)
3.3 ß 10−3
4.1 ß 10−9
2.7 ß 10−11
4.6 ß 10−4
7.1 ß 10−13
1.5 ß 10−15
3.2 ß 10−5
3.3 ß 10−5
2.8 ß 10−11
1.4 ß 10−10
4.4 ß 10−6
2.2 ß 10−14
2.6 ß 10−23
1.4 ß 10−24
4.1 ß 10−23
3.3 ß 10−3
3.8 ß 10−5
9.3 ß 10−11
6.5 ß 10−24
4.7 ß 10−5
6.0 ß 10−24
7.6 ß 10−3
5.5 ß 10−4
6.8 ß 10−14
1.5 ß 10−8
2.5 ß 10−7
3.9 ß 10−24
2.0 ß 10−9
Genome-wide binding locations of Feast/Famine Regulatory Proteins (FFRPs)
We then mapped the genomic binding locations of all eight FFRPs from H. salinarum NRC-1 to understand how the homologous TFs might have diverged to perform different functions. Each FFRP was over-expressed with an epitope tag, chromatin immunoprecipitation (ChIP) was performed, and its genome-wide binding locations were mapped by tiling microarray hybridization (ChIP-chip). The over-expression of the epitope tagged FFRPs allows the identification of an FFRP’s binding sites independent of the condition in which the ChIP-chip study was performed. The genomic distribution of FFRP binding sites between intergenic and genic sequences (18% and 82%, respectively) was equivalent to the fraction of intergenic and coding sequences in the genome (14% and 86%, respectively; Additional file 5: Table S4). The eight FFRPs were found to regulate between 34 and 356 genes whose promoters harbor their experimentally mapped binding sites (i.e. when the binding site was within 250 bp upstream and 50 bp downstream of the start codon of a gene; Additional file 5: Table S4). The DNA-binding map revealed that approximately 30% of all genes (n =712) in H. salinarum NRC-1 were putatively regulated by one or more FFRPs. Interestingly, nearly half of these genes (i.e., 341 out of 712, permuted p-value <1 ß 10−5) had at least two FFRP binding sites in their promoter region, generating a highly overlapping set of interactions. The high degree of overlap between FFRP target genes could be explained by similarity of FFRP DNA recognition motifs  and/or formation of hetero-oligomeric structures .
Evidence of functional divergence between FFRPs
Evidence of contextual divergence of FFRPs
One plausible explanation for the evolutionary retention of two or more FFRPs with similar target genes is that they might contribute to fitness in different contexts or respond to different effector molecules. We looked for evidence of contextual divergence by comparing the expression patterns and putative effector molecule dependencies of the eight FFRPs.
First, we computed pairwise expression correlations of the eight FFRPs across a compendium of 466 transcriptome profiles of H. salinarum NRC-1 from 35 different growth conditions (high temperature, copper, high H2O2, etc.; Additional file 11: Table S10) ,,-. Interestingly, five FFRPs (AsnC, Trh2, Trh6, VNG1179C and VNG1237C) that putatively regulate different sets of genes had similar expression patterns (pair-wise correlation coefficient >0.5 and p-value <0.05; Figure 2 green edges; Additional file 2: Figure S4; Additional file 12: Table S11). By contrast, with the exception of Trh2 and VNG1237C, the expression patterns of FFRPs that regulate similar sets of genes (e.g., AsnC, Trh3 and Trh4) were not correlated (Figure 2).
Second, we observed significant evidence for functional divergence between the RAM domains of many FFRPs (Additional file 13: Table S12). Again, we converted this functional divergence measure into a distance metric and used it to analyze relationships of the RAM domains of the 8 FFRPs (Figure 1D). This led to targeted analysis of key residues in the RAM domain ,, which further enabled the discovery of the most likely effector molecules for each of the five FFRPs (AsnC, Trh3, Trh4, Trh6 and Trh7; Figure 1E and F; Figure 2 purple edges; Additional file 2: Figure S2). The additional TrkA-C domain  in Trh2 and the TRASH domain , in VNG1179C suggested that these FFRPs might regulate genes in response to changes in K+/NAD+and metal ions (e.g. Cu(II) ), respectively (Figure 1F; Figure 2 purple edges; Additional file 2: Figure S2). Impressively, the structure of functional distance between FFRP RAM domains parses the FFRPs into clusters that explain their effector molecule preferences (Figure 1F). Firstly, functional distance grouped together Trh3 and Trh4 and predicted that they have similar preferences for Arg, Gln, and Lys. Similarly, Trh3, Trh4, Trh6 and Trh7 were predicted to share a preference for polar amino acids. By contrast, AsnC and VNG1237C are most likely modulated by nonpolar amino acids. Finally, the co-clustering of Trh2 and VNG1179C is most likely because they are most diverged and their RAM domains are likely non-functional. Instead, their effector molecule preferences originate from the fused domains (K+(TrkA-C domain) for Trh2, and Cu2+ (TRASH domain) for VNG1179C). Thus, the responsiveness to different effector molecules explains how FFRPs that regulate a similar set of genes or have similar expression patterns across 35 environmental contexts (e.g. Trh6 and AsnC) might have sub- or neo-contextualized (Figure 2).
FFRPs evolved into distinct roles through both functional and contextual divergence
Altogether, the evidence for functional and contextual divergence demonstrates that no two FFRPs are similar in all respects (Figure 2). VNG1179C and Trh2 provide an excellent example of functional and contextual divergence. The two FFRPs are highly co-expressed (correlation coefficient =0.85, p-value =6.7 ß 10−8; Figure 2 red highlight with red dashed outline) but have functionally diverged because of variations in their binding motifs (p-value =0.68), and their functions are further contextualized by their differential responsiveness to K+(Trh2) and Cu(II) (VNG1179C). On the other hand, Trh3 and Trh4 (Figure 2 green highlight with green dashed outline) have very similar DNA recognition motifs, similar preference for effector molecules (lysine and arginine), but have contextually diverged through differential expression across environments (correlation coefficient=-0.32, p-value =5.8 ß 10−2). Thus, the eight FFRPs in H. salinarum NRC-1 have evolved to take on distinct roles based on who they regulate (variations in DNA-binding domain), when they are expressed (promoter variations), or which effector molecules modulate their activity (variations in RAM domain, or fusion of an additional effector molecule binding domain).
Context dependent regulation of FFRP target genes
Conditional activation of 158 genes by AsnC contributes to fitness in sub-inhibitory levels of paraquat
Conditional repression of 47 genes by VNG1237C is important for normal growth
Significance of effects of deleting VNG1237C on growth curve are not observed for other FFRPs
FFRP deletion strain
Max growth ratep-value
Time to max growth ratep-value
Area under growth curve (AUC)p-value
2.7 ß 10−1
4.3 ß 10−2
8.9 ß 10−2
9.1 ß 10−1
7.8 ß 10−1
4.3 ß 10−2
5.9 ß 10−1
1.9 ß 10−1
8.6 ß 10−2
9.4 ß 10−1
3.4 ß 10−1
8.9 ß 10−1
1.4 ß 10−2
3.4 ß 10−1
1.8 ß 10−1
1.5 ß 10−2
4.5 ß 10−2
3.5 ß 10−3
7.5 ß 10−5
Our results demonstrate how divergence across regulators of the FFRP family has rewired the H. salinarum NRC-1 gene regulatory network to differentially regulate genes and bring physiologically relevant functions to specific environmental conditions. The specialization of each FFRP function is a product of changes to three properties: 1) variations in the DNA-binding domain, which determine which promoters it binds; 2) promoter variations, which determine when it is expressed; and 3) variations in the RAM domain or fusion to an entirely different ligand-binding domain, which alters specificity for effector-molecules to determine when it is post-translationally activated or inactivated. We demonstrated significant functional divergence was present at the protein sequence level between each FFRP. Thus it is not surprising that none of the FFRPs are alike in all three respects, because variations in one or more of these three key properties has potentially given each a unique and physiologically relevant capability, such PQ-responsive regulation by AsnC and growth-specific regulation by VNG1237C. There was significant overlap between the set of genes that were conditionally regulated by AsnC and VNG1237C (p-value =2.0 × 10−33). In addition to regulating a common set of core functions such as ribosomes, translation factors, ATP synthase, and cobalamin biosynthesis, the two FFRPs also regulated few unique genes, such as DNA repair enzymes (AsnC) and a Na+/H+antiporter complex (VNG1237C). Yet, conditional regulation of similar genes by the two FFRPs had distinct environment-specific consequences, demonstrating how contextual and functional divergence of FFRPs has physiologically important consequences for fitness of H. salinarum NRC-1 under routine and stressful conditions.
Early work on the mechanisms by which TFs diverge into unique roles focused primarily on variation at the level of DNA recognition motif, i.e. functional divergence . More recent work has shown that variation in the promoters of TFs altering the contexts in which they are expressed is also important, i.e. contextual divergence ,. Our work demonstrates that contextual divergence can also arise from variation in ligand binding domains that result in preference for different effector molecules. Interestingly, variations in these three properties were interrelated in that divergence in one excludes the need for divergence in the other two but they are not directly predictive of each other. As was observed previously for human TF homologs , there is no correlation between pairwise similarity of expression and similarity of DNA recognition motifs across all FFRPs (Kendall’s tau =0.02, p-value =0.87). By integrating across the different levels of divergence it was possible to come up with specific predictions about physiologically relevant conditional regulation of FFRP target genes. For instance, AsnC was down-regulated in response to PQ treatment, and the down-regulation of its target genes in this context ultimately influenced the fitness of H. salinarum under oxidative stress conditions. Similarly, VNG1237C was up-regulated during growth, it was active during growth, and the repression of its target genes was important for wild type growth characteristics. These interrelationships underlie the success and power of our strategy of integrating binding and gene expression data to elucidate conditional regulation by each member of this expanded FFRP family of regulators. Specifically, by over-expressing a tagged TF we were able to generate comprehensive DNA-binding maps in a relatively context independent manner. The assessment of correlation between transcriptional changes of the FFRP and its target genes revealed the subset of binding events that were conditionally functional in specific environmental contexts. This is a generic strategy that can readily be applied to elucidate conditionally active gene regulatory networks in any organism, just from a map of genome-wide TF-DNA binding locations and a compendium of transcriptome profiles from diverse environments. Future studies employing ChIP-exo  will enhance this generic strategy by improving the resolution of FFRP binding events which makes the discovery of FFRP target genes and motifs more straightforward.
While we had some success in identifying the effector molecules for each FFRP homolog, these predictions will have to be experimentally verified ,-. Recent studies in a closely related species H. salinarum R1 validated the preference of Trh4 for glutamine and determined that the novel effector molecule binding residues of Trh7 indicate a preference for aspartic acid . The preferences, specificity, and affinity for effector molecules, and how they influence the activity of an FFRP are critical pieces of information as they will shed insight into how an organism senses and responds to dynamic intra-cellular and external environmental changes. The systems level integration and coordination of varied functions of expanded FFRPs, as well as of all other regulators will ultimately reveal how variations in the promoter, DNA-binding domain and effector-molecule recognition in an FFRP co-evolve to generate novel coordination of different cellular processes that is better suited for the new environment.
Evolutionary analyses of FFRPs in H. salinarum NRC-1
Protein sequences were collected from MicrobesOnline  for FFRPs and closely related homologs from other species. Homologs from other species were discovered using FastBLAST  with ?50% identity with an FFRP in H. salinarum NRC-1. Using MEGA6  protein sequences were aligned using MUSCLE  and a phylogenetic tree was reconstructed using the Minimum Evolution method . The aligned protein sequences and phylogenetic tree were then used to compute type-I functional divergence coefficients (I) using the maximum likelihood based method in the DIVERGE  software package. Significance of type-I functional divergence coefficients were calculated using a chi-square test from the log of the likelihood ratio test statistic . Pairwise functional distances were calculated from type-I functional divergence as the −ln(1 θ ij) , where i and j are FFRPs. Functional distances were then converted to a distance matrix and clustered using complete linkage hierarchical clustering. Analyses were repeated for the DBD and RAM domain by restricting analyses to only those protein coding sequences corresponding to those regions as described on MicrobesOnline .
Strains construction and culturing conditions
The two FFRP deletion strains for asnC and VNG1237C were derived from H. salinarum NRC-1 △ura3 strain via a two-step gene replacement strategy  using the primers and plasmids described in (Additional file 17: Table S16). C-Myc tagged strains for ChIP-chip were constructed as previously described  using the primers and plasmids described in Additional file 18: Table S17. All H. salinarum NRC-1 strains were cultured in standard growth conditions in complete medium (CM, 250 NaCl, 20 g/L MgSO4 7H2O, 3 g/L sodium citrate, 2 g/L KCl, 10 g/L peptone) or complete defined medium with 19 amino acids (CDM, Additional file 19: Table S18). All deletion strains and the control/parental strains (?ura3) of deletions were grown in CM or CDM with an added 0.05 mg/ml uracil to compensate for their uracil deficiency.
Genome wide FFRP binding site analysis (ChIP-chip)
Chromatin immuno-precipitation and microarray hybridization (ChIP-chip) experiments were carried out for all 8 FFRPs in H. salinarum NRC-1 using the Agilent-030521 Halobacterium sp. NRC-1 Tiling V1 013324 array (GPL13426) . To study the FFRP localization in both nutrient replete and deplete conditions, all strains were grown in CDM, and samples were harvested in both early log phase and late log phase. ChIP of c-Myc tagged protein complexes were conducted as described ,. Enriched DNA from ChIP complexes and unenriched non-IP DNA were each labeled and hybridized to the whole genome tiling array (GSE62052). Each ChIP-chip consisted of at least two independent biological replicates, with at least 16 replicate spots in each. Resulting localization data was median normalized and further analyzed for statistically significant enrichment using MeDiChI, a regression model that learns a generative model of joint binding events . MeDiChI provides a list of putative binding sites with a resolution of 50 bp. By calculating the average intergenic region upstream of the transcriptional start site for all genes in H. salinarum NRC-1 the promoter region was determined to be ~200 bp (195 bp) and added a 50 bp buffer to each end accounting for the resolution of MeDiChI for a total or ~250 bp upstream and +50 bp downstream of the transcriptional start site. Genes targeted by a particular FFRP were identified as having a MeDiChI p-value 0.01 in their promoter region or by being part of an operon with a binding site in the promoter region of an upstream gene in the operon. Pairwise FFRP overlap was computed using hypergeometric enrichment p-values and percent overlap uses the smallest target gene set as the total.
Functional enrichment analysis
We used the Bioconductor package topGO  to discover significantly enriched GO biological process terms in gene sets of interest.
Discovering FFRP DNA recognition motifs from ChIP-chip target genes
Discovery of putative DNA recognition motifs was conducted as described in Ashworth, et al. 2014 . Briefly, gene promoters in Halobacterium salinarum were considered binding targets if a ChIP-chip peak with a p-value less than 0.10 was present within 100 bp upstream of the transcriptional start site. For de novo discovery of genome-wide promoter DNA binding sites from ChIP experiments for each FFRP, MEME  was performed on the upstream non-coding promoter sequences of all genes with evidence of ChIP binding. The following parameters were used to run MEME: -minw 13, -maxw 17, -nmotifs 2, and MEME was supplied with a first-order background Markov model computed over all input sequences. Upstream sequence regions tested for de novo motif detection were −100 to 0 bp relative to gene CDS starts or transcriptional start sites. For the purpose of inferring gene promoters which are bound by FFRPs, FIMO was used  to identify potential FFRP transcription factor binding sites (TFBS) in promoter regions from DNA-binding position weight matrices (PMWs) with a motif p-value below the default threshold (1×10−4). The similarity between DNA recognition motifs was computed using TOMTOM .
Discovering contextual divergence across FFRPs
Similarity in expression between FFRPs was computed though pair-wise Spearman’s rank based correlations of expression between all eight FFRPs across 35 condition sets (Additional file 11: Table S10). A correlation coefficient greater than or equal to 0.5 and p-value less than or equal to 0.05 was used as a cutoff for significant similarity between two FFRPs expression.
Discovery of putative effector molecules for FFRPs utilized previous work that discovered a set of nine key amino acids that specify the code for effector molecule preference . First, all eight FFRP protein sequences were aligned with five homologous FFRPs (Additional file 2: Figure S2) . Then similarity between the nine effector molecule specifying residues were scored using the BLOSSUM62 matrix and average distance hierarchical clustering was used to define clusters of FFRPs with similar effector molecule preferences.
Discovering context dependent regulation by FFRPs
In total 35 condition sets were utilized for the analysis (Additional file 11: Table S10). These condition sets were further filtered to include only sets where the expression of a FFRP has at least 1.75 fold-change in expression across a condition set. We chose to use 1.75 fold-change as it provided a reasonable number of 127 filtered experimental contexts where an FFRP significantly changed. This resulted in 24, 26, 12, 14, 9, 4, 14, and 21 EO terms for AsnC, VNG1179C, VNG1237C, Trh2, Trh3, Trh4, Trh6, and Trh7 (Additional file 14: Table S13). Median correlation coefficient between the expression of a FFRP and its ChIP-chip targets were calculated for each filtered condition set. Positive and negative correlations were calculated separately, where a positive correlation indicates an activator role and a negative correlation co-efficient indicates a repressor role. Median correlation coefficients between a FFRP and its ChIP-chip targets in a condition set were compared to randomly sampled gene sets of the same size. In total 100,000 permutations were performed and significance of the median correlation coefficient between a FFRP and its target genes under each condition set was calculated based on the resulting permuted p- values (Benjamini-Hochberg multiple hypothesis correction <0.05). As a final filter the variance of an FFRP’s target genes were required to be significantly correlated with the FFRP’s expression (Benjamini-Hochberg multiple hypothesis correction <0.05). Genes whose magnitude of correlation coefficient was greater than the median correlation coefficient of the FFRP’s target genes were considered to be significantly correlated with the FFRP’s expression under that condition set.
Microarray analysis of FFRP deletion strains in relevant condition sets
Total RNA was prepared from cell pellets using the mirVANA RNA kit (Ambion, Austin, TX) according to the manufacturer’s instructions. Whole-genome tiling array, RNA labeling, hybridization and normalization were conducted as previously described . All microarray conditions were collected in biological triplicates. The asnC deletion strain (△ura3 △asnC) and the control △ura3 strain were grown in 4 mM paraquat (PQ) and were sampled at 1 and 160 minutes after PQ addition. Relative activation was calculated for each correlated ChIP-chip target gene separately as the expression level at the 1 minute time point minus the 160 minute time point. The VNG1237C deletion strain (△ura3 △VNG1237C) and the control △ura3 strain were grown in CM media and sampled when the △ura3 optical density at 600 nm (OD600) ~0.18 and ~1.15 (corresponding △VNG1237C OD600~0.15 and 0.69, respectively). Sampling timing for △VNG1237C was designed such that the starting OD600s were as similar as possible and once an OD600 of 1.17 was reached for △ura3 the △VNG1237C was sampled at this same time point. Relative repression was calculated for each correlated ChIP-chip target gene separately as the expression level at the larger OD600 minus the starting OD600. Significant differences between the relative activation and repression were calculated by taking the median of all correlated target genes and comparing the deletion FFRP strain to the △ura3 control strain using an un-paired one-sided Wilcoxon rank-sum test. Deletion microarray data reported in this paper have been deposited in the National Center for Biotechnology Information (NCBI) Gene Expression Omnibus (GEO) as GSE62052.
Growth curve analysis
Growth assays were performed in multi-plex using a Bioscreen C instruments (Growth Curves USA, Piscataway, NJ). Each experimental condition was done in technical duplicate and biological triplicate. Cultures for inoculation were titrated from starter cultures to have a similar OD600 which greatly increased the consistency and reproducibility of the growth curve experiments. OD600 was measured every 30 minutes for 4 days after inoculation with a starting OD600 of 0.09. We have developed an R package entitled ‘Growth Curve Analysis Function’ that automates the extraction of the parameters maximum growth rate, time to maximum growth rate and area under the growth curve . Technical duplicates were collapsed based upon the mean of the values. All three parameters were calculated for each set of biological triplicates and then the deletion FFRP strain to the △ura3 control strain were compared using an un-paired two-sided Student’s T-test.
Availability of supporting data
Genome-wide ChIP-chip binding profiles for each FFRP studied and expression profiles for knock-out validation studies are provided under GSE62052. The 446 expression profiles from the 35 different experimental conditions are available under GSE1040, GSE4890 to GSE48900, GSE4925, GSE5557, GSE5924, GSE5925, GSE5929, GSE6776, GSE7559, GSE7609 to GSE7613, GSE7709 to GSE7740, GSE29706, GSE13150, and GSE17515.
This work conducted by ENIGMA was supported by the Office of Science, Office of Biological and Environmental Research, of the U. S. Department of Energy under Contract No. DE-AC02-05CH11231. Additional funding was provided by grants from: the U.S. Department of Energy (DE-FG02-04ER64685 to NB, DE-FG02-07ER64327 to NB, DE-FG02-08ER64685 to NB); the U.S. National Science Foundation (EAGER – MSB-1237267 to NB, DB-1262637 to NB); and by the U.S. National Institutes of Health (2P50GM076547 and 1R01GM077398-01A2 to NB). CLP was supported by an American Cancer Society Research Scholar grant. JA is a Gordon and Betty Moore Foundation Fellow of the Life Sciences Research Foundation. ANB is supported by the Department of Energy Office of Science Graduate Fellowship Program (DOE SCGF), made possible in part by the American Recovery and Reinvestment Act of 2009, administered by ORISE-ORAU under contract no. DE-AC05-06OR23100.
- Brooks AN, Turkarslan S, Beer KD, Lo FY, Baliga NS: Adaptation of cells to new environments. Wiley Interdiscip Rev Syst Biol Med. 2011, 3: 544-561. 10.1002/wsbm.136.PubMed CentralView ArticlePubMedGoogle Scholar
- Singh LN, Hannenhalli S: Functional diversification of paralogous transcription factors via divergence in DNA binding site motif and in expression. PLoS One. 2008, 3: e2345-10.1371/journal.pone.0002345.PubMed CentralView ArticlePubMedGoogle Scholar
- Singh LN, Hannenhalli S: Correlated changes between regulatory cis elements and condition-specific expression in paralogous gene families. Nucleic Acids Res. 2010, 38: 738-749. 10.1093/nar/gkp989.PubMed CentralView ArticlePubMedGoogle Scholar
- Yokoyama K, Ishijima SA, Clowney L, Koike H, Aramaki H, Tanaka C, Makino K, Suzuki M: Feast/famine regulatory proteins (FFRPs): Escherichia coli Lrp, AsnC and related archaeal transcription factors. FEMS Microbiol Rev. 2006, 30: 89-108. 10.1111/j.1574-6976.2005.00005.x.View ArticlePubMedGoogle Scholar
- Calvo JM, Matthews RG: The leucine-responsive regulatory protein, a global regulator of metabolism in Escherichia coli. Microbiol Rev. 1994, 58: 466-490.PubMed CentralPubMedGoogle Scholar
- Peeters E, Charlier D: The Lrp family of transcription regulators in archaea. Archaea Vanc BC. 2010, 2010: 750457-Google Scholar
- Pérez-Rueda E, Janga SC: Identification and genomic analysis of transcription factors in archaeal genomes exemplifies their functional architecture and evolutionary origin. Mol Biol Evol. 2010, 27: 1449-1459. 10.1093/molbev/msq033.PubMed CentralView ArticlePubMedGoogle Scholar
- Ng WV, Kennedy SP, Mahairas GG, Berquist B, Pan M, Shukla HD, Lasky SR, Baliga NS, Thorsson V, Sbrogna J, Swartzell S, Weir D, Hall J, Dahl TA, Welti R, Goo YA, Leithauser B, Keller K, Cruz R, Danson MJ, Hough DW, Maddocks DG, Jablonski PE, Krebs MP, Angevine CM, Dale H, Isenbarger TA, Peck RF, Pohlschroder M, Spudich JL: Genome sequence of Halobacterium species NRC-1. Proc Natl Acad Sci U S A. 2000, 97: 12176-12181. 10.1073/pnas.190337797.PubMed CentralView ArticlePubMedGoogle Scholar
- Leonard PM, Smits SH, Sedelnikova SE, Brinkman AB, de Vos WM, van der Oost J, Rice DW, Rafferty JB: Crystal structure of the Lrp-like transcriptional regulator from the archaeon Pyrococcus furiosus. EMBO J. 2001, 20: 990-997. 10.1093/emboj/20.5.990.PubMed CentralView ArticlePubMedGoogle Scholar
- Ouhammouch M, Geiduschek EP: A thermostable platform for transcriptional regulation: the DNA-binding properties of two Lrp homologs from the hyperthermophilic archaeon Methanococcus jannaschii. EMBO J. 2001, 20: 146-156. 10.1093/emboj/20.1.146.PubMed CentralView ArticlePubMedGoogle Scholar
- Ettema TJG, Brinkman AB, Tani TH, Rafferty JB, Van Der Oost J: A novel ligand-binding domain involved in regulation of amino acid metabolism in prokaryotes. J Biol Chem. 2002, 277: 37464-37468. 10.1074/jbc.M206063200.View ArticlePubMedGoogle Scholar
- Ouhammouch M, Geiduschek EP: An expanding family of archaeal transcriptional activators. Proc Natl Acad Sci U S A. 2005, 102: 15423-15428. 10.1073/pnas.0508043102.PubMed CentralView ArticlePubMedGoogle Scholar
- Okamura H, Yokoyama K, Koike H, Yamada M, Shimowasa A, Kabasawa M, Kawashima T, Suzuki M: A structural code for discriminating between transcription signals revealed by the feast/famine regulatory protein DM1 in complex with ligands. Struct Lond Engl 1993. 2007, 15: 1325-1338.Google Scholar
- Schwaiger R, Schwarz C, Furtwängler K, Tarasov V, Wende A, Oesterhelt D: Transcriptional control by two leucine-responsive regulatory proteins in Halobacterium salinarum R1. BMC Mol Biol. 2010, 11: 40-10.1186/1471-2199-11-40.PubMed CentralView ArticlePubMedGoogle Scholar
- Hart BR, Blumenthal RM: Unexpected coregulator range for the global regulator Lrp of Escherichia coli and Proteus mirabilis. J Bacteriol. 2011, 193: 1054-1064. 10.1128/JB.01183-10.PubMed CentralView ArticlePubMedGoogle Scholar
- Song N, Nguyen Duc T, van Oeffelen L, Muyldermans S, Peeters E, Charlier D: Expanded target and cofactor repertoire for the transcriptional activator LysM from Sulfolobus. Nucleic Acids Res. 2013, 41: 2932-2949. 10.1093/nar/gkt021.PubMed CentralView ArticlePubMedGoogle Scholar
- Vassart A, Van Wolferen M, Orell A, Hong Y, Peeters E, Albers S-V, Charlier D: Sa-Lrp from Sulfolobus acidocaldarius is a versatile, glutamine-responsive, and architectural transcriptional regulator. Microbiology Open. 2013, 2: 75-93. 10.1002/mbo3.58.PubMed CentralView ArticlePubMedGoogle Scholar
- Liu H, Orell A, Maes D, van Wolferen M, Lind’s A-C, Bernander R, Albers S-V, Charlier D, Peeters E: BarR, an Lrp-type transcription factor in Sulfolobus acidocaldarius, regulates an aminotransferase gene in a ?-alanine responsive manner. Mol Microbiol. 2014, 92: 625-639. 10.1111/mmi.12583.View ArticlePubMedGoogle Scholar
- Anantharaman V, Koonin EV, Aravind L: Regulatory potential, phyletic distribution and evolution of ancient, intracellular small-molecule-binding domains. J Mol Biol. 2001, 307: 1271-1292. 10.1006/jmbi.2001.4508.View ArticlePubMedGoogle Scholar
- Kaur A, Pan M, Meislin M, Facciotti MT, El-Gewely R, Baliga NS: A systems view of haloarchaeal strategies to withstand stress from transition metals. Genome Res. 2006, 16: 841-854. 10.1101/gr.5189606.PubMed CentralView ArticlePubMedGoogle Scholar
- Pang WL, Kaur A, Ratushny AV, Cvetkovic A, Kumar S, Pan M, Arkin AP, Aitchison JD, Adams MWW, Baliga NS: Metallochaperones regulate intracellular copper levels. PLoS Comput Biol. 2013, 9: e1002880-10.1371/journal.pcbi.1002880.PubMed CentralView ArticlePubMedGoogle Scholar
- Gu X, Zou Y, Su Z, Huang W, Zhou Z, Arendsee Z, Zeng Y: An update of DIVERGE software for functional divergence analysis of protein family. Mol Biol Evol. 2013, 30: 1713-1719. 10.1093/molbev/mst069.View ArticlePubMedGoogle Scholar
- Yokoyama K, Nogami H, Kabasawa M, Ebihara S, Shimowasa A, Hashimoto K, Kawashima T, Ishijima SA, Suzuki M: The DNA-recognition mode shared by archaeal feast/famine-regulatory proteins revealed by the DNA-binding specificities of TvFL3, FL10, FL11 and Ss-LrpB. Nucleic Acids Res. 2009, 37: 4407-4419. 10.1093/nar/gkp378.PubMed CentralView ArticlePubMedGoogle Scholar
- Nguyen-Duc T, van Oeffelen L, Song N, Hassanzadeh-Ghassabeh G, Muyldermans S, Charlier D, Peeters E: The genome-wide binding profile of the Sulfolobus solfataricus transcription factor Ss-LrpB shows binding events beyond direct transcription regulation. BMC Genomics. 2013, 14: 828-10.1186/1471-2164-14-828.PubMed CentralView ArticlePubMedGoogle Scholar
- Kawashima T, Aramaki H, Oyamada T, Makino K, Yamada M, Okamura H, Yokoyama K, Ishijima SA, Suzuki M: Transcription regulation by feast/famine regulatory proteins, FFRPs, in archaea and eubacteria. Biol Pharm Bull. 2008, 31: 173-186. 10.1248/bpb.31.173.View ArticlePubMedGoogle Scholar
- Peeters E, Wartel C, Maes D, Charlier D: Analysis of the DNA-binding sequence specificity of the archaeal transcriptional regulator Ss-LrpB from Sulfolobus solfataricus by systematic mutagenesis and high resolution contact probing. Nucleic Acids Res. 2007, 35: 623-633. 10.1093/nar/gkl1095.PubMed CentralView ArticlePubMedGoogle Scholar
- Yokoyama K, Ishijima SA, Koike H, Kurihara C, Shimowasa A, Kabasawa M, Kawashima T, Suzuki M: Feast/famine regulation by transcription factor FL11 for the survival of the hyperthermophilic archaeon Pyrococcus OT3. Struct Lond Engl 1993. 2007, 15: 1542-1554.Google Scholar
- Baliga NS, Bjork SJ, Bonneau R, Pan M, Iloanusi C, Kottemann MCH, Hood L, DiRuggiero J: Systems level insights into the stress response to UV radiation in the halophilic archaeon Halobacterium NRC-1. Genome Res. 2004, 14: 1025-1035. 10.1101/gr.1993504.PubMed CentralView ArticlePubMedGoogle Scholar
- Whitehead K, Kish A, Pan M, Kaur A, Reiss DJ, King N, Hohmann L, DiRuggiero J, Baliga NS: An integrated systems approach for understanding cellular responses to gamma radiation. Mol Syst Biol. 2006, 2: 47-10.1038/msb4100091.PubMed CentralView ArticlePubMedGoogle Scholar
- Facciotti MT, Reiss DJ, Pan M, Kaur A, Vuthoori M, Bonneau R, Shannon P, Srivastava A, Donohoe SM, Hood LE, Baliga NS: General transcription factor specified global gene regulation in archaea. Proc Natl Acad Sci U S A. 2007, 104: 4630-4635. 10.1073/pnas.0611663104.PubMed CentralView ArticlePubMedGoogle Scholar
- Schmid AK, Reiss DJ, Kaur A, Pan M, King N, Van PT, Hohmann L, Martin DB, Baliga NS: The anatomy of microbial cell state transitions in response to oxygen. Genome Res. 2007, 17: 1399-1413. 10.1101/gr.6728007.PubMed CentralView ArticlePubMedGoogle Scholar
- Bonneau R, Facciotti MT, Reiss DJ, Schmid AK, Pan M, Kaur A, Thorsson V, Shannon P, Johnson MH, Bare JC, Longabaugh W, Vuthoori M, Whitehead K, Madar A, Suzuki L, Mori T, Chang D-E, Diruggiero J, Johnson CH, Hood L, Baliga NS: A predictive model for transcriptional control of physiology in a free living cell. Cell. 2007, 131: 1354-1365. 10.1016/j.cell.2007.10.053.View ArticlePubMedGoogle Scholar
- Schmid AK, Reiss DJ, Pan M, Koide T, Baliga NS: A single transcription factor regulates evolutionarily diverse but functionally linked metabolic pathways in response to nutrient availability. Mol Syst Biol. 2009, 5: 282-10.1038/msb.2009.40.PubMed CentralView ArticlePubMedGoogle Scholar
- Koide T, Reiss DJ, Bare JC, Pang WL, Facciotti MT, Schmid AK, Pan M, Marzolf B, Van PT, Lo F-Y, Pratap A, Deutsch EW, Peterson A, Martin D, Baliga NS: Prevalence of transcription promoters within archaeal operons and coding sequences. Mol Syst Biol. 2009, 5: 285-10.1038/msb.2009.42.PubMed CentralView ArticlePubMedGoogle Scholar
- Kaur A, Van PT, Busch CR, Robinson CK, Pan M, Pang WL, Reiss DJ, DiRuggiero J, Baliga NS: Coordination of frontline defense mechanisms under severe oxidative stress. Mol Syst Biol. 2010, 6: 393-10.1038/msb.2010.50.PubMed CentralView ArticlePubMedGoogle Scholar
- Schmid AK, Pan M, Sharma K, Baliga NS: Two transcription factors are necessary for iron homeostasis in a salt-dwelling archaeon. Nucleic Acids Res. 2011, 39: 2519-2533. 10.1093/nar/gkq1211.PubMed CentralView ArticlePubMedGoogle Scholar
- Platko JV, Willins DA, Calvo JM: The ilvIH operon of Escherichia coli is positively regulated. J Bacteriol. 1990, 172: 4563-4570.PubMed CentralPubMedGoogle Scholar
- Lin R, D’Ari R, Newman EB:Lambda placMu insertions in genes of the leucine regulon: extension of the regulon to genes not regulated by leucine. J Bacteriol. 1992, 174: 1948-1955.PubMed CentralPubMedGoogle Scholar
- Itzkovitz S, Tlusty T, Alon U: Coding limits on the number of transcription factors. BMC Genomics. 2006, 7: 239-10.1186/1471-2164-7-239.PubMed CentralView ArticlePubMedGoogle Scholar
- Rhee HS, Pugh BF: Comprehensive genome-wide protein-DNA interactions detected at single-nucleotide resolution. Cell. 2011, 147: 1408-1419. 10.1016/j.cell.2011.11.013.PubMed CentralView ArticlePubMedGoogle Scholar
- Dehal PS, Joachimiak MP, Price MN, Bates JT, Baumohl JK, Chivian D, Friedland GD, Huang KH, Keller K, Novichkov PS, Dubchak IL, Alm EJ, Arkin AP: MicrobesOnline: an integrated portal for comparative and functional genomics. Nucleic Acids Res. 2010, 38 (Database issue): D396-D400. 10.1093/nar/gkp919.PubMed CentralView ArticlePubMedGoogle Scholar
- Tamura K, Stecher G, Peterson D, Filipski A, Kumar S: MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013, 30: 2725-2729. 10.1093/molbev/mst197.PubMed CentralView ArticlePubMedGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.PubMed CentralView ArticlePubMedGoogle Scholar
- Rzhetsky A, Nei M: METREE: a program package for inferring and testing minimum-evolution trees. Comput Appl Biosci CABIOS. 1994, 10: 409-412.PubMedGoogle Scholar
- Marzolf B, Deutsch EW, Moss P, Campbell D, Johnson MH, Galitski T: SBEAMS-microarray: database software supporting genomic expression analyses for systems biology. BMC Bioinformatics. 2006, 7: 286-10.1186/1471-2105-7-286.PubMed CentralView ArticlePubMedGoogle Scholar
- Reiss DJ, Facciotti MT, Baliga NS: Model-based deconvolution of genome-wide DNA binding. Bioinforma Oxf Engl. 2008, 24: 396-403. 10.1093/bioinformatics/btm592.View ArticleGoogle Scholar
- Alexa A, Rahnenführer J, Lengauer T: Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinforma Oxf Engl. 2006, 22: 1600-1607. 10.1093/bioinformatics/btl140.View ArticleGoogle Scholar
- Ashworth J, Plaisier CL, Lo FY, Reiss DJ, Baliga NS: Inference of expanded Lrp-like feast/famine transcription factor targets in a non-model organism using protein structure-based prediction. PLoS One. 2014, 9: e107863-10.1371/journal.pone.0107863.PubMed CentralView ArticlePubMedGoogle Scholar
- Bailey TL, Elkan C: Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol ISMB Int Conf Intell Syst Mol Biol. 1994, 2: 28-36.Google Scholar
- Gupta S, Stamatoyannopoulos JA, Bailey TL, Noble WS: Quantifying similarity between motifs. Genome Biol. 2007, 8: R24-10.1186/gb-2007-8-2-r24.PubMed CentralView ArticlePubMedGoogle Scholar
- Turkarslan S, Reiss DJ, Gibbins G, Su WL, Pan M, Bare JC, Plaisier CL, Baliga NS: Niche adaptation by expansion and reprogramming of general transcription factors. Mol Syst Biol. 2011, 7: 554-10.1038/msb.2011.87.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.