The use of genome-scale metabolic network reconstruction to predict fluxes and equilibrium composition of N-fixing versus C-fixing cells in a diazotrophic cyanobacterium, Trichodesmium erythraeum

Gardner, Joseph J.; Boyle, Nanette R.

doi:10.1186/s12918-016-0383-z

Research article
Open access
Published: 19 January 2017

The use of genome-scale metabolic network reconstruction to predict fluxes and equilibrium composition of N-fixing versus C-fixing cells in a diazotrophic cyanobacterium, Trichodesmium erythraeum

Joseph J. Gardner¹ &
Nanette R. Boyle¹

BMC Systems Biology volume 11, Article number: 4 (2017) Cite this article

5118 Accesses
13 Citations
Metrics details

Abstract

Background

Computational, genome based predictions of organism phenotypes has enhanced the ability to investigate the biological phenomena that help organisms survive and respond to their environments. In this study, we have created the first genome-scale metabolic network reconstruction of the nitrogen fixing cyanobacterium T. erythraeum and used genome-scale modeling approaches to investigate carbon and nitrogen fluxes as well as growth and equilibrium population composition.

Results

We created a genome-scale reconstruction of T. erythraeum with 971 reactions, 986 metabolites, and 647 unique genes. We then used data from previous studies as well as our own laboratory data to establish a biomass equation and two distinct submodels that correspond to the two cell types formed by T. erythraeum. We then use flux balance analysis and flux variability analysis to generate predictions for how metabolism is distributed to account for the unique productivity of T. erythraeum. Finally, we used in situ data to constrain the model, infer time dependent population compositions and metabolite production using dynamic Flux Balance Analysis. We find that our model predicts equilibrium compositions similar to laboratory measurements, approximately 15.5% diazotrophs for our model versus 10-20% diazotrophs reported in literature. We also found that equilibrium was the most efficient mode of growth and that equilibrium was stoichiometrically mediated. Moreover, the model predicts that nitrogen leakage is an essential condition of optimality for T. erythraeum; cells leak approximately 29.4% total fixed nitrogen when growing at the optimal growth rate, which agrees with values observed in situ.

Conclusion

The genome-metabolic network reconstruction allows us to use constraints based modeling approaches to predict growth and optimal cellular composition in T. erythraeum colonies. Our predictions match both in situ and laboratory data, indicating that stoichiometry of metabolic reactions plays a large role in the differentiation and composition of different cell types. In order to realize the full potential of the model, advance modeling techniques which account for interactions between colonies, the environment and surrounding species need to be developed.

Background

Nitrogen serves a critical role in the metabolism of all organisms. As a key component in nucleic acids and proteins, it is required for healthy growth and it is often one of the most limiting nutrients for optimal yield. Human intervention via the Haber-Bosch process for the production of ammonia has greatly shifted the global nitrogen cycle, however many ecosystems still rely heavily on biological nitrogen fixation. One such ecosystem is in the open ocean, which is a nutrient-limited environment and organisms that thrive here have evolved to thrive in deplete conditions. Trichodesmium is a genus of filamentous diazotrophic (nitrogen fixing) cyanobacteria that not only flourishes in this environment but provides bio-available nitrogen for surrounding species. Trichodesmium is responsible for fixing roughly 100 TgNy^-1 of nitrogen annually (42% of global N fixation) [1] and has been reported to ‘leak’ 30-50% of the nitrogen it fixes [2]. The genus is ubiquitous in marine environments; it is found in environments as diverse as the Mediterranean Sea [3], the Pacific Ocean [4–6], and the Great Barrier Reef where it has implications not only as a source of nitrogen, but also as a center for eutrophication [7]. It dwells primarily near the surface [8] and can swell to occupy acres of the ocean or sea. Despite its prominence in the global nitrogen cycle, most research efforts have focused on in situ sampling and therefore little has been done to model and or predict the effect of different environmental factors on the growth and nitrogen fixation rates in Trichodesmium.

Trichodesmium is a colonial cyanobacteria which grows in multicellular filaments called trichomes, each containing about 130 cells [9]. Trichodesmium is a non-heterocystous cyanobacterium which means it does not employ specialized cells (heterocysts) for nitrogen fixation. Instead, nitrogen fixation and photosynthesis can occur within the same cell. Most non-hetrocystous cyanobacteria separate oxygen producing photosynthesis from nitrogenase by using temporal separation; they fix nitrogen at night when the cellular metabolism is in respiration mode (consuming carbohydrates stored during the day by photosynthesis). Trichodesmium is unique in its mechanism to fix nitrogen, it fixes nitrogen during the day while simultaneously fixing carbon via photosynthesis. Respiration rates in Trichodesmium are reported to be higher than other cyanobacteria, which ensures a micro- or anaerobic environment and thus minimizes the potential poisoning of nitrogenase by oxygen [10, 11]. Nitrogenase is only expressed in a subset (10-20%) of cells consecutively arranged in the middle of the trichome. These diazotrophic cells only express photosystem I because photosystem II produces oxygen [10, 12–15]. Current characterization of Trichodesmium is limited predominantly to population level observations due to its genetic intractability and difficulty to culture. While several laboratory studies investigating the complex genome [16–18], transcriptome [19, 20], and proteome [21] have been published, most relate to populations level or sparse in situ studies in diverse, non-ideal growth conditions. A handful of other recent studies report on the morphology/structure of the cells [8, 10, 22, 23] and how cells respond to iron, nickel, and other nutrient stresses [24–27]. Despite the availability of these studies, they are limited in scope and do not provide a complete picture of Trichodesmium on a cellular scale. The long doubling time (57-98 h), low growth density (~100mg/L) [24, 28–30], and lack of genetic tools have limited laboratory based research on Trichodesmium, especially compared to other diazotrophic cyanobacteria such as Anabaena and Cyanothece.

This work presents the first genome-scale reconstruction of a colony forming diazotrophic cyanobacterium, T. erythraeum, a leader in marine nitrogen fixation. It models biological optimization of metabolic exchange and biomass creation through Flux Balance Analysis (FBA) and Flux Variability Analysis (FVA) [31] to predict the different metabolic behaviors of the two cell types formed by T. erythraeum, nitrogen fixing cells and photoautotrophic cells, constrained by laboratory or published data/observations. The models described in this work illustrate how T. erythraeum divides labor between two cells stoichiometrically and applies the first step towards a multi-objective framework of these bilaterally operating cells. These results are extended to understand overall population compositions and metabolite production rates to visualize what role metabolite passage plays in formation of these complex colonies via dynamic Flux Balance Analysis (dFBA) [32] and a population co-optimization algorithm. This model lays the foundation for future colonial cyanobacteria characterization and integration with in situ and transcriptomic data for T. erythraeum.

Results

Biomass composition and growth rates

Cells were grown in YBC-II medium in ambient air (79% N₂) or supplemented with either 100 μ mol KNO₃ as a nitrogen source. Growth was monitored by measuring total chlorophyll content (see Fig. 1a). These data were then used to calculate the growth rate and doubling time assuming exponential growth (Table 1).

Table 1 Growth rates and doubling times of T. erythraeum

Full size table

The biomass composition of T. erythraeum was measured in order to formulate an accurate biomass formation equation. The composition of cells grown on both ambient air and nitrate were measured (see Table 2 as described in the methods section. The “soluble pool” is a collection of soluble metabolites that are well conserved between organisms for survival and includes small sugars, energy carrying molecules, and other small molecules. To accurately represent proteins and lipids, the amino acid and fatty acid composition of cells were measured as well (see Additional file 8: Tables S1 and S2). We attempted to grow cells on ammonium because we hypothesized that this would remove the necessity of diazotroph formation and enable all the cells in the trichome to be carbon-fixing. This would allow biomass measurement of photoautotrophs directly; however, growth on ammonium in the laboratory was not possible as seems to be consistent with some other cyanobacteria. Therefore we used the composition of cells grown on N₂ as the average biomass composition for all subsequent modeling efforts.

Table 2 Biomass composition of T. erythraeum

Full size table

Network reconstruction and manual curation

Reconstructing a complete genome-scale metabolic network from a genomic sequence required several iterations. The first draft of the network was created using an automated genome-scale model algorithm, the SEED RAST [33] and contained 956 reactions. Automated metabolic network reconstructions rely on homology to well characterized and annotated model organisms such as E. coli or S. cerevisiae. Therefore, the unique metabolic pathways for photosynthesis and nitrogen fixation were not accounted for in the initial draft. The initial automated draft had several gaps and missing reactions, therefore several iterations of manual curation were necessary to fill in missing reactions and infer the presence of reactions that were not predicted based on homology to model organisms. First, we focused on closing the balance on reactions associated with biomass and cellular energy pathways (light harvesting, ATP cycling, and Redox reactions); this was done by referencing metabolic pathway databases such as BioCyc [34] and KEGG [35]. We then compared our draft network to genome scale reconstructions of other related organisms, including Cyanothece ATCC 51142 [36], Synechocystis sp. PCC 6803 [37], Synechococcus sp. PCC 7002 [38], Arabidopsis thaliana [39], Phaeodactylum tricornutum [40], and Chlamydomonas reinhardtii [41, 42]. Through comparison, we added more photosynthesis-specific metabolic reactions which were predicted to be present in the genome based on BLAST results. Finally, transport reactions included in the model were selected based on proteomic data or diffusion (CO₂, H₂O, N_2, etc.) [16]. Manual curation efforts built the model out to a maximum of 1035 reactions; closer inspection of the reactions revealed that several were predicted by the SEED algorithm but had no significant homology to the T. erythraeum genome and were non-essential, therefore they were removed. The current draft of the genome-scale metabolic model (Additional file 1: Table S1 and Additional file 2: Table S2) for T. erythraeum contains 971 reactions: 1 biomass formation equation (based on experimental data), 9 macromolecule synthesis and condensation reactions, 27 exchange reactions, 38 transport reactions (validated by proteomics data), and 907 metabolic reactions. These reactions involve 647 unique genes and 986 metabolites. Despite our manual curation efforts, the model still has 215 dead end reactions: 113 are involved in lipid, amino acid, or pyrimidine/purine metabolism and are bypassed by summary reactions (see Additional file 3 for a complete list). Manual curation of the model led us to identify 9 genes which are present in the genome based on homology but not annotated, 5 genes encoding enzymes with related functions we assume to be promiscuous and 1 gene which is required for the production of biomass but was not present in the genome based on homology (see Table 3).

Table 3 Unannotated metabolic reactions in the T. erythraeum genome but included in the model based on homology to related organisms and/or to close gaps for biomass formation

Full size table

Flux balance analysis

A T. erythraeum trichome is made up of cells with two distinct metabolic modes: photoautotrophic and diazotrophic. Each cell type was modeled separately and thus required a different set of constraints to define the cell type. The specific constraints applied to each cell type based on literature and experimental data are provided in the methods section (Table 6). Growth associated ATP demand is assumed to be identical to Cyanothece sp. ATCC 51142 [36]: 544 mmol ATP (g DW h)^-1. Maintenance energy, represented by the reaction EN_ATP: ATP + H₂O → ADP + H⁺ + Pi was adjusted until the predicted growth rate matched published experimental growth rates (0.0146 h^-1) [29]. Maintenance energy demands were found to be: 64.3 mmol ATP (g DW h)^-1 for photoautotrophs and 67.2 mmol ATP (g DW h)^-1 for diazotrophs. We hypothesize that this number is significantly higher than heterotrophic bacteria for two reasons: (i) maintaining a micro-aerobic or anaerobic environment in the cells for nitrogenase is energetically demanding and (ii) we do not constrain the photon absorption beyond the amount provided to the cells in the laboratory despite knowledge that cells aren’t 100% efficient at light harvesting. The COBRA toolbox [43] was used to evaluate the biomass yields for each cell type separately subject to published growth rates, carbon dioxide uptake, and nitrogenase activity (see Table 4). An .m file to run the model in Matlab with appropriate constraints has been included in the supplemental files (Additional file 4).

Table 4 Predicted yields and selected fluxes for T. erythraeum

Full size table

Flux maps were generated in order to visualize carbon trafficking within the cell for both cell types (illustrated in Fig. 2 with data in Additional file 5: Table S5) subject to the constraints specified in Table 6. As expected, the model for a photoautotrophic cell exhibited high flux through the non-oxidative pentose phosphate pathway and gluconeogenesis. Moreover, the highest recorded flux is through ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO), the enzyme responsible for carbon fixation. The TCA and glyoxylate cycles are partially inactive; the majority of energy is produced through lower glycolysis or photosynthesis and the TCA Cycle’s main utility is in precursor biosynthesis. The diazotroph, on the other hand, has higher flux in the pathways associated with respiration: the oxidative pentose phosphate pathway and TCA cycle in particular displayed substantial activity. The glyoxylate shunt also has high flux, indicating crucial differences in how metabolism is regulated in the different cell types.

Another utility of a FBA model is the ability to predict essential genes. We performed an in silico gene knockout analysis and identified 275 genes as essential in phototrophic cells and 253 in diazotrophic cells (see red-coded genes in Additional file 6: Genes Table S6). Essential genes are frequently linked to biomass relevant compound synthesis (like pigments and amino acids), carbon and/or nitrogen fixation, or glycolysis. Genes and reactions which decrease growth rate but were not lethal were frequently linked to central carbon processing. These gene knockout results are corroborated by reaction analysis, where reactions are removed from the model instead of genes; this analysis found 370 reactions essential in a photoautotroph and 363 in a diazotroph (see red coded reactions in Additional file 6: Reactions Table S6). Most reactions overlapped, but 6 carbon fixation reactions and 3 gluconeogenesis reactions were unique to essentiality in photoautotrophs while ammonium output and 2 nitrogen fixation reactions were unique to diazotrophs. Unfortunately, T. erythraeum has not been reported to be genetically tractable and therefore it is not possible to experimentally validate these results.

Flux variability analysis (FVA)

Flux balance analysis uses optimization techniques to predict fluxes in the cell, but most (if not all) genome-scale models are underdetermined so there exists the possibility of multiple flux distributions that lead to the same solution (in our case biomass/growth rate). To evaluate the flexibility in our model, we performed flux variability analysis (FVA) to estimate how much variability a particular reaction can tolerate and still give the same solution. Enzymes, selected from important pathways and by comparing different metabolic distributions in the diazotrophic and photoautotrophic FBA, are shown in Fig. 3 and a table of bounds provided as Additional file 7: Table S7. Overall, photoautotrophic cells display tighter metabolic regulation and have required flux through every major pathway except for nitrogen fixation. In comparison, the diazotrophic model displays high variability through these central pathways with the exception of nitrogenase function. For the photoautotrophic model, two genes exhibit fully nonzero flux: phosphoglycerate kinase (PGK) and ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO). Otherwise, two genes, D-fructose-1,6-bisphosphate D-glyceraldehyde 3-phosphate lyase (FPAL) in upper glycolysis, and sedoheptulose-7-phosphate: D-glyceraldehyde-3-phosphate (TAGSFE) in the pentose phosphate pathway exhibit wide ranges with TAGSFE possibly functioning in both directions while the same optimum is achieved. The complete tricarboxylic acid (TCA) cycle is invariably nonfunctional for energy metabolism in optimization of the photoautotrophs (ACCOASYN, MDH, ICLY, and ICIT) and only functions for precursor biosynthesis. Diazotrophs ultimately demonstrate more variability due to their energy source as more redundancy exists in carbon oxidation than in photosynthesis: all reactions can accommodate zero flux except nitrogen fixation. FPAL and TAGSFE both show reversibility (same as photoautotroph with respect to TAGSFE). Although there are similar patterns in the different cell types, energy metabolism is significantly different.

Dynamic flux balance analysis

Dynamic Flux Balance Analysis (dFBA) was conducted to predict how well the model performed when compared to laboratory experiments using the same data to constraint the model as reported earlier (see Table 6). No additional constraints on diatomic nitrogen or carbon dioxide uptake were applied except for the demand from the other cell type We used the dFBA model to test a number of different growth conditions and hypotheses. First, to predict the equilibrium trichome composition, the model was run with an inoculum of equal parts photoautotrophic and diazotrophic cells. Once equilibrium was determined, initial concentrations of each cell type were calculated from experimental data and the simulation was re-run to validate that the most efficient growth and metabolite production occurred at this equilibrium. The resulting colony fractions to which the cells invariably converged were found to be 0.15:0.85 diazotrophs:photoautotrophs, similar to experimental evidence [10]. To understand how the initial inoculum effects the lag phase of the cells, we ran simulations for a number of different inoculums. Given that the algorithm requires a sequential progression of metabolism (with the photoautotroph calculating its metabolism and then the diazotroph reacting), a “grace period” of 3 time steps was built in where the cells could borrow substrates from an arbitrary cache in their environment so that they could grow initially. However, if a population is unable to produce metabolites or biomass after those three consecutive time steps, the cells were assumed to be unviable and were terminated. How the growth rate evolves over time to reach the experimental value (0.0146 h^-1) is shown in Fig. 4. The expected growth rate is obtained rapidly for near-equilibrium starting populations (15:85, 20:80) and more slowly for populations far from equilibrium (90:10). Populations of 0:100 and 100:0 were nonviable. More detailed figures of how the model predicts composition changes when far from equilibrium is shown in Fig. 5 with initial equality of cell types. Cells appear to have three distinct growth phases, the first where they are adjusting their ratios to move closer toward equilibrium (lag phase) while not able to leak metabolites. When the cells are near equilibrium (at ~0.22 diazotroph: 0.78 photoautotroph), they are able to grow exponentially, but are unable to leak ammonium consistent with their exponential growth (Fig. 5c). When they finally reach equilibrium, growth and ammonium leakage is exponential, indicating most efficient growth. In the dFBA model, photoautotrophic cells are unable to grow without diazotrophic cells and vice versa; however, if the models are optimized assuming nutrient unlimited environments (unrestricted availability of glycogen or ammonium), the photoautotroph can grow at a rate of 0.0182 h^-1 and the diazotroph can grow at a maximum rate of 0.0188 h^-1. From these results, the equilibrium found in nature and this study represents the best-case growth scenario for T. erythraeum; this growth appears to be stoichiometrically and metabolically motivated instead of regulated through other means.

This experiment generated several important parameters to compare against literature. Intercellular metabolite production, cell fraction, and percent released of total fixed nitrogen were all predicted using dFBA. The model was run for 1000 iterations with different inoculum compositions to investigate the effect of initial compositions of population development. The generated values for an ideal population (beginning with equilibrium concentrations of each cell type) are summarized in Table 5. The model over predicts nitrogenase flux (0.490 mol N₂ (g DW h)^-1 for the model versus 0.132 mol N₂ (g DW h)^-1 for published measurements but is similar to literature values for carbon dioxide exchange (0.922 mol CO₂ (g DW h)^-1 for the model versus 0.927 mol CO₂ (g DW h)^-1 for laboratory results) [29]. The model resulted in a total biomass of 13.8 mg/L when simulated at equilibrium starting composition, compared to the 10–40 mg/L found in our laboratory experiments. Variability in growth occurred as illustrated in Fig. 6: growth rates peaked at 0.0146 h^-1 for growth rate as expected due to constraints (47.5 h for doubling time) at initial cell fractions of 0.15 diazotrophs: 0.85 photoautotrophs). Initial diazotroph concentrations at zero resulted in no growth while photoautotroph dominated inoculums had the minimum nonzero values of 2.80⨯10^-4 h^-1. Fixed nitrogen release rate distributions and growth rate distributions through all iterations are illustrated in Fig. 7a and b respectively.

Table 5 Productivities of T. erythraeum according to literature, laboratory experiments, and the dFBA model

Full size table

Finally, dFBA was conducted to investigate the effect of different nitrogen sources on the growth rate of T. erythraeum. All other constraints on the mode were held constant and the resulting growth curves are given in Fig. 8. As expected, growth rates increase with increasing levels of nitrogen reduction and carbon in the nitrogen source. Unfortunately, growth on nitrogen sources other than N₂ or NO₃ have proven to be problematic and there is no evidence of T. erythraeum’s ability to utilize other nitrogen sources.

Discussion

In this work, we present a genome-scale metabolic network reconstruction of T. erythraeum which has been experimentally validated and used to predict growth under a variety of conditions.

Experimental data

An important aspect of any metabolic model is to collect experimental data to aid in the development of the model (biomass equation) and for validation (metabolic production and cellular equilibrium). Major biomass constituents were measured experimentally with direction from literature to define the scope of the biomass constituents (such as inclusion of biliproteins and cyanophycin [8, 23]). It should be noted that T. erythraeum differs from other nitrogen fixing organisms and cyanobacteria. Species like Cyanothece and Anabaena either temporally regulate nitrogen fixation using circadian rhythms or through spatial segregation by forming special cells called heterocysts. T. erythraeum, while it does exhibit some circadian regulation [44], can fix nitrogen at all times, day or night. It does form two cell types, but the only evidence of different structures is indicated by accumulation of starches [23]. Cyanophycin granules exist, but are distributed throughout the cells, indicating that they are nitrogen storage compounds rather than structural devices. After assessing the literature based behavior of T. erythraeum and its differences from other bacteria, biomass was evaluated when grown on nitrogen or nitrate. The existence of two separate cell types (diazotrophic and photoautotrophic) within a trichome implies a locally nitrogen limited and nitrogen replete environment. We sought to mimic these effects. As expected, the nitrogen provided to the cell culture has a significant impact on biomass composition, and cells partition their carbon in different ways depending on availability of reduced nitrogen. Since the reduction of diatomic nitrogen to ammonia requires a large amount of energy (16 ATP), it is likely that the cells activate their nitrogen sparing mechanisms in order to limit the amount of nitrogen needed for growth. This is evident in the ratio between carbohydrates and lipids (Fig. 1a), which is significantly lower – only 54% – in cells grown on diatomic nitrogen compared to those grown on nitrate. This is similar to reports of algae which accumulate lipids in response to nitrogen starvation [45–50]. Here, we assume that growth on diatomic nitrogen causes the cells to act like nitrogen is limiting. Cells grown on nitrate have an increased protein content, again highlighting that the cell is no longer acting as if nitrogen is limiting. This increase in protein is also commensurate with an increase in the major photosynthetic pigment-protein complex phycoerythrin. Cultures grown on diatomic nitrogen convert their fixed nitrogen to higher quantities of cyanophycin, a molecule used for nitrogen storage. Meanwhile, this study also sought to characterize the laboratory growth rate of T. erythraeum grown on different sources of nitrogen. Unfortunately, T. erythraeum showed a reticence to grow on ammonium in the laboratory and has no evidence of a putative transporters for amino acids, commensurate with heuristic knowledge of other cyanobacteria, which limited this experimentation. Even so, growth rates are similar to previous laboratory measurements (0.0108 h^-1 for the Boyle Lab and 0.0146 h^-1 from [29]) but are dissimilar to in situ studies with values measured as low as 1.46 × 10^-4 h^-1 in the North Atlantic [22]. This is not surprising, it is well known that the open ocean is extremely nutrient deplete and thus a challenging growth environment. Also, our measurements of growth on different nitrogen sources do not indicate a statistically significant disparity in growth rate based on nitrogen source despite significant changes in cellular composition. As such, the data implies that nitrogen availability is not the major limiting factor in cellular growth. This is supported by literature which demonstrates that T. erythraeum is much more efficient in higher CO₂ concentrations [51]. This means that the major effect of nitrogen source is on cellular composition and implies that carbon fixation is the growth limiting function of T. erythraeum.

Metabolic network reconstruction

Equipped with a high-level biological understanding of the organism, a genome scale reconstruction was built using the annotated genome, enzyme databases, models of related organisms, and the laboratory biomass measurements. The first draft of the genome-scale metabolic network for T. erythraeum was created using the SEED RAST algorithm [33] followed by manual curation. Manual curation was conducted by adding all possible reactions predicted by databases, similar organisms, and gaps based on data in primary literature and publicly available databases [52]. This was pruned to the current model (986 metabolites, 647 unique genes and 971 reactions) by vetting each reaction through BLAST and essentiality. These numbers are comparable to metabolic network reconstructions of similar organisms (Additional file 8: Table S3). Reactions relating to central metabolism, including carbon fixation, glycolysis, the TCA cycle, photosynthesis and the pentose phosphate pathway, are conserved between T. erythraeum and the other cyanobacteria listed in Additional file 8: Table S3. Biomass equations were based on the Cyanothece ATCC 51142 model [36] for lipid and protein formation. The reactions were mass balanced to predict molar masses, and the biomass measurements were interpreted using these computational predictions. Finally, closure was obtained through proteomic assessment of transport reactions in conjunction with the metabolic network and biomass equations. Overall, the manually curated model of T. erythraeum is a strong summary of the majority of metabolic processes and relates well to previous models.

One of the more challenging aspects of building a metabolic network of a non-model organism is the lack of (or completeness of) genome annotation. However, in performing a genome-scale reconstruction, shortcomings in annotation can be identified. In order to achieve closure, 16 reactions were added that are not linked directly to a gene in the T. erythraeum genome by assuming enzyme promiscuity, hypothetical protein function, or a lack of evidence for its existence. To validate or refute the presence of these genes in the genome, we did extensive BLAST analysis with related organisms with more complete annotations (other cyanobacteria, E. coli and A. thaliana). We identified 5 genes encoding enzymes which had evidence of the desired function but were annotated differently (Table 3); we assumed these enzymes to be promiscuous and perform the necessary functions to fill gaps in the network. These assumptions were possible because the genes are often found to have redundant or generic function (the promiscuous genes include phosphatases and transaminases). We identified an additional 10 genes which were previously unannotated but are predicted to be present in the T. erythraeum based on homology to related organisms (Table 3). Most of these genes encode enzymes associated with the glyoxylate cycle, aminotransferases, and amino acid metabolism. Again, some of these enzymes have traditionally exhibited generic behavior (like transaminases), but other enzymes were unannotated due to their single-function synthetic purposes (like amino acid synthesis). Interestingly, enzymes associated with the glyoxylate cycle show the most similarity with ammonium oxidizing bacteria. The aminotransferases and amino acid metabolism enzymes, which generate some of the carbon molecules that participate in the glyoxylate cycle, show most similarity to other colony forming cyanobacteria like Leptolyngbya and Nostoc. This indicates that nitrogen metabolism may have impacts on peripheral carbon metabolism that merits future investigation. Finally, one enzyme which was necessary for the model to produce biomass, (R)-pantoate: NADP⁺ 2-oxidoreductase (E.C. 1.1.1.169), was not found to be present in the genome. This implies that T. erythraeum may have evolved an alternative pathway which has not yet been identified, or that the synthetic route for (R)-pantoate has significant structural differences. Main pathways included in the model along with the several of the newly annotated enzymes are presented in Fig. 9. Confidence levels in the model reflect the strength of prediction of the enzymes, with ones referring to the enzymes discussed above.

Predicting central metabolic fluxes

With a vetted series of metabolic interactions, the model was used to study whole-cell behavior for both diazotrophs and photoautotrophs separately using flux balance analysis (FBA). The two cell types share the same genome but have distinct metabolisms based on differences in regulation due to cellular differentiation: diazotrophic cells provide biologically available nitrogen for the community and photoautotrophic cells provide a reduced carbon source, in the form of glycogen, to the diazotrophs. To model metabolic fluxes for each type of cell, we developed two sets of strict constraints (see Table 6) to reflect the laboratory constraints measured by literature [29]. We fit the predicted growth rate from the model by changing maintenance energy and used metabolite production (of the metabolites exchanged between cells) as the objective equation to ensure satisfaction of both objectives. As expected, flux distributions were specific to cell type. Carbon fixation is central to photoautotrophic function and is evident by high flux through the Calving Benson Bassham cycle. For diazotrophs, the absence of Photosystem II requires a functional TCA cycle for production of cellular energy. Interestingly, the oxidative pentose phosphate pathway was preferred over glycolysis, likely due to minimizing carbon oxidation and/or redox balancing. This same phenomenon explains the activation of the glyoxylate shunt which has evolved to conserve carbon [53–56]. The flux maps envision crucial, but expected, differences in metabolism between cell types that are similar to previous findings.

Table 6 Constraints for each cell type in model simulations for FBA and FVA

Full size table

Gene and reaction essentiality were also performed using the same constraints and agree well with flux balance analysis (see Additional file 6: Table S6) and gene essentiality in related organisms. Essential genes include carbon fixation, biomass synthesis, and central metabolism (gluconeogenesis and lower glycolysis) for photoautotrophs and nitrogen fixation, biomass synthesis, and carbon oxidative cycles for diazotrophic cells. Most notably, ammonium export is required for biomass production in diazotrophs, predicting that ammonium leakage is necessary for redox/energy balancing in these cells. These findings correlated with previous studies which find that photosynthetic pigments and certain amino acid pathways are essential [42] while some carbon metabolism like the later parts of the TCA cycle are dispensable in cyanobacteria [57]. The model is effective, then, in in silico predictions of essential and conserved metabolic motifs.

The flux maps we present here represent only a single possible solution due to the underdetermined system used for optimization. To assess the bounds and possible adaptations of the model, FVA was conducted and visualized for important pathways (Fig. 3). The photoautotrophic cells showed much tighter bounds for central metabolic processes than their diazotrophic counterparts, likely because their metabolites (glycogen) require roughly 64 ATP/mol as opposed to ammonium which requires 8 ATP/mol (16/mol N₂) plus additional maintenance energy. Consistent with gene essentiality and intuition, Ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO) requires a non-zero flux to maintain optimal behavior. Phosphoglycerate kinase (PGK) – which is in lower glycolysis/gluconeogenesis – is a less intuitive non-zero reaction, but the reaction represents an important step in both optimal carbon oxidation and reduction. For both cell types, D-fructose-1,6-bisphosphate D-glyceraldehyde 3-phosphate lyase (FPAL) and sedoheptulose-7-phosphate: D-glyceraldehyde-3-phosphate (TAGSFE), which correspond to reactions in upper glycolysis and the non-oxidative pentose phosphate pathway, show high variability; TAGSFE can even function reversibly while maintaining optimality. Otherwise, the photoautotrophic cells have zero or near-zero flux for all TCA_Cycle enzymes (acetyl-coA synthase: ACCOASYN, malate dehydrogenase: MDH, isocitrate lyase: ICLY, and bifunctional aconitate hydratase 2/2-methylisocitrate dehydratase: ICIT) because of a lack of electron donors available for carbon oxidation. Glutamine synthase (GLNSYN) displays a small flux for ammonium metabolism and incorporation while PEP carboxylase (PCX) can accommodate flux using its role as an alternative to RuBisCO for single-carbon incorporation. In the diazotrophic cells, pyruvate kinase (PYRK) and glutamine synthase (GLNSYN) exhibit high flux capacities due to their substrates being precursors for many essential metabolites in the cell and the energetic latitude given by the reduced carbon supply, but are irreversible. TCA cycle has intermediate flux capacity because the reactions in it represent best-case processing for carbon oxidation, while the glyoxylate shunt (isocitrate lyase, or ICLY) is used for carbon conservation [53–56]. Diazotrophic metabolism only has one significant deviation: the flux through the nitrogenase enzyme is invariable. These data are similar to findings for other photoautotrophic and diazotrophic organisms [36, 58, 59] and illustrate similar conserved energetic and metabolic behaviors.

Modeling equilibrium colony composition

Simulations to this point showed the metabolic differences between the cell types, but had not accommodated their function as a community. For this reason, dynamic FBA (dFBA) was used to expand the study to simulate how gene-predicted metabolism causes higher order community behaviors. Previous studies have used dFBA for similar applications, including diauxic growth in E. coli and the competition between Rhodoferax and Geobacter [32, 60, 61], but this study presents the idea of dynamic allocation of cells based on metabolite production to simulate need-based differentiation. Since all cells are defined by the same genome, differentiate based on the needs of the community and are invested in the viability of surrounding cells, there is a third level of interaction within these communities. To model this, diazotrophs and photoautotrophs were treated like separate, symbiotic, interdependent populations that could achieve perfect diffusion of substrate. Then, based on the deficiency of one metabolite, cells would be allocated from the deficiency generating cell (the consumer) in proportion to the shortage to correct it. If there was only metabolite surplus, all excess cells from the diazotroph would be allocated towards photoautotrophic production which was implied to drive biomass growth. This was run with both this and the opposite assumption (that all excess would be allocated to the diazotroph) with identical results, meaning that this assumption was an artifact of the algorithm rather than a pivotal administration. Importantly, this algorithm requires the photoautotrophs to act by generating biomass and enough glycogen for the prior generation plus a predictive amount based on growth rate before the diazotroph reacts and creates ammonium. Therefore, the model can inappropriately fail since metabolites are not being simultaneously synthesized. The model was “seeded” with initial ammonium and glycogen substrates that served to prevent this from affecting the model with a “grace period” of three time steps to allow the model to correct the initial “loan” from the environment. Once the algorithm was developed, it was used to interrogate population efficiency by determining equilibrium compositions of cells, predicting metabolite excretion fluxes, and to predict the effect of changing environments on the cell (especially for conditions that are not possible experimentally). The laboratory imposed constraints used for FBA were converted to relaxed, optimal constraints to reflect the lower growth rates measured in our lab. When optimized independently, assuming replete nutrients for each cell type, both cells grow much faster than in limited, co-dependent conditions (0.0182 h^-1 for photoautotrophs and 0.0188 h^-1 for diazotrophs versus 0.0146 h^-1 for the population). This is logical: when nutrients are limited overall growth is limited. Another observation is that photoautotrophic growth is slower even ideal conditions, reinforcing the hypothesis that biomass generation is limited more by carbon than nitrogen.

To determine equilibrium behavior by the community and how well the model could predict the behaviors we observed, we conducted dFBA using this algorithm over 360 h with several different starting conditions. First, a 50:50 starter culture was used to model both how a colony might achieve equilibrium through cellular differentiation and the resulting equilibrium. In order to achieve efficiency, three phases govern cell response to metabolite deficiencies as illustrated by Fig. 5. First, photoautotrophs are generated to correct for carbon deficiency over the first 120 h (steeper slopes in Fig. 5a and linear slopes in Fig. 5a and d) and without excretion of metabolites (Fig. 5c). Meanwhile, the diazotrophs are tasked with producing enough fixed nitrogen to support the community, but all biomass generation is diverted towards photoautotrophs because the “seed” carbon or nitrogen is rapidly depleted. In the second phase from hours 120 to 250, where the cells are near but not at equilibrium, the cells redistribute more slowly as the community is able to support itself without reaching a deficiency while producing some metabolites (Fig. 5c). Finally, after about 250 h, equilibrium is reached where more photoautotrophs will deplete the environment of fixed nitrogen and fewer will reduce biomass generation. This is also the stage where metabolite production is most rapid, and ammonium and glycogen are exchanged between cells and the environment exponentially Fig. 5. These results are encouraging because they justify our observation that the model converges to an equilibrium regardless of initial composition, and the equilibrium falls within experimental and in situ levels.

Convergence toward equilibrium was investigated further by incrementally changing initial biomass ratios and plotting growth rate versus time. Rapid convergence by an inoculum of 15:85 diazotroph:photoautotroph is expected and promotes this as ideal behavior by cellular populations (Fig. 4). The convergence by other inoculums reinforces the idea of an equilibrium composition and shows the inefficiency of other starting populations. Finally, since the model assumes no diffusional limitation and similarly structured cells, this convergence to an equilibrium demonstrates that metabolic interactions play a large role in community regulation of T. erythraeum. We repeated this analysis 1000 times to see how the composition of the inoculum affected equilibrium composition of the cells (Fig. 6). In agreement with the single simulation results from above, starting with compositions that were not close to equilibrium resulted in suboptimal growth (for excess diazotrophs) or no growth (for excess photoautotrophs). “Death” (defined as a growth rate of zero for three time steps or more), was strictly contained within the initial phase of growth when the inoculum was 0:100 or 100:0. This is because either nitrogen generation or glycogen generation is too low to support the population before efficient biomass production is achieved and the cells do not have enough exogenous resources to correct this. The equilibrium composition and nitrogen release fraction are both close to observed data: a starter culture closer to the cell composition measured in the ocean (10–20% diazotrophic cells and 80–90% photoautotrophic cells [10, 12–15]) grows nearest the constrained growth rate, while compositions further from that equilibrium result in depressed rates. The full distribution of possible growth rates can be seen in Fig. 7b and illustrates a bimodal distribution. The separation between the two peaks can be explained by the extent of domination by either cell type. When diazotrophs dominate, suboptimal growth occurs; when photoautotrophs dominate, the cells operate optimally or die due to deficient nitrogen production. Figure 7a depicts the range of nitrogen release rates dependent on initial inoculum and display a fairly narrow range. In non-death situations, it is consistent within 25% and 45% of total fixed nitrogen, well within the literature recorded values of 7.7% [62] –52% [22]. Across all growth rates, nitrogen leakage is 29.4% of total fixed nitrogen, but for optimal growth rates ($ \frac{\mu }{\mu_{max}}\ge 0.9 $), nitrogen leakage is 37.5%. Again, nitrogen leakage appears essential to efficiency in biomass generation as evidenced both by observation and simulation. The repetition of the simulation show that an equilibrium composition similar to observation was invariable and is mediated by metabolism while nitrogen is a required side-effect of this optimal growth.

The model was also used to simulate growth conditions that have not been possible to perform in the laboratory. To date, T. erythraeum has not been reported to grow on more reduced nitrogen sources; however, we are able to predict how growth on these other N sources using our model (Fig. 8). With increasingly more reduced nitrogen sources and higher carbon content (N₂ > NH₄ ⁺ > Urea > Glutamate > Glutamine), the growth rate increases from 0.0146 h^-1 to 0.0262 h^-1 with the exception of urea, which has the second lowest growth rate (0.0151 h^-1) due to its requirement of ATP consuming active transport and a byproduct of moderately useful CO₂. In the Boyle laboratory, T. erythraeum only grows on N₂ or nitrate and this general trend is also seen in other laboratory data. T. erythraeum, therefore, lives in a very narrow optimum between two important objectives: carbon and nitrogen fixation. The availability of carbon (or lack thereof) particularly limits growth. In reality, light also has an important role in growth rate because a few centimeters below the surface of the ocean light becomes severely limited. Diffusion, predation, colony shape, and other influential factors can also limit the cell’s ability to grow optimally. Even so, biomass generation predictions correlate well between with laboratory data and predicted equilibrium compositions of cells are close to observed values. In order to more effectively leverage this model for colony modeling more sophisticated methods should be used, however, the model has already enabled genome discovery and investigation into the metabolic regulation of this unique and significant cyanobacterium.

Conclusions

A genome-scale metabolic network reconstruction was performed for the filamentous diazotrophic cyanobacterium, T. erythraeum. This organism has a prominent role in the global nitrogen cycle; it is responsible for 42% of the annual biological nitrogen fixation and secretes between 7.7% [62] and 52% [22] of the nitrogen it fixes into the ocean, providing an important source of bio-available nitrogen to other organisms. This model was then subjected to constraints based modeling, such as FBA, FVA and dFBA to investigate the effect of changing environmental conditions and initial cell composition on equilibrium cell compositions and to predict inter- and intra-cellular fluxes. Our simulations indicate that cells exhibit traditional metabolic diversions to conserve carbon in nitrogen-fixing cells (diazotrophs) and to conserve energy in carbon fixing cells (photoautotrophs). From simulations with unconstrained carbon/nitrogen uptake, it appears that photoautotrophs have a lower optimal growth rate than diazotrophs; however in nature, these conditions would never exist. Diazotrophs rely on photoautotrophs for their required carbon and cannot grow in their absence. The model predicts an optimal growth rate for a trichome made up of 15.4% diaztroph, this agrees well with published reports of 10–20% [10, 12–15]. Thus, the model is capable of accurately predicting the required cellular compositions for optimal growth, implying that the composition of cells within a trichome is largely determined by stoichiometry. The success of this and other correlations between our model and laboratory and in situ measurements infers that the hypothesis that metabolites are the main influence for cell differentiation for T. erythraeum is correct.

The genome-scale metabolic network reconstruction and subsequent model simulations lay the foundation for further interrogation of the metabolism of T. erythraeum. This globally significant diazotroph plays an integral role in the ocean, providing a much needed nitrogen source in a very deplete environment. The modeling techniques we have employed above perform well in terms of predicting the growth of a trichome in an ideal environment; but to fully capture the role of T. erythraeum in the ocean, more advanced modeling techniques must be developed. In its native environment, T. erythraeum interacts with many other species and therefore multiscale models which can capture not only the interaction of the cells with their environment but interactions between cells (identical and other species) must be developed. These types of multi-scale models may also prove useful in modeling and predicting the effect of rising temperatures and carbon dioxide levels in the atmosphere.

Methods

Cell cultivation

Trichodesmium erythraeum IMS101 cells were acquired from the Bigelow Laboratory for Ocean Sciences (East Boothbay, ME, USA). They were grown in an New Brunswick (Hamburg, Germany) Innova 44R incubator at 24 °C with 80 μE/m²/s with 12h light/12h dark cycles. Cells were grown in artificial seawater YBC-II medium [44] at pH 8.15-8.20. Where specified, KNO₃ was added to final concentration of 100 μM. All chemicals were obtained from Sigma-Aldrich (St. Louis, MO). Growth rate was monitored by measuring chlorophyll absorbance [63] from 50 mL of culture every two days.

Biomass composition

Total biomass mass was determined by dry weight analysis, cells were filtered with a Whatman 0.22 μm cellulose-nitrate filter and dried in an oven overnight.

Protein quantitation

Total protein was quantified using the Pierce BCA Protein Assay Kit (Waltham, MS, USA). Cyanophycin and phycoerythrin are both protein-based compounds, so to avoid double counting, their mass fractions were subtracted from the total protein content. Proteins were hydrolyzed, derivatized and analyzed on an Agilent 5973 Mass Detector with an Agilent 6890N Network GC System on an HP-5 column to determine the amino acid composition following the method by Antoniewicz et al. [64].

Carbohydrate quantitation

Carbohydrates were measured colorimetrically using the anthrone method [65] against glycogen as a standard.

Cyanophycin quantitation

Cyanophycin is a primary nitrogen storage molecule for diazotrophs and is polymerized arginine and aspartamine in a 1:1 ratio [66]. Cyanophycin was extracted by disrupting 740 μL of 250 mL cells concentrated to 2 mL via filtration and rinsing with TE buffer with 2.70 mg/mL lysozyme overnight at 37 ºC, centrifuging at 16,100 x G for 5 min, and resuspending the pellet in 1 mL of 0.01 M HCl (in which cyanophycin is soluble) for 2 h. The extraction was repeated on the pellet, the supernatant fractions were combined, and cyanophycin was quantified colorimetrically using the Sakaguchi reaction [67].

Phycoerythrin quantitation

Phycoerythrin is a protein-pigment complex that is abundant in T. erythraeum and absorbs light for photosynthesis. It is soluble in neutral and slightly basic environments and the supernatant resulting from disruption of 740 μL of cells concentrated from 250 mL to 2 mL via 2.70 mg/mL lysozyme was collected and measured spectrophotometrically at 455 nm, 565 nm, and 592 nm, and quantitated using the equation [R − PE] = 0.12 × [(A ₅₆₅ − A ₅₉₂) − 0.20 × (A ₄₅₅ − A ₅₉₂)] [68].

Lipid quantitation

Lipids were extracted using the Bligh and Dyer method [69]. Chlorophyll a and phycocyanin are both lipids, so their masses were subtracted from the total lipid to avoid double counting.

Total lipids were assumed to be made up of the four major subclasses: SQDGs, MGDGs, DGDGs, and PGPs [70]. Fatty acid composition was determined via GC/MS to determine the relative carbon length that constituted lipids in total. The carbon length ratios (C₁₈:C₁₆:C₁₄, etc.) was assumed to be the same distribution for every lipid subclass. Otherwise, lipid pathways were consistent with the methods described in the Cyanothece ATCC51422 model [36] and the relative amounts of lipid subclasses (SQDG:MGDG:DGDG:PGP) were assumed to be consistent with Cyanothece.

Fatty acid methyl esters (FAMEs) were analyzed using GC/MS methods previously described for Synechococcus sp. PCC7002 [71].

Chlorophyll A quantitation

The only chlorophyll pigment T. erythraeum contains is chlorophyll a, which was extracted using an 80% acetone/20% methanol solvent according to Harris [63]. First, 250 mL cells were filtered using a 0.45 μm glass fiber filter and were then washed with 2 mL solvent to yield a 2 mL green solution. The resulting liquid was centrifuged at 4 °C at 13,100 x G for 10 min and the 1 mL of the supernatant was diluted to 2 mL and assayed spectrophotometrically through measurement at 646.6 nm, 663.6 nm, and 750 nm, and using the equation: [ChlA] = [0.01776(A _646.6 − A ₇₅₀) + 0.00734(A _663.6 − A ₇₅₀)].

Phycocyanin quantitation

2.70 mg.mL lysozyme was used to disrupt 740 μL cells concentrated from 250 mL to 2 mL via filtration overnight and phycocyanin was measured in the resulting medium according to established techniques [72, 73].

DNA and RNA quantitation

DNA and RNA were extracted using MoBio UltraClean Microbial Isolation Kits (Carlsbad, CA, USA) with 50 mL cells concentrated to 2 mL, of which 740 μL was disrupted using 2.70 mg/mL lysozyme overnight at 37 °C/proteinase K for 2 h at 55 °C instead of bead disruption. The concentration of DNA and RNA were assessed spectrophotometrically using 1 cm path-length extinction coefficients [74].

Genome scale metabolic network reconstruction

The genome scale network reconstruction was based on the publically available genome sequence [75] and an automated annotation program [33]. This first draft was manually curated using primary literature [76, 77], enzyme/metabolic databases [34, 35, 78, 79], and comparison to similar organisms [36, 38, 59, 80]. Reactions were considered reversible unless indicated by a database, an annotation in a similar organism, or significant thermodynamic infeasibility to be otherwise. For conserved or metabolically necessary reactions that were not contained in a database, NCBI’s BLAST with an acceptable e-score of 1e-6 [81] algorithm was used and compared to related genera like Leptolyngbya, Anabaena, Synechococcus, and Cyanothece to identify unannotated genes. Where these organisms failed to predict genome similarity, proteins displaying the function as the unannotated, or insufficiently annotated, reaction were BLASTed against all available organisms. This resulted in similarities to the organisms in Table 3, and included Nitrosococcus oceani, Pleurocapsa sp. PCC 7327, Zymomonas mobilis subsp. NRRL B-12526, Amborella trichopoda, Dehalococcoides mccartyi VS, Nostoc sp. PCC 7524, and Candidatus Nitrosophaera gargensis. Photosynthesis was modeled similarly to other cyanobacterial genome scale reconstructions [36, 38, 59, 80] by converting the complex protein interactions to a series of redox reactions with photons as an initial substrate and subsequent reduction reactions being passed to energy carriers and ATP synthase. Gene-protein-reaction associations were assumed using sequence data in conjunction with existing models and protein structures to predict conditionality of their assignments (e.g., if a reaction required a protein complex, the GPRs were assigned an and operator; redundant genes were assigned an or operator). These were done with consultation of KEGG [35], BioCyc [34], BRENDA [78], and CyanoBase [79]. Finally, transport reactions included in the model were validated based on proteomic data or diffusion (CO₂, H₂O, N_2, etc.) [16]. Manual curation efforts built the model out to 1035 reactions; closer inspection of the reactions revealed that several were predicted by the SEED algorithm but had no significant homology to the T. erythraeum genome and were non-essential, therefore they were removed. Irreversibility was assumed where databases indicated significant energetic unfavorability (such as in redox reactions) or where other models demonstrated consistent unidirectionality. Reactions were finally elementally and charge balanced using built-in COBRA functionality in conjunction with the PubChem Database [82]. Small molecules with known properties like pyruvate were used to build out into summary biomass reactions and predict summary macromolecule and biomass constitution. Confidence intervals were added to the model to reflect the source of the reported gene linking: fours were assigned to the transport reactions because of their proteomic data, twos were assigned to all reactions with significant sequence similarity to known protein reactions, and ones were assigned to all gap-filling reactions or reactions with insignificant similarity. The model was assessed using built-in COBRA functionality to determine Type III Reactions by closing all exchange reactions and evaluating the flux variability of the internal reactions. This resulted in no variability, meaning that futile cycles were eliminated [83].

Model simulations

Defining constraints

The genome scale metabolic network was used to predict fluxes using FBA [84]. The constraints applied to the model were determined from experimental data or laboratory or in situ data found in literature [22, 29]. Two different sets of constraints were used in order to model the different cell types: diazotrophic and photoautotrophic. Diazotrophic involved Photosystem II deactivation [85], nitrogenase activation, and carbon/nitrogen sources available for uptake. Photoautotrophic cells uptake CO₂ and ammonium while exporting glycogen, diatomic oxygen, and diatomic nitrogen. Available nutrients, trace metals, and ions which are allowed to be used for growth were restricted according to the YBC-II medium formulation. Table 6 details the differences in constraints for each modeled cell type. An upper limit for photon uptake was set to 80 μEinsteins because this represents the total light provided to the cells in the laboratory. Then, biomass related ATP was set to values similar to Cyanothece sp. ATCC 51142, another photosynthetic diazotroph [36], at 544 mmol ATP (g DW h)^-1 and maintenance related ATP was set so that the growth rate matched literature derived physiological data [29]. The model was constrained by the growth rate provided by literature using dFBA to define biomass maintenance energy constraints [29] and then concurrently constraining nitrogenase and carbon dioxide uptake rates from the same study. The maintenance flux for photoautotrophs was determined to be 64.3 mmol (g DW h)^-1 and 67.2 mmol (g DW h)^-1 for diazotrophs by matching the growth rate in the dFBA model to the growth rate measured in literature. This is significantly higher than other reported values, but is justified by the lack of substantial light restrictions. It is assumed that light uptake restrictions and maintenance ATP are both contained within this constraint. All other nitrogen and carbon sources except for those specified in Table 6 were set to zero.

Problem formulation for FVA and FBA

FBA and FVA were conducted using COBRA toolbox functions [43]. The optimization problem was constructed around the steady-state assumption with objectives of biomass generation subject to different cell type constraints specified by Table 6. Table 7 shows the abbreviations and notations for equations.

Table 7 Variables and notations used in simulations

Full size table

The cell-specific uptake and export rules as well as the seawater constraints generated a suite of constraints of the form (where DZ means diazotroph, PA means photoautotroph, and M means macroscopic):

The cell-specific uptake and export rules as well as the seawater constraints generated a suite of constraints of the form:

$$ -1000\le\ {\boldsymbol{v}}_s^{\boldsymbol{i}}\le\ 0\ \forall s\in \mathcal{S},\ \forall i $$

(1)

Diazotroph Cell Bounds:

$$ \begin{array}{c}\hfill -1000\le {\boldsymbol{v}}_n^{\boldsymbol{DZ}}\le 0\ \forall n\in \left\{{N}_2, Glycogen,{O}_2\right\}\hfill \\ {}\hfill 0\le {\boldsymbol{v}}_{\mu}^{\boldsymbol{DZ}}\le 1000\ \forall \mu \in \left\{N{H}_4^{+},C{O}_2\right\}\hfill \\ {}\hfill {\boldsymbol{v}}_{PSII}^{\boldsymbol{DZ}}=0\hfill \\ {}\hfill {\boldsymbol{v}}_{chIL}^{\boldsymbol{DZ}}=0.206\hfill \end{array} $$

(2)

Phtotoautotroph Cell Bounds:

$$ \begin{array}{c}\hfill -1000\le {\boldsymbol{v}}_{N{H}_4^{+}}^{\boldsymbol{PA}}\le 0\hfill \\ {}\hfill 0\le {\boldsymbol{v}}_c^{\mathbf{PA}}\le 1000\ \forall c\in \left\{{N}_2, Glycogen\right\}\hfill \\ {}\hfill 0\le {\boldsymbol{v}}_{PSII}^{\boldsymbol{PA}}\le 1000\hfill \\ {}\hfill \begin{array}{l}{\boldsymbol{\nu}}_{C{O}_2}^{\boldsymbol{PA}}=-0.927\\ {}{\boldsymbol{v}}_{chIL}^{\boldsymbol{PA}}=0\end{array}\hfill \end{array} $$

(3)

It should be noted that seawater ingredients are assumed to be replete in this model.

Steady state was assumed to make the model solvable:

$$ \boldsymbol{S}\cdot \boldsymbol{v}=0 $$

(4)

Essentially, this means that mass is conserved and all inputs exit either through the biomass equation or through export. The second major assumption was that the only outputs would be biomass, glycogen, or ammonium; energy or mass balancing mechanisms, such as through organic acid spilling, were assumed to be negligible because of efficient, exponential growth conditions.

Gene deletions, reaction deletions, and Dead End reaction analysis were run using these constraints. These problem formulations were solved using the COBRA toolbox [43] in conjunction with the Gurobi (Houston, TX, USA) solver in MatLab (Natick, MS, USA).

Dynamic FBA (dFBA)

dFBA was executed with the aid of the Dynamic Multispecies Metabolic Modeling (DyMMM) framework as a template [61]. Both this model and the DyMMM model built on traditional dFBA which estimates solutions to the time-dependent equations governing population growth and substrate utilization. This is done using FBA methods to determine interactive fluxes (exchange, biomass, and objective reaction fluxes) from a genome scale. Metabolites were assumed to perfectly diffuse since the relevant study was assumed to center around metabolism and differentiation of cells in response to shortage. The growth rate is also sufficiently slow as to reduce the effects of transport versus diffusion phenomena in ideal, exponential conditions. To model the differentiation between cell types, dFBA allowed a portion of a period’s growth rate to be proportionally allocated to the underproducing cell. If there was no metabolite deficiency in a period, excess growth of all cells was allocated to photoautotrophic cells. Otherwise, the carbon dioxide uptake, nitrogen uptake, and metabolite export rates were constrained only by the requirements of the population (therefore eliminating the strict carbon dioxide and nitrogenase constraints). Only the biomass maintenance energy was constrained to match laboratory growth rates with initial ideal (equilibrium) concentrations of cells at biomass similar to initial biomass measured in laboratory studies. These maintenance fluxes became 64.3 mmol ATP (g DW)^-1 h^-1 for the photoautotroph and 67.2 mmol ATP (g DW)^-1 h^-1 for the diazotroph.

$$ \begin{array}{c}\hfill { \max}_{{\boldsymbol{C}}_{\boldsymbol{Glycogen}}^{\boldsymbol{i}},\boldsymbol{t},{\boldsymbol{\mu}}^{\boldsymbol{i}},{\boldsymbol{v}}_{\boldsymbol{C}{\boldsymbol{O}}_2}^{\boldsymbol{PA}},{\boldsymbol{v}}_{\boldsymbol{m}}^{\boldsymbol{DZ}},{\boldsymbol{f}}_{\boldsymbol{DZ}}}\left({\boldsymbol{\mu}}^{\boldsymbol{PA}}\ and\ {\boldsymbol{v}}_{\boldsymbol{N}{\boldsymbol{H}}_4^{+}}^{\boldsymbol{DZ}}\right)\hfill \\ {}\hfill \boldsymbol{s}.\boldsymbol{t}.{\left[{\boldsymbol{C}}_{Glycogen}^{\boldsymbol{PA}}\right]}_{\boldsymbol{t}}\ge {\left[\left|{\boldsymbol{C}}_{Glycogen}^{\boldsymbol{DZ}}\right|\right]}_{\boldsymbol{t}-1}\ \hfill \\ {}\hfill {\boldsymbol{v}}_{Glycogen}^{\boldsymbol{DZ}}\le \frac{1}{6}\frac{{\boldsymbol{v}}_{C{O}_2}^{\boldsymbol{PA}}}{{\boldsymbol{f}}_{\boldsymbol{DZ}}}\hfill \end{array} $$

(5)

For the purpose of this program, the final constraint is estimated (knowing that μ and Δ t are non-negative) by:

$$ {\left[{\boldsymbol{C}}_{Gly}^{\boldsymbol{PA}}\right]}_{\boldsymbol{t}}=\left({\boldsymbol{\mu}}^{\boldsymbol{DZ}}\varDelta \boldsymbol{t}+1\right){\left[\left|{\boldsymbol{C}}_{Glycogen}^{\boldsymbol{DZ}}\right|\right]}_{\boldsymbol{t}-1}{\left[\frac{{\boldsymbol{X}}_{\boldsymbol{f}}^{\boldsymbol{DZ}}}{{\boldsymbol{X}}_{\boldsymbol{f}}^{\boldsymbol{PA}}}\right]}_{\boldsymbol{t}-1} $$

(6)

The dFBA model was built assuming Batch Reactor behavior which has the design equation:

$$ \frac{d\boldsymbol{X}}{d\boldsymbol{t}}=\boldsymbol{\mu} \boldsymbol{X} $$

(7)

Assuming constant growth rate:

$$ {\boldsymbol{X}}_{\boldsymbol{f}}^{\boldsymbol{\hbox{'}}}={\boldsymbol{X}}_0{e}^{\boldsymbol{\mu} \varDelta \boldsymbol{t}} $$

(8)

$$ \varDelta {\boldsymbol{X}}^{\boldsymbol{\hbox{'}}}=\boldsymbol{X}{\boldsymbol{\hbox{'}}}_{\boldsymbol{f}}-{\boldsymbol{X}}_0 $$

(9)

Cells were allocated to a pool of differentiated cells from the preliminary Δ X ', meaning that a cell can only lose as much biomass as it gains in a period. The amount of cells was determined according to the substrate shortage in a period.

$$ \begin{array}{c}\hfill {\boldsymbol{C}}_{cons,m}^{\boldsymbol{T}}={\displaystyle \sum_{\gamma }}\left(\varDelta {\boldsymbol{X}}^{\boldsymbol{\gamma}}\cdot {\boldsymbol{v}}_m^{\boldsymbol{\gamma}}\cdot {{\boldsymbol{X}}^{\boldsymbol{\hbox{'}}}}_{\boldsymbol{f}}^{\boldsymbol{\gamma}}\cdot \varDelta \boldsymbol{t}\right)\ \forall \gamma \in {\mathbf{\mathcal{C}}}_m\hfill \\ {}\hfill {\boldsymbol{C}}_{prod,m}^{\boldsymbol{T}}={\displaystyle \sum_{\pi }}\left(\varDelta {\boldsymbol{X}}^{\boldsymbol{\pi}}\cdot {\boldsymbol{v}}_m^{\boldsymbol{\pi}}\cdot {{\boldsymbol{X}}^{\boldsymbol{\hbox{'}}}}_{\boldsymbol{f}}^{\boldsymbol{\pi}}\cdot \varDelta \boldsymbol{t}\right)\forall \pi \in {\mathbf{\mathcal{P}}}_m\hfill \\ {}\hfill {\boldsymbol{\sigma}}^{\boldsymbol{\gamma}}=\frac{{\boldsymbol{C}}_{cons,m}^{\boldsymbol{\gamma}}}{{\boldsymbol{C}}_{cons,m}^{\boldsymbol{T}}}=\frac{\boldsymbol{X}{\boldsymbol{\hbox{'}}}_{\boldsymbol{f}}^{\boldsymbol{\gamma}}\cdot \varDelta \boldsymbol{t}\cdot \left|{\boldsymbol{v}}_m^{\boldsymbol{\gamma}}\right|}{{\displaystyle {\sum}_{\gamma }}\left(\varDelta {\boldsymbol{X}}^{\boldsymbol{\gamma}}\cdot {\boldsymbol{v}}_m^{\boldsymbol{\gamma}}\cdot {{\boldsymbol{X}}^{\boldsymbol{\hbox{'}}}}_{\boldsymbol{f}}^{\boldsymbol{\gamma}}\cdot \varDelta \boldsymbol{t}\right)}\forall \gamma \in {\mathbf{\mathcal{C}}}_m\hfill \\ {}\hfill \begin{array}{l}{\boldsymbol{\sigma}}^{\boldsymbol{\pi}}=\frac{{\boldsymbol{C}}_{cons,m}^{\boldsymbol{\pi}}}{{\boldsymbol{C}}_{cons,m}^{\boldsymbol{T}}}=\frac{\boldsymbol{X}{\boldsymbol{\hbox{'}}}_{\boldsymbol{f}}^{\boldsymbol{\pi}}\cdot \varDelta \boldsymbol{t}\cdot \left|{\boldsymbol{v}}_m^{\boldsymbol{\pi}}\right|}{{\displaystyle {\sum}_{\gamma }}\left(\varDelta {\boldsymbol{X}}^{\boldsymbol{\pi}}\cdot {\boldsymbol{v}}_m^{\boldsymbol{\pi}}\cdot {{\boldsymbol{X}}^{\boldsymbol{\hbox{'}}}}_{\boldsymbol{f}}^{\boldsymbol{\pi}}\cdot \varDelta \boldsymbol{t}\right)}\forall \pi \in {\mathbf{\mathcal{P}}}_m\\ {}\boldsymbol{\alpha} =\frac{{\boldsymbol{S}}_{prod,m}^{\boldsymbol{T}}+{\boldsymbol{S}}_{cons,m}^{\boldsymbol{T}}}{{\boldsymbol{S}}_{cons,m}^{\boldsymbol{T}}}\end{array}\hfill \end{array} $$

(10)

The differentiated cell pool, X _δ, was determined by applying the allocation coefficient to the productive cells according to their relative production coefficient:

$$ {\boldsymbol{X}}_{\boldsymbol{\delta}}={\displaystyle \sum_{\gamma }}\left(\boldsymbol{\alpha} \cdot {\boldsymbol{\sigma}}^{\boldsymbol{\gamma}}\cdot \varDelta {\boldsymbol{X}}^{\boldsymbol{\hbox{'}}\boldsymbol{\gamma}}\right)\ \forall \gamma \in {\mathbf{\mathcal{C}}}_m $$

(11)

$$ {\boldsymbol{X}}_{\boldsymbol{f}}^{\boldsymbol{\gamma}}=\varDelta \boldsymbol{X}{\boldsymbol{\hbox{'}}}^{\boldsymbol{\gamma}}+{\boldsymbol{\sigma}}^{\boldsymbol{\gamma}}\cdot {\boldsymbol{X}}_{\boldsymbol{\delta}}+{\boldsymbol{X}}_0^{\boldsymbol{\gamma}}\ \forall \gamma \in {\mathbf{\mathcal{C}}}_m $$

(12)

And cells are “dealt” according to those same coefficients for consumers:

$$ {\boldsymbol{X}}_{\boldsymbol{f}}^{\boldsymbol{\pi}}=\varDelta {\boldsymbol{X}}^{\boldsymbol{\pi}}+{\boldsymbol{\sigma}}^{\boldsymbol{\pi}}\cdot {\boldsymbol{X}}_{\boldsymbol{\delta}}+{\boldsymbol{X}}_0^{\boldsymbol{\pi}}\ \forall \pi \in {\mathbf{\mathcal{P}}}_m $$

(13)

Adjusted values are calculated for final concentrations and, during the next iteration, the new initial values are the final values from the previous iteration. This ends at a defined maximum time (400 h) and cell fractions are calculated as a function of biomass of cell type i divided by total biomass. Convergence was assessed using randomized initial compositions, assuming the same total initial biomass amount, and comparing the incremental growth rate, or biomass generation at each time step, as it changed over the time period. Incremental growth rate (μ _t), total growth rate (μ _T), and fixed nitrogen yield (Y _{N → Env}) were calculated by:

$$ \begin{array}{c}\hfill {\boldsymbol{\mu}}_{\boldsymbol{t}} = \frac{ \ln \left(\frac{{\displaystyle {\sum}_i}{X}_t^i}{{\displaystyle {\sum}_i}{X}_{t-1}^i}\right)}{1}\hfill \\ {}\hfill {\boldsymbol{\mu}}_{\boldsymbol{T}}=\frac{ \ln \left(\frac{{\displaystyle {\sum}_i}{\boldsymbol{X}}_{\boldsymbol{f}}^{\boldsymbol{i}}}{{\displaystyle {\sum}_i}{\boldsymbol{X}}_0^{\boldsymbol{i}}}\right)}{\varDelta \boldsymbol{t}}\hfill \end{array} $$

(14)

$$ {\boldsymbol{Y}}_{\boldsymbol{N}\to \boldsymbol{E}\boldsymbol{n}\boldsymbol{v}}=\frac{\varDelta {\boldsymbol{C}}_{\boldsymbol{N}{\boldsymbol{H}}_4^{+}}}{{\boldsymbol{C}}_{N{H}_4^{+}, prod}} $$

(15)

Since multiple steady states exist for population composition, growth rate, and substrate release rate, randomized initial cell concentrations (both in terms of composition and initial biomass) were generated using MatLab over 1000 iterations. Also, FBA was run individually on each cell type once more to determine the growth rate of each cell type in replete (independent, unlimited) conditions.

The hypothetical viability of T. erythraeum on different nitrogen sources was assessed by “opening” reaction boundaries for exchange reactions corresponding to the nutrient and assuming the nutrient was in excess. Then, the dFBA simulation was run assuming equilibrium starting composition. The growth profiles and growth rates were recorded in the same way as previous simulations. Where proteomics data did not infer transport of a particular compound, a simple, diffusion style transporter was temporarily added to the model.

Abbreviations

BLAST:: Basic local alignment search tool
COBRA:: Constraint based reconstruction and analysis
dFBA:: Dynamic flux balance analysis
DW:: Dry weight
DyMMM:: Dynamic multispecies metabolic model
FAME:: Fatty acid methyl ester
FBA:: Flux balance analysis
FVA:: Flux variability analysis
GC/MS:: Gas chromatography/mass spectrometry
PSI:: Photosystem I
PSII:: Photosystem II
RAST:: Rapid annotation using subsystems technology
TCA:: Tricarboxylic acid
TAGSFE:: sedoheptulose-7-phosphate: D-glyceraldehyde-3-phosphate (E.C. 2.2.1.2)
PCX:: PEP carboxylase (E.C. 4.1.1.31)
MDH:: Malate dehydrogenase (1.1.1.37)
PYRK:: Pyruvate kinase (E.C. 2.7.1.40)
ICLY:: Isocitrate lyase (E.C. 4.1.3.1)
ACCOASYN:: Acetyl-CoA synthetase (E.C. 6.2.1.1)
PDH:: Pyruvate dehydrogenase (E.C. 1.2.4.1)
RuBisCO:: D-ribulose-5-phosphate 1-phosphotransferase (E.C. 2.7.1.19)
chIL :: nitrogenase (E.C. 1.18.6.1)
GLNSYN:: L-glutamate: ammonium ligase (E.C. 6.3.1.2)
PGK:: 3-phospho-D-glycerate 1-phosphotransferase (E.C. 2.7.2.3)
ATP:: ATP synthase (E.C. 3.6.3.14)
FPAL:: D-fructose-1,6-bisphosphate D-glyceraldehyde 3-phosphate lyase (E.C. 4.1.2.13)
ICIT:: bifunctional aconitate hydratase 2/2-methylisocitrate dehydratase (E.C. 4.2.1.3, E.C. 4.2.1.4)
DGDG:: Digalactosyldiacylglycerol
MGDG:: Monogalactosyldiacylglycerol
PG:: Phosphatidylglycerol
SQDG:: Sulfoquinovosyldiacylglycerol
6PG:: 6-phospho-D-gluconate
6PGDL:: 6-phosph-D-glucono-1,5-lactone
AcCoA:: Acetyl-CoA
AKG/αKG:: α-ketoglutarate/2-oxoglutarate
ALA:: L-alanine
cAMP:: cyclic-AMP
CIT:: Citrate
CoA:: Coenzyme-A
DHAP:: Dihydroxyacetone phosphate
E4P:: Erythrose-4-phosphate
F6P:: Fructose-6-phosphate
FDP:: Fructose 1,6-diphosphate
FOR:: Formate
FUM:: Fumarate
G6P:: Glucose-6-phosphate
GAP:: Glyceraldehyde 3-phosphate
GLX:: Glyoxylate
GLY:: Glycine
GLYR:: Glycerate
GOL:: Glycerol
GP:: 3-phosphoglycerate
ICIT:: Isocitrate
MAL:: Malate
NMP:: Nucleotide monophosphate
OAA:: Oxaloacetate
PEP:: Phosphoenolpyruvate
PGOL:: Phosphoglycolate
PYR:: Pyruvate
R5P:: Ribose-5-phosphate
Ru5P:: Ribulose-5-phosphate
RuBP:: Ribulose 1,5-bisphosphate
S17P:: Sedoheptulose 1,7-bisphosphate
S7P:: Sedoheptulose 7-phosphate
SUCC:: Succinate
SUCSAL:: Succinic semialdehyde
X5P:: Xylulose 5-phosphate
βG6P:: β-glucose-6-phosphate

References

Berman-Frank I, Lundgren P, Falkowski P. Nitrogen fixation and photosynthetic oxygen evolution in cyanobacteria. Res Microbiol. 2003;154:157–64.
Article CAS PubMed Google Scholar
Glibert PM, Bronk DA. Release of dissolved organic nitrogen by marine diazotrophic cyanobacteria, Trichodesmium spp. Appl Environ Microbiol. 1994;60:3996–4000.
CAS PubMed PubMed Central Google Scholar
Spatharis S, Skliris N, Meziti A, Kormas KA, Weyhenmeyer G. First record of a Trichodesmium erythraeum bloom in the Mediterranean Sea. Can J Fish Aquat Sci. 2012;69:1444–55.
Article Google Scholar
Letelier RM, Karl DM. Role of Trichodesmium spp. in the productivity of the subtropical North Pacific Ocean. Mar Ecol Prog Ser. 1996;133:263–73.
Article Google Scholar
Bowman TE, Lancaster L. A bloom of the planktonic blue-green alga, Trichodesmium erythraeum, in the Tonga Islands. Limnol Oceanogr. 1965;10:291–3.
Article Google Scholar
Montoya JP, Holl CM, Zehr JP, Hansen A, Villareal TA, Capone DG. High rates of N₂ fixation by unicellular diazotrophs in the oligotrophic Pacific Ocean. Nature. 2004;430:1027–32.
Article CAS PubMed Google Scholar
Bell PR, Uwins PJ, Elmetri I, Phillips JA, Fu F-X, Yago AJ. Laboratory culture studies of Trichodesmium isolated from the great Barrier Reef Lagoon, Australia. Hydrobiologia. 2005;532:9–21.
Article Google Scholar
van Baalen C, Brown Jr RM. The ultrastructure of the marine blue green alga, Trichodesmium erythraeum, with special reference to the cell wall, gas vacuoles, and cylindrical bodies. Arch Mikrobiol. 1969;69:79–91.
Article PubMed Google Scholar
Ohki K, Taniuchi Y. Detection of nitrogenase in individual cells of a natural population of Trichodesmium using immunocytochemical methods for fluorescent cells. J Oceanogr. 2009;65:427–32.
Article CAS Google Scholar
Berman-Frank I, Lundgren P, Chen Y-B, Küpper H, Kolber Z, Bergman B, Falkowski P. Segregation of Nitrogen Fixation and Oxygenic Photosynthesis in the Marine Cyanobacterium Trichodesmium. Science. 2001;294:1534–7.
Article CAS PubMed Google Scholar
Capone DG, Zehr JP, Paerl HW, Bergman B, Carpenter EJ. Trichodesmium, a Globally Significant Marine Cyanobacterium. Science. 1997;276:1221–9.
Article CAS Google Scholar
Paerl HW. Spatial Segregation of CO₂ Fixation in Trichodesmium spp.: Linkage to N₂ fixation potential. J Phycol. 1994;30:790–9.
Article Google Scholar
Lin S, Henze S, Lundgren P, Bergman B, Carpenter EJ. Whole-cell immunolocalization of nitrogenase in marine diazotrophic cyanobacteria, Trichodesmium spp. Appl Environ Microbiol. 1998;64:3052–8.
CAS PubMed PubMed Central Google Scholar
Janson S, Carpenter EJ, Bergman B. Compartmentalisation of nitrogenase in a non-heterocystous cyanobacterium: Trichodesmium contortum. FEMS Microbiol Lett. 1994;118:9–14.
Article CAS Google Scholar
Fredriksson C, Bergman B. Nitrogenase quantity varies diurnally in a subset of cells within colonies of the non-heterocystous cyanobacteria Trichodesmium spp. Microbiology. 1995;141:2471–8.
Article CAS Google Scholar
Walworth N, Pfreundt U, Nelson WC, Mincer T, Heidelberg JF, Fu F, Waterbury JB, del Rio TG, Goodwin L, Kyrpides NC. Trichodesmium genome maintains abundant, widespread noncoding DNA in situ, despite oligotrophic lifestyle. Proc Natl Acad Sci. 2015;112:4251–6.
Article CAS PubMed PubMed Central Google Scholar
Berman-Frank I, Bidle KD, Haramaty L, Falkowski PG. The demise of the marine cyanobacterium, Trichodesmium spp., via an autocatalyzed cell death pathway. Limnology Oceanography. 2004;49:997–1005.
Article Google Scholar
Sudek S, Haygood MG, Youssef DT, Schmidt EW. Structure of trichamide, a cyclic peptide from the bloom-forming cyanobacterium Trichodesmium erythraeum, predicted from the genome sequence. Appl Environ Microbiol. 2006;72:4382–7.
Article CAS PubMed PubMed Central Google Scholar
Pfreundt U, Kopf M, Belkin N, Berman-Frank I, Hess WR: The primary transcriptome of the marine diazotroph Trichodesmium erythraeum IMS101. Scientific reports 2014, 4.
El-Shehawy R, Lugomela C, Ernst A, Bergman B. Diurnal expression of hetR and diazocyte development in the filamentous non-heterocystous cyanobacterium Trichodesmium erythraeum. Microbiology. 2003;149:1139–46.
Article CAS PubMed Google Scholar
Sandh G, Ran L, Xu L, Sundqvist G, Bulone V, Bergman B. Comparative proteomic profiles of the marine cyanobacterium Trichodesmium erythraeum IMS101 under different nitrogen regimes. Proteomics. 2011;11:406–19.
Article CAS PubMed Google Scholar
Mulholland MR, Capone DG. The nitrogen physiology of the marine N₂-fixing cyanobacteria Trichodesmium spp. Trends Plant Sci. 2000;5:148–53.
Article CAS PubMed Google Scholar
Sandh G, El-Shehawy R, Díez B, Bergman B. Temporal separation of cell division and diazotrophy in the marine diazotrophic cyanobacterium Trichodesmium erythraeum IMS101. FEMS Microbiol Lett. 2009;295:281–8.
Article CAS PubMed Google Scholar
Garcia NS, Fu F, Sedwick PN, Hutchins DA. Iron deficiency increases growth and nitrogen-fixation rates of phosphorus-deficient marine cyanobacteria. ISME J. 2015;9:238–45.
Article CAS PubMed Google Scholar
Dyhrman S, Chappell P, Haley S, Moffett J, Orchard E, Waterbury J, Webb E. Phosphonate utilization by the globally important marine diazotroph Trichodesmium. Nature. 2006;439:68–71.
Article CAS PubMed Google Scholar
Ho T-Y. Nickel limitation of nitrogen fixation in Trichodesmium. Limnol Oceanogr. 2013;58:112–20.
Article CAS Google Scholar
Sohm JA, Mahaffey C, Capone DG. Assessment of relative phosphorus limitation of Trichodesmium spp. in the North Pacific, North Atlantic, and the north coast of Australia. Limnol Oceanogr. 2008;53:2495.
Article CAS Google Scholar
Rodriguez IB, Ho T-Y. Diel nitrogen fixation pattern of Trichodesmium: the interactive control of light and Ni. Sci Rep. 2014;4.
Hutchins D, Fu F-X, Zhang Y, Warner M, Feng Y, Portune K, Bernhardt P, Mulholland M. CO₂ control of Trichodesmium N₂ fixation, photosynthesis, growth rates, and elemental ratios: Implications for past, present, and future ocean biogeochemistry. Limnol Oceanogr. 2007;52:1293–304.
Article CAS Google Scholar
Mulholland MR, Capone DG. Stoichiometry of nitrogen and carbon utilization in cultured populations of Trichodesmium IMS101: Implications for growth. Limnol Oceanogr. 2001;46:436–43.
Article Google Scholar
Mahadevan R, Schilling C. The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metab Eng. 2003;5:264–76.
Article CAS PubMed Google Scholar
Mahadevan R, Edwards JS, Doyle FJ. Dynamic flux balance analysis of diauxic growth in Escherichia coli. Biophys J. 2002;83:1331–40.
Article CAS PubMed PubMed Central Google Scholar
Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, Edwards RA, Gerdes S, Parrello B, Shukla M. The SEED and the rapid annotation of microbial genomes using subsystems technology (RAST). Nucleic Acids Res. 2014;42:D206–14.
Article CAS PubMed Google Scholar
Caspi R, Foerster H, Fulcher CA, Kaipa P, Krummenacker M, Latendresse M, Paley S, Rhee SY, Shearer AG, Tissier C. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res. 2008;36:D623–31.
Article CAS PubMed Google Scholar
Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016;44:D457–62.
Article PubMed Google Scholar
Vu TT, Stolyar SM, Pinchuk GE, Hill EA, Kucek LA, Brown RN, Lipton MS, Osterman A. Fredrickson JK, Konopka AE: Genome-scale modeling of light-driven reductant partitioning and carbon fluxes in diazotrophic unicellular cyanobacterium Cyanothece sp. ATCC 51142. PLoS Comput Biol. 2012;8:e1002460.
Article CAS PubMed PubMed Central Google Scholar
Knoop H, Gründel M, Zilliges Y, Lehmann R, Hoffmann S, Lockau W, Steuer R. Flux Balance Analysis of Cyanobacterial Metabolism: The Metabolic Network of Synechocystis sp. PCC 6803. PLoS Comput Biol. 2013;9:e1003081.
Article CAS PubMed PubMed Central Google Scholar
Vu TT, Hill EA, Kucek LA, Konopka AE, Beliaev AS, Reed JL. Computational evaluation of Synechococcus sp. PCC 7002 metabolism for chemical production. Biotechnol J. 2013;8:619–30.
Article CAS PubMed Google Scholar
de Oliveira Dal'Molin CG, Quek L-E, Palfreyman RW, Brumbley SM, Nielsen LK. AraGEM, a genome-scale reconstruction of the primary metabolic network in Arabidopsis. Plant Physiol. 2010;152:579–89.
Article PubMed PubMed Central Google Scholar
Levering J, Broddrick J, Dupont CL, Peers G, Beeri K, Mayers J, Gallina AA, Allen AE, Palsson BO, Zengler K. Genome-scale model reveals metabolic basis of biomass partitioning in a model diatom. PLoS ONE. 2016;11:e0155038.
Article PubMed PubMed Central Google Scholar
Boyle N, Morgan J. Flux balance analysis of primary metabolism in Chlamydomonas reinhardtii. BMC Syst Biol. 2009;3:4.
Article PubMed PubMed Central Google Scholar
Chang RL, Ghamsari L, Manichaikul A, Hom EF, Balaji S, Fu W, Shen Y, Hao T, Palsson BØ, Salehi‐Ashtiani K. Metabolic network reconstruction of Chlamydomonas offers insight into light‐driven algal metabolism. Mol Syst Biol. 2011;7:518.
Article PubMed PubMed Central Google Scholar
Schellenberger J, Que R, Fleming RM, Thiele I, Orth JD, Feist AM, Zielinski DC, Bordbar A, Lewis NE, Rahmanian S. Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2. 0. Nat Protoc. 2011;6:1290–307.
Article CAS PubMed PubMed Central Google Scholar
Chen Y-B, Zehr JP, Mellon M. Growth and Nitrogen Fixation of the Diazotrophic Filamentous Nonheterocytous Cyanobacterium Trichodesmium sp. IMS 101 in Defined Media: Evidence for a Circadian Rhythm. J Phycology. 1996;32:916–23.
Article Google Scholar
Yamaberi K, Takagi M, Yoshida T. Nitrogen depletion for intracellular triglyceride accumulation to enhance liqefaction yield of marine microalgal cells into a fuel oil. J Mar Biotechnol. 1997;6:44–8.
Google Scholar
Li Y, Horsman M, Wang B, Wu N, Lan CQ. Effects of nitrogen sources on cell growth and lipid accumulation of green alga Neochloris oleoabindans. Biotechnol Products Process Eng. 2008;81:629–36.
CAS Google Scholar
Li Y, Han D, Hu G, Dauvillee D, Sommerfeld M, Ball S, Hu Q. Chlamydomonas starchless mutant defective in ADP-glucose pyrophosphorylase hyper-accumulates triacylglycerol. Metab Eng. 2010;12:387–91.
Article PubMed Google Scholar
Boyle NR, Page MD, Liu B, Blaby IK, Casero D, Kropat J, Cokus SJ, Hong-Hermesdorf A, Shaw J, Karpowicz SJ. Three acyltransferases and nitrogen-responsive regulator are implicated in nitrogen starvation-induced triacylglycerol accumulation in Chlamydomonas. J Biol Chem. 2012;287:15811–25.
Article CAS PubMed PubMed Central Google Scholar
Blaby IK, Glaesener AG, Mettler T, Fitz-Gibbon ST, Gallaher SD, Liu B, Boyle NR, Kropat J, Stitt M, Johnson S. Systems-level analysis of nitrogen starvation–induced modifications of carbon metabolism in a Chlamydomonas reinhardtii starchless mutant. Plant Cell. 2013;25:4305–23.
Article CAS PubMed PubMed Central Google Scholar
Miller R, Wu G, Deshpande RR, Vieler A, Gärtner K, Li X, Moellering ER, Zäuner S, Cornish AJ, Liu B, et al. Changes in Transcript Abundance in Chlamydomonas reinhardtii following Nitrogen Deprivation Predict Diversion of Metabolism. Plant Physiol. 2010;154:1737–52.
Article CAS PubMed PubMed Central Google Scholar
Levitan O, Rosenberg G, Setlik I, Setlikova E, Grigel J, Klepetar J, Prasil O, Berman‐Frank I. Elevated CO₂ enhances nitrogen fixation and growth in the marine cyanobacterium Trichodesmium. Glob Chang Biol. 2007;13:531–8.
Article Google Scholar
Förster J, Famili I, Fu P, Palsson BØ, Nielsen J. Genome-scale reconstruction of the Saccharomyces cerevisiae metabolic network. Genome Res. 2003;13:244–53.
Article PubMed PubMed Central Google Scholar
Kornberg H. The role and control of the glyoxylate cycle in Escherichia coli. Biochem J. 1966;99:1.
Article CAS PubMed PubMed Central Google Scholar
Flavell RB, Woodward DO. Metabolic role, regulation of synthesis, cellular localization, and genetic control of the glyoxylate cycle enzymes in Neurospora crassa. J Bacteriol. 1971;105:200–10.
CAS PubMed PubMed Central Google Scholar
Schwender J, Seemann M, Lichtenthaler HK, Rohmer M. Biosynthesis of isoprenoids (carotenoids, sterols, prenyl side-chains of chlorophylls and plastoquinone) via a novel pyruvate/glyceraldehyde 3-phosphate non-mevalonate pathway in the green alga Scenedesmus obliquus. Biochem J. 1996;316:73–80.
Article CAS PubMed PubMed Central Google Scholar
Eastmond PJ, Germain V, Lange PR, Bryce JH, Smith SM, Graham IA. Postgerminative growth and lipid catabolism in oilseeds lacking the glyoxylate cycle. Proc Natl Acad Sci. 2000;97:5669–74.
Article CAS PubMed PubMed Central Google Scholar
Rubin BE, Wetmore KM, Price MN, Diamond S, Shultzaberger RK, Lowe LC, Curtin G, Arkin AP, Deutschbauer A, Golden SS. The essential gene set of a photosynthetic organism. Proc Natl Acad Sci. 2015;112:E6634–43.
Article CAS PubMed PubMed Central Google Scholar
Boyle NR, Morgan JA. Flux balance analysis of primary metabolism in Chlamydomonas reinhardtii. BMC Syst Biol. 2009;3:1.
Article Google Scholar
Montagud A, Navarro E, de Córdoba PF, Urchueguía JF, Patil KR. Reconstruction and analysis of genome-scale metabolic model of a photosynthetic bacterium. BMC Syst Biol. 2010;4:1.
Article Google Scholar
Zomorrodi AR, Maranas CD. OptCom: a multi-level optimization framework for the metabolic modeling and analysis of microbial communities. PLoS Comput Biol. 2012;8:e1002363.
Article CAS PubMed PubMed Central Google Scholar
Zhuang K, Ma E, Lovley DR, Mahadevan R. The design of long‐term effective uranium bioremediation strategy using a community metabolic model. Biotechnol Bioeng. 2012;109:2475–83.
Article CAS PubMed Google Scholar
Mulholland MR, Bernhardt PW, Heil CA, Bronk DA, O’Neil JM. Nitrogen fixation and release of fixed nitrogen by Trichodesmium spp. in the Gulf of Mexico. Limnol Oceanogr. 2006;51:1762–76.
Article CAS Google Scholar
Harris EH, Stern DB, Witman G. The Chlamydomonas sourcebook. Amsterdam: Elsevier; 2009.
Antoniewicz MR, Kelleher JK, Stephanopoulos G. Accurate assessment of amino acid mass isotopomer distributions for metabolic flux analysis. Anal Chem. 2007;79:7554–9.
Article CAS PubMed Google Scholar
Yemm E, Willis A. The estimation of carbohydrates in plant extracts by anthrone. Biochem J. 1954;57:508.
Article CAS PubMed PubMed Central Google Scholar
Simon RD. Cyanophycin granules from the blue-green alga Anabaena cylindrica: a reserve material consisting of copolymers of aspartic acid and arginine. Proc Natl Acad Sci. 1971;68:265–7.
Article CAS PubMed PubMed Central Google Scholar
Messineo L. Modification of the Sakaguchi reaction: spectrophotometric determination of arginine in proteins without previous hydrolysis. Arch Biochem Biophys. 1966;117:534–40.
Article CAS Google Scholar
Dumay J, Morançais M, Nguyen HPT, Fleurence J. Extraction and purification of R-phycoerythrin from marine red algae. Nat Prod Mar Algae Methods Protoc. 2015;1308:109–17.
Google Scholar
Bligh EG, Dyer WJ. A rapid method of total lipid extraction and purification. Can J Biochem Physiol. 1959;37:911–7.
Article CAS PubMed Google Scholar
Harwood JL. Membrane lipids in algae. In Lipids in photosynthesis: structure, function and genetics. Cardiff: Springer; 1998:53–64.
Work VH, Radakovits R, Jinkerson RE, Meuser JE, Elliott LG, Vinyard DJ, Laurens LM, Dismukes GC, Posewitz MC. Increased lipid accumulation in the Chlamydomonas reinhardtii sta7-10 starchless isoamylase mutant and increased carbohydrate synthesis in complemented strains. Eukaryot Cell. 2010;9:1251–61.
Article CAS PubMed PubMed Central Google Scholar
Boussiba S, Richmond AE. C-phycocyanin as a storage protein in the blue-green alga Spirulina platensis. Arch Microbiol. 1980;125:143–7.
Article CAS Google Scholar
Subramaniam A, Carpenter EJ, Karentz D, Falkowski PG. Bio-optical properties of the marine diazotrophic cyanobacteria Trichodesmium spp. I. Absorption and photosynthetic action spectra. Limnol Oceanogr. 1999;44:608–17.
Article CAS Google Scholar
Barbas CF, Burton DR, Scott JK, Silverman GJ. Quantitation of DNA and RNA. Cold Spring Harb Protoc. 2007;2007:pdb. ip47.
Article Google Scholar
Nordberg H, Cantor M, Dusheyko S, Hua S, Poliakov A, Shabalov I, Smirnova T, Grigoriev IV, Dubchak I. The genome portal of the Department of Energy Joint Genome Institute: 2014 updates. Nucleic Acids Res. 2014;42:D26–31.
Article CAS PubMed Google Scholar
Zhang S, Bryant DA. Biochemical validation of the glyoxylate cycle in the cyanobacterium Chlorogloeopsis fritschii strain PCC 9212. J Biol Chem. 2015;290:14019–30.
Article CAS PubMed PubMed Central Google Scholar
Zhang S, Bryant DA. The tricarboxylic acid cycle in cyanobacteria. Science. 2011;334:1551–3.
Article CAS PubMed Google Scholar
Scheer M, Grote A, Chang A, Schomburg I, Munaretto C, Rother M, Söhngen C, Stelzer M, Thiele J, Schomburg D. BRENDA, the enzyme information system in 2011. Nucleic Acids Res. 2010:doi: doi: 10.1093/nar/gkq1089.
Fujisawa T, Okamoto S, Katayama T, Nakao M, Yoshimura H, Kajiya-Kanegae H, Yamamoto S, Yano C, Yanaka Y, Maita H. CyanoBase and RhizoBase: databases of manually curated annotations for cyanobacterial and rhizobial genomes. Nucleic Acids Res. 2014;42:D666–70.
Article CAS PubMed Google Scholar
Triana J, Montagud A, Siurana M, Fuente D, Urchueguía A, Gamermann D, Torres J, Tena J, de Córdoba PF, Urchueguía JF. Generation and evaluation of a genome-scale metabolic network model of Synechococcus elongatus PCC7942. Metabolites. 2014;4:680–98.
Article PubMed PubMed Central Google Scholar
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.
Article CAS PubMed Google Scholar
Kim S, Thiessen PA, Bolton EE, Chen J, Fu G, Gindulyte A, Han L, He J, He S, Shoemaker BA. PubChem substance and compound databases. Nucleic acids research 2015:doi: 10.1093/nar/gkv951
Thiele I, Palsson BØ. A protocol for generating a high-quality genome-scale metabolic reconstruction. Nat Protoc. 2010;5:93–121.
Article CAS PubMed PubMed Central Google Scholar
Orth JD, Thiele I, Palsson BO. What is flux balance analysis? Nat Biotech. 2010;28:245–8.
Article CAS Google Scholar
Küpper H, Ferimazova N, Šetlík I, Berman-Frank I. Traffic lights in Trichodesmium. Regulation of photosynthesis for nitrogen fixation studied by chlorophyll fluorescence kinetic microscopy. Plant Physiol. 2004;135:2120–33.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to thank Dr. Bri-Mathias Hodge for guidance with computational aspects of the project and Ben Miller for initial experimentation while protocols were first being applied as well as the Bigelow Laboratory for Ocean Sciences, East Boothbay, ME, for helping with T. erythraeum culturing.

Funding

We are grateful to the Coors Foundation and Colorado School of Mines for funding this research. Neither funding body had a role in the design of the study or the collection, analysis or interpretation of results.

Availability of data and materials

The SBML version of the T. erythraeum model as well as the Matlab .m file required to simulate FBA fluxes as described in the text are provided in the supplemental files. After publication, these will also be made available on the PIs website: http://metabolism.mines.edu All other simulation results are also provided in the supplemental files with this manuscript.

Authors’ contributions

NRB conceived the study. JJG reconstructed the metabolic network from genome sequences and wrote/modified code for modeling. JJG also performed all the experiments required for measuring biomass composition. NRB and JJG both participated in troubleshooting, analyzing data, writing and revising the manuscript. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Consent to publish

Not applicable.

Ethics and consent to participate

Not applicable.

Author information

Authors and Affiliations

Department of Chemical and Biological Engineering, Colorado School of Mines, Golden, CO, 80401, USA
Joseph J. Gardner & Nanette R. Boyle

Authors

Joseph J. Gardner
View author publications
You can also search for this author in PubMed Google Scholar
Nanette R. Boyle
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nanette R. Boyle.

Additional files

Additional file 1:

Genome scale metabolic reconstruction for T. erythraeum. This file contains the full SBML formatted genome scale reconstruction of Trichodesmium erythraeum (iTery101) described by this study without constraints. (XML 1720 kb)

Additional file 2:

List of reactions and metabolites included in the model. This file contains the more readable format of the Genome scale reconstruction with a catalog of reactions in one tab and of metabolites in the other, supplemented by relevant information for both sets. (XLS 442 kb)

Additional file 3:

This file contains the list of dead end reactions generated by the study, not including exchange reactions. This means all reactions which include a metabolite that cannot be otherwise resolved (through reaction to another compound) is included. Dead ends are ignorant to photoautotroph or diazotroph because of the similar genetic background. (XLSX 50 kb)

Additional file 4:

MakeTERY function to establish constraints and create separate models. This file contains all relevant constraints for the two different cell types. By default, it also generates the optimal solutions subject to literature constraints as described in the FBA and FVA sections of the paper. (M 2 kb)

Additional file 5:

Flux reactions for ideal, exponential growth with fixed biomass/growth rate and objective metabolite production. This file contains the entire simulations data used to generate Fig. 2 and based on the flux balance analysis described by the study. It includes the entire catalog of reactions, the corresponding flux for both diazotroph and photoautotroph, and the reaction formula. (XLSX 58 kb)

Additional file 6:

List of effects of gene and reaction deletion identified by in silico knockout analysis. This file contains the list of effects from each gene and reaction deletion in separate tabs. Gene/reaction deletions that result in lethality are colored red, those that reduce function are colored yellow, and those that cause no change are white. The single green reaction (deletion of biomass maintenance, EN_ATP) improves gene function. Reactions have associated genes listed alongside and genes have associated reactions listed alongside. There are reports for both diazotrophs and photoautotrophs. Data is reported as ratio (growth rate of knockout divided by “wild type” growth rate), raw knockout growth rate, and change in growth rate calculated using the following equation: $ \varDelta \mu =\frac{\mu_{WT}-{\mu}_{mutant}}{\mu_{WT}} $ (XLSX 182 kb)

Additional file 7:

List of lower and upper bounds of flux identified by flux variability analysis. Flux Variability for ideal, exponential growth with fixed biomass/growth rate and objective metabolite production with the same bounds as Flux Balance Analysis. Included are lower and upper bounds for acceptable fluxes along with the total range for each reaction. Reactions are organized by the “Euclidean Range”, the square root of the ranges squared, which give an image for the most consistently variable and non-variable reactions between cell types. (XLSX 80 kb)

Additional file 8: Table S1.

Average protein composition of T. erythraeum. Proteins were hydrolyzed and amino acid concentrations were measured using gas chromatography/mass spectrometry. The molar fraction from ambient air (N₂) was used for the protein and lipid assembly equations. Starred quantities were derived from a previous study [1] because our method was not able to detect them. Table S2. Average lipid composition of T. erythraeum. Extracted lipids then analyzed as fatty acid methyl esters on a gas chromatograph/ mass spectrometer. Relative amounts of subclasses of lipids were assumed from previous literature [2]. Table S3: Comparison to related genome-scale metabolic network reconstructions. Summary figures for related genome scale reconstructions for relevant photosynthetic organisms. (DOCX 22 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Gardner, J.J., Boyle, N.R. The use of genome-scale metabolic network reconstruction to predict fluxes and equilibrium composition of N-fixing versus C-fixing cells in a diazotrophic cyanobacterium, Trichodesmium erythraeum . BMC Syst Biol 11, 4 (2017). https://doi.org/10.1186/s12918-016-0383-z

Download citation

Received: 05 July 2016
Accepted: 21 December 2016
Published: 19 January 2017
DOI: https://doi.org/10.1186/s12918-016-0383-z

The use of genome-scale metabolic network reconstruction to predict fluxes and equilibrium composition of N-fixing versus C-fixing cells in a diazotrophic cyanobacterium, Trichodesmium erythraeum

Abstract

Background

Results

Conclusion

Background

Results

Biomass composition and growth rates

Network reconstruction and manual curation

Flux balance analysis

Flux variability analysis (FVA)

Dynamic flux balance analysis

Discussion

Experimental data

Metabolic network reconstruction

Predicting central metabolic fluxes

Modeling equilibrium colony composition

Conclusions

Methods

Cell cultivation

Biomass composition

Protein quantitation

Carbohydrate quantitation

Cyanophycin quantitation

Phycoerythrin quantitation

Lipid quantitation

Chlorophyll A quantitation

Phycocyanin quantitation

DNA and RNA quantitation

Genome scale metabolic network reconstruction

Model simulations

Defining constraints

Problem formulation for FVA and FBA

Dynamic FBA (dFBA)

Abbreviations

References

Acknowledgements

Funding

Availability of data and materials

Authors’ contributions

Competing interests

Consent to publish

Ethics and consent to participate

Author information

Authors and Affiliations

Corresponding author

Additional files

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Systems Biology

Contact us