Understanding key features of bacterial restriction-modification systems through quantitative modeling

Rodic, Andjela; Blagojevic, Bojana; Zdobnov, Evgeny; Djordjevic, Magdalena; Djordjevic, Marko

doi:10.1186/s12918-016-0377-x

Volume 11 Supplement 1

Selected articles from BGRS\SB-2016: systems biology

Research
Open access
Published: 24 February 2017

Understanding key features of bacterial restriction-modification systems through quantitative modeling

Andjela Rodic^1,2,
Bojana Blagojevic³,
Evgeny Zdobnov⁴,
Magdalena Djordjevic³ &
…
Marko Djordjevic¹

BMC Systems Biology volume 11, pages 1–15 (2017)Cite this article

3533 Accesses
12 Citations
4 Altmetric
Metrics details

Abstract

Background

Restriction-modification (R-M) systems are rudimentary bacterial immune systems. The main components include restriction enzyme (R), which cuts specific unmethylated DNA sequences, and the methyltransferase (M), which protects the same DNA sequences. The expression of R-M system components is considered to be tightly regulated, to ensure successful establishment in a naïve bacterial host. R-M systems are organized in different architectures (convergent or divergent) and are characterized by different features, i.e. binding cooperativities, dissociation constants of dimerization, translation rates, which ensure this tight regulation. It has been proposed that R-M systems should exhibit certain dynamical properties during the system establishment, such as: i) a delayed expression of R with respect to M, ii) fast transition of R from “OFF” to “ON” state, iii) increased stability of the toxic molecule (R) steady-state levels. It is however unclear how different R-M system features and architectures ensure these dynamical properties, particularly since it is hard to address this question experimentally.

Results

To understand design of different R-M systems, we computationally analyze two R-M systems, representative of the subset controlled by small regulators called ‘C proteins’, and differing in having convergent or divergent promoter architecture. We show that, in the convergent system, abolishing any of the characteristic system features adversely affects the dynamical properties outlined above. Moreover, an extreme binding cooperativity, accompanied by a very high dissociation constant of dimerization, observed in the convergent system, but absent from other R-M systems, can be explained in terms of the same properties. Furthermore, we develop the first theoretical model for dynamics of a divergent R-M system, which does not share any of the convergent system features, but has overlapping promoters. We show that i) the system dynamics exhibits the same three dynamical properties, ii) introducing any of the convergent system features to the divergent system actually diminishes these properties.

Conclusions

Our results suggest that different R-M architectures and features may be understood in terms of constraints imposed by few simple dynamical properties of the system, providing a unifying framework for understanding these seemingly diverse systems. We also provided predictions for the perturbed R-M systems dynamics, which may in future be tested through increasingly available experimental techniques, such as re-engineering R-M systems and single-cell experiments.

Background

Restriction-modification systems are rudimentary bacterial immune systems, whose main components are the restriction enzyme (R), and the methyltransferase (M). We here consider Type II restriction-modification (R-M) systems [1], where R cuts the same DNA sequences that are protected by M. Consequently, R and M act, respectively, as a toxic molecule and its antidote, and analogies of R-M and toxin-antitoxin systems are often made [2]. R-M present rudimentary “bacterial immune systems”, as they protect the host bacterial cell against infection by foreign DNA, such as viruses (bacteriophages) [3–6]. The protection mechanism is straightforward, as the foreign DNA entering bacterial cell is unmethylated, and is consequently cut (destroyed) by R. On the other hand, the host DNA is methylated due to presence of M, and is therefore not cut by R, which prevents autoimmunity. In fact, many bacteriophages are under pressure from R-M systems with whom they have common hosts [7, 8], and have developed different mechanisms to avoid restriction [9–11]. Consequently, expression of the toxic molecule and its antidote provides an effective protection of the bacterial cell against foreign DNA infection [12].

R-M systems are often mobile [2, 12, 13], spreading from one bacterial host to the other, so that a bacterial host, which initially did not contain the R-M system (a naïve host), can acquire it through horizontal transfer. Expression of R and M was directly observed in single cells only very recently, for the Esp1396I system [14], and it is still unclear how different R-M system features affect this expression. It is however assumed that R-M expression has to be tightly regulated during its establishment in a naïve host [15]. For example, as the naïve host genome is initially unmethylated, R must be, and where tested actually is, expressed after a delay with respect to M, so that the host’s genomic DNA can be protected before the appearance of R [14, 16, 17]. To ensure such tight regulation, a significant subset of R-M systems contains a third gene, which expresses the control protein (C) [5, 6, 18–23]. C is a transcription factor, which regulates expression of genes in R-M system, including its own expression. In fact, C is typically co-transcribed with R from a common promoter (CR promoter), while M is transcribed from a separate promoter (M promoter) [5, 6, 24].

With respect to the organization of the transcription units, two different architectures are exhibited, which correspond to the convergent (Fig. 1a), and the divergent (Fig. 1b) orientation of CR and M promoters [5, 6, 14, 20, 21, 23, 25, 26]. Despite R-M systems being known for few decades now, with numerous biotechnological uses of restriction enzymes, control of expression of these systems has been insufficiently studied. Two relatively well studied examples are AhdI (a representative of the convergent architecture) [6], and EcoRV (a divergent architecture representative) [5]. For both systems, the core promoters (binding sites of RNA polymerase), and the binding sites of C protein, are experimentally mapped. In addition, for AhdI system, the transcription activity of CR promoter was measured as a function of C protein amount. We previously showed that a thermodynamic model of CR promoter regulation provides a good agreement with this measurement [6]. We also recently showed [14] that a similar thermodynamic model, coupled with a dynamical model of transcript and protein synthesis, can reasonably explain the dynamics of the enzyme synthesis measured by single-cell experiments in another convergent R-M system (Esp1396I). This strongly suggests that quantitative modeling presented here can realistically explain R-M system transcription control. Additionally, thermodynamical modeling of transcription regulation was successfully applied to a number of different biological problems [27–30], while dynamical modeling was applied to explain both more and less complex gene circuits including control of other convergent R-M systems [31–33].

As we detail below on the example of AhdI (convergent system), and EcoRV (divergent system), it is experimentally firmly established that R-M systems exhibit both different architectures, and different features that characterize their gene expression regulation [1, 15]. On the other hand, the regulation should yield the same three dynamical properties, so that the host genome is protected, while the system is efficiently established. In particular, as discussed above, there would have to be a significant expression of M before R is expressed, to ensure that the host genome is protected. Furthermore, once the host genome is protected, the system should likely turn to “ON” state as rapidly as possible, so that the host genome becomes “immune” to the virus infections – this would then require that after an initial delay, R is rapidly generated. Finally, we also previously proposed that, once the toxic molecule (R) reaches a steady-state, its fluctuations should be low – otherwise a high fluctuation in the toxic molecule (R) may not be matched by the antidote (M), which could destroy the host genome [34].

It is however unclear how the diverse system features and architectures, relate with the constraints on the dynamical response of the system stated above. Experimentally, one could, in principle, address this issue by mutating the relevant features (or introducing them in the system where they do not exist), and then measuring how the resulting system dynamics is perturbed. This would however be very hard, as the system would have to be extensively experimentally mutated and/or redesigned, and the resulting protein dynamics measured in-vivo during the system establishment. In that respect, note that the in-vivo dynamics of R and M expression were directly observed for only two Type II systems – in PvuII via nearly simultaneous introduction into a culture using bacteriophage M13 [17], and in Esp1396I, via transformation followed by single cell analysis [14]. Even in these cases, the measurements are done only on the wild-type (wt) system, i.e. perturbations were not introduced in the system.

Therefore, the main purpose of this paper is to investigate the relationship between different system features/architectures, and the dynamical properties which the system is expected to exhibit during its establishment. In particular, it is our hypothesis that the diverse features exhibited in R-M systems may largely be explained in terms of the three dynamical properties discussed above. To start testing this hypothesis, we will here biophysically model the control of AhdI and EcoRV, and assess the resulting dynamics when the characteristic system features are either perturbed (in AhdI case) or (artificially) introduced (in EcoRV case) in the system. This is analogous to a classical approach in molecular biology, where the system is analyzed by mutating its main features, or introducing new features in the system where they do not exist, and consequently observing what effect these perturbations have on the presumed system function. The difference is that we here analyze the system computationally instead of experimentally, where we build on the fact that we previously showed that the modeling approach that we employ here can reasonably explain the available equilibrium measurements [6], and the available single cell experiments [14]. Therefore, the ability of the modeling to explain the measured wild-type data in R-M systems provides a reasonable confidence that our predictions for the perturbed system will also be realistic. Moreover, with the advancement of sophisticated experimental approaches, such as single cell experiments, or possibility to reengineer the system, there comes a prospect of directly experimentally testing these predictions in the future.

Specifically, we will here start by reviewing the relevant experimental information for AhdI and EcoRV systems (the structure of their promoter regions and their regulatory features), which will provide a bases for our theoretical modeling. We will then quantify the general principles discussed above, i.e. introduce what we here call the dynamical property observables, which will allow us quantifying the delay between R and M, how fast the system makes the transition from OFF to ON state, and the stability of R steady-state levels. We will then investigate if abolishing the characteristic features of AhdI also diminishes these observables, i.e. negatively affects the dynamical properties discussed above. Furthermore, we will also study if these dynamical properties also apply to the system (EcoRV) where AhdI features are absent, but a new feature is present (the overlapping promoters). We will then ask what happens if the AhdI features are (computationally) introduced in wild-type EcoRV system, where they originally do not exist. That is, we will investigate if introducing these features leads to (at least) some of the three dynamic property observables being diminished – therefore explaining why they are absent from EcoRV. Overall, we will here systematically investigate how perturbing (or introducing new) features in two characteristic R-M systems affects the resulting system dynamics.

Methods

In the first subsection, we provide in detail the experimentally available information on AhdI (the convergent system) and EcoRV (the divergent system), on which we base our quantitative modeling. The main properties of the model, including the observables through which we assess the system dynamical properties, are provided in the second subsection. We note that the model itself is provided in details in Additional files 1 and 2, where all the parameters (including their experimental/theoretical support) are listed.

Experimentally determined configurations of AhdI and EcoRV

For AhdI, the positions of different promoter elements (C protein and RNAP binding sites) were experimentally mapped for both CR and M promoters [6] (see Fig. 1a). In addition, the binding affinities and the transcription activities for both the wild type and mutant systems (where C protein binding sites were mutated) were measured [6]. These measured values, together with the standard literature values for the kinetic parameters (the translation and the degradation rates), were used to parameterize the model, as provided in detail in Additional file 1.

As indicated in Fig. 2a, C binds to CR promoter, regulating both its own transcription and the transcription of R [6, 19]. C binds to promoter DNA as a dimer, where binding to the distal binding site (configuration K₃), when C is present at relatively low concentration, leads to transcription activation, as C dimer bound to this position recruits RNAP binding to the promoter (configuration K₅). On the other hand, when C is present at high concentration, C dimer bound to the distal binding site recruits another C dimer to the proximal binding site (the tetramer configuration, K₄), thus repressing the transcription, as RNAP cannot bind to the promoter. Note that the configuration in which C dimer binds only to the proximal binding site (equivalent to K₃) is not shown, as the binding affinity to the proximal binding site is much lower compared to the distal binding site, making this configuration much less probable. As for M gene, its transcription is controlled by a negative feedback loop, i.e. M methylates specific sites in its own core promoter thereby repressing the transcription (Fig. 2b).

There are three features which characterize control of AhdI expression [6]. First, there is a very high cooperativity in binding of the C protein dimers to the distal and the proximal positions in CR promoter, so that C dimer bound only to the distal site (K₃ configuration) exists only very transiently in the wild-type (wt) AhdI system. That is, in the absence of RNAP, a C dimer bound to the distal position immediately recruits another C dimer to the proximal binding site. Second, the C dissociation constant of dimerization for AhdI is very high, so that almost all C protein in the solution is in the form of monomers. Finally, C protein is translated from a leaderless transcript (i.e. a transcript which does not contain a ribosome binding site), which was in E. coli shown to be associated with lower translation initiation rate [35, 36].

For EcoRV, CR and M promoters are divergently oriented, as schematically shown in Fig. 1b. Consequently, the promoter elements are located in the intergenic region that separates CR and M genes, and these elements are also experimentally mapped [5]. Some of the binding affinities were also measured [5], while the others were eliminated by rescaling the equations (see Additional file 2) – note that we can rescale the equations, as we are interested only in the relative protein amounts. The kinetic parameters (the translation and the degradation rates), correspond to the standard literature values, and are taken to be the same as for AhdI (with the exception of C translation rate, see below).

In contrast to AhdI, the main feature of EcoRV is the partially overlapping CR and M core promoters, as schematically shown in Fig. 3. Consequently, RNAP cannot simultaneously bind to and initiate transcription from both P_M and P_CR. Moreover, the characteristic features of AhdI are not found in EcoRV [5]. In particular, while the transcription control of the CR promoter by C protein is similar as in AhdI, the main difference is that the large cooperativity between the C dimers at the distal and the proximal binding site is now absent, in fact it was found in EcoRV that the two dimers bind to DNA with no cooperativity [5]. Furthermore, the transcription from P_M is not directly influenced by C protein binding, i.e. C binding does not directly affect RNAP binding to P_M. However, the influence of C on P_M transcription is indirect, as the regulation by C of RNAP binding to P_CR, also affects when RNAP can bind to P_M. Consequently, while in AhdI transcription of CR and M was independent from each other, in EcoRV we have a more complex system where their transcription is strongly coupled. Similar regulation through overlapping CR and M core promoters is also found in CfrBI R-M system [26, 37]. Finally, C transcript is not leaderless in EcoRV, so the feature which was associated with lower translation initiation rate in E. coli, and which is present in AhdI, is now absent from EcoRV.

Modeling AhdI and EcoRV dynamics

We model R and M synthesis upon introducing AhdI and EcoRV in naïve bacterial hosts. The models are based on the experimental knowledge of AhdI and EcoRV transcription regulation, which is summarized in Figs. 2 and 3, respectively. The models are provided in detail in Additional files 1 and 2, and are briefly based on:

(i)
A thermodynamic model, which takes into account the activation and the repression of CR promoter by C, and the repression of M gene by its own product (which was experimentally shown in [6]). The model assumes that the promoter transcription activity is proportional to the equilibrium binding probability of RNAP to promoter, which is a general assumption initially proposed by the classical Shea-Ackers approach [38].
(ii)
Equations that predict how the transcription activity of CR and M promoters depends on C-protein concentration, which further allows modeling the dynamics of transcript and protein expression. That is, the modeled transcription activities provide the main input for a kinetic model, which calculates R, C and M transcript and protein synthesis. Also, note that R-M systems are characterized by very high expression of R and M proteins [14] so that on the order of thousands of molecules are present in the cell. Consequently, the system is expected to be well in the limit where deterministic modeling can be used to realistically describe the system.

We previously showed that such modeling can well explain the wild-type measurements for AhdI [6] - in particular the measured dependence of the transcription activity on C protein concentration – as well as the most recent measurements in single-cell experiments allowing directly observing the dynamics of R and M synthesis [14]. Our aim here is to computationally analyze how systematically abolishing individual system features affects the system’s dynamics, focusing on the following properties:

i.
the time delay between R and M accumulation,
ii.
the transition speed of the system from “OFF” to “ON”,
iii.
the stability of R steady-state levels.

For this, we will introduce observables (which we call the dynamical property observables) that can quantify these properties. To reasonably define them, it is useful to visualize the predicted system dynamics, and the stability of R steady-state levels in wild-type AhdI system, which is shown in Fig. 4 and calculated from Eqs. (1.12), (1.22) and (1.24)–(1.27) (see Additional file 1).

The first dynamical property observable (delay)

From Fig. 4a, we see that the system features lead to a significant delay in the expression of R compared to M, in accordance with the first dynamical property. To quantify how the delay changes upon perturbing these features, we introduce the first dynamical property observable, which corresponds to the ratios of the shaded areas in the perturbed system and in wt AhdI, at an initial interval post-system entry.

The second dynamical property observable (OFF to ON transition speed)

Furthermore, in Fig. 4a, we see that R expression curve has a sigmoidal shape. Consequently, the maximal slope of this curve (indicated in the figure) provides a reasonable measure of transition velocity from “OFF” (low R value) to “ON” (high R value) state. Therefore, as the second dynamical property observable, we introduce the maximal slope of this curve. The changes of this slope will allow assessing how the transition velocity – which determines the time window between the host genome being methylated, and the cell being protected against viruses – will be affected when the system features are perturbed.

The third dynamical property observable (R steady-state level stability)

Finally, the third dynamical property relates with fluctuations of the toxic R molecule, which we propose should be small in the steady-state [34]. The fluctuations are directly related with the stability of the steady-state, so that smaller fluctuations imply larger steady-state stability, which we introduce as the third dynamical property observable.

Different (in-silico) perturbations of the wild-type system – i.e. gradually abolishing the existing or introducing new features – will be introduced in either the thermodynamic model, or in the kinetic equations (see Additional files 1 and 2).

Results and discussion

We will start by gradually abolishing the three characteristic AhdI features introduced above, and assess how this will affect the dynamical property observables. We will next model the dynamics of EcoRV establishment in a naïve bacterial host, to see if the proposed dynamical properties also apply to a system with different architecture and transcription regulation features. This will provide, to our knowledge, the first quantitative model of a divergent R-M system control, and an opportunity to assess dynamics of R and M expression, which was up to now not experimentally observed for the divergent systems. Finally, we will in-silico introduce to EcoRV the regulation features that exist in AhdI, but are not found in EcoRV, to investigate how this effects the dynamical property observables, and why these features are not present in EcoRV.

Perturbing AhdI system features

The three characteristic AhdI features are the high C subunit dissociation constant of dimerization, the large cooperativity between C dimers bound at the distal and the proximal position, and the low C transcript translation initiation rate. It was previously discussed that these features serve to limit the amount of the synthesized toxic molecule (R) [6]. However, it is not clear that this amount per-se should be limited, as a too small steady-state amount of R may compromise the immune response – i.e. it can lead to the virus genome being protected by M before it can be destroyed by R [39]. As we discussed above, it would be very hard to experimentally investigate the effect of these AhdI features on the system dynamics, this can be readily predicted from the model that we formulated above.

Decreasing the dissociation constant of dimerization

The dissociation constant of dimerization K ₁ is very high for AhdI, leading to almost all C subunits being present as monomers in solution [6, 40] – e.g. for another convergent R-M system (Esp1396I), the measured dissociation constant of dimerization was found to be significantly (four times) lower [41]. We start by gradually decreasing this high dissociation constant of dimerization, in the range that corresponds to the wild-type (all monomers in the solution) to the opposite limit of lower K ₁, in which only dimers are present in the solution. In Fig. 5a, we see that this perturbation has a significant effect on R synthesis dynamics – note that the M dynamics curve, which is also indicated in the figure for reference, is not affected by perturbing the three characteristic AhdI features. One can observe the three main effects from Fig. 5a: The decrease of the delay between R and M expression, the slower transition from OFF to ON state, and the decrease in the steady-state level of R. The first two effects are further quantified in Fig. 5b and c, as discussed below.

In Fig. 5b, we see that decreasing K ₁ leads to a significant, more than twofold, decrease in the relative delay between R and M expression. This perturbation can then significantly impact the ability of the system to protect the host genome from being cut during R-M establishment, with the necessary lag also depending on the specific activity of the M protein and the propensity for R to nick hemimethylated sites. Furthermore, in Fig. 5c we see that decreasing K ₁ also leads to a significantly slower transition from OFF to ON state, so that the maximal slope is decreased for almost two-fold. Therefore, decreasing the wt dissociation constant of dimerization also significantly impacts the time window in which the host will be protected from foreign DNA infection. However, perturbing K1 has no significant effect on the steady-state stability of R levels (Fig. 5d). Overall, decreasing the high dissociation constant of dimerization characteristic for wt AhdI, has a significant adverse effect on two of the three proposed design principles.

Increasing C protein translation rate

In AhdI C transcript is leaderless [6], which was in E. coli [35, 36] shown to be associated with a significantly smaller translation initiation rate – consequently in [6] a five times smaller C transcript translation rate k _C, compared to R and M was assumed. We now test the effect of perturbing this system feature, i.e. increasing k _C towards those of R and M transcripts, which is shown in Fig. 6. We see that the main effect of this perturbation is on decreasing the steady-state level of R and the delay between R and M expression (for ~ 40%), as shown in Fig. 6a-b. Intuitively, this can be understood that by a more efficient C transcript translation, C accumulates faster, facilitating the formation of the activating and the repressing complexes on the CR promoter, so that R is expressed with a smaller delay, and reaches the lower steady-state level. On the other hand, the effect on the other two design-observables, i.e. on the transition velocity and the stability of R steady-state levels, is rather small (Fig. 6c-d). Consequently, increasing the low C transcript translation rate adversely affects one of the dynamical property observables, i.e. the delayed expression of R with respect to M, which is considered crucial for the protection of the host genome.

Decreasing cooperativity in the dimer binding

A rather drastic feature of AhdI is a very large cooperativity ω in binding of the two dimers to the distal and the proximal position in the promoter [6], which is either not present (EcoRV) [5], or significantly smaller (Esp1396I) [41], in other R-M systems. We therefore investigate how gradually abolishing this high cooperativity affects the system dynamics and the design observables. In Fig. 7a, we see that abolishing ω affects only the late dynamics of R, so that the first two dynamical properties are not affected (and not shown in Fig. 7). On the other hand, we see that the steady-state amount of R significantly increases as the cooperativity ω decreases. This can be intuitively understood by the fact that perturbing the cooperativity affects only the efficiency of forming the repressor tetramer complex. As the probability of forming this complex is proportional to C⁴ (see Additional file 1), it becomes significant only in the later period, when a large enough amount of C is synthesized. Furthermore, in accordance with the perturbation affecting the late dynamics, from Fig. 7b, we see that decreasing the cooperativity significantly impacts the stability of R steady-state levels, leading to its 50% decrease.

Importantly, the first two AhdI features (the large dissociation constant of dimerization, and the small C translation initiation rate) have an opposite effect on the steady-state amount of R, as compared to the large cooperativity in C dimer binding. That is, while we showed that the first two features significantly increase the steady-state R amount, the third feature (the large cooperativity) significantly decreases it. On the other hand, all three features generally have the same effect on the three dynamical properties that we consider, i.e. abolishing these features either decreases the values of the dynamical property observables (making the corresponding dynamical property less optimal), or do not significantly affect them. This can then explain the extremely large binding cooperativity that was experimentally observed, as on the one side it allows controlling the steady-state amount of the toxic protein due to the opposite effect from the other two features, while at the same time working together with the first two features to ensure more optimal dynamical properties. In particular, note that both the large dissociation constant of dimerization and the large binding cooperativity significantly increase the stability of R steady-state levels, while having a significant - but opposite – effects on the steady-state R amounts.

EcoRV wild-type dynamics

EcoRV is an example of R-M system with a divergent organization of CR and M transcription units. Overlapping CR and M promoters is the most distinctive feature of this system (presenting its main difference with respect to AhdI), which is, together with C protein binding, responsible for control of EcoRV transcription. That is, high occupancy of M promoter by RNAP, prevents RNAP binding to CR promoter, leading to lower CR transcription activity, and vice versa. In modeling the gene expression regulation, we consider that CR promoter transcription is controlled by C, while C binding has little to none direct effect on M promoter transcription activity, as shown in [5]. In distinction to AhdI [6], which shows an extremely high cooperativity in C dimer binding, no coperativity was found in EcoRV [5]. We also assume that C dissociation constant of dimerization is significantly lower than the relevant range of C concentration, so that the majority of C molecules in solution exist as dimers. Note that in another R-M system (Esp1396I), which has a much lower cooperativity in C dimer binding compared to AhdI, a significantly lower dissociation constant of dimerization is also observed [41]. Finally, in distinction to AhdI, C transcript in EcoRV is not leaderless, so for EcoRV we assume that C has the same translation initiation rate as R and M.

Consequently, EcoRV does not have the three features that control transcription in AhdI, but has instead another characteristic feature, i.e. the overlapping CR and M promoters. We therefore ask if EcoRV, with different architecture and the regulation features, can also meet the three dynamical properties that we consider. To that end, we modeled the synthesis of R and M during the system establishment in wild-type EcoRV, under the assumptions stated above, and following the scheme of the transcription configurations shown in Fig. 3. The model is provided in detail in Additional file 2, and is based on the same thermodynamics assumptions as the one for AhdI dynamics. To our knowledge, this presents the first model of expression dynamics for a divergent R-M system, which has a more complex regulation due to overlapping nature of their promoters. This model moreover presents the first opportunity to assess the dynamics of R and M synthesis for a divergent R-M system, as, to our knowledge, either their regulation or their expression dynamics was not previously measured.

The predictions for R and M accumulation in wild-type EcoRV are shown by the full black curve (for R) and by the black dashed curve (for M), in Fig. 8 below. From the figure we see that, regardless of lacking the characteristic AhdI regulatory features, the synthesis of R and M is well in accordance with the three dynamical properties. Namely, by comparing Fig. 4 (the dynamics of AhdI) with the EcoRV dynamics, we see that: i) the time delay for EcoRV is even larger compared to AhdI, ii) there is a clear switch-like behavior of R expression in EcoRV, i.e. the speed of transition from “OFF” to “ON” state is comparable to the one in AhdI, iii) the system reaches the steady-state level (Ω² > 0), where the reached stabilities of R steady-state levels are comparable (compare Fig. 5d with Fig. 8c). Therefore, we see that the design principles which we showed are inherent to AhdI R-M system, are retained in EcoRV R-M system, despite the apparent distinction in gene expression regulation.

Introducing AhdI control features to EcoRV

Next, there is a question of why the characteristic AhdI features are absent from EcoRV. That is, could we get even more optimal design-observables if AhdI control features are introduced in wild-type EcoRV? Therefore, we next use our model, to individually introduce each of the three control features of AhdI, on the top of the existing wt EcoRV regulation (i.e. the overlapping promoters). Specifically, in the wild-type EcoRV, we will perturb: i) the dissociation constant of dimerization towards the high values characteristic for AhdI, ii) cooperativity in C dimer binding to the promoter, also towards the high values observed in AhdI, iii) C protein translation rate k _C, towards the low values characteristic for leaderless AhdI C transcripts.

Introducing the high dissociation constant of dimerization to EcoRV

We first perturb the wt EcoRV system by increasing the rescaled equilibrium dissociation constant of dimerization \( {\overline{K}}_1 \) (see Fig.8 and Additional file 2), which corresponds to a gradual transition from the solution containing mostly C dimers to the solution containing mostly C monomers. Note that the dynamics of both R and M expression is now affected by the perturbation, in distinction to AhdI where only R expression is changed. This is because CR and M promoters overlap in EcoRV, so that changing transcription from one promoter, necessarily impacts transcription from the other.

We observe that this perturbation does not significantly affect the early accumulation of R and M (during the first ~10 min), but that the dynamics at later times is significantly affected (see Fig. 8a). In particular, we see that increasing the dissociation constant of dimerization leads to a significantly slower switch from “OFF” to “ON” state, so that the transition velocity decreases as much as four times (Fig. 8b). Furthermore, in Fig. 8c, we see that increasing \( {\overline{K}}_1 \) also significantly decreases the stability of R steady-state levels Ω², which drops almost three times. Consequently, introducing the high dissociation constant of dimerization to EcoRV, which is characteristic for AhdI, has a significant adverse effect on two of the three dynamical properties.

Introducing the high C dimer binding cooperativity

We next modify wt EcoRV by increasing the cooperativity ω of C dimer binding to the proximal and the distal binding site, while keeping the other wt EcoRV features unchanged. Note that the experimental measurements in wt EcoRV show an absence of C dimer binding cooperativity (ω = 1) [5], as opposed to the extremely large binding cooperativity that is observed in AhdI [6]. In Fig. 9, we see that increasing ω has the following effects: i) the time delay remains nearly the same (Fig. 9a), ii) the transition velocity decreases (Fig. 9b), where we see that increasing ω for a relatively moderate factor (2⁴), leads to a significant (somewhat less than twofold) decrease of v_max, iii) stability of R steady-state levels slightly increases. Consequently, we see that perturbing wt EcoRV cooperativity towards the higher values characteristic for AhdI, has a significant adverse effect on one of the dynamical properties (the transition velocity), while not significantly affecting the other two.

Decreasing C translation rate in EcoRV

Finally, we perturb wt EcoRV by decreasing C transcript translation rate k_C, towards the value characteristic for AhdI. Note that C transcript is leaderless in AhdI [6], which is not the case for EcoRV [5], so that we assume the same translation rate for all three transcripts (C, R and M) in EcoRV, while k _C is taken as five times lower in AhdI according to [6]. In Fig. 10a we observe that decreasing k _C does not impact the initial R and M accumulation (during the first ~10 min). On the other hand, at later times the perturbation significantly decreases both the transition velocity that decreases two times (see Fig. 10b), and the stability of R steady-state levels that decreases somewhat less than twofold (see Fig. 10c). Consequently, we see that again two of the three dynamical properties are significantly adversely affected by introducing a control feature from AhdI.

Overall, introducing AhdI characteristic features to EcoRV has a significant adverse effect on at least one of the dynamical properties, which may explain why those features are not found in EcoRV. Additionally, perturbing EcoRV wt parameters towards the AhdI values (Figs. 8a, 9a and 10a) changes M to R ratio in the same direction for each introduced feature (consistently increasing the ratio). This is in distinction to AhdI, where the high cooperativity of C dimer binding has an opposite effect on this ratio, compared to the other two features. Consequently, we argue that another reason for why the characteristic AhdI features are not observed in EcoRV, is because they do not allow balancing the amounts of R and M in the host cell.

Conclusion

R-M systems are characterized by different architectures and control features. We here test a hypothesis that these diverse features can be explained by constraints imposed by few dynamical properties. We started from a relatively well studied AhdI system, and computationally abolished three of its characteristic control features, showing that this has a clear adverse effect on the three dynamical properties. We then modeled a system with different architecture (EcoRV), and showed that its expression dynamics also satisfies the same properties. The EcoRV model has significance in its own right, as the expression dynamics of the divergent R-M systems was, to our knowledge, not studied before, either theoretically or experimentally. Finally, we computationally introduced to EcoRV the control features that exist in AhdI, and showed that this diminishes at least some of the proposed dynamical properties, consistent with the fact that these features do not appear in wt EcoRV. Moreover, increasing the binding cooperativity has the same effect on M to R ratio in EcoRV as increasing the dissociation constant of dimerization, or lowering the translation rate, which prevents balancing M to R ratio upon introducing these perturbations – this then provides another argument for why AhdI control features are absent from wt EcoRV.

Furthermore, dynamical properties proposed here can provide an explanation for a surprisingly large value of the cooperativity in C protein binding, accompanied by the large dissociation constant of dimerization that was observed in wt AhdI. We here showed that these two features have an opposite effect on the steady-state levels of the toxic molecule (R), allowing balancing the steady-state R amount, while at the same time leading to more optimal dynamical properties. In support of this proposal, a similar convergent system with lower binding cooperativity (Esp1396I) was also found to have a lower value of the dissociation constant of dimerization. As a prediction, it will be interesting to test if, in other R-M systems, the value of the dissociation constant of dimerization and the binding cooperativity are also related in this way.

Overall, this work provides an example that the system properties that may appear “random” or even surprising (such as the extremely large binding cooperativity) may be explained by constraints imposed by few general principles (in this case the system dynamical properties). Additionally, some of these system properties may serve other functions, e.g. the leaderless C transcripts might be related with a need for preferential translation under specific physiological conditions [42]. Analyzing other R-M systems can further test relation of the system features with the simple dynamical properties, where the main obstacle is that their transcription regulation is generally not well studied. In particular, investigating up to now poorly understood linear R-M systems, which have different architecture compared to the convergent and the divergent systems studied here, and which do not encode C proteins – but may exhibit control by antisense RNAs or at the level of translation initiation efficiency - may be particularly useful [43, 44]. As a further outlook, it will be interesting investigating if properties of other bacterial immune systems, such as recently discovered CRISPR/Cas systems [45], can also be explained by similar dynamical properties [34]. With that respect note that CRISPR/Cas is more advanced, i.e. adaptive bacterial immune system, which retains a memory of the past infections incorporated as spacers in the CRISPR array [46].

Also, in this work we follow a standard approach in molecular biology, where features of the system are perturbed/mutated (which is here done in-silico), and the effect of these perturbations on the presumed system function is assessed. In addition to such “single mutations”, a computational equivalent of “double” or “triple” mutations can be exhibited, where more than one system feature would be simultaneously perturbed. This would address the question if perturbations in one feature, can be rescued by also perturbing the other feature(s), which is related to the system robustness. While this question is out of the scope of this work, it also provides an interesting outlook for future research.

Finally, the recent advancement of experimental techniques, such as single-cell experiments, allows directly observing the protein dynamics during the system establishment. While in principle arduous, it would be interesting to experimentally observe how the relevant dynamics is perturbed when some of the key system features are abolished. This would then directly put to test some of the prediction from the computational modelling, which we provided here.

Abbreviations

C:: Control protein
M:: Methyltransferase
R:: Restriction enzyme
R-M:: Restriction-modification
RNAP:: RNA polymerase
wt:: Wild-type

References

Pingoud A, Wilson GG, Wende W. Type II restriction endonucleases—a historical perspective and more. Nucleic Acids Res. 2014;42(12):7489–527.
Mruk I, Kobayashi I. To be or not to be: regulation of restriction–modification systems and other toxin–antitoxin systems. Nucleic Acids Res. 2014;42(1):70-86.
Gingeras TR, Brooks JE. Cloned restriction/modification system from Pseudomonas aeruginosa. Proc Natl Acad Sci. 1983;80(2):402–6.
Article CAS PubMed PubMed Central Google Scholar
Kiss A, Posfai G, Keller CC, Venetianer P, Roberts RJ. Nudeotide sequence of the BsuRI restriction-modification system. Nucleic Acids Res. 1985;13(18):6403–21.
Article CAS PubMed PubMed Central Google Scholar
Semenova E, Minakhin L, Bogdanova E, Nagornykh M, Vasilov A, Heyduk T, Solonin A, Zakharova M, Severinov K. Transcription regulation of the EcoRV restriction-modification system. Nucleic Acids Res. 2005;33(21):6942–51.
Article CAS PubMed PubMed Central Google Scholar
Bogdanova E, Djordjevic M, Papapanagiotou I, Heyduk T, Kneale G, Severinov K. Transcription regulation of the type II restriction-modification system AhdI. Nucleic Acids Res. 2008;36(5):1429–42.
Article CAS PubMed PubMed Central Google Scholar
Krüger D, Bickle TA. Bacteriophage survival: multiple mechanisms for avoiding the deoxyribonucleic acid restriction systems of their hosts. Microbiol Rev. 1983;47(3):345.
PubMed PubMed Central Google Scholar
Tock MR, Dryden DT. The biology of restriction and anti-restriction. Curr Opin Microbiol. 2005;8(4):466–72.
Article CAS PubMed Google Scholar
Korona R, Korona B, Levin BR. Sensitivity of naturally occurring coliphages to type I and type II restriction and modification. Microbiology. 1993;139(6):1283–90.
CAS Google Scholar
Makino O, Saito H, Ando T. Bacillus subtilis-phage φ1 overcomes host-controlled restriction by producing BamNx inhibitor protein. Mol Gen Genet MGG. 1980;179(3):463–8.
Article CAS PubMed Google Scholar
Takahashi I, Marmur J. Replacement of thymidylic acid by deoxyuridylic acid in the deoxyribonucleic acid of a transducing phage for Bacillus subtilis. 1963.
Google Scholar
Kobayashi I. Behavior of restriction-modification systems as selfish mobile elements and their impact on genome evolution. Nucleic Acids Res. 2001;29(18):3742–56.
Article CAS PubMed PubMed Central Google Scholar
Jeltsch A, Pingoud A. Horizontal gene transfer contributes to the wide distribution and evolution of type II restriction-modification systems. J Mol Evol. 1996;42(2):91–6.
Article CAS PubMed Google Scholar
Morozova N, Sabantsev A, Bogdanova E, Fedorova Y, Maikova A, Vedyaykin A, Rodic A, Djordjevic M, Khodorkovskii M, Severinov K. Temporal dynamics of methyltransferase and restriction endonuclease accumulation in individual cells after introducing a restriction-modification system. Nucleic Acids Res. 2016;44(2):790–800.
Nagornykh M, Bogdanova E, Protsenko A, Solonin A, Zakharova M, Severinov K. Regulation of gene expression in a type II restriction-modification system. Russ J Genet. 2008;44(5):523–32.
Article CAS Google Scholar
Ichige A, Kobayashi I. Stability of EcoRI restriction-modification enzymes in vivo differentiates the EcoRI restriction-modification system from other postsegregational cell killing systems. J Bacteriol. 2005;187(19):6612–21.
Article CAS PubMed PubMed Central Google Scholar
Mruk I, Blumenthal RM. Real-time kinetics of restriction–modification gene expression after entry into a new host cell. Nucleic Acids Res. 2008;36(8):2581–93.
Article CAS PubMed PubMed Central Google Scholar
Ball NJ, McGeehan J, Streeter S, Thresh S-J, Kneale G. The structural basis of differential DNA sequence recognition by restriction–modification controller proteins. Nucleic acids research 2012;40(20):10532–42.
McGeehan JE, Papapanagiotou I, Streeter SD, Kneale GG. Cooperative binding of the C.AhdI controller protein to the C/R promoter and its role in endonuclease gene expression. J Mol Biol. 2006;358(2):523–31.
Article CAS PubMed Google Scholar
Sohail A, Ives CL, Brooks JE. Purification and characterization of C · BamHI, a regulator of the BamHI restriction-modification system. Gene. 1995;157(1):227–8.
Article CAS PubMed Google Scholar
Sorokin V, Severinov K, Gelfand MS. Systematic prediction of control proteins and their DNA binding sites. Nucleic Acids Res. 2009;37(2):441–51.
Article CAS PubMed Google Scholar
Ives CL, Nathan PD, Brooks JE. Regulation of the BamHI restriction-modification system by a small intergenic open reading frame, bamHIC, in both Escherichia coli and Bacillus subtilis. J Bacteriol. 1992;174(22):7194–201.
Article CAS PubMed PubMed Central Google Scholar
Tao T, Bourne J, Blumenthal R. A family of regulatory genes associated with type II restriction-modification systems. J Bacteriol. 1991;173(4):1367–75.
Article CAS PubMed PubMed Central Google Scholar
Česnavičienė E, Mitkaitė G, Stankevičius K, Janulaitis A, Lubys A. Esp1396I restriction–modification system: structural organization and mode of regulation. Nucleic Acids Res. 2003;31(2):743–9.
Article PubMed PubMed Central Google Scholar
Karyagina A, Shilov I, Tashlitskii V, Khodoun M, Vasil’ev S, Lau PC, Nikolskaya I. Specific binding of SsoII DNA methyltransferase to its promoter region provides the regulation of SsoII restriction-modification gene expression. Nucleic Acids Res. 1997;25(11):2114–20.
Article CAS PubMed PubMed Central Google Scholar
Zakharova M, Minakhin L, Solonin A, Severinov K. Regulation of RNA polymerase promoter selectivity by covalent modification of DNA. J Mol Biol. 2004;335(1):103–11.
Article CAS PubMed Google Scholar
Vilar JM, Saiz L. Systems biophysics of gene expression. Biophys J. 2013;104(12):2574–85.
Article CAS PubMed PubMed Central Google Scholar
Tolkunov D, Morozov AV. Genomic studies and computational predictions of nucleosome positions and formation energies. Adv Protein Chem Struct Biol. 2010;79:1–57.
Article CAS PubMed Google Scholar
Gertz J, Siggia ED, Cohen BA. Analysis of combinatorial cis-regulation in synthetic and genomic promoters. Nature. 2009;457(7226):215–8.
Article CAS PubMed Google Scholar
Seo Y-J, Chen S, Nilsen-Hamilton M, Levine HA. A mathematical analysis of multiple-target SELEX. Bull Math Biol. 2010;72(7):1623–65.
Article PubMed Google Scholar
Aguda B, Friedman A. Models of cellular regulation. New York: Oxford University Press; 2008.
Alon U. An introduction to systems biology: design principles of biological circuits. London: CRC press; 2006.
Williams K, Savageau MA, Blumenthal RM. A bistable hysteretic switch in an activator–repressor regulated restriction–modification system. Nucleic Acids Res. 2013;41(12);6045–57.
Djordjevic M. Modeling bacterial immune systems: strategies for expression of toxic - but useful - molecules. Biosystems. 2013;112(2):139–44.
Article CAS PubMed Google Scholar
O’Donnell SM, Janssen GR. The initiation codon affects ribosome binding and translational efficiency in Escherichia coli of cI mRNA with or without the 5′ untranslated leader. J Bacteriol. 2001;183(4):1277–83.
Article PubMed PubMed Central Google Scholar
Shell SS, Wang J, Lapierre P, Mir M, Chase MR, Pyle MM, Gawande R, Ahmad R, Sarracino DA, Ioerger TR. Leaderless transcripts and small proteins are common features of the mycobacterial translational landscape. PLoS Genet. 2015;11(11):e1005641.
Article PubMed PubMed Central Google Scholar
Beletskaya IV, Zakharova MV, Shlyapnikov MG, Semenova LM, Solonin AS. DNA methylation at the CfrBI site is involved in expression control in the CfrBI restriction–modification system. Nucleic Acids Res. 2000;28(19):3817–22.
Article CAS PubMed PubMed Central Google Scholar
Shea MA, Ackers GK. The OR control system of bacteriophage lambda. A physical-chemical model for gene regulation. J Mol Biol. 1985;181(2):211–30.
Article CAS PubMed Google Scholar
Enikeeva FN, Severinov KV, Gelfand MS. Restriction–modification systems and bacteriophage invasion: Who wins? J Theor Biol. 2010;266(4):550–9.
Article CAS PubMed PubMed Central Google Scholar
Streeter SD, Papapanagiotou I, McGeehan JE, Kneale GG. DNA footprinting and biophysical characterization of the controller protein C.AhdI suggests the basis of a genetic switch. Nucleic Acids Res. 2004;32(21):6445–53.
Article CAS PubMed PubMed Central Google Scholar
Bogdanova E, Zakharova M, Streeter S, Taylor J, Heyduk T, Kneale G, Severinov K. Transcription regulation of restriction-modification system Esp1396I. Nucleic Acids Res. 2009;37(10):3354–66.
Article CAS PubMed PubMed Central Google Scholar
Vesper O, Amitai S, Belitsky M, Byrgazov K, Kaberdina AC, Engelberg-Kulka H, Moll I. Selective translation of leaderless mRNAs by specialized ribosomes generated by MazF in Escherichia coli. Cell. 2011;147(1):147–57.
Article CAS PubMed PubMed Central Google Scholar
Mruk I, Liu Y, Ge L, Kobayashi I. Antisense RNA associated with biological regulation of a restriction–modification system. Nucleic Acids Res. 2011;39(13):5622–32.
Nagornykh M, Zakharova M, Protsenko A, Bogdanova E, Solonin AS, Severinov K. Regulation of gene expression in restriction-modification system Eco29kI. Nucleic Acids Res. 2011;39(11):4653–63.
Article CAS PubMed PubMed Central Google Scholar
Barrangou R, Marraffini LA. CRISPR-Cas systems: prokaryotes upgrade to adaptive immunity. Mol Cell. 2014;54(2):234–44.
Article CAS PubMed PubMed Central Google Scholar
Al-Attar S, Westra ER, van der Oost J, Brouns SJ. Clustered regularly interspaced short palindromic repeats (CRISPRs): the hallmark of an ingenious antiviral defense mechanism in prokaryotes. Biol Chem. 2011;392(4):277–89.
Article CAS PubMed Google Scholar
Rezulak M, Borsuk I, Mruk I: Natural C-independent expression of restriction endonuclease in a C protein-associated restriction-modification system. Nucleic Acids Res. 2016;44(6):2646–60.
Lubys A, Jurenaite S, Janulaitis A. Structural organization and regulation of the plasmid-borne type II restriction-modification system Kpn2l from Klebsiella pneumoniae RFL2. Nucleic Acids Res. 1999;27(21):4228–34.
Article CAS PubMed PubMed Central Google Scholar
Knowle D, Lintner RE, Touma YM, Blumenthal RM. Nature of the promoter activated by C. PvuII, an unusual regulatory protein conserved among restriction-modification systems. J Bacteriol. 2005;187(2):488–97.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank K. Severinov for useful discussions.

Declaration

This article has been published as part of BMC Systems Biology Vol 11 Suppl 1, 2017: Selected articles from BGRS\SB-2016: systems biology. The full contents of the supplement are available online at http://bmcsystbiol.biomedcentral.com/articles/supplements/volume-11-supplement-1.

Funding

This work (including the publication costs) was funded by the Swiss National Science foundation under SCOPES project number IZ73Z0_152297, by Marie Curie International Reintegration Grant within the 7^th European community Framework Programme (PIRG08-GA-2010-276996) and by the Ministry of Education and Science of the Republic of Serbia under project number ON173052.

Availability of data and materials

The datasets used and/or analyzed during the current study available from the corresponding author on reasonable request.

Authors’ contributions

MJD, MRD and EZ conceived the work. AR and BB performed the analysis. All the authors interpreted the results. MJD, AR and BB wrote the paper, with the help of MRD and EZ. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

Not applicable

Ethics approval and consent to participate

Not applicable

Author information

Authors and Affiliations

Institute of Physiology and Biochemistry, Faculty of Biology, University of Belgrade, Studentski trg 16, 11000, Belgrade, Serbia
Andjela Rodic & Marko Djordjevic
Multidisciplinary PhD program in Biophysics, University of Belgrade, Belgrade, Serbia
Andjela Rodic
Institute of Physics Belgrade, University of Belgrade, Belgrade, Serbia
Bojana Blagojevic & Magdalena Djordjevic
Department of Genetic Medicine and Development, University of Geneva and Swiss Institute of Bioinformatics, Geneva, Switzerland
Evgeny Zdobnov

Authors

Andjela Rodic
View author publications
You can also search for this author in PubMed Google Scholar
Bojana Blagojevic
View author publications
You can also search for this author in PubMed Google Scholar
Evgeny Zdobnov
View author publications
You can also search for this author in PubMed Google Scholar
Magdalena Djordjevic
View author publications
You can also search for this author in PubMed Google Scholar
Marko Djordjevic
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marko Djordjevic.

Additional files

Additional file 1:

Model of AhdI regulation and dynamics. (PDF 415 kb)

Additional file 2:

Model of EcoRV regulation and dynamics. (DOCX 198 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Rodic, A., Blagojevic, B., Zdobnov, E. et al. Understanding key features of bacterial restriction-modification systems through quantitative modeling. BMC Syst Biol 11 (Suppl 1), 1–15 (2017). https://doi.org/10.1186/s12918-016-0377-x

Download citation

Published: 24 February 2017
Issue Date: February 2017
DOI: https://doi.org/10.1186/s12918-016-0377-x

Selected articles from BGRS\SB-2016: systems biology

Understanding key features of bacterial restriction-modification systems through quantitative modeling

Abstract

Background

Results

Conclusions

Background

Methods

Experimentally determined configurations of AhdI and EcoRV

Modeling AhdI and EcoRV dynamics

The first dynamical property observable (delay)

The second dynamical property observable (OFF to ON transition speed)

The third dynamical property observable (R steady-state level stability)

Results and discussion

Perturbing AhdI system features

Decreasing the dissociation constant of dimerization

Increasing C protein translation rate

Decreasing cooperativity in the dimer binding

EcoRV wild-type dynamics

Introducing AhdI control features to EcoRV

Introducing the high dissociation constant of dimerization to EcoRV

Introducing the high C dimer binding cooperativity

Decreasing C translation rate in EcoRV

Conclusion

Abbreviations

References

Acknowledgements

Declaration

Funding

Availability of data and materials

Authors’ contributions

Competing interests

Consent for publication

Ethics approval and consent to participate

Author information

Authors and Affiliations

Corresponding author

Additional files

Additional file 1:

Additional file 2:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Systems Biology

Contact us