Identifying reaction modules in metabolic pathways: bioinformatic deduction and experimental validation of a new putative route in purine catabolism
© Barba et al.; licensee BioMed Central Ltd. 2013
Received: 20 June 2013
Accepted: 25 September 2013
Published: 5 October 2013
Enzymes belonging to mechanistically diverse superfamilies often display similar catalytic mechanisms. We previously observed such an association in the case of the cyclic amidohydrolase superfamily whose members play a role in related steps of purine and pyrimidine metabolic pathways. To establish a possible link between enzyme homology and chemical similarity, we investigated further the neighbouring steps in the respective pathways.
We identified that successive reactions of the purine and pyrimidine pathways display similar chemistry. These mechanistically-related reactions are often catalyzed by homologous enzymes. Detection of series of similar catalysis made by succeeding enzyme families suggested some modularity in the architecture of the central metabolism. Accordingly, we introduce the concept of a reaction module to define at least two successive steps catalyzed by homologous enzymes in pathways alignable by similar chemical reactions. Applying such a concept allowed us to propose new function for misannotated paralogues. In particular, we discovered a putative ureidoglycine carbamoyltransferase (UGTCase) activity. Finally, we present experimental data supporting the conclusion that this UGTCase is likely to be involved in a new route in purine catabolism.
Using the reaction module concept should be of great value. It will help us to trace how the primordial promiscuous enzymes were assembled progressively in functional modules, as the present pathways diverged from ancestral pathways to give birth to the present-day mechanistically diversified superfamilies. In addition, the concept allows the determination of the actual function of misannotated proteins.
KeywordsDihydroorotase Cyclic amidohydrolases Dihydroorotase dehydrogenase Pyrimidine metabolism Purine metabolism Reaction module Functional annotation Rubrobacter xylanophilus
Investigating the evolution of metabolic pathways requires tracing back how the enzymes that catalyze successive steps have evolved to perform specific chemical reactions [1–3]. Enzyme families are grouping all homologous gene products descending from a common ancestor by speciation and/or gene duplication. An increasingly prevailing model  postulates that present-day enzyme families and superfamilies are the result of the progressive divergence of ancestral proteins endowed with a promiscuous function. Contrary to the classical model proposed by Ohno , it is anticipated that innovation (enzyme promiscuity) preceded gene duplication and functional divergence of the paralogous copies by descent with modification . To explain the appearance of many closely related families which group into mechanistically diverse superfamilies, Glasner et al.  have proposed to distinguish two degrees of promiscuity: shared chemistry (substrate ambiguity) and substrate binding (catalytic promiscuity). More and more data suggest that substrate ambiguity, first defined in the classical patchwork model of Jensen , rather than catalytic promiscuity , is the main road which facilitates divergence of most enzyme families [10, 11].
To examine further the observed coupling of enzyme homology and chemical similarity, we investigated the neighbouring steps in the respective pathways in purine and pyrimidine metabolism. In this paper, we identify that successive reactions display similar chemistry. These mechanistically-related reactions are often catalyzed by homologous enzymes. These homologues diverge in their molecular and cellular functions while maintaining a similar chemical mechanism in their catalytic process. This detection of series of similar catalysis made by succeeding enzyme families suggested some modularity [17, 18] in the architecture of central metabolism.
This led us to propose the term reaction module to describe such related suites of catalyses found in parallel pathways that are alignable at the level of their chemically similar steps.
We demonstrate further the importance of this concept in the characterization of a new route in purine catabolism. After a bioinformatic discovery of a putative ureidoglycine carbamoyltransferase (UGTCase) activity, we present experimental data supporting the idea that UGTCase is likely involved in such an alternative metabolic route.
Results and discussion
Comparing dihydroorotate dehydrogenase and dihydropyrimidine dehydrogenase and finding new putative dehydrogenase families
Figure 1, Box 2 illustrates that DHOase, involved in pyrimidine biosynthesis (Figure 1 arrow C), is homologous both to hydantoinase/dihydropyrimidinase (HYDase/DHPase) involved in degradation of pyrimidines (Figure 1 arrow B), and to allantoinase (ALLase) a major enzyme of purine catabolism (Figure 1 arrow A). Figure 1, Box 2 underlines the similar chemical reactions performed by these different homologues on similar substrates . For instance, carbamoyl-aspartate (substrate of DHOase) has a similar structure to N-carbamoyl-beta-aminoisobutyrate (the product of thymine degradation) and N-carbamoyl-beta -alanine (the product of uracil degradation). Interestingly, Figure 1, Box 3 shows further similarities in the chemical reactions carried out by the enzymes that are active in the subsequent step of pyrimidine metabolism in both anabolic (arrow C) and catabolic (arrow B) directions. Indeed, the dihydroorotate is transformed anabolically into orotate by the dihydroorotate dehydrogenase (DHODase, EC 18.104.22.168), in a process similar to the catabolic transformation (EC 22.214.171.124 and EC 126.96.36.199) of uracil or thymine to dihydrouracil or dihydrothymine by their respective dihydropyrimidine dehydrogenases (DHPDases). To improve our knowledge of the evolutionary mechanisms leading to the establishment of such related adjacent reactions (arrows B and C, Box 3), we looked further at the evolutionary relationships between DHODases and DHPDases.
A gene coding for X2 was found in 69 bacterial species (belonging to nearly all phyla of the Domain Bacteria) as a close neighbour of a gene annotated as encoding a pyruvate-ferredoxin oxidoreductase. Moreover, in 13 out of these 69 species, the immediate neighbour to this pyruvate-ferredoxin oxidoreductase is a gene homologous to preT, encoding the ferredoxin part of the complex PreA-PreT of the Escherichia coli DHPDase . In the remaining species defining the X2 subtree, this preT-like gene is present but is not in the same transcriptional unit as the gene for dehydrogenase X2. Since E. coli is found to contain four preT paralogues but only one copy of preA, one may guess by analogy that X2 could be the partner of one of these preT paralogues. This should form a complex with the pyruvate-ferredoxin oxidoreductase in order to dehydrogenate an uncharacterized component that may be similar to dihydropyrimidines or hydantoin.
Updating the carbamoyltransferase phylogenetic tree
The methodological approach previously described  and summarized above and in Methods, was used to reconstruct an accurate evolutionary tree of the carbamoyltransferase superfamily. Figure 4 shows a simplified view (see Additional files 5, 6, 7 and 8 for complete views of the MSA and tree, respectively) of this updated tree. As in the trees we reconstructed previously, with far less sequences [25–27], there is a clear separation between the ornithine carbamoyltransferases (OTCases) and the ATCases (Figure 4). However, the huge increase in the number of sequences brings with it three notable features. (i) The previously described OTCase alpha and beta subfamilies appear now to be partially intermingled. (ii) The putrescine carbamoyltransferases (PTCases) form a monophyletic group that currently branches inside this OTCase subtree. This evolutionary location appears to be biologically significant since it has been recently demonstrated that the PTCase synthesized by Listeria monocytogenes is actually a bifunctional enzyme, catalyzing the decarbamoylation of either citrulline or carbamoylputrescine . This depends on growth conditions at low pH and when expressed as a virulence factor . In addition, the recent determination of the 3D structure of PTCases confirms the evolutionary inclusion of PTCases among OTCases [29, 30]. (iii) The two families of ATCases (ATC I and ATC II) we described previously [25–27] still form two monophyletic subtrees corresponding to different quaternary structures .
Conserved residues among carbamoyltransferases
Common TCase motifs
(Positions in Escherichia coli ATCase)
ATCase (I & II)
AIA or AIS or SIA
Characterizing the pseudo-ATCases
Figure 5 shows that pseudo-ATCases can be differentiated into three evolutionarily-defined subgroups using their closest homologue, the Pyrococcus abyssi authentic ATCase, as the outgroup. Table 1 shows that this phylogenetic differentiation is also well supported by major differences, defined using the E. coli ATCase sequence as a reference, in the two structural motifs located in the C-terminal part (in bold). The first motif, corresponding to H265-P268 (column 3 in Table 1) distinguishes each group of pseudo-ATCases from all the other carbamoyltransferases, while the second motif R230-Q232 (column 4 in Table 1) distinguishes each group of pseudo-ATCases from the genuine ATCases. Moreover, pseudo-ATCases contain, at their N-terminal region, two structural motifs (corresponding to S53-T56 (column 1), and H135-Q138 (column 2) in the E. coli ATCase sequence) that are highly conserved in the whole superfamily as a carbamoyltransferase signature.
Additionally, the phylogenetic differentiation of each of the three subgroups of pseudo-ATCases is confirmed by a distinctive gene context (Figure 5). In Subgroup 2 (composed of only one organism, the actinobacterium Nocardioides sp. JS614), the gene encoding the pseudo-ATCase is adjacent to a homologue of the E. coli ygeW gene. YgeW  is a carbamoyltransferase which belongs to a group sharing a common ancestor with the AOTCases  and SOTCases  (see Figure 4 and Table 1), but its true physiological role remains elusive . In Nocardioides, the adjacent downstream genes are xdhA, xdhB and xdhC, together encoding a putative heterotrimeric xanthine dehydrogenase (involved in purine degradation ), and then a gene encoding a putative phenylhydantoinase HyuA . Such gene association is reminiscent of the specific context of X1 family (Figure 3) even if there is no detectable X1 homologue in Nocardioides sp. JS614.
Members of the Subgroup 3 pseudo-ATCases are found in marine bacterial species. The encoding gene is part of a conserved syntenic block containing on one side a gene encoding a Na+ symporter (seawater milieu?) and on the other side a gene encoding a putative asparaginase. In nearly all cases, this cluster is adjacent to a gene annotated as encoding a putative deiminase, which is most probably an N-carbamoyl-L-amino acid amidohydrolase (HyuC) involved in hydantoin metabolism .
Genomic context of the Rubrobacter xylanophilus pseudo-ATCase
Deducing a novel carbamoyltransferase activity in a reaction module involved in purine degradation
The degradation process of allantoin to glyoxylate may involve one of several possible enzymes, namely, allantoicase, allantoate amidohydrolase, ureidoglycine amidohydrolase, ureidoglycolatase, and ureidoglycolate amidohydrolase [35–38]. Importantly, however, we could not detect in the R. xylanophilus genome (RefSeq: NC_008148) any gene encoding the catalytic step corresponding to any of these enzymes. This suggested the possibility that Rxyl_2847 and its neighbours (Table 2) could play a role in this pathway. Accordingly, we searched for possible reaction modules composed of chemically similar substrates/products through the comparison of purine catabolism (Figure 1A), pyrimidine catabolism (Figure 1B), and pyrimidine biosynthesis (Figure 1C).
Indirect evidence that Rxyl_2847 has a ureidoglycine carbamoyltransferase activity
Experimental validation that Rxyl_2847 has a ureidoglycine carbamoyltransferase activity
To ascertain the bioinformatic deduction that Rxyl_2847 is really an UGTCase, the Rxyl_2847 gene was cloned into a pBAD expression vector (see Methods and Additional file 9). Unexpectedly, the sequencing of plasmid pCEC53 revealed several mismatches between the cloned ORF sequence and the published genomic sequence of R. xylanophilus DSM9941 (RefSeq: NC_008148). An extended DNA fragment encompassing the ORF Rxyl_2847 was generated by two independent PCR reactions, sequenced, and this confirmed the sequence of the cloned Rxyl_2847 (GenBank : JX289826).
Characterization of the Rubrobacter xylanophilus UGTCase activity
Reaction mixture composition
Specific activityc(μmol min-1 mg-1)
25 mM potassium arsenate pH 6.8, 20 mM allantoate
0.046 ± 0.007 a
25 mM Hepes pH 6.8, 20 mM allantoate
50 mM potassium phosphate pH 6.8, 20 mM allantoate, 5 mM ornithine, EcOTCase (100 units)
0.050 ± 0.002 b
25 mM Hepes pH 6.8, 20 mM allantoate, 5 mM ornithine, EcOTCase (100 units)
50 mM potassium phosphate pH 6.8, 5 mM ornithine, EcOTCase (100 units)
50 mM potassium phosphate pH 6.8, 20 mM allantoate, 5 mM ornithine, EcOTCase (100 units)
5.913 ± 0.584 b
50 mM potassium phosphate pH 6.8, 5 mM ornithine, EcOTCase (100 units)
To assay if the purified recombinant protein had a carbamoyltransferase activity, we first examined its ability to catalyze the arsenolytic cleavage of allantoate (Table 3). Importantly, since ureidoglycine, the putative product of the reaction, is thermally unstable, assays were performed with an incubation time not exceeding 5 min. (see Methods for details). The amino content in the reaction mixture was immediately analyzed by reverse phase HPLC after pre-column derivatization with o-phtaldialdehyde (see Methods). One prominent peak, corresponding to ureidoglycine (retention time: 11.6 min), was obtained after the enzyme was incubated at 37°C with allantoate and arsenate. After incubation at 60°C (the optimal growth temperature of R. xylanophilus), only a small amount of ureidoglycine was observed, probably due to the lability of this product. Ureidoglycine formation was observed when UGTCase was incubated with allantoate and arsenate, but not in the absence of arsenate, excluding the possibility of enzymatic hydrolysis of allantoate (Table 3).
The physiological reaction catalyzed by UGTCase, namely the phosphorolysis of allantoate, was then analyzed by coupling with purified E. coli OTCase to prove that carbamoyl phosphate was effectively formed in the reaction. The citrulline produced in the coupled assay was quantified by reverse phase HPLC after pre-column derivatization with o-phtaldialdehyde. Table 3 demonstrates that carbamoyl phosphate was indeed produced. Comparison of the activities at both 37° and at 60°C showed that UGTCase was much more active at 60°C, as expected for a thermophilic enzyme (Table 3).
In this paper, we have substantiated the potency of the concept of the reaction module to unravel undisclosed functional relationships in central metabolism and to discover the actual function of misannotated proteins [46, 47], especially when coupled with an informative gene context. The so-called pseudo-ATCases (Figure 4) were found to be such an example of uncharacterized paralogues when we demonstrated they are unrelated to authentic ATCases (Figure 5 and Table 1). Using our conceptual approach, we have deduced and experimentally proved that the Rxyl_2847 protein, the pseudo-ATCase detected in the extremophile R. xylanophilus, is an UGTCase involved in the degradation of allantoin to ureidoglycine (Figure 6).
By analogy with steps observed in other species [37, 48, 49], we further suggest that this reaction is part of a new route of purine catabolism, where ureidoglycine is then degraded by the carbamate kinase Rxyl_2848 to produce carbamoyl-phosphate (Figure 6). Moreover, spontaneous degradation of ureidoglycine at 60°C (growth temperature of R. xylanophilus) would provide eventually glyoxylate (as well as ammonia and urea). We propose that Rxyl_2849, adjacent to Rxyl_2847 and Rxyl_2848, also plays a crucial role in this newly described pathway (Figure 6). In fact, Rxyl_2849 has been annotated as an asparaginase-like enzyme in the R. xylanophilus genome (RefSeq: NC_008148). In contrast, Gravenmade et al.  claimed that allantoate amidohydrolase isolated from Streptococcus allantoicus could hydrolyze allantoate to ureidoglycolate with the release of CO2 and NH3+. In this case, AAHase is expected to produce ureidoglycine, which is later transformed into ureidoglycolate by an ureidoglycine aminohydrolase. Thus, in R. xylanophilus, Rxyl_2849 may also catalyze this deamination of ureidoglycine to ureidoglycolate, as the logical step following the action of the UGTCase Rxyl_2847 (Figure 6). Although this prediction remains to be experimentally validated, it would introduce another category of reaction module. In that case, while substrates and products are chemically similar, the enzymes catalyzing analogous reactions could be evolutionarily unrelated. Consequently, automated detection of such modules with the currently available bioinformatic tools would be less simple than in the case of enzyme homology. However, it remains possible when using tools like ChemMine  to compare various substrates and products to detect similar chemical functions in alignable pathways.
Use of the reaction module concept should be of great value when studying mechanistically diversified superfamilies [7, 11, 13, 14, 46, 47]. Reaction modules could be viewed as the elementary bricks used to assemble functional modules [17, 18]. More generally, defining progressively these different elements will help to trace how the present pathways progressively diverged from ancestral pathways where the successive primordial enzymes  were promiscuous and gave birth to the present-day superfamilies.
Building a reference multiple sequence alignment (MSA) of superfamilies
We developed a two-step approach to obtain an MSA reflecting the structural and functional diversity of enzyme superfamilies. In a first step, we collected the limited set of homologues that have been both crystallized (published in the Protein Data Bank ) and experimentally studied, as indicated in UniProtKB/SwissProt .These sequences were multiply aligned using the Expresso update of the 3D-Coffee program  that has been benchmarked as optimal when sequence identity between target and template falls below 50% . The automated alignment was further improved by hand to define a seed MSA. In a second step, an HMM profile of this seed was created to screen UniProtKB using HMMsearch . This allows the identification of suitable (threshold of E-value = 10E-15) homologues that were further clustered using Cd-hit . For each cluster, an automated MSA was built with MUSCLE  and an HMM profile (HMM_cluster) was computed. In parallel, another HMM profile was computed for the closest homologous sequences present in the seed alignment (HMM_seed). Then, the two profiles, HMM_cluster and HMM_seed, were aligned using the HHalign program . A stepwise approach allows progressive addition of each aligned cluster to the seed alignment. To make this step-up more efficient and safer, we started with highly matching sequences (at least 70% identity), and the whole process was repeated while the identity threshold was progressively decreased 60, 55, 50, 45, and 40%. This allowed us to exclude a few unreliable distant sequences and to assort the individual tribes that are part of each aligned cluster.
A script was designed to detect the emergence of new homologues each time a new version of UniProtKB  was published. These presumptive homologues were assessed and added to the reference alignment using the HMM stepwise approach described above. Accordingly, we worked at any one time with a reliable reference MSA that was always up to date.
Reconstructing phylogenetic trees
Seed and reference MSA were used to derive phylogenetic trees with approximate maximum likelihood approaches (FastTree version 2.1 ). Robustness of the reconstructed tree topologies was assessed using a bootstrap approach or a much faster alternative, the approximate likelihood-ratio test (aLRT ). The trees obtained (written in Newick format) were visualized using MEGA 5.1  or Dendroscope 3.2.2  programs.
Functional annotation by monophyly
With the deluge of new genome sequences, phylogenetic trees contain more and more functionally unknown sequences branching together with a few experimentally characterized proteins. We used topological information of elementary subtrees to annotate uncharacterized leaves as follows. When two monophyletic subtrees, sharing a common ancestor, each contain at least one of their leaves with the same experimentally assessed functional annotation, then this function is transferred to their whole subtree, on the assumption that this shared feature comes from their common ancestor. If this is not the case, each monophyletic subtree is considered independently, tentatively divided in two more elementary subtrees and the analysis continued until the most distal subtrees coincide with leaves. Such a cautious approach prevents the introduction of damaging overinterpretation of functional proximity.
Chemical and structural comparison of potential substrates
We used ChemMine tools  to compare systematically potential substrates and products of putative enzymes. Their hierarchical clustering was calculated by all-against-all comparisons of chemically related compounds using atom pair similarity measures. We used the Tanimoto coefficient, which is defined as c/(a + b + c), where c is the number of features common in both compounds, while a and b are the number of features that are unique in one or the other compound, respectively. For each cluster, the similarity scores generated were transformed into distance values, allowing creation of a dendrogram and then a heatmap that highlights the hierarchical clustering of the analyzed compounds.
Cloning and heterologous expression of Rxyl_2847
Rubrobacter xylanophilus strain DSM9941 obtained from Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSMZ) was grown aerobically at 60°C on a rotary shaker in complex medium (9 g of tryptic soy broth, 4 g of yeast extract, 3 g of NaCl, H2O to 1 L, adjusted to pH 7.5). Genomic DNA was extracted according to Magarvey et al.  and we used Pfu DNA polymerase (ThermoScientific) to amplify the open reading frame (ORF) Rxyl_2847 with primers ocej475 (5’- tttaactttaagaaggagatatacatacccatgcagaaagaggcggtaaggga -3’) and ocej476 (5’- atccgccaaaacagccaagctggagaccgtctaatgatgatgatgatgatgcgcccccacgatagcggcgac -3’). The PCR product was inserted into the pBAD vector (Life Technologies) by homologous recombination in E. coli MC1061  after growth on LB broth in the presence of 100 μg/mL ampicillin. The resulting pCEC53 plasmid was verified by sequencing (Genetic Service Facility, University of Antwerp, Belgium). For additional sequencing of the ORF Rxyl_2847, a PCR fragment extending from nucleotide 2853037 to nucleotide 2854149 of the published genome of R. xylanophilus DSM9941 (accession number GI:108764099) was generated with Pfu DNA polymerase and primers ocej483 (5’- ctcttcgagaaggcctgagaatag -3’) and ocej484 (5’- tcgtcctttatgagggagttgc- 3’). The PCR product was cloned subsequently with the CloneJet PCR cloning kit (ThermoScientific) and sequenced.
Production and purification of recombinant Rxyl_2847 protein
E. coli MC1061 transformed with the expression vector pCEC53 was grown at 37°C in two litres of LB broth supplemented with 100 μg/mL ampicillin. Expression of Rxyl_2847 was induced in mid-exponential phase by adding 0.2% arabinose, followed by overnight growth at 18°C. Cells were harvested by centrifugation, suspended in 50 ml 0.05 M potassium phosphate buffer pH 7.5, containing Complete EDTA-free protease inhibitor cocktail (Roche Applied Science) and disrupted by sonication (Ultrasonic Inc., W-225R). Insoluble particles were pelleted at 17,500 g for 30 min (Sorvall RC-6, SS34 rotor). The cell extract was heated at 60°C for 15 min and coagulated proteins were removed by centrifugation for 30 min at 17,500 g. The cleared lysate was submitted to ion metal affinity chromatography on Ni-nitrilotriacetic acid agarose resin (Qiagen) in 0.05 M potassium phosphate buffer, pH 7.5, containing 0.3 M NaCl. Elution was performed in three steps with increasing concentrations (0.1, 0.25, and 0.5 M) of imidazole. Fractions corresponding to the elution peak at 0.25 M imidazole were pooled and applied to a Superdex 200 (GE Healthcare, 16/70 column) gel filtration resin in 0.02 M Hepes buffer, pH 7.5, containing 0.15 M NaCl and 10% glycerol. Fractions containing the protein of interest were pooled and concentrated using Vivaspin 15R 30 kDa (Sartorius) membranes. The presence and purity of the recombinant enzyme was checked throughout the purification procedure by SDS-PAGE and its identity verified by Western blot. Western blot analysis was carried out as previously described , Rxyl_2847 enzyme was detected using PentaHis antibodies (Qiagen) and Amersham ECL Prime western blotting reagents (GE Healthcare).
Enzymatic synthesis of ureidoglycine
Ureidoglycine is not commercially available and was generated by enzymatic hydrolysis of allantoate, catalyzed by purified recombinant E. coli allantoate amidohydrolase as referred to in French and Ealick . Plasmid EcCD00311947 carrying the E. coli allC gene under the control of a T7 promoter was obtained from DNASU Plasmid Repository (The Biodesign Institute/ Arizona State University, USA). E. coli strain BL21(DE3) was transformed with this plasmid and grown in LB broth supplemented with 50 μg/mL kanamycin to allow expression of recombinant E. coli AAHase with a 6xHis tag fused to its N-terminus. Expression of AAHase was induced in mid-exponential growth phase by adding 1 mM IPTG, followed by growth for 4 h at 37°C. All purification steps were performed as described above except that the thermal treatment was omitted.
Enzyme activities were measured in 200-μl assay mixtures whose composition is detailed in the Results section. After incubation, the reaction was stopped by freezing on ice and the enzyme was removed from the reaction mixture by ultrafiltration on a Vivaspin 500 3 kDa (Sartorius) membrane. The products of the reaction were immediately analyzed by reverse phase HPLC after pre-column derivatization with o-phtaldialdehyde. The fluorescent derivatives of amino compounds were prepared according to Hill et al.  and analyzed by reverse phase HPLC on an Alltech Altima C18 5 μm column (150/4.6) as referred to in Jones et al.. Initial conditions were 75% solvent A (tetrahydrofuran - methanol - 0.05 M sodium acetate (1:19:80) pH 5.9), 25% solvent B (methanol - 0.05 M sodium acetate (80:20) pH 5.9). The gradient program (flow rate of 1 ml min-1) was as follows: 75% solvent A + 25% solvent B for 1 min from the initiation step of the program; linear step to 80% solvent B in 14 min; isocratic step at 80% solvent B for 3 min; linear step to 100% solvent B for 7 min; isocratic step to 100% solvent B for 5 min. One unit of activity is defined as the amount of enzyme that converts 1 μmol of substrate to product per min under the assay conditions. Specific activity is defined in units per mg protein and activity was considered as not detectable when less than 0.001 μmol per min per mg. Protein concentration was determined by measurement of the UV absorbance at 280 nm and by the Bradford method, with bovine serum albumin as the standard.
Multiple sequence alignment
Hidden Markov model.
We are very indebted to Barry Holland for his invaluable help in critically reading our manuscript and improving the English. We thank M. Demarez for technical assistance. This work was funded by the CNRS (UMR 8621) and by the Commission Communautaire Française (Belgium). It is part of the PhD thesis of Matthieu Barba who was supported by a doctoral grant from the French Ministère de la Recherche. Publication fees were supported by Meurice R&D asbl (http://www.meurice.org/).
- Schmidt S, Sunyaev S, Bork P, Dandekar T: Metabolites: a helping hand for pathway evolution?. Trends Biochem Sci. 2003, 28: 336-341. 10.1016/S0968-0004(03)00114-2.PubMedView ArticleGoogle Scholar
- Alves R, Chaleil RAG, Sternberg MJE: Evolution of enzymes in metabolism: a network perspective. J Mol Biol. 2002, 320: 751-770. 10.1016/S0022-2836(02)00546-6.PubMedView ArticleGoogle Scholar
- Fani R, Fondi M: Origin and evolution of metabolic pathways. Phys Life Rev. 2009, 6: 23-52. 10.1016/j.plrev.2008.12.003.PubMedView ArticleGoogle Scholar
- Khersonsky O, Tawfik DS: Enzyme promiscuity: a mechanistic and evolutionary perspective. Annu Rev Biochem. 2010, 79: 471-505. 10.1146/annurev-biochem-030409-143718.PubMedView ArticleGoogle Scholar
- Ohno S: Evolution by Gene duplication. 1970, New-York: Springer-VerlagView ArticleGoogle Scholar
- Hughes AL: Gene duplication and the origin of novel proteins. Proc Natl Acad Sci U S A. 2005, 102: 8791-8792. 10.1073/pnas.0503922102.PubMedPubMed CentralView ArticleGoogle Scholar
- Glasner ME, Gerlt JA, Babbitt PC: Evolution of enzyme superfamilies. Curr Opin Chem Biol. 2006, 10: 492-497. 10.1016/j.cbpa.2006.08.012.PubMedView ArticleGoogle Scholar
- Jensen RA: Enzyme recruitment in evolution of new function. Annu Rev Microbiol. 1976, 30: 409-425. 10.1146/annurev.mi.30.100176.002205.PubMedView ArticleGoogle Scholar
- O’Brien PJ, Herschlag D: Catalytic promiscuity and the evolution of new enzymatic activities. Chem Biol. 1999, 6: R91-R105.PubMedView ArticleGoogle Scholar
- Khersonsky O, Roodveldt C, Tawfik DS: Enzyme promiscuity: evolutionary and mechanistic aspects. Curr Opin Chem Biol. 2006, 10: 498-508. 10.1016/j.cbpa.2006.08.011.PubMedView ArticleGoogle Scholar
- Khersonsky O, Malitsky S, Rogachev I, Tawfik DS: Role of chemistry versus substrate binding in recruiting promiscuous enzyme functions. Biochemistry. 2011, 50: 2683-2690. 10.1021/bi101763c.PubMedView ArticleGoogle Scholar
- Barba M, Glansdorff N, Labedan B: Evolution of cyclic amidohydrolases: a highly diversified superfamily. J Mol Evol. 2013, 77: 70-80. 10.1007/s00239-013-9580-1.PubMedView ArticleGoogle Scholar
- Seibert CM, Raushel FM: Structural and catalytic diversity within the amidohydrolase superfamily. Biochemistry. 2005, 44: 6383-6391. 10.1021/bi047326v.PubMedView ArticleGoogle Scholar
- Nam SH, Park HS, Kim HS: Evolutionary relationship and application of a superfamily of cyclic amidohydrolase enzymes. Chem Rec. 2005, 5: 298-307. 10.1002/tcr.20057.PubMedView ArticleGoogle Scholar
- Lohkamp B, Andersen B, Piskur J, Dobritzsch D: The crystal structures of dihydropyrimidinases reaffirm the close relationship between cyclic amidohydrolases and explain their substrate specificity. J Biol Chem. 2006, 281: 13762-13776. 10.1074/jbc.M513266200.PubMedView ArticleGoogle Scholar
- Kim GJ, Lee DE, Kim HS: Functional expression and characterization of the two cyclic amidohydrolase enzymes, allantoinase and a novel phenylhydantoinase, from Escherichia coli. J Bacteriol. 2000, 182: 7021-7028. 10.1128/JB.182.24.7021-7028.2000.PubMedPubMed CentralView ArticleGoogle Scholar
- Hartwell LH, Hopfield JJ, Leibler S, Murray AW: From molecular to modular cell biology. Nature. 1999, 402: C47-C52. 10.1038/35011540.PubMedView ArticleGoogle Scholar
- Ravasz E, Somera AL, Mongru DA, Oltvai ZN, Barabási AL: Hierarchical organization of modularity in metabolic networks. Science. 2002, 297: 1551-1555. 10.1126/science.1073374.PubMedView ArticleGoogle Scholar
- The UniProt Consortium: Ongoing and future developments at the universal protein resource. Nucleic Acids Res. 2011, 39: D214-D219.http://www.uniprot.org/,PubMed CentralView ArticleGoogle Scholar
- Björnberg O, Rowland P, Larsen S, Jensen KF: Active site of dihydroorotate dehydrogenase a from lactococcus lactis investigated by chemical modification and mutagenesis. Biochemistry. 1997, 36: 16197-16205. 10.1021/bi971628y.PubMedView ArticleGoogle Scholar
- Sørensen G, Dandanell G: A new type of dihydroorotate dehydrogenase, type 1S, from the thermoacidophilic archaeon Sulfolobus solfataricus. Extremophiles. 2002, 6: 245-251. 10.1007/s00792-001-0249-0.PubMedView ArticleGoogle Scholar
- Hidese R, Mihara H, Kurihara T, Esaki N: Escherichia coli dihydropyrimidine dehydrogenase is a novel NAD-dependent heterotetramer essential for the production of 5,6-dihydrouracil. J Bacteriol. 2011, 193: 989-993. 10.1128/JB.01178-10.PubMedPubMed CentralView ArticleGoogle Scholar
- Franceschini A, Szklarczyk D, Frankild S, Kuhn M, Simonovic M, Roth A, Lin J, Minguez P, Bork P, von Mering C, Jensen LJ: STRING v9.1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res. 2013, 41: D808-D815. 10.1093/nar/gks1094.http://string-db.org/, 10.1093/nar/gks1094PubMedPubMed CentralView ArticleGoogle Scholar
- Xi H, Schneider BL, Reitzer L: Purine catabolism in Escherichia coli and function of xanthine dehydrogenase in purine salvage. J Bacteriol. 2000, 182: 5332-5341. 10.1128/JB.182.19.5332-5341.2000.PubMedPubMed CentralView ArticleGoogle Scholar
- Labedan B, Boyen A, Baetens M, Charlier D, Chen P, Cunin R, Durbeco V, Glansdorff N, Herve G, Legrain C, Liang Z, Purcarea C, Roovers M, Sanchez R, Toong TL, Van de Casteele M, van Vliet F, Xu Y, Zhang YF: The evolutionary history of carbamoyltransferases: a complex set of paralogous genes was already present in the last universal common ancestor. J Mol Evol. 1999, 49: 461-473. 10.1007/PL00006569.PubMedView ArticleGoogle Scholar
- Labedan B, Xu Y, Naumoff DG, Glansdorff N: Using quaternary structures to assess the evolutionary history of proteins: the case of the aspartate carbamoyltransferase. Mol Biol Evol. 2004, 21: 364-373.PubMedView ArticleGoogle Scholar
- Naumoff DG, Xu Y, Glansdorff N, Labedan B: Retrieving sequences of enzymes experimentally characterized but erroneously annotated: the case of the putrescine carbamoyltransferase. BMC Genomics. 2004, 5: 52-10.1186/1471-2164-5-52.PubMedPubMed CentralView ArticleGoogle Scholar
- Chen J, Cheng C, Xia Y, Zhao H, Fang C, Shan Y, Wu B, Fang W: Lmo0036, An ornithine and putrescine carbamoyltransferase in Listeria monocytogenes, participates in arginine deiminase and agmatine deiminase pathways and mediates acid tolerance. Microbiology. 2011, 157: 3150-3161. 10.1099/mic.0.049619-0.PubMedView ArticleGoogle Scholar
- Polo LM, Gil-Ortiz F, Cantín A, Rubio V: New insight into the transcarbamylase family: the structure of putrescine transcarbamylase, a key catalyst for fermentative utilization of agmatine. PLoS One. 2012, 7: e31528-10.1371/journal.pone.0031528.PubMedPubMed CentralView ArticleGoogle Scholar
- Shi D, Yu X, Zhao G, Ho J, Lu S, Allewell NM, Tuchman M: Crystal structure and biochemical properties of putrescine carbamoyltransferase from Enterococcus faecalis: assembly, active site, and allosteric regulation. Proteins. 2012, 80: 1436-1447. 10.1002/prot.24042.PubMedPubMed CentralView ArticleGoogle Scholar
- Li Y, Jin Z, Yu X, Allewell NM, Tuchman M, Shi D: The ygeW encoded protein from Escherichia coli is a knotted ancestral catabolic transcarbamylase. Proteins. 2011, 79: 2327-2334. 10.1002/prot.23043.PubMedPubMed CentralView ArticleGoogle Scholar
- Shi D, Yu X, Roth L, Morizono H, Tuchman M, Allewell NM: Structures of N-acetylornithine transcarbamoylase from xanthomonas campestris complexed with substrates and substrate analogs imply mechanisms for substrate binding and catalysis. Proteins. 2006, 64: 532-542. 10.1002/prot.21013.PubMedView ArticleGoogle Scholar
- Shi D, Morizono H, Cabrera-Luque J, Yu X, Roth L, Malamy MH, Allewell NM, Tuchman M: Structure and catalytic mechanism of a novel N-succinyl-L-ornithine transcarbamylase in arginine biosynthesis of Bacteroides fragilis. J Biol Chem. 2006, 281: 20623-20631. 10.1074/jbc.M601229200.PubMedView ArticleGoogle Scholar
- Martínez-Rodríguez S, Clemente-Jiménez JM, Rodríguez-Vico F, Las Heras-Vázquez FJ: Molecular cloning and biochemical characterization of L-N-carbamoylase from sinorhizobium meliloti CECT4114. J Mol Microbiol Biotechnol. 2005, 9: 16-25. 10.1159/000088142.PubMedView ArticleGoogle Scholar
- Cusa E, Obradors N, Baldoma L, Badia J, Aguilar J: Genetic analysis of a chromosomal region containing genes required for assimilation of allantoin nitrogen and linked glyoxylate metabolism in Escherichia coli. J Bacteriol. 1999, 181: 7479-7484.PubMedPubMed CentralGoogle Scholar
- Schultz AC, Nygaard P, Saxild HH: Functional analysis of 14 genes that consitute the purine catabolic pathway in bacillus subtilis and evidence for a novel regulon controlled by the PucR transcription activator. J Bacteriol. 2001, 183: 3293-3302. 10.1128/JB.183.11.3293-3302.2001.PubMedPubMed CentralView ArticleGoogle Scholar
- Vogels GD, Van der Drift C: Degradation of purines and pyrimidines by microorganisms. Bacteriol Rev. 1976, 40: 403-468.PubMedPubMed CentralGoogle Scholar
- Yoo HS, Genbauffe FS, Cooper TG: Identification of the ureidoglycolate hydrolase gene in the DAL gene cluster of Saccharomyces cerevisiae. Mol Cell Biol. 1985, 5: 2279-2288.PubMedPubMed CentralView ArticleGoogle Scholar
- Serventi F, Ramazzina I, Lamberto I, Puggioni V, Gatti R, Percudani R: Chemical basis of nitrogen recovery through the ureide pathway: formation and hydrolysis of S-ureidoglycine in plants and bacteria. ACS Chem Biol. 2010, 5: 203-214. 10.1021/cb900248n.PubMedView ArticleGoogle Scholar
- Backman TWH, Cao Y, Girke T: ChemMine tools: an online service for analyzing and clustering small molecules. Nucleic Acids Res. 2011, 39: W486-W491. 10.1093/nar/gkr320.http://chemmine.ucr.edu/, 10.1093/nar/gkr320PubMedPubMed CentralView ArticleGoogle Scholar
- Legrain C, Villeret V, Roovers M, Gigot D, Dideberg O, Pierard A, Glansdorff N: Biochemical characterization of ornithine carbamoyltransferase from pyrococcus furiosus. Eur J Biochem. 1997, 247: 1046-1055. 10.1111/j.1432-1033.1997.01046.x.PubMedView ArticleGoogle Scholar
- Dutoit R, De Ruyck J, Durisotti V, Legrain C, Jacobs E, Wouters J: Overexpression, physicochemical characterization, and modeling of a hyperthermophilic pyrococcus furiosus type 2 IPP isomerase. Proteins. 2008, 71: 1699-1707.PubMedView ArticleGoogle Scholar
- Legrain C, Demarez M, Glansdorff N, Piérard A: Ammonia-dependent synthesis and metabolic channelling of carbamoyl phosphate in the hyperthermophilic archaeon pyrococcus furiosus. Microbiology. 1995, 141: 1093-1099. 10.1099/13500872-141-5-1093.View ArticleGoogle Scholar
- Legrain C, Stalon V: Ornithine carbamoyltransferase from Escherichia coli W. Purification, structure and steady-state kinetic analysis. Eur J Biochem. 1976, 63: 289-301. 10.1111/j.1432-1033.1976.tb10230.x.PubMedView ArticleGoogle Scholar
- Ferreira AC, Nobre MF, Moore E, Rainey FA, Battista JR, da Costa MS: Characterization and radiation resistance of new isolates of rubrobacter radiotolerans and rubrobacter xylanophilus. Extremophiles. 1999, 3: 235-238. 10.1007/s007920050121.PubMedView ArticleGoogle Scholar
- Furnham N, Garavelli JS, Apweiler R, Thornton JM: Missing in action: enzyme functional annotations in biological databases. Nat Chem Biol. 2009, 5: 521-525. 10.1038/nchembio0809-521.PubMedView ArticleGoogle Scholar
- Schnoes AM, Brown SD, Dodevski I, Babbitt PC: Annotation error in public databases: misannotation of molecular function in enzyme superfamilies. PLoS Comput Biol. 2009, 5: e1000605-10.1371/journal.pcbi.1000605.PubMedPubMed CentralView ArticleGoogle Scholar
- French JB, Ealick SE: Biochemical and structural characterization of a ureidoglycine aminotransferase in the Klebsiella pneumoniae uric acid catabolic pathway. Biochemistry. 2010, 49: 5975-5977. 10.1021/bi1006755.PubMedPubMed CentralView ArticleGoogle Scholar
- Ramazzina I, Costa R, Cendron L, Berni R, Peracchi A, Zanotti G, Percudani R: An aminotransferase branch point connects purine catabolism to amino acid recycling. Nat Chem Biol. 2010, 6: 801-806. 10.1038/nchembio.445.PubMedView ArticleGoogle Scholar
- Gravenmade EJ, Vogels GD, der Drift CV: Hydrolysis, racemization and absolute configuration of ureidoglycolate, a substrate of allantoicase. Biochim Biophys Acta. 1970, 198: 569-582. 10.1016/0005-2744(70)90134-8.PubMedView ArticleGoogle Scholar
- Berman HM, Henrick K, Kleywegt G, Nakamura H, Markley J: The worldwide protein data bank. International Tables for Crystallography. 2012, Vol F ch.24.1: 827-832.View ArticleGoogle Scholar
- Armougom F, Moretti S, Poirot O, Audic S, Dumas P, Schaeli B, Keduas V, Notredame C: Expresso: automatic incorporation of structural information in multiple sequence alignments using 3D-Coffee. Nucleic Acids Res. 2006, 34: W604-W608. 10.1093/nar/gkl092.PubMedPubMed CentralView ArticleGoogle Scholar
- Dalton JAR, Jackson RM: An evaluation of automated homology modelling methods at low target template sequence similarity. Bioinformatics. 2007, 23: 1901-1908. 10.1093/bioinformatics/btm262.PubMedView ArticleGoogle Scholar
- Eddy SR: Profile hidden Markov models. Bioinformatics. 1998, 14: 755-763. 10.1093/bioinformatics/14.9.755.PubMedView ArticleGoogle Scholar
- Li W, Godzik A: Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006, 22: 1658-1659. 10.1093/bioinformatics/btl158.http://weizhong-lab.ucsd.edu/cd-hit/, 10.1093/bioinformatics/btl158PubMedView ArticleGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.http://www.drive5.com/muscle/, 10.1093/nar/gkh340PubMedPubMed CentralView ArticleGoogle Scholar
- Söding J: Protein homology detection by HMM-HMM comparison. Bioinformatics. 2005, 21: 951-960. 10.1093/bioinformatics/bti125.PubMedView ArticleGoogle Scholar
- Price MN, Dehal PS, Arkin AP: FastTree 2 – approximately maximum-likelihood trees for large alignments. PLoS One. 2010, 5: e9490-10.1371/journal.pone.0009490.https://wiki.gacrc.uga.edu/wiki/Fasttree, 10.1371/journal.pone.0009490PubMedPubMed CentralView ArticleGoogle Scholar
- Anisimova M, Gascuel O: Approximate likelihood-ratio test for branches: a fast, accurate, and powerful alternative. Syst Biol. 2006, 55: 539-552. 10.1080/10635150600755453.http://www.atgc-montpellier.fr/phyml/paper.php, 10.1080/10635150600755453PubMedView ArticleGoogle Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.http://www.megasoftware.net/, 10.1093/molbev/msr121PubMedPubMed CentralView ArticleGoogle Scholar
- Huson DH, Richter DC, Rausch C, Dezulian T, Franz M, Rupp R: Dendroscope: an interactive viewer for large phylogenetic trees. BMC Bioinformatics. 2007, 8: 460-10.1186/1471-2105-8-460.http://ab.inf.uni-tuebingen.de/software/dendroscope/, 10.1186/1471-2105-8-460PubMedPubMed CentralView ArticleGoogle Scholar
- Magarvey NA, Keller JM, Bernan V, Dworkin M, Sherman DH: Isolation and characterization of novel marine-derived actinomycete taxa rich in bioactive metabolites. Appl Environ Microbiol. 2004, 70: 7520-7529. 10.1128/AEM.70.12.7520-7529.2004.PubMedPubMed CentralView ArticleGoogle Scholar
- Casadaban MJ, Cohen SN: Analysis of gene control signals by DNA fusion and cloning in Escherichia coli. J Mol Biol. 1980, 138: 179-207. 10.1016/0022-2836(80)90283-1.PubMedView ArticleGoogle Scholar
- Hill DW, Walters FH, Wilson TD, Stuart JD: High performance liquid chromatographic determination of amino acids in the picomole range. Anal Chem. 1979, 51: 1338-1341. 10.1021/ac50044a055.PubMedView ArticleGoogle Scholar
- Jones BN, Pääbo S, Stein S: Amino acid analysis and enzymatic sequence determination of peptides by an improved o-phthaldialdehyde precolumn labeling procedure. J Liq Chromatogr. 1981, 4: 565-586. 10.1080/01483918108059956.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.