In silico pathway reconstruction: Iron-sulfur cluster biogenesis in Saccharomyces cerevisiae
© Alves and Sorribas; licensee BioMed Central Ltd. 2007
Received: 25 September 2006
Accepted: 31 January 2007
Published: 31 January 2007
Current advances in genomics, proteomics and other areas of molecular biology make the identification and reconstruction of novel pathways an emerging area of great interest. One such class of pathways is involved in the biogenesis of Iron-Sulfur Clusters (ISC).
Our goal is the development of a new approach based on the use and combination of mathematical, theoretical and computational methods to identify the topology of a target network. In this approach, mathematical models play a central role for the evaluation of the alternative network structures that arise from literature data-mining, phylogenetic profiling, structural methods, and human curation. As a test case, we reconstruct the topology of the reaction and regulatory network for the mitochondrial ISC biogenesis pathway in S. cerevisiae. Predictions regarding how proteins act in ISC biogenesis are validated by comparison with published experimental results. For example, the predicted role of Arh1 and Yah1 and some of the interactions we predict for Grx5 both matches experimental evidence. A putative role for frataxin in directly regulating mitochondrial iron import is discarded from our analysis, which agrees with also published experimental results. Additionally, we propose a number of experiments for testing other predictions and further improve the identification of the network structure.
We propose and apply an iterative in silico procedure for predictive reconstruction of the network topology of metabolic pathways. The procedure combines structural bioinformatics tools and mathematical modeling techniques that allow the reconstruction of biochemical networks. Using the Iron Sulfur cluster biogenesis in S. cerevisiae as a test case we indicate how this procedure can be used to analyze and validate the network model against experimental results. Critical evaluation of the obtained results through this procedure allows devising new wet lab experiments to confirm its predictions or provide alternative explanations for further improving the models.
Increasing amounts of data that can be mined for information about how proteins in cells assemble as metabolic pathways, signal transduction pathways, and gene circuits, are generated each day. Datasets available for such tasks include the primary literature, large scale micro array experiments, whole genome two hybrid screenings, full genome sequences, and the patterns of conserved/non-conserved homologues and orthologues in them. Theoretical and computational methods are being developed and used to analyze these different types of data and infer networks of proteins or genes that are involved in the same cellular process(es) (e.g. [1–10]).
In general, the networks derived by the computational analysis of these data are static, in the sense that they provide little information, if any, about the flow of causality and events in the process and no information about the dynamics of the processes and its regulation (however, see ). For example, the involvement of proteins X, Y and Z in a process does not elucidate if X catalyzes a reaction that produces a substrate for another reaction catalyzed by Z or by Y, or if X modulates Y or Z activity. This can be an important problem while assembling the network structure of either novel pathways (e.g. Iron-Sulfur Cluster biogenesis) or complex pathways with an unclear reaction and regulation network, (e. g. cell cycle). Thus, it is a challenge to transform the network of interactions inferred from the analysis of static data into a causal network that allows for the creation of mathematical models whose dynamic behavior can be analyzed and tested against experimental observations.
To achieve such a goal, strategies that combine the different theoretical and computational methods to identify proteins and generate a set of plausible alternative network topologies for the process of interest are needed. Such networks can then be translated into mathematical models whose dynamic behavior can be analyzed and compared to that of the real system, thus discriminating against some of the proposed topologies when they do not reproduce the expected behavior. Such an analytical process integrates omics data and provides testable predictions and information about systemic behavior.
The more than likely absence of known mechanistic and kinetic data for each of the individual proteins in a novel pathway hinders the process of translating network topology into a mathematical model. A way around the problem is by using approximation theory . This well-established methodology approximates the continuous functions that typically describe the kinetics of protein processes by using, for example, truncated Taylor series, either in linear or non-linear spaces (see e.g. [13–19]). Among the non-linear approximations, the power-law formalism provides a useful representation that comes associated with powerful and eclectic analytical methods (see e.g. [20–24]).
In this paper, we shall focus on defining and applying a global strategy combining bioinformatics tools and mathematical modeling to reconstruct the network structure of a pathway. Computational tools will be used for a) obtaining relevant information on genes and proteins that are identified as playing a role in the target pathway, b) checking putative interactions between proteins, c) testing the co-evolution of different proteins, and d) for setting-up alternative networks that accommodate all this information. Then, expert knowledge is used to curate the set of alternative network structures. Finally, mathematical models are used to explore the systemic behavior of each alternative network and comparing it with existing experimental data.
As a benchmark problem we shall focus on the Iron-Sulfur Cluster (ISC) biogenesis pathway. ISC are widespread cofactors of proteins that work as catalytic mediators, as electron transport mediators, and as sensors for the oxidation state of the cell and of its environment [25–32]. Although ISC have been known to assemble autonomously in proteins, in recent years, an evolutionarily conserved set of proteins that controls this assembly has been identified [29, 33, 34]. In eukaryotes, initial ISC biogenesis is mitochondrial . Deregulation of ISC biogenesis in humans can create different pathological effects, leading to diseases such as Friedreich's ataxia, X-linked sideroblastic anemia, or hypochromic anemia. In yeast, deleting one ISC biogenesis gene creates cells that accumulate iron and have a decrease/deregulation in the activity of ISC dependent proteins. The extent of the phenotype ranges from mild (e.g. ΔGRX5 strains ) to lethal (e.g. ΔARH1 strains ), depending on the protein that is mutated. Friedreich's ataxia is linked to mutations in one of the ISC biogenesis proteins (Frataxin) which has as a homologous protein in yeast the proteinYfh1 (Yeast Frataxin Homologue 1). Additionally, iron accumulation can lead to cellular aging and its associated diseases.
Proteins involved in ISC synthesis in Saccharomyces cerevisiae.
Type of evidence for Protein Function
Suggested Function In ISC Assembly
Type of evidence for suggested systemic role
Homology to known ferredoxin/adrenodoxin reductases
Reduces Yah1/Provides electrons for ISC assembly/transfer/repair
Homology, Physiological, Genetic
Homology to known ferredoxin/adrenodoxin
Provides electrons for ISC assembly/transfer/repair
Homology, Biochemical; Structural
Stores/Provides Fe directly to ISC assembly and Heme synthesis
Physiological, Structural, Genetic
Regulates glutathionylation state of protein cysteinyl residues
As dimer scaffolds initial ISC assembly between two monomers and then transfers it to apo-proteins
Biochemical, Spectroscopic, Direct transfer observation
As dimer scaffolds initial ISC assembly between two monomers and then transfers it to apo-proteins
Biochemical, Spectroscopic, Direct transfer observation
As dimer scaffolds initial ISC assembly between two monomers and then transfers it to apo-proteins
Biochemical, Spectroscopic, Direct transfer observation
As dimer scaffolds initial ISC assembly between two monomers and then transfers it to apo-proteins
Biochemical, Spectroscopic, Direct transfer observation
As dimer scaffolds initial ISC assembly between two monomers and then transfers it to apo-proteins
Homology, Biochemical, Genetic, Structural
Assists in proper folding of ISC biosynthetic proteins, namely Yfh1 and Isa-Isu proteins/Assists in maintaining ISC assembled in scaffold dimer for proper transfer
Hsp40 Co- chaperone
Homology, Biochemical, Genetic, Structural
Assists Ssq1 in interacting with Isu/Isa proteins
Biochemical, Genetic, Structural
Co- chaperone/Nucleotide exchange factor
Homology, Biochemical, Genetic, Structural
Biochemical, Genetic, Structural
Provides sulphur for ISC assembly in scaffold dimers or in situ ISC assembly/repair
Physiological, Biochemical, Genetics
Involved in ISC export for cytoplasm and nuclear proteins
Fundamental for Nfs1 action
As mentioned in the previous paragraph, there is enough information to attribute a function to some of the proteins involved in ISC biogenesis in S. cerevisiae. This is the case for example of Nfs1, Isu1, Isu2, or Atm1. However, the role of other proteins is still not clear. For example, what do Isa1, Isa2 or Nfu1 do in the process? What is the role of the chaperones Ssq1-Jac1-Mge1 or of Grx5 in ISC biogenesis? Thus ISC biogenesis is a good benchmark problem for the application of the methodology we describe, as it will provide the chance to validate some of the prediction with published experimental results. Simultaneously, the methodology will generate biological insight regarding some of the proteins with an unclear role, thus creating an added value from the methodological and from the biological point of view. In previous papers [34, 44, 45] we combined structural bioinformatics with experiments and kinetic modeling to investigate the possible role of proteins Arh1, Yah1 and Grx5 in mitochondrial ISC biogenesis. In this paper we present a structured computational approach that is used to infer and analyze probable topologies for the global network of mitochondrial ISC biogenesis. We analyze seven of the proteins involved in the process (Arh1, Yah1, Yfh1, Grx5, Nfs1, Ssq1 and Jac1), proposing likely systemic roles for their action in ISC biogenesis.
The proteins that are known to be involved in ISC mitochondrial biogenesis in S. cerevisiae are shown in Table 1. We shall first describe how the different large scale datasets are analyzed and combined, using different computational tools in order to infer initial alternatives for the network assembly of the pathway. Then, we use mathematical modeling strategies to further analyze the different alternative and identify the most reliable network structure, based on comparing the dynamic behavior of the models to experimental observations.
Pathway Reconstruction using automated literature analysis
Pathway Reconstruction using Phylogenetic profiling
It is not possible to infer probable network structure(s) for the ISC biogenesis pathway just from the co-occurrence of genes in papers. Additional data is necessary in order to translate the group of proteins that are identified to be involved in the biological process of interest into a structured network of causal and regulatory relationships. Phylogenetic profiling [52, 53] of the genes assists in adding some structure to the network. In this type of analysis, one searches for patterns of co-occurrence of sets of orthologues over a large number of fully sequenced genomes. The assumption behind the use of this approach is that, if the pattern of presence and absence of two or more proteins in a number of genomes is very close, this is an indication of co-evolution among the proteins. Such co-evolution can be taken as an indication that they are likely to be involved in the same cellular process(es).
Results from phylogenetic co-evolution analysis of the different genes involved in ISC biogenesis are summarized in Figure 1. This additional information already assists in inferring some form of causality to the network structure of the pathway. For example, Grx5 homologues coincide with Isa1 and Isa2 homologues in more genomes than with any other of the identified proteins. This suggests that Grx5 is more likely to act directly on these proteins than with other proteins that are also involved in ISC biogenesis. As can be seen in Table 1, Grx5 is a glutaredoxin that is likely to regulate (de)glutathionylation of protein cysteine residues. The phylogenetic analysis suggests that the scaffold proteins are likely targets for that regulation. Similarly, Atm1 homologues coincide with Isu1 and Isu2 homologues in more genomes than with homologues of other ISC biogenesis proteins. Because Atm1 is a membrane transporter, this result suggests that Isu1 and Isu2 are more likely to be involved in ISC transfer to the cytoplasm than other scaffold proteins. Sqq1, Jac1 and Mge1 are a chaperone, a co-chaperone, and a nucleotide exchange factor respectively. Homologues of the three proteins are present together in many genomes. Finally, Yah1 and Yfh1 homologues coincide in a high number of genomes, suggesting a common role for these two proteins.
Pathway Reconstruction using low definition protein docking and in vitro protein interaction data
Alternatives for a partial network structure of the ISC biogenesis pathway have emerged from the phylogenetic analysis in the previous section. Additional information regarding this structure can be obtained through the use of protein docking methods [54–60].
Using protein docking to determine how proteins assemble in a pathway is not a standard procedure and may present various types of problems. First, proteins in the same pathway need not physically interact. Products from an early step may be released into solution and used by a protein involved in a later step of the process. However, this problem is not relevant in the current case. Because ISC are labile, a direct physical interaction between proteins is required for synthesis and transfer of the clusters. Second, protein modeling and/or docking is computationally very expensive. However, in the current case a limited number of proteins are involved in the target pathway. Thus, this problem is reduced to manageable levels. Third, docking any two proteins will always give a result. For example, consider proteins A and B that co-occur in the same cellular compartment and are involved in the same process, while protein C is located in another cellular compartment and has no role in the process of interest. If we dock protein A to protein B and then dock protein A to protein C, the score of the later docking may be much larger than that of the former. Choosing A and C as the preferred docking partners would be an erroneous result because A and C never meet in vivo. This problem is avoided in the present case because the proteins we are docking co-exist in the same cellular compartment and are involved in the same pathway. Fourth, docking a large aggregate of proteins that works together to form a pathway is still beyond the scope of protein docking. This is an important problem but we have made the assumption that, because physical interaction between proteins that catalyze a step is required, the individual docking results would be indicative of actual physical interactions.
In silico protein docking methods require protein structures. Of the proteins involved in ISC biogenesis in S. cerevisiae only Yfh1 has had its structure experimentally determined . We built models for the structures of the remaining proteins. These models were obtained by homology modeling as described in the methods section.
Extending our previous work with this system [34, 44, 45], we use GRAMM [54–60] and Hex  to perform all against all in silico protein docking studies. The GRAMM methodology is specially suited to dock protein models because it averages over possible errors in the conformation that are unquestionably larger than in the case of crystallographic structures [55, 63, 64]. Figure 1 shows the four highest scores of interaction for each of the relevant genes. Several possibilities for the structure of the network can be inferred from these results. On one hand, both Nfs1 and Yfh1 show stronger docking scores with the four scaffold proteins. This suggests that both proteins act in the processes that assemble the cluster in the scaffolds. Arh1 and Yah1 are likely to be a pair that acts together, given that their mutual docking score is high. Grx5 docking scores are higher for Nfs1 and Arh1, suggesting that the later proteins are targets for the regulation of cysteine (de)glutathionylation by Grx5.
We have complemented the network of interactions inferred from the docking experiments with the interactions reported in two-hybrid screenings of yeast proteins from the published literature [34, 65–69]. Within these data sets, we have only found evidence for interaction between Nfs1 and Isu1, Nfs1 and Isu2, Isa1 and Isa2, and between Isa1 and Grx5. The Nfs1-scaffold interactions are predicted by protein docking, thus suggesting that using docking methods to resolve pathway structure might be an adequate method, under similar circumstances. The Grx5-Isa1 interaction is consistent with the phylogenetic analysis reported in a previous section.
Generating a curated pathway model using expert knowledge
The previous analysis suggests a skeletal structure for the ISC Biogenesis pathway. However, this structure needs to be further refined before it can be analyzed using mathematical modeling. A more detailed topology requires a deeper analysis of the data available in the literature. Additionally, non-genetic regulatory influences to the pathway are hard to identify automatically, and one must include such information using expert knowledge. At this stage, human curation is needed to turn the automated analysis of the previous sections into a network of reactions and to add further detail to the reaction topology and regulation. This curation is described in detail in the Supplementary Appendix [see Additional File 1].
Alternative Mathematical Models
We use the power-law formalism in its Generalized Mass (GMA) Action form for building-up a mathematical model that can accommodate the different alternatives in Figure 2. Details on the kinetic equations for our models, which are automatically generated from the network of reactions, are given in the supplementary appendix. The power-law description in GMA form of the network from Figure 2 is given in SBML format as supplementary material (see appendix for complete list of reactions and technical details of the approximation and parameter scanning procedure).
Normalization and parameter scanning
A power law model has two types of parameters. Multiplicative parameters are analogous to rate constants (γi). They can be appropriately normalized and roughly determine the time scale of the processes. Apparent kinetic order f ij measures the direct influence (sensitivity) of metabolite j on flux i. Their values are typically smaller than the number of binding sites for the metabolite in the reaction, in absolute value [20, 72]. Kinetic orders are positive if the flux is activated by the metabolite, negative if the flux is inhibited by the metabolite and zero if the considered metabolite has no direct influence upon the flux. The alternative networks we consider in order to evaluate the role of different proteins on ISC biogenesis can be derived as special cases of a general model that incorporates all the possible reactions and regulatory effects. Each alternative network is generated by setting to zero specific kinetic orders corresponding to proteins that do not directly modulate the flux of a given process in a specific network alternative. For example, when we consider that Grx5 may act upon recovery of Nfs1 activity by deglutathionylation, the kinetic order for Grx5 in the deglutathionylation flux is positive; otherwise this kinetic order is zero.
To analyze the behavior of each alternative network, we scan the relevant parameter values and evaluate the resulting dynamic behavior of the corresponding mathematical model. Because values for the kinetic orders have a limited range of permissible values, after normalization of the rate constants, one can do a fairly comprehensive scan of the parameter values for the alternative networks and characterize the most relevant behaviors associated with each topology.
1- The gene of interest is knocked out from the yeast genome and replaced with the same gene in a plasmid with expression controlled by either a repressible or an inducible promoter.
2- The phenotype of the mutated strain is compared to the wild type. This is done under conditions when the mutated strain synthesizes the protein of interest and repeated under conditions where the synthesis of that specific protein is shut off.
The experiments described in the previous paragraph can be reproduced in the following way, using our in silico models:
1- Introduce a sink flux for the protein of interest with the purpose of mimicking the shutting off of gene expression. For example, consider Grx5. When we are scanning to predict the role of Grx5, we define an auxiliary first-order reaction for Grx5 consumption. By changing the value of the rate-constant for this auxiliary reaction, we cause the amount of Grx5 to decrease as the simulation time increases. This reproduces the effect of the dilution and destruction of the protein in strains in which the gene has been knocked out .
2- Scan the parameter values and compute the resulting activity of ISC dependent enzymes, represented by P1 and P2 levels, and the mitochondrial iron levels. Rate constants are scanned for 3 orders of magnitude about each side of their normalized values. Details on the exact values for the scanning parameters are given in the supplementary appendix.
3- Save the simulation data. Plot it as is done in Figure 3. Verify if the concentration of Fe increases and the concentration of holo P1, holo P2, and holo Yah1 decreases upon the knock out of a specific gene, as observed experimentally.
As a result of this strategy, we obtain for each of the alternative networks a qualitative picture of the possible dynamic response regimes that the network may have. Following this methodology, the analysis of the in silico dynamic behavior of each network can have the following alternative outcomes: (i) A network reproduces the experimental depletion of ISC dependent protein activity and Fe accumulation that is observed in experiments upon knock out of a given protein, independent of parameter values; (ii) A network reproduces the experimental results for particular subsets of parameter values; (iii) A network does not reproduce the experimental results, independent of parameter values. Clearly, the last case means that the considered network is not a plausible representation of the target system. Designs that lead to reproducing observed results independently of the parameter choice are the most likely to be good approximations to the real network structure. Networks that reproduce experimental results only for particular parameter values are in between both extreme cases.
Possible alternative regulatory roles for the different proteins in the scheme shown in Figure 2. A '+' sign in a row indicates that the protein could participate in the regulation of reactions in the corresponding block of reactions in Figure 2.
Sites in Figure 2
The Role of Arh1-Yah1
The analysis from the bioinformatics section leads us to consider that Arh1-Yah1 can act in regulating S(ynthesis), T(ransfer) or R(epair) of the clusters (Table 2) or in any combination of the three roles. When comparing the simulations for the alternative roles of Arh1-Yah1 to the experimental results, we see that only the curves where Arh1-Yah1 participate in S or ST reproduce experimental results. Only for those roles does the activity of ISC dependent proteins decreases and Fe accumulates in response to a depletion in Arh1-Yah1 levels (Figure 3A and 3B). These results refine our previous predictions  and are in agreement with previous experimental results that have been interpreted as indicating that Arh1-Yah1 do not regulate ISC in situ repair [73, 74].
It is interesting to note that the results from Arh1 depletion studies  show a slight increase in the activity of ISC-dependent proteins upon initial depletion of Arh1, followed by a decrease in ISC-dependent protein activity upon further depletion of Arh1. This behavior is not reproduced in Yah1 depletion studies . Within the range of our parameter scanning, only the networks where Arh1-Yah1 acts on ISC synthesis shows the same behavior for the ISC-dependent proteins as that observed in Li et al. . In our models this effect can be explained as follows. The simulations start with an overexpression of Arh1-Yah1. At high concentrations, much of the ISC biosynthesis takes place to keep Arh1-Yah1 active, as this protein requires ISC for activity. When the levels of Arh1-Yah1 drop bellow saturation levels the ISC biogenesis machinery can upload more ISC into the other ISC-dependent proteins, thus increasing their activity. As Arh1-Yah1 levels continue to drop there is not enough Arh1-Yah1 around to keep the full fraction of other ISC-dependent proteins in active form.
There is also the possibility that Arh1 and Yah1 act on ISC biogenesis in an unknown way that is not represented in the network. To look for additional evidence to support this hypothesis, we reanalyzed the phylogenetic profile of both proteins. This analysis reveals that Arh1 and Yah1 have a strong co-evolution with the proteins Sco1/Sco2 and Hol1, which are involved in cation transport. There are other connections between ISC biosynthesis and divalent ion mitochondrial transport, which suggests that the connection between Arh1/Yah1 and Sco1/Sco2 may be worth investigating in the future. Co-affinity purification experiments would provide evidence for this interaction. Additionally, measuring accumulation of iron and an impaired ISC dependent protein activity in mutants in which Sco1, Sco2 or Hol1 have been knocked out would provide evidence for the involvement of these proteins in ISC biogenesis.
We have also investigated the unsolved problem of the mutual functional relationship between Arh1 and Yah1. Homology with other ferredoxin/ferredoxin reductase pairs suggests that Arh1 is the reductase for Yah1. However, two-hybrid experiments have failed to recover an interaction between the two proteins , (Herrero, E. & Vilella, F. unpublished results). This negative result could be explained by the fact that the two-hybrid essay attempts to recover the interactions in a nuclear environment while the relevant interactions, if they exist, occur at the mitochondrial matrix.
Our structural modeling and protein docking experiments suggest that the two proteins interact in a way that is similar to their bovine homologues. We have repeated all the relevant analysis reported previously  for new structural models derived from updated structure information. We found that, although detailed numbers change, the general pictures remains the same. We have identified three arginine residues (Arg268, Arg272 and Arg273) in Arh1 that, upon mutation are likely to disturb Arh1-Yah1 interaction and thus reproduce both Δarh1 and Δyah1 mutant phenotypes . This suggests a way of experimentally testing the predictions. If these proteins act together, disrupting this docking would affect their role in ISC biogenesis. This can be experimentally tested by creating point mutants of the two proteins, changing specific Arginine and Aspartate residues into alanine. Then, one could create mutant strains with different combinations of these two proteins. If the mutant strains accumulate Fe and show an impaired ISC dependent protein activity, this would be indirect evidence that the mutated residues are important for the function. Strains with tagged Arh1 and with tagged Yah1 could be used to over-express and purify the proteins by tag affinity purification (TAP). TAP analysis and co-purification experiments with the different point mutants of Arh1 and Yah1 should help in clarifying if Arh1 and Yah1 act in tandem during ISC biogenesis.
The Role of Yfh1
The analysis from the bioinformatics section leads us to consider that Yfh1 can act in regulating Fe I(mport) and ISC S(ynthesis), T(ransfer) or R(epair) of the clusters (Table 2) or in any combination of the four roles. Only the curves for the simulations where Yfh1 participates in S, T or ST reproduce experimental results (Figure 3C and 3D). The activity of ISC dependent proteins decreases and Fe accumulates in response to the depletion of Yfh1, when Yfh1 acts in any of those roles. The phylogenetic and docking analysis supports that Yfh1 acts in S, T or ST because that analysis suggests that Yfh1 functionally cooperates with Arh1-Yah1. Recent experimental evidence detected physical and functional interactions between Yfh1, the scaffold proteins and Nfs1 [76–79] which provides further support for our interpretation. Other modes of action also cause accumulation of Fe and ISC dependent protein activity depletion; however either the Fe accumulation is small (R) or the protein activity depletion is biphasic (RS, RT, RST). Therefore we do not consider these as likely alternatives.
A speculative mechanistic model of how Arh1, Yah1 and Yfh1 may collaborate in ISC biogenesis is as follows. Yfh1 can regulate the oxidation state of iron ions, independently of the protein's oligomerization state [61, 80, 81]. Thus, Yfh1 could be responsible for supplying iron in the appropriate oxidation state for ISC biogenesis and repair. This has been previously suggested by Gakh et al. . Arh1 and Yah1 can act as alternative electron donors/acceptors for Yfh1, ensuring Yfh1's ability to provide iron in the appropriate state of oxidation for synthesis and repair of ISC. The other donor/acceptor could be a mitochondrial respiratory chain. This alternative model could explain why some experiments detect independence of Yfh1 function from mitochondrial respiration  while others detect functional interaction between Yfh1 function and mitochondrial respiration . One way to test this model would be the following. First, one would inhibit respiratory chain electron transfer (while still providing ATP for the cell) and Arh1-Yah1 independently, under the same experimental conditions. Yfh1 should remain active to some level in either case. Then, one would inhibit both simultaneously. If the model is correct then Yfh1 activity should be severely hindered in this double mutant cell lines.
Experimental evidence shows that, in vitro, Yfh1 can form large homo-multimers and store great amounts of iron that can be mobilized for later usage [81, 84, 85]. This is mimicked in our models by the role of Yfh1 in Section I of Figure 2. The analysis of these mathematical models shows that, if Yfh1 had such a function, Δyfh1 cells would have impaired ISC dependent protein activity. However, if the pool was as large as in vitro studies suggest it can be mitochondrial iron levels would be lower in Δyfh1 cells than in wild type cells. Thus, our simulations, together with recent experimental work , suggest that iron storage by Yfh1 is dispensable for its in vivo function.
The Role of Grx5
The analysis from the bioinformatics section leads us to consider that Grx5 can act in regulating the D(ead end complex recycling), A(rh1-Yah1 deglutathionylation), I(SS) and N(fs1 deglutathionylation) blocks in Figure 2 (Table 2) or in any combination of the four roles. Only the simulation curves where Grx5 participates in regulating at least D, I or N are able to reproduce published experimental results (Figure 3E and 3F). Only the A mode of action for Grx5 can be excluded by our simulations.
In silico protein docking of Grx5 with each of the ISC proteins indicates that Nfs1 is the protein that has a better surface complementary with Grx5. Furthermore, the docking of Grx5 and Nfs1 suggests that Grx5 could be regulating the glutathionylation state of the Cys421 residue in Nfs1, which is the putative active site cysteine. This prediction could be tested by purifying and isolating the different proteins involved in ISC biogenesis, together with appropriate control proteins. Then one would incubate each protein with reduced glutathione, to create protein-GS adducts. In a different experiment, one would put the proteins under strong oxidizing conditions that lead to disulfide bridge formation. After that, one would mix each protein that has been submitted to oxidative stress with Grx5 and follow the reduction of the oxidized proteins. The targets for Grx5 should be the proteins that become reduced at a quicker rate. This experiment should clarify the role of Grx5 in ISC biogenesis. In parallel one could also perform co-affinity purification studies with Grx5 and identify which proteins interact with Grx5.
The Role of Nfs1
Nfs1 catalyzes the removal of sulfur from cysteine and provides sulfur for ISC formation [29, 87, 88]. Nevertheless, it is not clear how it does so. It has been shown that Nfs1 transfers this sulfur to ISS proteins for ISC assembly [89, 90]. It has also been shown in vitro that Nfs1 can directly repair ISC in situ upon oxidative damage, if iron is available in the ISC dependent protein [91–93]. However, it is unclear whether this later role is important in generating the phenotype of ISC deletion mutants. Thus we consider that Nfs1 can act in S(ynthesis) or R(epair) of the ISC (Figure 2, Table 2) or in both. The simulations show that the models can reproduce the experimental results if Nfs1 does not work on R (Figures 3G and 3H). Occam's razor suggests that participation of Nfs1 in S is sufficient to justify the phenotype. Nevertheless, the network where Nfs1 acts on SR can also reproduce the experimental phenotype observed upon Nfs1 depletion (Figure 3).
The Role of the Chaperones and Co-chaperones Ssq1-Jac1-Mge1
We consider that Ssq1-Jac1-Mge1 can act in regulating F(olding of the proteins), R(epair of the clusters) and St(abilization of the clusters in the scaffolds) (Figure 2, Table 2) or in any combination of the three roles. When comparing the simulations for the alternative roles of the chaperones to published experimental results, only the curves where Ssq1-Jac1-Mge1 participate in regulating R can be excluded (Figure 3I and 3J). However, again following Occam's razor, there is no need to consider any role for the chaperones other than the folding of ISC proteins. This is in agreement with recently published work . One way to test whether the chaperones are involved in the stabilization of the clusters in the scaffold proteins would be to create the following two in vitro systems. One system would contain scaffold proteins and Yah1. Then one would create favorable conditions so that the scaffolds would be loaded with ISC and follow the kinetics of both ISC assembly in the scaffolds and ISC transfer to apo-Yah1. This is an experiment that has been repeated and published using homologues from different species (see for example reference ). Finally, one would repeat the experiment adding chaperones to the medium. Faster kinetics of ISC assembly would suggest that the chaperones assist in stabilizing the clusters on the scaffolds.
Roles for the proteins in ISC biogenesis. Likely roles for the different proteins in the scheme shown in Figure 2 are indicated by a '+' sign. The letter codes refer to the blocks in Figure 2. Entries not in bold are not likely due to Occam's razor rule, although they reproduce the experimental results.
Experimental validation of the predictions and suggested experiments. See text for details on the suggested experiments.
Grx5 acts in recovering the activity of Nfs1 or of the scaffold proteins
Experimentally detected interaction with the scaffolds 
Use of kinetic assays and TAP assays to pinpoint the most likely targets for Grx5
Arh1-Yah1 act on S or ST
Yes [73, 74]
Arh1-Yah1 interact in a similar way to their bovine homologues
Point Mutants and use of TAP assays combined with measurements of ISC dependent protein activity and Fe accumulation in mutant strains
Yfh1 acts on S, T or ST
Yfh1 storage of Fe is not important for its ISC biogenesis role
Nfs1 role in synthesis is sufficient to justify Δnfs1 phenotype
Chaperone role in F or St
Yes for F 
In vitro systems with and without chaperones to compare rate of assembly of ISC onto the scaffold would provide some evidence for or against St.
As shown is this paper, bioinformatics tools, expert knowledge, and modeling techniques can thus be used to assist in the predictive analysis of the system, and in suggesting experimental test for the predictions. Another useful aspect of our approach is that it can be used in reassessing the interpretation of experimental data. For example, if Yfh1 had an important role in accumulating a mitochondrial Fe pool, then our model predicts that Δyfh1 mutants would have less mitochondrial iron, which is opposite to what is observed. It is our goal to continue to refine our model and to apply this analysis to other organisms, thus assisting in the understanding of how the ISC biogenesis pathways have evolved.
1) Choose the biological process of interest (in our benchmark example the proteins involved in ISC assembly in Saccharomyces cerevisiae)
2) Determine the proteins and metabolites of interest that are thought to be involved in the process (Bibliomic analysis and Phylogenetic profiling). Use phylogenetic profiling as a first indicator of which proteins may be acting together in the process of interest.
3) Interrogate available databases for physical interactions between the proteins of interest.
4) If possible, obtain structures for the proteins. If no structure is available, obtain structural models from homologue proteins (Structural bioinformatics)
5) Use all against all protein docking to derive the most likely interactions (Structural bioinformatics). Analyze those interactions (Protein-protein data sets).
6) Derive a degenerate/incomplete set of possible network structures based on the interactions obtained from 4).
7) Eliminate from/add to 5) any interactions that are eliminated/suggested from known data (Expert knowledge)
8) Identify alternative models corresponding to different hypothesis for the component elements and processes.
9) Derive mathematical models for the selected schemes using the GMA approach (Mathematical modeling)
10) Normalize the models and scan parameters over large permissible ranges to determine which alternative networks are able to reproduce known experimental behavior of the system.
11) If some of the alternatives reproduce known experimental results, devise thought experiments that can also be executed experimentally and that could differentiate between the behavior of the alternatives. Go back to step 6).
12) If none of the alternative networks is able to reproduce the known experimental behavior use Phylogenetic profiling to determine new potential components of the system of interest. Go back to step 3).
In certain cases, some of the steps proposed here can be avoided, as they would add no extra information. It is likely that, by applying this procedure to the reconstruction of different metabolic processes, the various methods will contribute differently for the reconstruction. In some cases, there will be overlap of the network of interactions that are predicted by each data set, while in others the analysis of the different datasets will provide different and sometimes contradictory networks of interactions. This emphasizes the need for expert curation of the networks at this stage of the methodology's development. Any program/server/software package that would allow the implementation of such a procedure would have to be sufficiently flexible so that the use of only certain parts of the procedure would be possible for non expert users. It is also critical that such an application is flexible enough to allow the researcher to modify the pathway topology interactively, using expert knowledge. Once the alternative pathways are identified, the use of GMA models greatly facilitates the mathematical modeling. These models can automatically be derived from the reaction scheme and dynamic simulations can be performed by scanning parameters of the normalized models. When it comes to ill-defined pathways with almost no parameter determination studies available, the scanning procedure is a limiting step, as tens of hundreds to a few millions of simulation curves may be needed even for fairly small pathways. The scanning of the parameter space can be bounded by the availability of good measurements for at least some of the parameter values. This is for example what happens in reference .
The ISC biogenesis pathway of S. cerevisiae is partially reconstructed using a flexible in silico methodology that combines sequence analysis, literature analysis and structural bioinformatics methods. The role of different S. cerevisiae proteins in ISC biogenesis is predicted. Some predictions are validated by published experimental results. Other predictions need further experimental work to validate. The methodology proposed here is flexible and is applicable to the reconstruction of other systems. This methodology could be a step forward in integrating different types of data to a) obtain systemic knowledge about novel pathways, and b) clarify how current models of known pathways would work, generating rational hypothesis for testing.
Pathway Reconstruction from Automated Literature Analysis
Bibliometric methods consider that the co-occurrence of the names of two or more genes or proteins in the same paper is an indication that both proteins are likely to be involved in common cellular processes. Under this assumption one can identify proteins that are know to be involved in a given cellular process. iHOP is a resource that allows this type of literature meta-analysis. We used this tool to identify possible additional genes involved in mitochondrial ISC biogenesis and as a first predictor of likely interactions between these different yeast proteins.
This technique proposes that if the homologues of two proteins are equally present or absent in a set of fully sequenced genomes, then the two proteins are likely to be involved in some common cellular function [52, 53, 96]. The rationale for this assumption is that, given two proteins, they will co-evolve if some functional requirement of the organism is dependent on both in a similar way. To perform phylogenetic profiling of the proteins involved in mitochondrial ISC biogenesis we have downloaded the proteome of all fully sequenced organisms described in the KEGG database (Version 36.0). Homology searches for each of the S. cerevisiae proteins in each of the other proteomes was done by running version 2.2.4 of PSI-BLAST  locally, with e-value ≤ 0.0001 and three iterations. A vector of present or absent homologues in each genome was built for each yeast protein. Then, taking as reference the vector for the relevant protein, a co-occurrence index (CI) is calculated for each of the other proteins in the yeast genome: CI = Σ δij-iPR/Total number of genomes = 1-Normalized Hamming Distance between two genes. In this formula δij-PR is the Kronecker delta function, taken to be 1 if the reference protein PR and protein j both have (or do not have) homologues in the proteome of organism i and 0 otherwise. The Hamming distance has been used before and benchmarked both in S. cerevisiae and in other organisms . We calculate the p value for each coefficient using the hypergeometric distribution as described in . All CI values between any pair of ISC proteins have a p values smaller than10-7. Due to this, the p-value provides no discrimination for the likelihood of interaction between any pair of ISC biogenesis proteins. Thus, we selected as having a significant phylogenetic co-evolution any other ISC protein with a CI higher than that for 95% of all the proteins in the genome. Results are shown in Figure 1.
Large Scale Protein Interaction Datasets
We have analyzed large scale protein interaction datasets downloaded from BIND , DIP [101–103], GRID [104, 105], PathCalling [68, 106] and YRC  to determine which proteins have been found to physically interact within the set of proteins involved in mitochondrial ISC biogenesis.
Protein Modeling and Docking
Because the structure of the proteins involved in mitochondrial ISC biogenesis in S. cerevisiae has not been determined we had to create structural models to perform low resolution docking. These model have been obtained either by homology modeling, using 3DJIGSAW [108–114] and SWISSMODEL [115–118] or by ab initio modeling using ROSETTA [119–123]. Models were optimized locally by selected loop reconstruction using DEEPVIEW [115–118]. When more than one model was obtained for the same protein we used a genetic algorithm to optimize the structures , followed by loop reconstruction. Model optimization was finalized with a full energy minimization using the GROMACS97 force field  as implemented in DEEPVIEW. The protein models are given as supplementary material [See Additional File 2]
Given the atomic coordinates of two proteins, docking methods search for the best complex between them in which the shape of the two surfaces fit best [63, 64], while also considering electrostatic interactions and hydrogen bonds. Protein docking experiments were done using GRAMM [54, 55, 57–60] and Hex . The ability of these methods to recover existing complexes has been benchmarked by different authors [54, 62] and, on a limited scale, by us for the ISC biogenesis pathway.
To calculate the docking score for a given pair of proteins, we used the list of the 20 highest docking scores from GRAMM. Within that list, we separated the clusters of docking solutions. The cluster score was obtained by adding the score of each individual solution within that cluster. Hex was then used to perform the same docking experiment. If the a solution from the cluster with the highest GRAMM score was not present in the best twenty Hex docking solutions, the GRAMM result was discarded and the score of the next best cluster that was also found by HEX was used instead. The results are shown in Figure 1. The scores are the product of the best fit solution within a cluster, predicted using GRAMM, by the size of the cluster within the 20 highest score predictions. In previous work, we performed several tests with positive and negative controls to ensure that this usage of the methods could indeed provide reasonable guesses. For negative controls of the docking we chose Grx5. We obtained structures for the proteins Rex2, Cyt1, Lip2, Atp16 and Cox 11. These are all proteins that either are not located in the mitochondria or, if located in the mitochondria, are not involved in ISC biogenesis. Then, we performed docking of Grx5 against these proteins as well as against the ISC biogenesis proteins. The results rank the five negative controls as the 5 smallest docking scores . As positive control we performed pair wise docking of Arh1, Yah1 and their bovine homologues. The test scores are higher for the appropriate pairs (Arh1-Yah1 and Bovine Ferredoxin Reductase-Bovine Ferredoxin; data not shown). Furthermore, the best docking for the bovine pair had a RMSD of less than three Angstroms with respect to the crystallized structure of the complex. This suggests that the docking methods used here are useful for the purpose we describe.
In this paper the generalized mass action (GMA) canonical representation of the power-law formalism is used for the set of ordinary differential equations that describes the dynamical behavior of the metabolic network of interest. The derivation and normalization of the models using this formalism is explained in detail in the supplementary appendix. The complete model is provided as supplementary material [see Additional File 3]. The GMA models are used to predict the dynamic behavior of the alternative networks. This behavior is then compared to that observed in published experiments and network alternatives are validated or invalidated on the basis of being able or unable (respectively) to reproduce published those experimental results.
The experimental data regarding changes in concentration for this system are mostly qualitative or semi quantitative and incomplete. For example when there are measurements at several time points for the accumulation of iron or for the decrease of ISC dependent protein activity, there are no measurements regarding the decrease in concentration of the protein being knocked out of the genome. Also, in most cases, there are only two measurements for Fe concentrations and ISC dependent protein activity. One measurement is made in wild type cells and the other measurement in the mutated strains at a suitable time point. This precludes using any current fitting algorithm with precision to determine goodness of fit between the dynamic behavior of a given topology and the experimental results. If accurate quantitative data regarding the concentrations of the different proteins, the variation of Fe concentration and ISC dependent protein activity were available, we would apply such a fitting algorithm to find the set(s) of parameter values that would best fit those data. For each alternative topology, the least square deviation between the observed data and the best fit set(s) of parameter values could then be used to determine which topology would be more likely to explain the experimental data. Ranking the more likely models in the absence of accurate experimental data is less straight forward. If no topology can reproduce experimental results over the entire range of scanned parameter values one can calculate the percentage of the parameter space in which the experimental behavior is qualitatively reproduced. The higher that percentage, the more likely it is that a given topology is able to explain the experimental results. However if more than one topology reproduces the qualitative experimental observations over the entire range of parameter values that are scanned, it is difficult to define a measure for the goodness of fit of these topologies to the experimental data.
This work has been partially supported by grant BFU2005-0234BMC of the Spanish Ministerio de Educación y Ciencia. RA was supported by a Ramon y Cajal award from the Spanish Ministerio de Educacion y Ciencia. We thank the anonymous reviewers for suggestions that greatly improved the paper.
- Alves R, Antunes F, Salvador A: Tools for kinetic modeling of biochemical networks. Nat Biotechnol. 2006, 24 (6): 667-672.PubMedGoogle Scholar
- Francke C, Siezen RJ, Teusink B: Reconstructing the metabolic network of a bacterium from its genome. Trends Microbiol. 2005, 13 (11): 550-558.PubMedGoogle Scholar
- Ideker T: A systems approach to discovering signaling and regulatory pathways – or, how to digest large interaction networks into relevant pieces. Adv Exp Med Biol. 2004, 547: 21-30.PubMedGoogle Scholar
- Ideker T, Winslow LR, Lauffenburger AD: Bioengineering and systems biology. Ann Biomed Eng. 2006, 34 (2): 257-264.PubMedGoogle Scholar
- Karp PD: Call for an enzyme genomics initiative. Genome Biol. 2004, 5 (8): 401-PubMed CentralPubMedGoogle Scholar
- Karp PD, Paley S, Krieger CJ, Zhang P: An evidence ontology for use in pathway/genome databases. Pac Symp Biocomput. 2004, 190-201.Google Scholar
- Karp PD, Paley S, Romero P: The Pathway Tools software. Bioinformatics. 2002, 18 (Suppl 1): S225-232.PubMedGoogle Scholar
- Nakao M, Bono H, Kawashima S, Kamiya T, Sato K, Goto S, Kanehisa M: Genome-scale Gene Expression Analysis and Pathway Reconstruction in KEGG. Genome Inform Ser Workshop Genome Inform. 1999, 10: 94-103.PubMedGoogle Scholar
- Nikitin F, Rance B, Itoh M, Kanehisa M, Lisacek F: Using Protein Motif Combinations to Update KEGG Pathway Maps and Orthologue Tables. Genome Inform. 2004, 15 (2): 266-75.PubMedGoogle Scholar
- Ogata H, Goto S, Fujibuchi W, Kanehisa M: Computation with the KEGG pathway database. Biosystems. 1998, 47 (1–2): 119-128.PubMedGoogle Scholar
- de Lichtenberg U, Jensen LJ, Brunak S, Bork P: Dynamic complex formation during the yeast cell cycle. Science. 2005, 307 (5710): 724-727.PubMedGoogle Scholar
- Christensen O, Christensen KL, Birkhauser : Approximation Theory : From Taylor Polynomials to Wavelets. 2005Google Scholar
- Heinrich R, Rapoport TA: Linear Theory of Enzymatic Chains – Its Application for Analysis of Crossover Theorem and of Glycolysis of Human Erythrocytes. Acta Biologica et Medica Germanica. 1973, 31 (4): 479-494.PubMedGoogle Scholar
- Hernandez-Bermejo B, Fairen V, Sorribas A: Power-law modeling based on least-squares minimization criteria. Mathematical Biosciences. 1999, 161 (1–2): 83-94.PubMedGoogle Scholar
- Hernandez-Bermejo B, Fairen V, Sorribas A: Power-law modeling based on least-squares criteria: consequences for system analysis and simulation. Mathematical Biosciences. 2000, 167 (2): 87-107.PubMedGoogle Scholar
- Savageau MA: Biochemical systems analysis. I. Some mathematical properties of the rate law for the component enzymatic reactions. J Theor Biol. 1969, 25 (3): 365-369.PubMedGoogle Scholar
- Savageau MA: Biochemical systems analysis. II. The steady-state solutions for an n-pool system using a power-law approximation. J Theor Biol. 1969, 25 (3): 370-379.PubMedGoogle Scholar
- Savageau MA: Biochemical systems analysis. 3. Dynamic solutions using a power-law approximation. J Theor Biol. 1970, 26 (2): 215-226.PubMedGoogle Scholar
- Sorribas A, Savageau MA: A comparison of variant theories of intact biochemical systems. II. Flux-oriented and metabolic control theories. Math Biosci. 1989, 94 (2): 195-238.PubMedGoogle Scholar
- Alves R, Savageau MA: Systemic properties of ensembles of metabolic networks: application of graphical and statistical methods to simple unbranched pathways. Bioinformatics. 2000, 16 (6): 534-547.PubMedGoogle Scholar
- Alves R, Savageau MA: Extending the method of mathematically controlled comparison to include numerical comparisons. Bioinformatics. 2000, 16 (9): 786-798.PubMedGoogle Scholar
- Alves R, Savageau MA: Comparing systemic properties of ensembles of biological networks by graphical and statistical methods. Bioinformatics. 2000, 16 (6): 527-533.PubMedGoogle Scholar
- Salvador A: Synergism analysis of biochemical systems. I. Conceptual framework. Mathematical Biosciences. 2000, 163 (2): 105-129.PubMedGoogle Scholar
- Voit EO, Voit EO: Canonical Nonlinear Modeling: S-System Approach to Understanding Complexity. 1991, New York: Chapman & HallGoogle Scholar
- Beinert H: Iron-sulfur proteins: ancient structures, still full of surprises. J Biol Inorg Chem. 2000, 5 (1): 2-15.PubMedGoogle Scholar
- Beinert H: A tribute to sulfur. Eur J Biochem. 2000, 267 (18): 5657-5664.PubMedGoogle Scholar
- Frazzon J, Dean DR: Feedback regulation of iron-sulfur cluster biosynthesis. Proc Natl Acad Sci USA. 2001, 98 (26): 14751-14753.PubMed CentralPubMedGoogle Scholar
- Frazzon J, Dean DR: Formation of iron-sulfur clusters in bacteria: an emerging field in bioinorganic chemistry. Curr Opin Chem Biol. 2003, 7 (2): 166-173.PubMedGoogle Scholar
- Frazzon J, Fick JR, Dean DR: Biosynthesis of iron-sulphur clusters is a complex and highly conserved process. Biochem Soc Trans. 2002, 30 (4): 680-685.PubMedGoogle Scholar
- Kiley PJ, Beinert H: The role of Fe-S proteins in sensing and regulation in bacteria. Curr Opin Microbiol. 2003, 6 (2): 181-185.PubMedGoogle Scholar
- Rees DC: Great metalloclusters in enzymology. Annu Rev Biochem. 2002, 71: 221-246.PubMedGoogle Scholar
- Rees DC, Howard JB: The interface between the biological and inorganic worlds: iron-sulfur metalloclusters. Science. 2003, 300 (5621): 929-931.PubMedGoogle Scholar
- Schilke B, Voisine C, Beinert H, Craig E: Evidence for a conserved system for iron metabolism in the mitochondria of Saccharomyces cerevisiae. Proc Natl Acad Sci USA. 1999, 96 (18): 10206-10211.PubMed CentralPubMedGoogle Scholar
- Vilella F, Alves R, Rodriguez-Manzaneque MT, Belli G, Swaminathan S, Sunnerhagen P, Herrero E: Evolution and cellular function of monothiol glutaredoxins: involvement in iron-sulphur cluster assembly. Comp Funct Genomics. 2004, 5 (4): 328-341.PubMed CentralPubMedGoogle Scholar
- Lill R, Diekert K, Kaut A, Lange H, Pelzer W, Prohl C, Kispal G: The essential role of mitochondria in the biogenesis of cellular iron-sulfur proteins. Biol Chem. 1999, 380 (10): 1157-66.PubMedGoogle Scholar
- Rodriguez-Manzaneque MT, Tamarit J, Belli G, Ros J, Herrero E: Grx5 is a mitochondrial glutaredoxin required for the activity of iron/sulfur enzymes. Mol Biol Cell. 2002, 13 (4): 1109-1121.PubMed CentralPubMedGoogle Scholar
- Manzella L, Barros MH, Nobrega FG: ARH1 of Saccharomyces cerevisiae: a new essential gene that codes for a protein homologous to the human adrenodoxin reductase. Yeast. 1998, 14 (9): 839-846.PubMedGoogle Scholar
- Barras F, Loiseau L, Py B: How Escherichia coli and Saccharomyces cerevisiae build Fe/S proteins. Adv Microb Physiol. 2005, 50: 41-101.PubMedGoogle Scholar
- Lill R, Dutkiewicz R, Elsasser HP, Hausmann A, Netz DJ, Pierik AJ, Stehling O, Urzica E, Muhlenhoff U: Mechanisms of iron-sulfur protein maturation in mitochondria, cytosol and nucleus of eukaryotes. Biochim Biophys Acta. 2006, 1763 (7): 652-667.PubMedGoogle Scholar
- Lill R, Muhlenhoff U: Iron-sulfur-protein biogenesis in eukaryotes. Trends Biochem Sci. 2005, 30 (3): 133-141.PubMedGoogle Scholar
- Lill R, Muhlenhoff U: Iron-Sulfur Protein Biogenesis in Eukaryotes: Components and Mechanisms. Annu Rev Cell Dev Biol. 2006Google Scholar
- Ye H, Pilon M, Pilon-Smits EA: CpNifS-dependent iron-sulfur cluster biogenesis in chloroplasts. New Phytol. 2006, 171 (2): 285-292.PubMedGoogle Scholar
- Fontecave M, Choudens SO, Py B, Barras F: Mechanisms of iron-sulfur cluster assembly: the SUF machinery. J Biol Inorg Chem. 2005, 10 (7): 713-721.PubMedGoogle Scholar
- Alves R, Herrero E, Sorribas A: Predictive reconstruction of the mitochondrial iron-sulfur cluster assembly metabolism: I. The role of the protein pair ferredoxin-ferredoxin reductase (Yah1-Arh1). Proteins. 2004, 56 (2): 354-366.PubMedGoogle Scholar
- Alves R, Herrero E, Sorribas A: Predictive reconstruction of the mitochondrial iron-sulfur cluster assembly metabolism. II. Role of glutaredoxin Grx5. Proteins. 2004, 57 (3): 481-492.PubMedGoogle Scholar
- Stapley BJ, Benoit G: Biobibliometrics: information retrieval and visualization from co-occurrences of gene names in Medline abstracts. Pac Symp Biocomput. 2000, 529-40.Google Scholar
- Hoffmann R, Krallinger M, Andres E, Tamames J, Blaschke C, Valencia A: Text mining for metabolic pathways, signaling cascades, and protein networks. SciSTKE. 2005, 2005 (283): e21-Google Scholar
- Hoffmann R, Valencia A: A gene network for navigating the literature. Nat Genet. 2004, 36 (7): 664-PubMedGoogle Scholar
- Hoffmann R, Valencia A: Implementing the iHOP concept for navigation of biomedical literature. Bioinformatics. 2005, 21 (Suppl 2): ii252-ii258.PubMedGoogle Scholar
- Adam AC, Bornhovd C, Prokisch H, Neupert W, Hell K: The Nfs1 interacting protein Isd11 has an essential role in Fe/S cluster biogenesis in mitochondria. EMBO J. 2006, 25 (1): 174-183.PubMed CentralPubMedGoogle Scholar
- Wiedemann N, Urzica E, Guiard B, Muller H, Lohaus C, Meyer HE, Ryan MT, Meisinger C, Muhlenhoff U, Lill R: Essential role of Isd11 in mitochondrial iron-sulfur cluster synthesis on Isu scaffold proteins. EMBO J. 2006, 25 (1): 184-195.PubMed CentralPubMedGoogle Scholar
- Li H, Pellegrini M, Eisenberg D: Detection of parallel functional modules by comparative analysis of genome sequences. Nat Biotechnol. 2005, 23 (2): 253-260.PubMedGoogle Scholar
- Pellegrini M, Marcotte EM, Thompson MJ, Eisenberg D, Yeates TO: Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. Proc Natl Acad Sci USA. 1999, 96 (8): 4285-4288.PubMed CentralPubMedGoogle Scholar
- Tovchigrechko A, Vakser IA: Development and testing of an automated approach to protein docking. Proteins. 2005, 60 (2): 296-301.PubMedGoogle Scholar
- Tovchigrechko A, Wells CA, Vakser IA: Docking of protein models. Protein Sci. 2002, 11 (8): 1888-1896.PubMed CentralPubMedGoogle Scholar
- Vajda S, Vakser IA, Sternberg MJ, Janin J: Modeling of protein interactions in genomes. Proteins. 2002, 47 (4): 444-446.PubMedGoogle Scholar
- Vakser IA: Protein docking for low-resolution structures. Protein Eng. 1995, 8 (4): 371-377.PubMedGoogle Scholar
- Vakser IA: Low-resolution docking: prediction of complexes for underdetermined structures. Biopolymers. 1996, 39 (3): 455-464.PubMedGoogle Scholar
- Vakser IA: Evaluation of GRAMM low-resolution docking methodology on the hemagglutinin-antibody complex. Proteins. 1997, 226-230. Suppl 1Google Scholar
- Vakser IA, Matar OG, Lam CF: A systematic study of low-resolution recognition in protein – protein complexes. Proc Natl Acad Sci USA. 1999, 96 (15): 8477-8482.PubMed CentralPubMedGoogle Scholar
- He Y, Alam SL, Proteasa SV, Zhang Y, Lesuisse E, Dancis A, Stemmler TL: Yeast frataxin solution structure, iron binding, and ferrochelatase interaction. Biochemistry. 2004, 43 (51): 16254-16262.PubMed CentralPubMedGoogle Scholar
- Ritchie DW: Evaluation of protein docking predictions using Hex 3.1 in CAPRI rounds 1 and 2. Proteins. 2003, 52 (1): 98-106.PubMedGoogle Scholar
- Halperin I, Ma B, Wolfson H, Nussinov R: Principles of docking: An overview of search algorithms and a guide to scoring functions. Proteins. 2002, 47 (4): 409-443.PubMedGoogle Scholar
- Smith GR, Sternberg MJ: Prediction of protein-protein interactions by docking methods. Curr Opin Struct Biol. 2002, 12 (1): 28-35.PubMedGoogle Scholar
- Ho Y, Gruhler A, Heilbut A, Bader GD, Moore L, Adams SL, Millar A, Taylor P, Bennett K, Boutilier K: Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature. 2002, 415 (6868): 180-183.PubMedGoogle Scholar
- Ross-Macdonald P: Functional analysis of the yeast genome. Funct Integr Genomics. 2000, 1 (2): 99-113.PubMedGoogle Scholar
- Uetz P, Giot L, Cagney G, Mansfield TA, Judson RS, Knight JR, Lockshon D, Narayan V, Srinivasan M, Pochart P: A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature. 2000, 403 (6770): 623-627.PubMedGoogle Scholar
- Uetz P, Hughes RE: Systematic and large-scale two-hybrid screens. Curr Opin Microbiol. 2000, 3 (3): 303-308.PubMedGoogle Scholar
- Walhout AJ, Vidal M: High-throughput yeast two-hybrid assays for large-scale protein interaction mapping. Methods. 2001, 24 (3): 297-306.PubMedGoogle Scholar
- Ni TC, Savageau MA: Model assessment and refinement using strategies from biochemical systems theory: Application to metabolism in human red blood cells. Journal of Theoretical Biology. 1996, 179 (4): 329-368.PubMedGoogle Scholar
- Sorribas A, Savageau MA: Strategies for representing metabolic pathways within biochemical systems theory: reversible pathways. Math Biosci. 1989, 94 (2): 239-269.PubMedGoogle Scholar
- Voit EO, Savageau MA: Accuracy of alternative representations for integrated biochemical systems. Biochemistry. 1987, 26 (21): 6869-6880.PubMedGoogle Scholar
- Lange H, Kaut A, Kispal G, Lill R: A mitochondrial ferredoxin is essential for biogenesis of cellular iron-sulfur proteins. Proc Natl Acad Sci USA. 2000, 97 (3): 1050-1055.PubMed CentralPubMedGoogle Scholar
- Li J, Saxena S, Pain D, Dancis A: Adrenodoxin reductase homolog (Arh1p) of yeast mitochondria required for iron homeostasis. J Biol Chem. 2001, 276 (2): 1503-1509.PubMedGoogle Scholar
- Barros MH, Nobrega FG: YAH1 of Saccharomyces cerevisiae: a new essential gene that codes for a protein homologous to human adrenodoxin. Gene. 1999, 233 (1–2): 197-203.PubMedGoogle Scholar
- Bulteau AL, O'Neill HA, Kennedy MC, Ikeda-Saito M, Isaya G, Szweda LI: Frataxin acts as an iron chaperone protein to modulate mitochondrial aconitase activity. Science. 2004, 305 (5681): 242-245.PubMedGoogle Scholar
- Gerber J, Muhlenhoff U, Lill R: An interaction between frataxin and Isu1/Nfs1 that is crucial for Fe/S cluster synthesis on Isu1. EMBO Rep. 2003, 4 (9): 906-911.PubMed CentralPubMedGoogle Scholar
- Irazusta V, Cabiscol E, Reverter B, Ros J, Tamarit J: Manganese is the link between frataxin and iron-sulfur deficiency in the yeast model of Friedreich ataxia. J Biol Chem. 2006, 281 (18): 12227-12232.PubMedGoogle Scholar
- Ramazzotti A, Vanmansart V, Foury F: Mitochondrial functional interactions between frataxin and Isu1p, the iron-sulfur cluster scaffold protein, in Saccharomyces cerevisiae. FEBS Lett. 2004, 557 (1–3): 215-220.PubMedGoogle Scholar
- Gakh O, Park S, Liu G, Macomber L, Imlay JA, Ferreira GC, Isaya G: Mitochondrial iron detoxification is a primary function of frataxin that limits oxidative damage and preserves cell longevity. Hum Mol Genet. 2006, 15 (3): 467-479.PubMedGoogle Scholar
- Park S, Gakh O, O'Neill HA, Mangravita A, Nichol H, Ferreira GC, Isaya G: Yeast frataxin sequentially chaperones and stores iron by coupling protein assembly with iron oxidation. J Biol Chem. 2003, 278 (33): 31340-31351.PubMedGoogle Scholar
- Chen OS, Kaplan J: YFH1-mediated iron homeostasis is independent of mitochondrial respiration. FEBS Lett. 2001, 509 (1): 131-134.PubMedGoogle Scholar
- Gonzalez-Cabo P, Vazquez-Manrique RP, Garcia-Gimeno MA, Sanz P, Palau F: Frataxin interacts functionally with mitochondrial electron transport chain proteins. Hum Mol Genet. 2005, 14 (15): 2091-2098.PubMedGoogle Scholar
- Isaya G, O'Neill HA, Gakh O, Park S, Mantcheva R, Mooney SM: Functional studies of frataxin. Acta Paediatr Suppl. 2004, 93 (445): 68-71.PubMedGoogle Scholar
- Nichol H, Gakh O, O'Neill HA, Pickering IJ, Isaya G, George GN: Structure of frataxin iron cores: an X-ray absorption spectroscopic study. Biochemistry. 2003, 42 (20): 5971-5976.PubMedGoogle Scholar
- Aloria K, Schilke B, Andrew A, Craig EA: Iron-induced oligomerization of yeast frataxin homologue Yfh1 is dispensable in vivo. EMBO Rep. 2004, 5 (11): 1096-1101.PubMed CentralPubMedGoogle Scholar
- Kispal G, Csere P, Prohl C, Lill R: The mitochondrial proteins Atm1p and Nfs1p are essential for biogenesis of cytosolic Fe/S proteins. EMBO J. 1999, 18 (14): 3981-3989.PubMed CentralPubMedGoogle Scholar
- Li J, Kogan M, Knight SA, Pain D, Dancis A: Yeast mitochondrial protein, Nfs1p, coordinately regulates iron-sulfur cluster proteins, cellular iron uptake, and iron distribution. J Biol Chem. 1999, 274 (46): 33025-33034.PubMedGoogle Scholar
- Nishio K, Nakai M: Transfer of iron-sulfur cluster from NifU to apoferredoxin. J Biol Chem. 2000, 275 (30): 22615-22618.PubMedGoogle Scholar
- Urbina HD, Silberg JJ, Hoff KG, Vickery LE: Transfer of sulfur from IscS to IscU during Fe/S cluster assembly. J Biol Chem. 2001, 276 (48): 44521-44526.PubMedGoogle Scholar
- Bui BT, Escalettes F, Chottard G, Florentin D, Marquet A: Enzyme-mediated sulfide production for the reconstitution of [2Fe-2S] clusters into apo-biotin synthase of Escherichia coli. Sulfide transfer from cysteine to biotin. Eur J Biochem. 2000, 267 (9): 2688-2694.PubMedGoogle Scholar
- Bui BT, Florentin D, Fournier F, Ploux O, Mejean A, Marquet A: Biotin synthase mechanism: on the origin of sulphur. FEBS Lett. 1998, 440 (1–2): 226-230.PubMedGoogle Scholar
- Yang W, Rogers PA, Ding H: Repair of nitric oxide-modified ferredoxin [2Fe-2S] cluster by cysteine desulfurase (IscS). J Biol Chem. 2002, 277 (15): 12868-12873.PubMedGoogle Scholar
- Dutkiewicz R, Marszalek J, Schilke B, Craig EA, Lill R, Muhlenhoff U: The Hsp70 chaperone Ssq1p is dispensable for iron-sulfur cluster formation on the scaffold protein Isu1p. J Biol Chem. 2006, 281 (12): 7801-7808.PubMedGoogle Scholar
- Chandramouli K, Johnson MK: HscA and HscB stimulate [2Fe-2S] cluster transfer from IscU to apoferredoxin in an ATP-dependent reaction. Biochemistry. 2006, 45 (37): 11087-11095.PubMed CentralPubMedGoogle Scholar
- Pellegrini M, Thompson M, Fierro J, Bowers P: Computational method to assign microbial genes to pathways. J Cell Biochem Suppl. 2001, 106-109. Suppl 37Google Scholar
- McGinnis S, Madden TL: BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res. 2004, W20-W25. 32 Web ServerGoogle Scholar
- Zheng Y, Szustakowski JD, Fortnow L, Roberts RJ, Kasif S: Computational identification of operons in microbial genomes. Genome Research. 2002, 12 (8): 1221-1230.PubMed CentralPubMedGoogle Scholar
- Wu J, Mellor JC, DeLisi C: Deciphering protein network organization using phylogenetic profile groups. Genome Inform. 2005, 16 (1): 142-149.PubMedGoogle Scholar
- Bader GD, Betel D, Hogue CW: BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res. 2003, 31 (1): 248-250.PubMed CentralPubMedGoogle Scholar
- Deane CM, Salwinski L, Xenarios I, Eisenberg D: Protein interactions: two methods for assessment of the reliability of high throughput observations. Mol Cell Proteomics. 2002, 1 (5): 349-356.PubMedGoogle Scholar
- Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, Eisenberg D: The Database of Interacting Proteins: 2004 update. Nucleic Acids Res. 2004, D449-D451. 32 DatabaseGoogle Scholar
- Xenarios I, Salwinski L, Duan XJ, Higney P, Kim SM, Eisenberg D: DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions. Nucleic Acids Res. 2002, 30 (1): 303-305.PubMed CentralPubMedGoogle Scholar
- Breitkreutz BJ, Stark C, Tyers M: The GRID: the General Repository for Interaction Datasets. Genome Biol. 2003, 4 (3): R23-PubMed CentralPubMedGoogle Scholar
- Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M: BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006, D535-D539. 34 DatabaseGoogle Scholar
- Uetz P, Pankratz MJ: Protein interaction maps on the fly. Nat Biotechnol. 2004, 22 (1): 43-44.PubMedGoogle Scholar
- Hazbun TR, Fields S: Networking proteins in yeast. Proc Natl Acad Sci USA. 2001, 98 (8): 4277-4278.PubMed CentralPubMedGoogle Scholar
- Bates PA, Kelley LA, MacCallum RM, Sternberg MJ: Enhancement of protein modeling by human intervention in applying the automatic programs 3D-JIGSAW and 3D-PSSM. Proteins. 2001, 39-46. Suppl 5Google Scholar
- Bates PA, Sternberg MJ: Model building by comparison at CASP3: using expert knowledge and computer automation. Proteins. 1999, 47-54. Suppl 3Google Scholar
- Contreras-Moreira B, Bates PA: Domain fishing: a first step in protein comparative modelling. Bioinformatics. 2002, 18 (8): 1141-1142.PubMedGoogle Scholar
- Contreras-Moreira B, Fitzjohn PW, Bates PA: Comparative modelling: an essential methodology for protein structure prediction in the post-genomic era. Appl Bioinformatics. 2002, 1 (4): 177-190.PubMedGoogle Scholar
- Contreras-Moreira B, Fitzjohn PW, Bates PA: In silico protein recombination: enhancing template and sequence alignment selection for comparative protein modelling. J Mol Biol. 2003, 328 (3): 593-608.PubMedGoogle Scholar
- Contreras-Moreira B, Fitzjohn PW, Offman M, Smith GR, Bates PA: Novel use of a genetic algorithm for protein structure prediction: searching template and sequence alignment space. Proteins. 2003, 53 (Suppl 6): 424-429.PubMedGoogle Scholar
- Contreras-Moreira B, Jonsson PF, Bates PA: Structural context of exons in protein domains: implications for protein modelling and design. J Mol Biol. 2003, 333 (5): 1045-1059.PubMedGoogle Scholar
- Guex N, Diemand A, Peitsch MC: Protein modelling for all. Trends in Biochemical Sciences. 1999, 24 (9): 364-367.PubMedGoogle Scholar
- Guex N, Peitsch MC: SWISS-MODEL and the Swiss-PdbViewer: An environment for comparative protein modeling. Electrophoresis. 1997, 18 (15): 2714-2723.PubMedGoogle Scholar
- Schwede T, Diemand A, Guex N, Peitsch MC: Protein structure computing in the genomic era. Res Microbiol. 2000, 151 (2): 107-112.PubMedGoogle Scholar
- Schwede T, Kopp J, Guex N, Peitsch MC: SWISS-MODEL: An automated protein homology-modeling server. Nucleic Acids Res. 2003, 31 (13): 3381-3385.PubMed CentralPubMedGoogle Scholar
- Bradley P, Malmstrom L, Qian B, Schonbrun J, Chivian D, Kim DE, Meiler J, Misura KM, Baker D: Free modeling with Rosetta in CASP6. Proteins. 2005, 61 (Suppl 7): 128-134.PubMedGoogle Scholar
- Chivian D, Kim DE, Malmstrom L, Bradley P, Robertson T, Murphy P, Strauss CE, Bonneau R, Rohl CA, Baker D: Automated prediction of CASP-5 structures using the Robetta server. Proteins. 2003, 53 (Suppl 6): 524-533.PubMedGoogle Scholar
- Chivian D, Kim DE, Malmstrom L, Schonbrun J, Rohl CA, Baker D: Prediction of CASP6 structures using automated Robetta protocols. Proteins. 2005, 61 (Suppl 7): 157-166.PubMedGoogle Scholar
- Kim DE, Chivian D, Baker D: Protein structure prediction and analysis using the Robetta server. Nucleic Acids Res. 2004, W526-W531. 32 Web ServerGoogle Scholar
- Misura KM, Chivian D, Rohl CA, Kim DE, Baker D: Physically realistic homology models built with ROSETTA can be more accurate than their templates. Proc Natl Acad Sci USA. 2006, 103 (14): 5361-5366.PubMed CentralPubMedGoogle Scholar
- Weiner B, Deumens E, Ohrn Y: Coherent-State Approach to Electron-Nuclear Dynamics with an Antisymmetrized Geminal Power State. Journal of Mathematical Physics. 1994, 35 (3): 1139-1170.Google Scholar
- Voisine C, Cheng YC, Ohlson M, Schilke B, Hoff K, Beinert H, Marszalek J, Craig EA: Jac1, a mitochondrial J-type chaperone, is involved in the biogenesis of Fe/S clusters in Saccharomyces cerevisiae. Proc Natl Acad Sci USA. 2001, 98 (4): 1483-1488.PubMed CentralPubMedGoogle Scholar
- Lutz T, Westermann B, Neupert W, Herrmann JM: The mitochondrial proteins Ssq1 and Jac1 are required for the assembly of iron sulfur clusters in mitochondria. J Mol Biol. 2001, 307 (3): 815-825.PubMedGoogle Scholar
- Duby G, Foury F, Ramazzotti A, Herrmann J, Lutz T: A non-essential function for yeast frataxin in iron-sulfur cluster assembly. Hum Mol Genet. 2002, 11 (21): 2635-2643.PubMedGoogle Scholar
- Muhlenhoff U, Richhardt N, Ristow M, Kispal G, Lill R: The yeast frataxin homolog Yfh1p plays a specific role in the maturation of cellular Fe/S proteins. Hum Mol Genet. 2002, 11 (17): 2025-2036.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.