 Research article
 Open Access
 Published:
Exploring metabolic pathway disruption in the subchronic phencyclidine model of schizophrenia with the Generalized Singular Value Decomposition
BMC Systems Biology volume 5, Article number: 72 (2011)
Abstract
Background
The quantification of experimentallyinduced alterations in biological pathways remains a major challenge in systems biology. One example of this is the quantitative characterization of alterations in defined, established metabolic pathways from complex metabolomic data. At present, the disruption of a given metabolic pathway is inferred from metabolomic data by observing an alteration in the level of one or more individual metabolites present within that pathway. Not only is this approach open to subjectivity, as metabolites participate in multiple pathways, but it also ignores useful information available through the pairwise correlations between metabolites. This extra information may be incorporated using a higherlevel approach that looks for alterations between a pair of correlation networks. In this way experimentallyinduced alterations in metabolic pathways can be quantitatively defined by characterizing group differences in metabolite clustering. Taking this approach increases the objectivity of interpreting alterations in metabolic pathways from metabolomic data.
Results
We present and justify a new technique for comparing pairs of networksin our case these networks are based on the same set of nodes and there are two distinct types of weighted edges. The algorithm is based on the Generalized Singular Value Decomposition (GSVD), which may be regarded as an extension of Principle Components Analysis to the case of two data sets. We show how the GSVD can be interpreted as a technique for reordering the two networks in order to reveal clusters that are exclusive to only one. Here we apply this algorithm to a new set of metabolomic data from the prefrontal cortex (PFC) of a translational model relevant to schizophrenia, rats treated subchronically with the NmethylDAspartic acid (NMDA) receptor antagonist phencyclidine (PCP). This provides us with a means to quantify which predefined metabolic pathways (Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolite pathway database) were altered in the PFC of PCPtreated rats. Several significant changes were discovered, notably: 1) neuroactive ligands active at glutamate and GABA receptors are disrupted in the PFC of PCPtreated animals, 2) glutamate dysfunction in these animals was not limited to compromised glutamatergic neurotransmission but also involves the disruption of metabolic pathways linked to glutamate; and 3) a specific series of purine reactions Xanthine ← Hypoxyanthine ↔ Inosine ← IMP → adenylosuccinate is also disrupted in the PFC of PCPtreated animals.
Conclusions
Network reordering via the GSVD provides a means to discover statistically validated differences in clustering between a pair of networks. In practice this analytical approach, when applied to metabolomic data, allows us to quantify the alterations in metabolic pathways between two experimental groups. With this new computational technique we identified metabolic pathway alterations that are consistent with known results. Furthermore, we discovered disruption in a novel series of purine reactions that may contribute to the PFC dysfunction and cognitive deficits seen in schizophrenia.
Background
Background in neuroscience and metabolomics
Schizophrenia is characterized by deficits in cognition known to be dependent upon the functional integrity of the prefrontal cortex (PFC). Furthermore, compromised PFC function in schizophrenia is supported by a multitude of neuroimaging studies reporting hypometabolism ('hypofrontality'), as evidenced by decreased blood flow or glucose utilization [1, 2]. While the pathophysiological basis of PFC dysfunction in schizophrenia is not completely understood, a central role for NMDA receptor hypofunction is widely supported. For example, subchronic exposure to the NMDA receptor antagonist phencyclidine (PCP) induces cognitive deficits and a 'hypofrontality' which directly parallels that seen in schizophrenia [3–5]. Furthermore, subchronic PCP exposure induces alterations in GABAergic cell markers and 5HT receptor expression in the PFC similar to those seen in this disorder [3, 6, 7]. While this evidence places NMDA receptor hypofunction central to the pathophysiology of PFC dysfunction in schizophrenia, the mechanisms through which NMDA hypofunction promotes PFC dysfunction are poorly understood.
Metabolomics is the comprehensive analysis of small molecule metabolites in biological systems [8]. It involves the study of the metabolome which is defined as all of the small molecular weight compounds within a sample that are required for metabolism, whose roles include growth and functionality [9–11]. Sample sources include bacteria, parasites, animals and humans and sample types can include biofluids, cells or tissue extracts. Metabolomics can be utilized as a tool for the characterization and quantification of all of the metabolites in a biological system. Its applications include profiling disease biomarkers [12, 13], monitoring disease progression [14], investigating xenobiotic metabolism [15], investigating druginduced toxicity [16, 17] and investigating metabolism in genetically modified animals [18]. Mass spectrometry (MS) has been employed extensively as an analytical platform for metabolomics studies [19–21]. The popularity of this approach has increased over the last decade, in part due to the advent of high resolution Fourier transform mass spectrometers which offer improved reproducibility, accuracy and sensitivity. This makes mass spectrometry suitable for high throughput metabolomics studies [22]. In addition, the Orbitrap mass spectrometers that are now available offer similar performance to FTMS systems without the need for a high strength magnetic field [23]. HILIC chromatography has been utilized as a separation technique prior to MS detection of polar metabolites in aqueous biofluids such as urine, serum and plasma [24–30].
Additionally, it has also been used for the detection of multiple neurotransmitters in primate cerebral cortex [31]. HILIC chromatography has been chosen for metabolomic studies as it is useful for the analysis of highly polar metabolites which are poorly retained on reverse phase columns [32]. Detailed reviews of the principles and applications of HILIC have been previously outlined [25, 33]. Here, HILICchromatography is utilized in combination with an LTQOrbitrap for metabolic profiling of metabolite extracts from the PFC of control and PCPtreated rats.
Metabolomics represents a robust approach through which alterations in diverse metabolic pathways may be determined at a biological systems level. In this way a metabolomics approach may prove useful in further elucidating the pathophysiological mechanisms contributing to PFC dysfunction in schizophrenia. Furthermore, this approach may also allow for the identification of PFC metabolic biomarkers for the cognitive deficits in this disorder. While the metabolomics approach can provide a rich and comprehensive set of data, the appropriate quantitative analysis of this data has not been adequately developed. In particular, the identification of statistical differences in metabolic pathways between experimental groups rather than the identification of statistical differences in individual metabolites alone represents a major challenge to quantitatively identifying metabolic alterations at a systems level from metabolomic data. One method through which statistical differences in metabolic pathways can be identified from metabolomic data involves the representation of this data as a large, complex network of nodes (single metabolites) connected by realvalue edges (the correlation coefficient between two metabolites). This form of representation has high face validity as the relationship between two metabolites, in a given pathway, is governed by a single or series of enzymatic reactions that can be viewed as being represented by the correlation between the concentrations of the two metabolites. Another advantage is that metabolomic data consist of a range of metabolites detected in both of the experimental groups of interest meaning that these data can be expressed as two complex networks based upon the same set of nodes. This data structure is amenable to analysis through the application of the Generalized Singular Value Decomposition (GSVD) algorithm.
Background in network science and spectral methods
Large, complex interaction networks arise across many applications in science and technology [34–36]. Spectral methods, based on information computed from eigenvectors or singular vectors, have been used successfully to reveal fundamental network properties. For example, we may wish to cluster objects into groups [37], put objects into order [38] or discover specific patterns of connectivity within subgroups [39–42]. In this work, we look at the case where two interaction data sets are available and the aim is discover differences between the two sets in the form of mutually exclusive clusters. For example, a given group of biologically defined entities, such as genes, proteins, metabolites or brain regions, may contain a subgroup that behaves in a coordinated manner under one condition, or in one organism, but not in anotherthe network with respect to one type of interaction contains a cluster that is not present in the other. We will show that the Generalized Singular Value Decomposition, which is becoming more widely used in computational biology [43, 44] can be justified as the basis of a network reordering approach. We also consider how to quantify the statistical significance of network patterns that are uncovered.
Overall, this work develops and applies a novel algorithm in network science and shows that it reveals meaningful insights when applied to cuttingedge metabolomic data.
Results
Derivation of new algorithm
Suppose that the square, symmetric, realvalued matrices A and B in ℝ^{N×N}represent two different types of interaction between a set of N nodes. We have in mind the case where the weights play the role of correlation coefficients. Our aim is to discover clusters, in the sense of subsets of nodes that are mutually, pairwise, strongly connected through positive weights. The algorithm will also discover clusters of strong negative connectivity, although in practice this type of pattern is less likely to be present. However, we note that the arguments given below and the resulting algorithm remain valid in the case where the weights are nonnegative, with zero representing the minimal level of similarity. The novelty of our approach is that in order to reveal interesting differences between the two types of connectivity data, we look for a set of nodes that form a good cluster with respect to A and a poor cluster with respect to B, or vice versa. As a starting point for a computational algorithm, we consider the identity
for x ∈ ℝ ^{N} . Here ·_{2} denotes the Euclidean norm and is one way to generalize the concept of outdegree to the case of a weighted network. Suppose we wish to split the nodes into two groups such that nodes within each group are wellconnected but nodes across different groups are poorly connected. We could use an indicator vector x ∈ ℝ ^{N} to denote such a partition, with x_{ s } = 1 if node s is placed in group 1 and x_{ s } = 1 if node s is placed in group 2.
Fixing on two nodes, k and l, we could argue that the existence of a third node, i, such that a_{ ik } and a_{ il } are both large and positive or both large and negative is evidence in favor of placing k and l in the same group (since they have in common a strong similarity or dissimilarity with node i). On the other hand small or oppositely signed values for a_{ ik } and a_{ il } is evidence in favor of placing k and l in different groups. In terms of the indicator vector, this translates to

1.
1. a_{ ik }a_{ il } large and positive ⇒ try to choose x_{ k }x_{ l } = +1,

2.
2. a_{ ik }a_{ il } small or negative ⇒ try to choose x_{ k }x_{ l } = 1.
Returning to the righthand side of (1), we see that is independent of the choice of indicator vector, and gives a measure of how successfully we have incorporated the (possibly conflicting) desiderata in points 1 and 2 over all pairs k, l and third parties i. So we could judge the quality of an indicator vector by its ability to produce a large value of , provided other constraints, such as balanced group sizes, were satisfied.
Analogously, we can argue that making as negative as possible is a good way to avoid forming wellconnected subgroups, and so the problem
is a good basis for picking out strong clusters in A that are not present in B.
In general, optimizing over a large, discrete set of possibilities is computationally intractable, and hence we will follow the widely used practice of relaxing to an optimization over ℝ ^{N} [37, 45]. This approach goes back as far as the pioneering work of Fiedler [46] and has some theoretical underpinning in the case where a single network is analyzed [47, 48]. So, instead of (2) we have
At this stage we recall that a general pair of matrices A ∈ ℝ^{m×n}and B ∈ ℝ^{p×n}can be simultaneously factorized using the Generalized Singular Value Decomposition (GSVD) into
where U ∈ ℝ^{m×m}and V ∈ ℝ^{p×p}are both orthogonal, X ∈ ℝ^{n×n}is invertible, C ∈ ℝ^{m×n}and S ∈ ℝ^{p×n}are diagonal with nonnegative entries such that C = diag(c_{1}, c_{2},..,c_{ n } ) and S = diag(s_{1}, s_{2},..., s_{ q } ) with q = min(p, n), and 0 ≤ c_{1} ≤ c_{2} ≤ ··· ≤ c_{ n } and s_{1} ≥ s_{2} ≥ ··· ≥ s_{ q } ≥ 0 [49]. The ratios λ_{ i } = c_{ i } /s_{ i } are the generalized singular values of A and B.
A key property of the GSVD is that the columns of X are stationary points of the function f :ℝ ^{n} ↦ ℝ given by f(x) =  Ax _{2} /Bx _{2}, with the generalized singular values λ_{ i } giving the corresponding stationary values. Hence, we may tackle the problem (3) through the GSVD. Columns 1, 2, 3,... of X are candidates for finding good clusters in B that are poor clusters in A and, analogously, columns N, N  1, N  2,... of X are candidates for finding good clusters in A that are poor clusters in B.
To transform back from real to discrete domains, we may use the ordering of the elements in x to define a new ordering for the two networks. More precisely, we relabel row and column i of A and B as row and column p_{ i } , where
In this way, the existence or lack of clusters in each network becomes apparent from inspection of the heat map of the reordered matrix. This is the approach that we use. We will also show that pvalues can be computed to quantify the statistical significance of the results. The issue of fully automating the choice of cluster size is left as future work.
A variant of the algorithm
In our context, the matrices A and B are square, with m = n = p = N. In this case, if we make the additional assumption that A and B are invertible it is known that the GSVD is closely related to the standard Singular Value Decompositions (SVD) of AB^{1} and BA^{1}. To see this, we could rearrange (4) into
Alternatively, we may let z = Ax or y = Bx in (3), to obtain the quadratic problems
which can be solved through the standard SVD.
It is known from spectral graph theory that the dominant singular vectors give good directions in which to look for clusters [37, 50]. Inverting the weight matrix reverses their importance (the singular valueσ becomes σ ^{1}) and hence a spectral clustering approach applied to A^{1} will typically find the opposite of good clusterspoorly connected nodes will be grouped together [51]. So, intuitively, forming AB^{1} in (5) should produce a data matrix for which the SVD approach finds good clusters for A and poor clusters for B. Analogously, the opposite holds for BA^{1}.
Having interpreted the algorithm this way, it is then natural to consider the reverse products, A^{1}B and B^{1}A, or, equivalently, to form the optimization problem
We may interpret (6) from the point of view that making B^{1}x large encourages poor clusters for B, while making A^{1}x small encourages good clusters for A. In this case, we would base our algorithm on the GSVD of A^{1} and B^{1}.
In the situation where A and B are both symmetric, corresponding to undirected networks, we have, from (4),
and
Then we may appeal to the arguments given previously and use columns from the inverse of the third factor in the GSVD as the basis for reordering. With this approach we use columns of X ^{T}rather than columns of X. We emphasize that although this heuristic derivation used an assumption that A and B are invertible, the GSVD, and hence the final algorithm, applies in the noninvertible case. Also, the algorithms that we use do not require the computation of matrix inverses.
In tests on both synthetic and real network pairs, we found that this version of the algorithm was more effective, [52]. Hence, in this work we focus on the approach of reordering networks pairs via columns of X ^{T}. In summary, the first few columns of X ^{T}should give orderings that favor clusters in B rather than A and vice versa for the final few columns. In our computational examples, we used the gsvd routine built in to MATLAB http://www.mathworks.com/.
Synthetic test on binary networks
In this section we illustrate the algorithm in a simple, controlled case where we know the "correct" answer. We begin by considering binary networks, where results can be clearly visualized. We generated binary adjacency matrices A and B as shown in Figure 1. Here we have 20 nodes. In both networks, nodes 15 are well connected. In A there is a well connected cluster consisting of nodes 615, whereas in B there is a well connected cluster consisting of nodes 1520. To make the test more realistic, the clusters are not perfect; there are both missing edges (false negatives) within the clusters and spurious edges (false positives) outside the clusters. Our aim is to test whether the algorithms can identify the clusters that are particular to each data set. We then show how statistical significance can be quantified.
We emphasize that the node labeling in Figure 1 was chosen purely to make the inherent structure visually apparent. Any spectral reordering algorithm should be invariant to a relabeling of the input data. In our context, this follows from the fact that for any permutation matrix P, the factorizations A = UCX ^{1} and B = V SX ^{1} are equivalent to PAP^{T} = (PU)C(PX) ^{1} and PBP^{T} = (PU)S(PX) ^{1}. So, on the relabeled data matrices, (PX) plays the role that was played by X, and our algorithm reorders based on the appropriately permuted columns of X ^{T}, as required. In Figure 2 we show the same two data sets with an arbitrary relabeling in order to illustrate that the inherent structure is no longer apparent. In essence, we are hoping that the algorithm will find the structure that has been buried in Figure 2.
In Figure 3 we display the two adjacency matrices reordered with the algorithm; we show reordering with eight different columns of X ^{T}, four from each end of the spectrum. We see that mutually exclusive structures have been uncovered. The reordering from the first column begins with nodes 18, 20, 16, 15, 19, 17, which form a cluster in B, but not A. The final column begins by picking out nodes 7, 9, 10, 15, 14, 11, 6, 13, which form the bulk of the 615 cluster in A. Nodes 8 and 12, which are missing from this sequential ordering, are placed at the head of the ordering in the penultimate column, which begins 12, 8, 7, 10, 15, 14, 9, 11. So in summary, the 19th and 20th columns of X ^{T}each reveal almost complete information about the exclusive cluster in A, and between them they capture the full cluster.
Cluster validation
Suppose we find τ nodes giving a good cluster s for B but a poor cluster for A when the graphs are reordered by column v from X ^{T}. Is this type of substructure likely to arise "by chance"? The following general approach can be used in order to determine a p value, where we will regard a value below 0.05 as indicating statistical significance.
Initialization: Compute a measure of cluster quality, c(A, B), for the promising substructure consisting of those τ nodes in networks A and B reordered by column v.
Step 1: Randomize the networks and obtain new data sets and .
Step 2: Compute the GSVD for the randomized networks and and obtain a matrix .
Step 3: Compute the measure for the τ node 'cluster' in and reordered by column v from .
p value After performing M loops over Steps 1 to 3, compute a p value as the proportion of samples that exceed c(A, B).
For our cluster quality measure c(A, B) we used
For these binary graphs, the density f (s) of cluster s was defined as
Here, E(s) represents the actual number of edges in the object block s, and s is the maximum possible number of edges.
For weighted graphs, in the case where the cluster is dominated by positive weights, we will generalize this to
Here, w(s) denotes the average weight in block s. We note the denominator s cancels when ratios are computed in the pvalue algorithm.
In Figure 3, we see that eight nodes 7,9,10,15,14,11,6,13 form a cluster in A, but not in B, when the synthetic data is reordered with the final column of X ^{T}. Applying the procedure above, using permutation to randomize the networks M = 1000 times as described below, we obtained a pvalue of 0.007. Applying the same procedure, we also obtained a pvalue of 0.029 for the first 6 nodes 18, 20, 16, 15, 19, 17 when the synthetic data is reordered with the first column of X ^{T}, which visually form a cluster in B, but not in A. These pvalues (< 0.05) both indicate that the results are statistically significant. As a further test, we arbitrarily selected the subnetworks of A and B composed of nodes 2,4,12,16,1,3,18, which correspond to the 12th to 18th components of the sorted final column from X ^{T}. In this case, we would not expect to find a significant result. This is reflected in the large pvalue of 0.844. In more exhaustive experiments, three randomization methods were tested [52]:
• ErdösRényi: generate a classical random graph with the appropriate number of edges.
• Redistribution: redistribute the entries in each row and each column of A, and perform the same operations on B.
• Permutation: reorder the nodes in A and B and choose the first τ nodes in this new ordering. In this case, recomputation of the GSVD in Step 2 is not necessary, due to the permutation invariance of the factorization.
Of those three approaches, ErdösRényi may be the most commonly used method to randomize a binary network, whereas permutation extends most naturally to the case of weighted edges, so we used permutation in the test shown here. We also tested another simple cluster quality measure which is the ratio of density of edges within the cluster in one graph and that in the other graph.
These variations were studied within this general methodology on both real and synthetic data sets [52]. In all cases, comparable p values were produced.
Synthetic test on correlation networks
Having tested the algorithm on binary networks, we now consider the case where weighted edges arise as correlation coefficients.
First, we generate two correlation matrices A and B as shown in Figure 4. Here, each graph has 20 nodes, and each entry is real valued, representing the correlation coefficient between the corresponding nodes. The same cluster patterns given for the synthetic binary matrices in Figure 1 were built in to the synthetic correlation data: nodes 15 are well connected in both networks; in A there is a well connected cluster consisting of nodes 615, whereas in B there is a well connected cluster consisting of nodes 1520. Some noise was added to the clusters to make this test more realistic.
More precisely, in our computation, the value of each entry (the correlation coefficient) in A and B as shown in Figure 4 is generated from a pair of 20 × 50 rectangular matrices D_{ a } and D_{ b } . The corresponding cluster patterns are built from signals. Figure 5 shows the nine signals that take part in the data. These are row vectors with 50 elements. We use v^{[1]}, v^{[2]}, v^{[3]},..., v^{[9]} to denote them. From these signals, we set up two matrices

D_{ a }∈ ℝ^{20×50}: the first 5 rows are linear combinations of v^{[1]}, v^{[2]}, v^{[3]}, v^{[4]}, v^{[5]}, v^{[6]} and v^{[7]}. Rows 6 to 15 are combinations of v^{[7]} and v^{[8]}. The remaining rows (rows 16 to 20) are Gaussian pseudorandom numbers.

D_{ b }∈ ℝ^{20×50}: the first 5 rows are linear combinations of v^{[1]}, v^{[2]}, v^{[3]}, v^{[4]}, v^{[5]}, v^{[6]} and v^{[7]}. Rows 6 to 14 are Gaussian pseudorandom numbers. The remaining rows (rows 15 to 20) are combinations of v^{[4]} and v^{[9]}.
Building up the rows from the underlying signals in this manner allowed us to construct the correlation patterns seen in Figure 4.
Although the algorithm is invariant to permutation, for visual clarity, we also shuffled the synthetic correlation data sets A and B before applying our algorithm to them. Figure 6 shows the same synthetic correlation data sets with an arbitrary relabeling.
We present the results from our algorithm in Figures 7 and 8. We show the relabeled A and B reordered with two extreme columns of X ^{T}, one from each of the two ends of the spectrum. The reorderings reveal the mutually exclusive cluster structures of A and B. We also applied the cluster validation method to the structures uncovered by the reorderings using random permutation. In Figure 7 we see that the first column of X ^{T}picks out the continuous nodes 17, 15, 20, 18, 16, 19, which form a good cluster in B but not in A (p < 0.001). The reordering from the final column of X ^{T}shown in Figure 8 reveals that the 615 cluster in A but not in B was completely uncovered by the nodes 10, 14, 12, 9, 8, 7, 6, 13, 15, 11 (at the top left hand side of the heatmaps, p < 0.001).
In summary, this additional synthetic test illustrates that our GSVD based algorithm can be extended to reveal the pattern difference between two relative correlation matrices in terms of clustering.
Quantitative determination of metabolic pathways disrupted in the prefrontal cortex of PCPtreated animals
SIEVE analysis (ThermoFisher Scientific) revealed significant PCPinduced alterations in the level of specific metabolites in the PFC of PCPtreated rats (Table 1 Additional File 1). These changes were evident in multiple metabolic pathways as defined by the Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolite pathways database. Significant changes were evident in (i) glutamate metabolism (3 metabolites [m, n]), (ii) the alanine, aspartate and glutamate pathway (2 metabolites [n]), (iii) phenylalanine, tyrosine and tryptophan metabolism (3 metabolites [a]), (iv) purine metabolism (2 metabolites [o]) and (v) butanoate metabolism (2 metabolites [k]). This suggested that these metabolic pathways are disrupted in the PFC of PCPtreated animals. However, this simple level of analysis prevents any quantitative and statistically rigorous determination of the predefined (KEGG) metabolic pathways disrupted in the PFC of PCPtreated animals.
In the context of this study the aim of applying the GSVD algorithm to metabolomic data from control and PCPtreated animals was to quantitatively determine which predefined metabolic pathways were altered in PCPtreated animals. The intermetabolite Pearson's correlation coefficient (partial correlation) was used as the metric of the functional association between each pair of metabolites and was generated from the metabolite peak intensities, as determined by Liquid Chromatography Mass Spectrometry (LCMS), across all animals within the same experimental group (i.e. either control or PCPtreated). These correlations were Fisher transformed to give the correlation data a normal distribution. This resulted in a pair of symmetric, square, realvalued {98 × 98} partial correlation matrices (Control animals: Additional File 2 PCPtreated animals: Additional File 3). Each withingroup matrix represents the specific association strength between each of the 9506 possible pairs of metabolites in that experimental group. In the simplest biological case the correlation coefficient between two metabolites (nodes) in the matrix represents the series of enzymatic reactions responsible for converting one metabolite into another. However, it should be noted that this simple interpretation does not account for the complex relationships that may influence the correlation between two metabolites, such as the involvement of metabolites in alternative, often parallel, metabolic pathways. There are important limitations that must be recognized when modeling metabolomic data as a complex network of interactions between metabolites (as defined by the correlation that exists between them) such as the potential for correlations to exist between metabolites that are not biologically relevant. The impact of such erroneous associations on the interpretation of the data as outlined in this paper will be limited by the approach of characterizing alterations at the level of metabolic pathways, involving multiple metabolites (the approach taken in this study), rather than considering the disruption of single correlation coefficient between two metabolites.
Our network treats interactions between molecules as bidirectional, and so the set of interactions between molecules forms an undirected weighted network. In essence the GSVD algorithm allows the reordering of the two experimental matrices A (control animals) and B (PCPtreated animals) with the aim of discovering a new node (metabolite) ordering that reveals clusters of nodes that exhibit strong connectivity (mutual weights) in one network but not the other. In the context of this data the GSVD algorithm was used to identify clusters of metabolites present in one experimental group that are not present in the other with the aim of identifying those metabolic pathways in the PFC disrupted by PCP treatment. Once the matrices had been reordered through the GSVD algorithm the significant presence of a cluster in the given network was statistically tested by comparison of the cluster quality measure in the real networks relative to that in 1000 random permutations of the initial matrices. The original metabolomic networks are shown in Figure 9, where matrix A represents control animals and B represents PCPtreated animals. Figures 10 and 11 show the networks reordered by the first and the final column of X ^{T} , respectively. The original position of each metabolite detected by LCMS (Figure 9) and its new position in each of the reordered matrices (Figures 10 and 11) are shown in Additional File 4. Visually, in Figure 10 there was no obvious pattern of clustering that would identify significant clusters of metabolites present in PCPtreated animals that were not present in controls. In contrast, in Figure 11 there appeared to be clusters of metabolites present in the PFC of control animals that were not present in PCPtreated animals (top left and bottom right hand side of the heatmap). For Figure 11 the significance of the top cluster (first 22 nodes in the reordering, p < 0.001) and the bottom cluster (last 18 nodes in the reordering, p < 0.001) was confirmed, indicating that there were clusters of metabolites significantly present in control (A) animals that were not present in PCPtreated (B) animals. The identity of the metabolites, the KEGG pathways in which each metabolite is involved, and the PCPinduced alteration in the overt level of each metabolite (as determined by SIEVE analysis) are shown in Tables 2 and 3 for the top and bottom cluster, respectively. In contrast to the metabolite clustering shown in Figure 11 there was no evidence in Figure 10 for any significant cluster of metabolites present in PCPtreated animals (B) that was not present in control (A) animals: (i) potential top cluster [first 10 nodes] p = 0.421; (ii) potential middle cluster [nodes 1825] p = 0.494.
Rigorous significance testing, involving multiple potential metabolite clusters, confirmed that there were no significant clusters of metabolites in PCPtreated animals that were not present in controls (Figure 10). Following significance testing of potential metabolite clusters in the GSVD reordered matrices, hypergeometric probability (described in the Methods section) was applied to test the significance of KEGG defined metabolite pathway overrepresentation in these clusters. The results for hypergeometric probability testing are shown in Tables 4 and 5.
Discussion
Through its application to metabolomic data we have clearly demonstrated the added value that can be gained from applying the GSVD algorithm to two sets of complex, network data based upon the same set of nodes. In particular, we have demonstrated that the combined application of the GSVD algorithm with hypergeometric probability analysis provides an analytical framework by which statistical alterations in predefined metabolic pathways between experimental groups can be defined from complex metabolomic data. There is a great unmet need for this type of analytical approach in metabolomics, as well as in the other omics fields (e.g. transcriptomics), which allows the quantification of alterations at the biological systems (pathways) level rather than simply identifying significant alterations of discrete measures (i.e. single metabolites).
Through the application of this analytical approach we identified statistically significant alterations in specific, predefined metabolic pathways (KEGG database pathways) that may contribute to PFC dysfunction in PCPtreated animals, and so in schizophrenia. This included the disruption of the (1) Alanine, Aspartate and Glutamate [ko00250], (2) Arginine and Proline [ko00330], (3) Butanoate [ko00650], (4) Nicotinate and Nicotinamide [ko00760], (5) Glycine, Serine and Threonine metabolic pathways as well as an imbalance in (6) metabolites active as neurotransmitter ligands [ko04080]. The disruption of metabolic pathways involving glutamate in the PFC of PCPtreated rats seems particularly pertinent given the reported alterations in extracellular glutamate availability in the PFC following repeated PCP treatment [53] and the central hypothesis of hypofunctional glutamatergic PFC neurotransmission in schizophrenia [54, 55]. In addition to altered glutamate metabolism there was also evidence to support an imbalance in multiple metabolites known to be active at glutamate receptors. This included an imbalance in the relationship between glutamate, Laspartate and Tauring (Table 2) which are all known to be active at glutamate receptors. Furthermore, evidence for the disruption of glycine, serine and threonine metabolism may suggest that glycine and serine activity as coagonists at the NMDA receptors may be disrupted in the PFC of PCPtreated animals. However, it is important to note that we failed to detect glycine levels in our samples and serine levels appear to be overtly unchanged. The possibility of altered glycine levels in the PFC of PCPtreated rats warrants further investigation given the ability of glycine and NMDA receptor glycine site agonists to reverse subchronic PCPinduced alterations in PFC dopaminergic neurotransmission [56, 57], which may be central to the impact of subchronic PCP treatment on cognition. Altered glycine, serine and threonine metabolism in the PFC of PCPtreated animals is also consistent with the hypothesis that glycine and serine represent potential therapeutic targets for the treatment of schizophrenia [58]. In addition, we found evidence to suggest that GABA neurotransmission was also significantly decreased in the PFC of PCPtreated rats, which may relate to the compromised integrity of GABAergic interneurones in these animals [3, 6], which closely resemble the GABAergic interneuron alterations seen in schizophrenia. The imbalance in glutamate, glutamine and GABA levels identified in the PFC of PCPtreated rats may directly contribute to the hypofrontality (glucose hypometabolism) seen in these animals, as detected using the ^{14}C2deoxyglucose imaging technique [4], as all of these metabolites are intimately linked through metabolic pathways and have a central role in regulating the coupling of neuronal activity to cerebral glucose metabolism [59, 60].
Our results also suggest that glutamatergic dysfunction in the PFC of PCPtreated rats is not limited to the disruption of glutamatergic neurotransmission but also involves the disruption of the metabolic pathways in which glutatmate is involved. For example, altered glutamate metabolism may directly contribute to the disruption of the ArginineProline metabolic pathway in the PFC of PCPtreated animals. The significant disruption of the Arginine pathway in PCPtreated animals suggests that prolonged NMDA receptor hypofunction may result in disrupted nitric oxide (NO) signalling in the PFC. There is increasing evidence that NO signalling is directly linked to NMDA receptor activity through regulation of the enzyme nitric oxide synthase (NOS) [61] and that NO signaling contributes to the deficits in cognition that arise from acute NMDA receptor blockade [62, 63]. The finding that Citrulline levels, a metabolite in the ArginineProline pathway, are significantly decreased in the PFC of the PCPtreated rats in this study further supports the suggestion that NOS activity is altered in the PFC of these animals, as this metabolite is formed by NOS when it releases NO from Larginine. This suggests that NMDA receptor hypofunction may underlie the decreased NOS activity and protein expression levels reported in the PFC of schizophrenia patients [64] and may contribute to the cognitive deficits seen in this disorder.
In addition to quantitatively defining the specific metabolic pathways altered by experimental manipulation, our results suggest that the GSVD algorithm can identify discrete series of metabolic reactions altered by experimental manipulation. In this way, while we found no significant evidence to support the widespread disruption of purine metabolism, or the significant disruption of any other KEGG defined metabolic pathway in the bottom cluster as detected using the GSVD, we did find evidence in this cluster to suggest that a specific series of purine reactions were significantly disrupted in the PFC of PCPtreated animals. These disrupted purinergic reactions in the PFC of PCPtreated animals were:
This result suggests that the activity of adenylosuccinate synthase (ADSS), the enzyme responsible for the conversion of IMP to adenylosuccinate, may be significantly increased in the PFC of PCPtreated animals. An increase in the functional activity of this enzyme could result in both the increased level of adenylosuccinate and the altered balance in the enzyme's downstream metabolites (IMP, Inosine, Hypoxanthine, Xanthine) seen in the PFC of PCPtreated animals. While the influence of prolonged NMDA receptor hypofunction on the functional activity of this specific enzyme remains to be confirmed, and clearly warrants further systematic investigation, the recent finding of altered ADSS gene expression in schizophrenia [65] and the association of ADSS gene polymorphisms with schizophrenia [66] further highlights a potential role for this metabolic pathway in this disorder. In addition, a role for this metabolic pathway in cognition and schizophrenia is supported by the observation that inherited deficiency in the enzyme responsible for the breakdown of adenylosuccinate (ASL) results in mental retardation and autistic features [67, 68]. Furthermore, the ASL gene maps to chromosome 22q13.1q13.2 in humans [69] and these chromosomal loci have been repeatedly linked to schizophrenia [70–72]. The disruption of this metabolic pathway may also contribute to the reduced rate of cerebral glucose metabolism in the PFC of PCPtreated animals [3, 4] as ASL deficiency results in hypometabolism in frontal cortical structures [73]. Overall, these results suggest that the potential role of this specific series of metabolic reactions and its enzymes in cognition and schizophrenia warrants further investigation.
Conclusions
This work addresses the scenario where a pair of networks describes two different patterns of connection between a common set of nodes. We argued from first principles that the Generalized Singular Value Decomposition (equation (4)) can form the basis of a very useful computational tool. In practice, we have shown that this new computational network reordering technique was able to identify alterations in metabolic pathways in the PFC of rats treated subchronically with PCP that may contribute to the PFC dysfunction and cognitive deficits seen in these animals. Furthermore, the metabolic pathways identified as being disrupted in the PFC of PCPtreated rats trough the application of this new computation technique clearly overlap with those metabolic species known to be disrupted in schizophrenia. Applying this new algorithm in this way also identified novel pathways that may also be relevant to schizophrenia. In this way we identified alterations in glutamate metabolism and metabolic pathways central to glutamatergic neurotransmission, alterations in arginine and proline metabolism and the disruption of a novel series of purine reactions that may contribute to the PFC dysfunction and cognitive deficits seen in schizophrenia.
Methods
Chemicals
The solvents used for the study were purchased from the following sources: Acetonitrile, methanol and chloroform (Fisher Scientific, Leicestershire, UK) and formic acid (VWR, Poole, UK). All chemicals used were of analytical reagent grade. A Direct Q3^{®} water purification system (Millipore, Watford, UK) was used to produce HPLC grade water which was used in all analysis. Standards for 90 common biomolecules were also purchased which were used to characterize the ZICHILIC column (Sigma Aldrich, Dorset UK).
Animals
All experiments were completed using male Lister Hooded rats (HarlanOlac, UK) housed under standard conditions (21°C, 4565% humidity, 12h dark/light cycle (lights on 0600h) with food and drinking water available ad libitum). All manipulations were carried out at least 1 week after entry into the facility and all experiments were carried out under the Animals (Scientific Procedures) Act 1986. Animals received either subchronic treatment with vehicle (0.9% saline, i.p., n = 5) or 2.58mg.kg^{1} PCP.HCl (i.p., Sigma Aldrich, UK) once daily for five consecutive days (n = 5). At 72 hours after the final drug treatment dose animals were sacrificed and the brain rapidly dissected out and frozen in isopentane (40°C) and stored at 80°C until sectioning. Frozen brains were sectioned (20 μM) in the coronal plane in a cryostat (20°C). Tissue sections from the prefrontal cortex (PFC, Bregma +4.70mm to Bregma +3.20mm) were collected in 4ml glass vials with reference to a stereotactic rat brain atlas [74] and stored at 80°C until further preparation for LCMS analysis.
Extraction of polar metabolites from brain samples for LCMS analysis
Extraction of polar metabolites from brain tissue was carried out using the twostep extraction method described previously [75], using methanol, water and chloroform for the optimal extraction of polar metabolites. A hand held homogenizer was used to homogenize the samples once in solution. For preparation of samples for LCMS analysis 200 μl of the collected polar extract was added to 600 μl of 1 : 1 acetonitrile:water solution to produce a final solvent:sample ratio of 3 : 1. The samples were then filtered using Acrodisc 13mm syringe filters with 0.2 μm nylon membrane (Sigma Aldrich) before LCMS analysis.
LCMS analysis of polar metabolites
Experiments were carried out using a Finnigan LTQ Orbitrap (Thermo Fisher, Hemel Hempstead, UK) using 30000 resolution. Analysis was carried out in positive mode over a mass range of 601000 m/z. The capillary temperature was set at 250°C and in positive ionization mode the ion spray voltage was 4.5 kV , the capillary voltage 30 V and the tube lens voltage 105 V . The sheath and auxiliary gas flow rates were 45 and 15, respectively (units not specified by manufacturer). A ZICHILIC column (5 μm, 150 × 4.6 mm; HiChrom, Reading, UK) was used in all analysis and a binary gradient method was developed which produced good polar metabolite separation. Solvent A was 0.1% v/v formic acid in HPLC grade water and solvent B was 0.1% v/v formic acid in acetonitrile. A flow rate of 0.3 ml/min. was used and the injection volume was 10 μl. The gradient programme used was 80% B at 0 min. to 50% B at 12 min. to 20% B at 28 min. to 80% B at 37 min., with total run time of 45 minutes. The instrument was externally calibrated before analysis and internally calibrated using lock masses at m/z 83.06037 and m/z 195.08625. Samples were analysed sequentially and the vial tray temperature was set at a constant temperature of 4°C.
Data preparation and analysis
Determination of overt alterations in metabolite levels between experimental groups
The software program Xcalibur (version 2.0) was used to acquire the LCMS data. The raw Xcalibur data files from version 1.2 (Thermo Fisher, Hemel Hempstead, UK). SIEVE software (ThermoFisher Scientific) was used to identify all metabolites affected by drug treatment by calculating a pvalue and ratio based on the difference in average intensities of individual peaks, which correspond to different metabolites, between PCPtreated and control animals. A significant difference in the level of each metabolite between groups was set at p value < 0.05 and/or ratio less than 0.5 for downregulated metabolites and greater than 2 for upregulated metabolites. The ratio is the fold change in average peak intensities from control and treatment groups. For metabolite identification the masses of the polar metabolites were compared to the exact masses of 6000 biomolecules using an inhouse developed macro (Excel, Microsoft 2007).
Hypergeometric probability testing
The hypergeometric probability test was used to calculate the probability of finding at least the observed number of metabolites of a given predefined metabolic pathway (as defined on the KEGG pathway database) in the clusters identified through the GSVD algorithm, with knowledge of the total number of metabolites present in that pathway detected by LCMS in these samples. The hypergeometric probability test was used to identify whether any of the KEGG defined metabolic pathways were significantly overrepresented in any of the GSVD identified clusters. In its general form hypergeometric probability allows the calculation of the probability of observing at least (k) metabolites from a given defined KEGG pathway in a defined cluster of metabolites (n) given the total number of metabolites (N) and the total number of metabolites from the pathway in question (m). The probability mass function of hypergeometric distribution is:
So here the probability is calculated using the formula
Significant overrepresentation of a given functional group in any GSVD defined significant cluster was set by a hypergeometric probability threshold of 0.05.
References
 1.
Davidson LL, Heinrichs RW: Quantification of frontal and temporal lobe brainimaging findings in schizophrenia: a metaanalysis. Psychiatry Research: Neuroimaging. 2003, 122: 6987. 10.1016/S09254927(02)00118X.
 2.
Hill K, Mann L, Laws KR, Stephenson CME, NimmoSmith I, McKenna PJ: Hypofrontality in schizophrenia: a metaanalysis of functional imaging studies. Acta Psychiatrica Scandinavica. 2004, 110: 243256. 10.1111/j.16000447.2004.00376.x.
 3.
Cochran SM, Kennedy M, McKerchar CE, Steward LJ, Pratt JA, Morris BJ: Induction of Metabolic Hypofunction and Neurochemical Deficits after Chronic Intermittent Exposure to Phencyclidine: Differential Modulation by Antipsychotic Drugs. Neuropsychopharmacology. 2003, 28: 265275. 10.1038/sj.npp.1300031.
 4.
Dawson N, Thompson RJ, McVie A, Thomson DM, Morris BJ, Pratt JA: Modafinil reverses phencyclidine (PCP)induced deficits in cognitive flexibility, cerebral metabolism and functional brain connectivity. Schizophrenia Bulletin.
 5.
Egerton A, Reid L, McGregor S, Cochran SM, Morris BJ, Pratt JA: Subchronic and chronic PCP treatment produces temporally distinct deficits in attentional set shifting and prepulse inhibition in rats. Psychopharmacology. 2008, 198: 3749. 10.1007/s0021300810715.
 6.
Egerton A, Reid L, McKerchar CE, Morris BJ, Pratt JA: Impairment in perceptual attentional setshifting following PCP administration: a rodent model of setshifting deficits in schizophrenia. Psychopharmacology. 2005, 179: 7784. 10.1007/s002130042109y.
 7.
Steward LJ, Kennedy MD, Morris BJ, Pratt JA: The atypical antipsychotic drug clozapine enhances chronic PCPinduced regulation of prefrontal cortex 5HT2A receptors. Neuropharmacology. 2004, 47: 527537. 10.1016/j.neuropharm.2004.04.020.
 8.
Fiehn O: Combining genomics, metabolome analysis, and biochemical modelling to understand metabolic networks. Comparative and Functional Genomics. 2001, 2: 155168. 10.1002/cfg.82.
 9.
Goodacre R, Vaidyanathan S, Dunn WB, Harrigan GG, Kell DB: Metabolomics by numbers: acquiring and understanding global metabolite data. Trends in Biotechnology. 2004, 22: 245252. 10.1016/j.tibtech.2004.03.007.
 10.
Goodacre R, York EV, Heald JK, Scott IM: Chemometric discrimination of unfractionated plant extracts analyzed by electrospray mass spectrometry. Phytochemistry. 2003, 62: 859863. 10.1016/S00319422(02)007185.
 11.
Oliver SG, Winson MK, Kell DB, Baganz F: Systematic functional analysis of the yeast genome. Trends in Biotechnology. 1998, 16: 373378. 10.1016/S01677799(98)012141.
 12.
KaddurahDaouk R, Krishnan KRR: Metabolomics: A Global Biochemical Approach to the Study of Central Nervous System Diseases. Neuropsychopharmacology. 2009, 34: 173186. 10.1038/npp.2008.174.
 13.
Quinones MP, KaddurahDaouk R: Metabolomics tools for identifying biomarkers for neuropsychiatric diseases. Neurobiology of Disease. 2009, 35: 165176. 10.1016/j.nbd.2009.02.019.
 14.
Greenberg N, Grassano A, Thambisetty M, Lovestone S, LegidoQuigley C: A proposed metabolic strategy for monitoring disease progression in Alzheimer's disease. Electrophoresis. 2009, 30: 12351239. 10.1002/elps.200800589.
 15.
Chen C, Gonzalez FJ, Idle JR: LCMSbased metabolomics in drug metabolism. Drug Metab Rev. 2007, 39: 581597. 10.1080/03602530701497804.
 16.
Lindon JC, Holmes E, Nicholson JK: Metabonomics in pharmaceutical research and development. FEBS Journal. 2007, 274: 11401151. 10.1111/j.17424658.2007.05673.x.
 17.
Robertson DG: Metabonomics in Toxicology: A Review. Toxicological Sciences. 2005, 85: 809822. 10.1093/toxsci/kfi102.
 18.
Cheng KK, Benson GM, Grimsditch DC, Reid DG, Connor SC, Griffin JL: Metabolomic study of the LDL receptor null mouse fed a highfat diet reveals profound perturbations in choline metabolism that are shared with ApoE null mice. Physiological Genomics. 2010, 41: 224231. 10.1152/physiolgenomics.00188.2009.
 19.
Dettmer K, Aronov PA, Hammock BD: Mass spectrometrybased metabolomics. Mass Spectrometry Reviews. 2007, 26: 5178. 10.1002/mas.20108.
 20.
Lu W, Bennett BD, Rabinowitz JD: Analytical strategies for LCMSbased targeted metabolomics. Journal of Chromatography B. 2008, 871: 236242. 10.1016/j.jchromb.2008.04.031.
 21.
Wilson ID, Plumb R, Granger J, Major H, Williams R, Lenz EM: HPLCMSbased methods for the study of metabonomics. Journal of Chromatography B. 2005, 817: 6776. 10.1016/j.jchromb.2004.07.045.
 22.
Ohta D, Kanaya S, Suzuki H: Application of Fouriertransform ion cyclotron resonance mass spectrometry to metabolic profiling and metabolite identification. Current Opinion in Biotechnology. 2010, 21: 3544. 10.1016/j.copbio.2010.01.012.
 23.
Hu Q, Noll RJ, Li H, Makarov A, Hardman M, Graham Cooks R: The Orbitrap: a new mass spectrometer. Journal of Mass Spectrometry. 2005, 40: 430443. 10.1002/jms.856.
 24.
Calvano CD, Zambonin CG, Jensen ON: Assessment of lectin and HILIC based enrichment protocols for characterization of serum glycoproteins by mass spectrometry. Journal of Proteomics. 2008, 71: 304317. 10.1016/j.jprot.2008.06.013.
 25.
Cubbon S, Bradbury T, Wilson J, ThomasOates J: Hydrophilic Interaction Chromatography for Mass Spectrometric Metabonomic Studies of Urine. Analytical Chemistry. 2007, 79: 89118918. 10.1021/ac071008v.
 26.
Gika HG, Theodoridis GA, Wilson ID: Hydrophilic interaction and reversedphase ultraperformance liquid chromatography TOFMS for metabonomic analysis of Zucker rat urine. Journal of Separation Science. 2008, 31 (9): 15981608. 10.1002/jssc.200700644.
 27.
Idborg H, Zamani L, Edlund PO, SchuppeKoistinen I, Jacobsson SP: Metabolic fingerprinting of rat urine by LC/MS: Part 1. Analysis by hydrophilic interaction liquid chromatographyelectrospray ionization mass spectrometry. Journal of Chromatography B. 2005, 828: 913. 10.1016/j.jchromb.2005.07.031.
 28.
Kind T, Tolstikov V, Fiehn O, Weiss RH: A comprehensive urinary metabolomic approach for identifying kidney cancer. Analytical Biochemistry. 2007, 363: 185195. 10.1016/j.ab.2007.01.028.
 29.
Nordstrom A, Want E, Northen T, Lehtio J, Siuzdak G: Multiple Ionization Mass Spectrometry Strategy Used To Reveal the Complexity of Metabolomics. Analytical Chemistry. 2008, 80: 421429. 10.1021/ac701982e.
 30.
Paek IB, Moon Y, Ji HY, Kim HH, Lee HW, Lee YB, Lee HS: Hydrophilic interaction liquid chromatographytandem mass spectrometry for the determination of levosulpiride in human plasma. Journal of Chromatography B. 2004, 809: 345350.
 31.
Zhang X, Rauch A, Lee H, Xiao H, Rainer G, Logothetis NK: Capillary hydrophilic interaction chromatography/mass spectrometry for simultaneous determination of multiple neurotransmitters in primate cerebral cortex. Rapid Communications in Mass Spectrometry. 2007, 21: 36213628. 10.1002/rcm.3251.
 32.
Dunn WB, Bailey NJC, Johnson HE: Measuring the metabolome: current analytical technologies. The Analyst. 2005, 130: 606625. 10.1039/b418288j.
 33.
Spagou K, Tsoukali H, Raikos N, Gika H, Wilson ID, Theodoridis G: Hydrophilic interaction chromatography coupled to MS for metabonomic/metabolomic studies. Journal of Separation Science. 2010, 33: 716727. 10.1002/jssc.200900803.
 34.
Albert R, Barabási AL: Statistical mechanics of complex networks. Reviews of Modern Physics. 2002, 74: 4797. 10.1103/RevModPhys.74.47.
 35.
Newman MEJ: The structure and function of complex networks. SIAM Review. 2003, 45: 167256. 10.1137/S003614450342480.
 36.
Strogatz SH: Exploring complex networks. Nature. 2001, 410: 268276. 10.1038/35065725.
 37.
Shi J, Malik J: Normalized Cuts and Image Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2000, 22: 888905. 10.1109/34.868688.
 38.
Higham DJ: Spectral reordering of a rangedependent weighted random graph. IMA J. Numer. Anal. 2005, 25: 443457. 10.1093/imanum/dri003.
 39.
Estrada E: Protein bipartivity and essentiality in the yeast proteinprotein interaction network. J. Proteome Res. 2006, 5: 21772184. 10.1021/pr060106e.
 40.
Estrada E, Higham DJ, Hatano N: Communicability and multipartite structures in complex networks at negative absolute temperatures. Phys. Rev. E. 2008, 77: 026102
 41.
Morrison JL, Breitling R, Higham DJ, Gilbert DR: A LockandKey Model for ProteinProtein Interactions. Bioinformatics. 2006, 2: 20122019.
 42.
Thomas A, Cannings R, Monk NAM, Cannings C: On the structure of proteinprotein interaction networks. Biochemical Soc. Trans. 2003, 31: 14911496. 10.1042/BST0311491.
 43.
Alter O, Brown PO, Botstein D: Generalized Singular Value Decomposition for Comparative Analysis of GenomeScale Expression Datasets of Two Different Organisms. Proceedings of the National Academy of Sciences. 2003, 100: 33513356. 10.1073/pnas.0530258100.
 44.
Schreiber A, Shirley NJ, Burton RA, Fincher GB: Combining transcriptional datasets using the generalized singular value decomposition. BMC Bioinformatics. 2008, 9: 33510.1186/147121059335.
 45.
Higham DJ, Kalna G, Kibble M: Spectral clustering and its use in bioinformatics. J. Computational and Applied Math. 2007, 204: 2537. 10.1016/j.cam.2006.04.026.
 46.
Fiedler M: A property of eigenvectors of nonnegative symmetric matrices and its application to graph theory. Czechoslovak Mathematical Journal. 1975, 25: 619633.
 47.
Atkins JE, Boman EG, Hendrickson B: A spectral algorithm for seriation and the consecutive ones problem. SIAM Journal on Computing. 1998, 28: 297310. 10.1137/S0097539795285771.
 48.
RoblesKelly A, Hancock ER: Graph Edit Distance from Spectral Seriation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2005, 27: 365378.
 49.
Golub GH, Van Loan CF: Matrix Computations. 1996, Baltimore: Johns Hopkins University Press
 50.
Strang G: Computational Science and Engineering. 2008, WellesleyCambridge Press
 51.
Estrada E: Topological structural classes of complex networks. Physical Review E. 2007, 75: 016103
 52.
Xiao X: Complex Networks and the Generalized Singular Value Decomposition. PhD thesis. 2010, University of Strathclyde, Department of Mathematics and Statistics
 53.
Murai R, Noda Y, Matsui K, Kamei H, Mouri A, Matsuba K, Nitta A, Furukawa H, Nabeshima T: Hypofunctional glutamatergic neurotransmission in the prefrontal cortex is involved in the emotional deficit induced by repeated treatment with phencyclidine in mice: Implications for abnormalities of glutamate release and NMDACaMKII signaling. Behavioural Brain Research. 2007, 180: 152160. 10.1016/j.bbr.2007.03.003.
 54.
Hahn CG, Wang HY, Cho DS, Talbot K, Gur RE, Berrettini WH, Bakshi K, Kamins J, BorgmannWinter KE, Siegel SJ, Gallop RJ, Arnold SE: Altered neuregulin 1erbB4 signaling contributes to NMDA receptor hypofunction in schizophrenia. Nature Medicine. 2006, 12: 824828. 10.1038/nm1418.
 55.
Lewis DA, Moghaddam B: Cognitive Dysfunction in Schizophrenia: Convergence of gammaAminobutyric Acid and Glutamate Alterations. Arch Neurol. 2006, 63: 13721376. 10.1001/archneur.63.10.1372.
 56.
Javitt DC, Balla A, Burch S, Suckow R, Xie S, Sershen H: Reversal of PhencyclidineInduced Dopaminergic Dysregulation by NMethylDAspartate Receptor//Glycinesite Agonists. Neuropsychopharmacology. 2003, 29: 300307.
 57.
Sershen H, Balla A, Aspromonte JM, Xie S, Cooper TB, Javitt DC: Characterization of interactions between phencyclidine and amphetamine in rodent prefrontal cortex and striatum: Implications in NMDA/glycinesitemediated dopaminergic dysregulation and dopamine transporter function. Neurochemistry International. 2008, 52: 119129. 10.1016/j.neuint.2007.07.011.
 58.
Yang CR, Svensson KA: Allosteric modulation of NMDA receptor via elevation of brain glycine and dserine: The therapeutic potentials for schizophrenia. Pharmacology & Therapeutics. 2008, 120: 317332. 10.1016/j.pharmthera.2008.08.004.
 59.
Magistretti PJ: Role of glutamate in neuronglia metabolic coupling. Am J Clin Nutr. 2009, 90: 875S880S. 10.3945/ajcn.2009.27462CC.
 60.
Patel AB, de Graaf RA, Mason GF, Rothman DL, Shulman RG, Behar KL: The contribution of GABA to glutamate/glutamine cycling and energy metabolism in the rat cortex in vivo. Proceedings of the National Academy of Sciences. 2005, 102: 55885593. 10.1073/pnas.0501703102.
 61.
Rameau GA, Tukey DS, GarcinHosfield ED, Titcombe RF, Misra C, Khatri L, Getzoff ED, Ziff EB: Biphasic Coupling of Neuronal Nitric Oxide Synthase Phosphorylation to the NMDA Receptor Regulates AMPA Receptor Trafficking and Neuronal Cell Death. The Journal of Neuroscience. 2007, 27: 34453455. 10.1523/JNEUROSCI.479906.2007.
 62.
Fejgin K, Palsson E, Wass C, Svensson L, Klamer D: Nitric Oxide Signaling in the Medial Prefrontal Cortex is Involved in the Biochemical and Behavioral Effects of Phencyclidine. Neuropsychopharmacology. 2008, 33: 18741833. 10.1038/sj.npp.1301587.
 63.
Wass C, Svensson L, Fejgin K, Pålsson E, Archer T, Engel JA, Klamer D: Nitric oxide synthase inhibition attenuates phencyclidineinduced disruption of cognitive flexibility. Pharmacology Biochemistry and Behavior. 2008, 89: 352359. 10.1016/j.pbb.2008.01.011.
 64.
Xing G, Chavko M, Zhang LX, Yang S, Post RM: Decreased calciumdependent constitutive nitric oxide synthase (cNOS) activity in prefrontal cortex in schizophrenia and depression. Schizophrenia Research. 2002, 58: 2130. 10.1016/S09209964(01)003887.
 65.
Tsuang MT, Nossova N, Yager T, Tsuang MM, Guo SC, Shyu KG, Glatt SJ, Liew C: Assessing the validity of bloodbased gene expression profiles for the classification of schizophrenia and bipolar disorder: A preliminary report. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics. 2005, 133B: 15. 10.1002/ajmg.b.30161.
 66.
Zhang F, Xu Y, Liu P, Fan H, Huang X, Sun G, Song Y, Sham P: Association analyses of the interaction between the ADSS and ATM genes with schizophrenia in a Chinese population. BMC Medical Genetics. 2008, 9: 119125. 10.1186/147123509119.
 67.
Spiegel EK, Colman RF, Patterson D: Adenylosuccinate lyase deficiency. Molecular Genetics and Metabolism. 2006, 89 (12): 1931. 10.1016/j.ymgme.2006.04.018.
 68.
Stone RL, Aimi J, Barshop BA, Jaeken J, Van den Berghe G, Zalkin H, Dixon JE: A mutation in adenylosuccinate lyase associated with mental retardation and autistic features. Nature Genetics. 1992, 1: 5963. 10.1038/ng049259.
 69.
Fon E, Demczuk S, Delattre O, Thomas G, Rouleau G: Mapping of the human adenylosuccinate lyase (ADSL) gene to chromosome 22q13.1→q13.2. Cytogenetics and Cell Genetics. 1993, 64: 201203. 10.1159/000133575.
 70.
Jorgensen T, Børglum A, Mors O, Wang A, Pinaud M, Flint T, Dahl H, Vang M, Kruse T, Ewald H: Search for common haplotypes on chromosome 22q in patients with schizophrenia or bipolar disorder from the Faroe Islands. American Journal of Medical Genetics. 2002, 114: 245252. 10.1002/ajmg.10191.
 71.
Pulver AE, Karayiorgou M, Wolyniec PS, Lasseter VK, Kasch L, Nestadt G, Antonarakis S, Housman D, Kazazian HH, Meyers D, Ott J, Lamacz M, Liang KY, Hanfelt J, Ullrich G, DeMarchi N, Ramu E, McHugh PR, Adler L, Thomas M, Carpenter WT, Manschreck T, Gordon CT, Kimberland M, Babb R, Puck J, Childs B: Sequential strategy to identify a susceptibility gene for schizophrenia: Report of potential linkage on chromosome 22q12q13.1: Part 1. American Journal of Medical Genetics. 1994, 54: 3643. 10.1002/ajmg.1320540108.
 72.
Saleem Q, Dash D, Gandhi C, Kishore A, Benegal V, Sherrin T, Mukherjee O, Jain S, Brahmachari SK: Association of CAG repeat loci on chromosome 22 with schizophrenia and bipolar disorder. Molecular Psychiatry. 2001, 6: 694700. 10.1038/sj.mp.4000924.
 73.
De Volder AG, Jaeken J, Den Berghe GV, Bol A, Michel C, Cogneau M, Goffinet AM: Regional Brain Glucose Utilization in AdenylosuccinaseDeficient Patients Measured by Positron Emission Tomography. Pediatric Research. 1998, 24: 238242.
 74.
Paxinos G, Watson C: The Rat Brain in Stereotaxic Coordinates. 1998, Academic Press
 75.
Wu H, Southam AD, Hines A, Viant MR: Highthroughput tissue extraction protocol for NMR and MSbased metabolomics. Analytical Biochemistry. 2008, 372: 204212. 10.1016/j.ab.2007.10.002.
Acknowledgements
XX is supported by Engineering and Physical Sciences Research Council grant EP/E049370/1.
ND is supported by Psychiatric Research Institute in Glasgow (PsyRING), a joint initiative between the Universities of Glasgow and Strathclyde and the National Health Service of Greater Glasgow and Clyde. LM and DGW are supported by the Scottish Universities Life Sciences Alliance (SULSA).
DJH is supported by Engineering and Physical Sciences Research Council grant EP/E049370/1 and Medical Research Council grant G0601353.
Author information
Affiliations
Corresponding author
Additional information
Authors' contributions
All authors contributed extensively to the work presented in this paper. DJH and XX conceived and analyzed the computational algorithm, designed and performed the synthetic tests and wrote the description of this material. XX applied the algorithm to metabolic networks. ND performed the in vivo experiments, analyzed the metabolomic data and wrote the description of this material. BJM and JAP conceived the subchronic PCP model. LM and DGW conducted the metabolomics experiments. All authors discussed the results, interpreted the data and have read and approved the final version of the manuscript.
Electronic supplementary material
Table S1  List of all metabolites detected by LCMS in the PFC of Control and PCPtreated animals
Additional file 1: . Table S1 Legend: The molecular formula and tentative molecular identity for each metabolite detected in the PFC of control and PCPtreated animals is shown. In addition, the KEGG molecular identity and the KEGG metabolic pathways in which a metabolite is involved are also shown. The ratio difference in metabolite concentration and the significance of this change (pvalue), as determined by SIEVE analysis (see Methods section), are also shown. Those metabolites found to be significantly different between the two groups are highlighted in bold. The most prominent alterations in KEGG defined metabolic pathways appeared to be in (i) alanine, aspartate and glutamate metabolism (3 metabolites [ko00250]), (ii) phenylalanine, tyrosine and tryptophan metabolism (3 metabolites [ko00360]), (iii) purine metabolism (2 metabolites [ko00230]) and (iv) butanoate metabolism (2 metabolites [ko00650]). KEGG defined metabolic pathways; ko00250: Alanine, Aspartate and Glutamate metabolism; ko00627: Aminobenzoate degradation; ko00330: Arginine and Proline metabolism; ko00410: betaAlanine metabolism; ko00780: Biotin metabolism; map00650: Butanoate metabolism; ko04973: Carbohydrate metabolism; ko00270: Cysteine and Methionine metabolism; ko00071: Fatty acid metabolism; ko00051: Fructose and Manose metabolism; ko00052: Galactose metabolism; ko00471: Glutamine and Glutamate metabolism; k00480: Glutathione metabolism; ko00561: Glycerolipid metabolism; ko00564: Glycerophospholipid metabolism; ko00260: Glycine, Serine and Threonine metabolism; ko00010: Glycolysis/Gluconeogenesis; ko00340: Histidine metabolism; ko00562: Inositol Phosphate metabolism; map00300: Lysine biosynthesis; ko00310: Lysine degradation; ko00430: Methionine metabolism; ko04080; Neuroactive ligandreceptor interaction; ko00760: Nicotinate and Nicotinamide metabolism; ko00190: Oxidative phosphorylation; ko00770: Pantoate and CoA biosynthesis; ko00550: Peptidoglycan biosynthesis; ko00360: Phenylalanine metabolism; ko00400: Phenylalanine, Tyrosine and Tryptophan biosynthesis; ko00440: Phosphonate and Phosphinate metabolism; ko00640: Propanoate metabolism; ko00230: Purine metabolism; ko00240: Pyrimidine metabolism; ko00620: Pyruvate metabolism; ko00500: Starch and Sucrose metabolism; ko00600: Sphingolipid metabolism; ko00920: Sulphur metabolism; ko00430: Taurine and Hypotaurine metabolism; ko00730: Thiamine metabolism; ko00380: Tryptophan metabolism; ko00350: Tyrosine metabolism; ko00400: Tyrosine and Tryptophan biosynthesis; ko00290; Valine, Leucine and Isoleucine biosynthesis; ko00280: Valine, Leucine and Isoleucine degradation. NA denotes a metabolite not associated with a KEGG compound ID or KEGG pathway. (XLS 34 KB)
12918_2010_657_MOESM2_ESM.XLS
Additional file 2: 98 × 98 matrix of between metabolite correlations in the PFC of control animals. The 98 × 98 matrix of the Pearson's correlation coefficients (Fisher ztransformed) between all metabolites detected in the prefrontal cortex of control (salinetreated) animals by LCMS analysis is shown. (XLS 194 KB)
12918_2010_657_MOESM3_ESM.XLS
Additional file 3: 98 × 98 matrix of between metabolite correlations in the PFC of PCPtreated animals. The 98 × 98 matrix of the Pearson's correlation coefficient (Fisher ztransformed) between all metabolites detected in the prefrontal cortex of PCPtreated animals by LCMS analysis is shown. (XLS 192 KB)
Table S2  Table showing the axes labels in Figures
Additional file 4: 9, 10and 11. In Table S2 the position of each metabolite in the original ordering (Figure 4) is shown. In the columns for Figures 5 and 6, the corresponding numbers indicating the new position of each metabolite (node) in the matrix when reordered by the first column of X ^{T}and the final column of X ^{T}, respectively, is shown. (XLS 31 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Xiao, X., Dawson, N., MacIntyre, L. et al. Exploring metabolic pathway disruption in the subchronic phencyclidine model of schizophrenia with the Generalized Singular Value Decomposition. BMC Syst Biol 5, 72 (2011). https://doi.org/10.1186/17520509572
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/17520509572
Keywords
 Schizophrenia
 Metabolomic Data
 Polar Metabolite
 Generalize Singular Value Decomposition
 Binary Network