Is the transcript from the overlapping region of two converging genes part of either gene?
© Spence; licensee BioMed Central Ltd. 2007
Published: 8 May 2007
Microarrays continue to generate complex data for gene-expression. Clustering of both genes and samples is one of the most common analytical objectives – often achieved using spectral analysis of a matrix associated with the bipartite graph generated by the genes and samples and their corresponding links. Specifically, we first represent the activity of the ith gene in the jth sample as a positive value wij and then store these values in a rectangular matrix W. Then clustering of both genes and samples may be achieved using the singular value decomposition (SVD) of the matrix W, with the singular vectors corresponding to the second largest singular values providing the information to implement the clustering. These clustering techniques are heuristic and it is natural to ask how reliable they are. Using techniques from numerical linear algebra and probability analysis, it is possible to provide a sensitivity measure of the robustness of clustering using SVD. We use this sensitivity analysis to provide an answer to the above question about the expression of MYH11 and NDE1.
The advent of microarrays for all exons leads to new possibilities in identifying alternative transcripts and changes in the composition of mRNA and proteins. With these possibilities comes the challenge of reliably identifying candidates for alternative splicing and where possible suggesting "clusters" of co-expressed exons which can then be tested in the laboratory. The mathematical techniques used in the above work can help in this process.
This article is published under license to BioMed Central Ltd.