Skip to main content

Advertisement

Is the transcript from the overlapping region of two converging genes part of either gene?

Article metrics

MYH11 (myosin heavy polypeptide 11) and NDE1 (Nude1) are transcribed from opposite strands of human chromosome 16 (see Figure 1). The null hypothesis is that microarray features representing a single gene will behave in the same way; estimating the confidence of our belief is an important step to identification of deviations from this model. One Affymetrix microarray probeset "227249_at", annotated as detecting NDE1, is designed for the region of chromosome 16 where the two genes converge and is complementary to an identified part of MYH11 mRNA. The expression detected by this probeset is negatively correlated with some of the probesets for MYH11. We test the hypothesis that this region at the 3' end of NDE1 has a negative role in the expression of MYH11 and ask: "Is the RNA transcribed from here really part of the NDE1 or could it be an independent transcript?"

Figure 1
figure1

Transcription of MYH11 and NDE1

Microarrays continue to generate complex data for gene-expression. Clustering of both genes and samples is one of the most common analytical objectives – often achieved using spectral analysis of a matrix associated with the bipartite graph generated by the genes and samples and their corresponding links. Specifically, we first represent the activity of the ith gene in the jth sample as a positive value wij and then store these values in a rectangular matrix W. Then clustering of both genes and samples may be achieved using the singular value decomposition (SVD) of the matrix W, with the singular vectors corresponding to the second largest singular values providing the information to implement the clustering. These clustering techniques are heuristic and it is natural to ask how reliable they are. Using techniques from numerical linear algebra and probability analysis, it is possible to provide a sensitivity measure of the robustness of clustering using SVD. We use this sensitivity analysis to provide an answer to the above question about the expression of MYH11 and NDE1.

The advent of microarrays for all exons leads to new possibilities in identifying alternative transcripts and changes in the composition of mRNA and proteins. With these possibilities comes the challenge of reliably identifying candidates for alternative splicing and where possible suggesting "clusters" of co-expressed exons which can then be tested in the laboratory. The mathematical techniques used in the above work can help in this process.

Author information

Correspondence to Alastair Spence.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Spence, A. Is the transcript from the overlapping region of two converging genes part of either gene?. BMC Syst Biol 1, P11 (2007) doi:10.1186/1752-0509-1-S1-P11

Download citation

Keywords

  • Alternative Splice
  • Bipartite Graph
  • Singular Value Decomposition
  • Cluster Technique
  • Singular Vector