Frame check and domain annotation. Figure 4
(a) The length of the fusion transcript, from the start to the stop codon, must be multiple of three (three nucleotides per single encoded codon). If the length of the sequence module three is non-zero, the fusion sequence if frame-shifted. A premature stop codon can be introduced in the protein sequence. Figure 4
(b) The nucleotide sequence resulting from the fusion of the 5’ and 3’ gene is translated into amino acid sequence. Similarly, the genomic breakpoint coordinates are translated into protein amino acid coordinates. UniProt Web Service is queried and the list of the available domains for both the gene is retrieved. On the basis of the protein domain sequence and protein breakpoint, the list of both conserved and lost domains is reported.