Post-transcriptional exon shuffling events in humans can be evolutionarily conserved and abundant

  1. Michael S. Jackson1,8
  1. 1Institute of Genetic Medicine, Newcastle University, Newcastle NE1 3BZ, United Kingdom;
  2. 2NewGene Limited, Bioscience Building, International Centre for Life, Newcastle upon Tyne NE1 4EP, United Kingdom;
  3. 3Northern Institute for Cancer Research, Paul O'Gorman Building, Newcastle University, Newcastle upon Tyne NE2 4HH, United Kingdom
    1. 7 These authors contributed equally to this work.

    • Present addresses: 4Department of Developmental Biochemistry Biocenter, University of Wuerzburg, 97074 Wuerzburg, Germany;

    • 5 Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA;

    • 6 Department of Human Genetics, Guru Nanak Dev University, Amritsar 143005, Punjab, India.

    Abstract

    In silico analyses have established that transcripts from some genes can be processed into RNAs with rearranged exon order relative to genomic structure (post-transcriptional exon shuffling, or PTES). Although known to contribute to transcriptome diversity in some species, to date the structure, distribution, abundance, and functional significance of human PTES transcripts remains largely unknown. Here, using high-throughput transcriptome sequencing, we identify 205 putative human PTES products from 176 genes. We validate 72 out of 112 products analyzed using RT-PCR, and identify additional PTES products structurally related to 61% of validated targets. Sequencing of these additional products reveals GT-AG dinucleotides at >95% of the splice junctions, confirming that they are processed by the spliceosome. We show that most PTES transcripts are expressed in a wide variety of human tissues, that they can be polyadenylated, and that some are conserved in mouse. We also show that they can extend into 5′ and 3′ UTRs, consistent with formation via trans-splicing of independent pre-mRNA molecules. Finally, we use real-time PCR to compare the abundance of PTES exon junctions relative to canonical exon junctions within the transcripts from seven genes. PTES exon junctions are present at <0.01% to >90% of the levels of canonical junctions, with transcripts from MAN1A2, PHC3, TLE4, and CDK13 exhibiting the highest levels. This is the first systematic experimental analysis of PTES in human, and it suggests both that the phenomenon is much more widespread than previously thought and that some PTES transcripts could be functional.

    Footnotes

    • Received October 11, 2010.
    • Accepted July 28, 2011.

    Freely available online through the Genome Research Open Access option.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server