Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts

  1. Jeremy R. Sanford1,6,
  2. Xin Wang2,
  3. Matthew Mort3,
  4. Natalia VanDuyn4,
  5. David N. Cooper3,
  6. Sean D. Mooney5,
  7. Howard J. Edenberg4,5 and
  8. Yunlong Liu2
  1. 1 Department of Molecular, Cellular, and Developmental Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA;
  2. 2 Department of Medicine, Division of Biostatistics, Indiana University School of Medicine, Indianapolis, Indiana 46202, USA;
  3. 3 Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff CF14 4XN, United Kingdom;
  4. 4 Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, Indiana 46202, USA;
  5. 5 Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, Indiana 46202, USA

    Abstract

    Metazoan genes are encrypted with at least two superimposed codes: the genetic code to specify the primary structure of proteins and the splicing code to expand their proteomic output via alternative splicing. Here, we define the specificity of a central regulator of pre-mRNA splicing, the conserved, essential splicing factor SFRS1. Cross-linking immunoprecipitation and high-throughput sequencing (CLIP-seq) identified 23,632 binding sites for SFRS1 in the transcriptome of cultured human embryonic kidney cells. SFRS1 was found to engage many different classes of functionally distinct transcripts including mRNA, miRNA, snoRNAs, ncRNAs, and conserved intergenic transcripts of unknown function. The majority of these diverse transcripts share a purine-rich consensus motif corresponding to the canonical SFRS1 binding site. The consensus site was not only enriched in exons cross-linked to SFRS1 in vivo, but was also enriched in close proximity to splice sites. mRNAs encoding RNA processing factors were significantly overrepresented, suggesting that SFRS1 may broadly influence the post-transcriptional control of gene expression in vivo. Finally, a search for the SFRS1 consensus motif within the Human Gene Mutation Database identified 181 mutations in 82 different genes that disrupt predicted SFRS1 binding sites. This comprehensive analysis substantially expands the known roles of human SR proteins in the regulation of a diverse array of RNA transcripts.

    Footnotes

    • 6 Corresponding author.

      E-mail sanford{at}biology.ucsc.edu; fax (831) 459-3139.

    • [Supplemental material is available online at www.genome.org.]

    • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.082503.108.

      • Received June 23, 2008.
      • Accepted December 15, 2008.
    | Table of Contents

    Preprint Server