De novo discovery of a tissue-specific gene regulatory module in a chordate

  1. David S. Johnson1,
  2. Qing Zhou2,
  3. Kasumi Yagi3,
  4. Nori Satoh3,
  5. Wing Wong2, and
  6. Arend Sidow1,4
  1. 1 Departments of Pathology and Genetics, Stanford University Medical Center, Stanford, California 94305-5324, USA
  2. 2 Departments of Statistics and Biostatistics, Stanford University, Stanford, California 94305, USA
  3. 3 Department of Zoology, Graduate School of Science, Kyoto University, Sakyo-ku, Kyoto 606-8502 Japan

Abstract

We engage the experimental and computational challenges of de novo regulatory module discovery in a complex and largely unstudied metazoan genome. Our analysis is based on the comprehensive characterization of regulatory elements of 20 muscle genes in the chordate, Ciona savignyi. Three independent types of data we generate contribute to the characterization of a muscle-specific regulatory module: (1) Positive elements (PEs), short sequences sufficient for strong muscle expression that are identified in a high-resolution in vivo analysis; (2) CisModules (CMs), candidate regulatory modules defined by clusters of overrepresented motifs predicted de novo; and (3) Conserved elements (CEs), short noncoding sequences of strong conservation between C. savignyi and C. intestinalis. We estimate the accuracy of the computational predictions by an analysis of the intersection of these data. As final biological validation of the discovered muscle regulatory module, we implement a novel algorithm to search the genome for instances of the module and identify seven novel enhancers.

Footnotes

  • [Supplemental material is available online at www.genome.org. The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: K. Small and P. Lemaire.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.4062605. Article published online before print in September 2005.

  • 4 Corresponding author. E-mail arend{at}stanford.edu; fax (650) 725-4905.

    • Accepted August 8, 2005.
    • Received April 22, 2005.
| Table of Contents

Preprint Server