Theories and Applications for Sequencing Randomly Selected Clones

  1. Michael C. Wendl1,3,
  2. Marco A. Marra2,
  3. LaDeana W. Hillier1,
  4. Asif T. Chinwalla1,
  5. Richard K. Wilson1, and
  6. Robert H. Waterston1
  1. 1Genome Sequencing Center, Washington University, St. Louis, Missouri 63108, USA; 2British Columbia Cancer Research Centre, Vancouver, British Columbia V5Z 1L3, Canada

Abstract

Theory is developed for the process of sequencing randomly selected large-insert clones. Genome size, library depth, clone size, and clone distribution are considered relevant properties and perfect overlap detection for contig assembly is assumed. Genome-specific and nonrandom effects are neglected. Order of magnitude analysis indicates library depth is of secondary importance compared to the other variables, especially as clone size diminishes. In such cases, the well-known Poisson coverage law is a good approximation. Parameters derived from these models are used to examine performance for the specific case of sequencing random human BAC clones. We compare coverage and redundancy rates for libraries possessing uniform and nonuniform clone distributions. Results are measured against data from map-based human-chromosome-2 sequencing. We conclude that the map-based approach outperforms random clone sequencing, except early in a project. However, simultaneous use of both strategies can be beneficial if a performance-based estimate for halting random clone sequencing is made. Results further show that the random approach yields maximum effectiveness using nonbiased rather than biased libraries.

Footnotes

  • 3 Corresponding author.

  • E-MAIL mwendl{at}watson.wustl.edu; FAX (314) 286-1810.

  • Article published online before print: Genome Res.,10.1101.gr.133901.

  • Article and publication are at www.genome.org/cgi/doi/10.1101/gr.133901.

    • Received February 2, 2000.
    • Accepted November 21, 2000.
| Table of Contents

Preprint Server