A GeneTrek analysis of the maize genome

  1. Renyi Liu*,
  2. Clémentine Vitte*,
  3. Jianxin Ma*,
  4. A. Assibi Mahama,
  5. Thanda Dhliwayo,
  6. Michael Lee, and
  7. Jeffrey L. Bennetzen*,
  1. *Department of Genetics, University of Georgia, Athens, GA 30602; and
  2. Department of Agronomy, Iowa State University, Ames, IA 50011
  1. Contributed by Jeffrey L. Bennetzen, May 7, 2007 (received for review March 19, 2007)

Abstract

Analysis of the sequences of 74 randomly selected BACs demonstrated that the maize nuclear genome contains ≈37,000 candidate genes with homologues in other plant species. An additional ≈5,500 predicted genes are severely truncated and probably pseudogenes. The distribution of genes is uneven, with ≈30% of BACs containing no genes. BAC gene density varies from 0 to 7.9 per 100 kb, whereas most gene islands contain only one gene. The average number of genes per gene island is 1.7. Only 72% of these genes show collinearity with the rice genome. Particular LTR retrotransposon families (e.g., Gyma) are enriched on gene-free BACs, most of which do not come from pericentromeres or other large heterochromatic regions. Gene-containing BACs are relatively enriched in different families of LTR retrotransposons (e.g., Ji). Two major bursts of LTR retrotransposon activity in the last 2 million years are responsible for the large size of the maize genome, but only the more recent of these is well represented in gene-containing BACs, suggesting that LTR retrotransposons are more efficiently removed in these domains. The results demonstrate that sample sequencing and careful annotation of a few randomly selected BACs can provide a robust description of a complex plant genome.

Footnotes

  • To whom correspondence should be addressed. E-mail: maize{at}uga.edu
  • Author contributions: R.L. and C.V. contributed equally to this work; R.L., C.V., J.M., and J.L.B. designed research; R.L., C.V., J.M., and A.A.M. performed research; R.L., C.V., J.M., A.A.M., T.D., M.L., and J.L.B. analyzed data; and R.L., C.V., M.L., and J.L.B. wrote the paper.

  • The authors declare no conflict of interest.

  • This article contains supporting information online at www.pnas.org/cgi/content/full/0704258104/DC1.

  • Abbreviations:
    TSD,
    target site duplication;
    My,
    million years;
    RJM,
    repeat junction marker;
    TE,
    transposable element;
    IBM,
    Intermated B73 × Mo17.
« Previous | Next Article »Table of Contents