Elsevier

Gene

Volume 594, Issue 2, 15 December 2016, Pages 197-202
Gene

Research paper
Mitogenomics of Hesperelaea, an extinct genus of Oleaceae

https://doi.org/10.1016/j.gene.2016.09.007Get rights and content

Abstract

The recent developments in high-throughput DNA sequencing allowed major advances in organelle genomics. Assembly of mitochondrial genomes (hereafter mitogenomes) in higher plants however remains a challenge due to their large size and the presence of plastid-derived regions and repetitive sequences. In this study, we reconstructed the first mitogenome of Oleaceae using a herbarium specimen of the extinct genus Hesperelaea collected in 1875. Paired-end reads produced with the HiSeq technology (shotgun) in a previous study were re-used. With an approach combining reference-guided and de novo assembly, we obtained a circular molecule of 658,522 bp with a mean coverage depth of 35 ×. We found one large repeat (ca. 8 kb) and annotated 46 protein-coding genes, 3 rRNA genes and 19 tRNA genes. A phylogeny of Lamiales mitogenomes confirms Oleaceae as sister to a group comprising Lamiaceae, Phyrmaceae and Gesneriaceae. The Hesperelaea mitogenome has lower rates of synonymous and non-synonymous substitution compared to Nicotiana tabacum than other available mitogenomes of Lamiales. To conclude, we show that mitogenome reconstruction in higher plants is possible with shotgun data, even from poorly preserved DNA extracted from old specimens. This approach offers new perspectives to reconstruct plant phylogenies from mitochondrial markers, and to develop functional mitogenomics in Oleaceae.

Introduction

Recent advances in high-throughput DNA sequencing, so-called Next Generation Sequencing (NGS), have revolutionized phylogenetics by generating massive genomic resources (Harrison and Kidner, 2011). In particular, the genome skimming approach (i.e. low coverage shotgun sequencing) allowed the assembly of the most represented genomic regions such as the nuclear ribosomal cluster and cytoplasmic genomes (e.g. Cronn et al., 2008, Straub et al., 2012, Besnard et al., 2013, Malé et al., 2014). In plants, genome skimming has been mainly used to generate chloroplast genomes (hereafter plastomes) and nearly complete nuclear ribosomal sequences (Straub et al., 2012), but the method can also be used to assemble sequences from the mitochondrial genome (hereafter mitogenome) (e.g. Ma et al., 2012, Bock et al., 2014, Malé et al., 2014). In addition, the genome skimming method was successfully applied to low-quality DNA extracted from herbarium specimens, which are generally not amenable to classical PCR (e.g. Staats et al., 2013, Besnard et al., 2014, Bakker et al., 2016, Zedane et al., 2016). This approach thus offers the possibility to include in phylogenetic analyses rare and extinct species from natural history collections.

Both plastomes and mitogenomes are maternally inherited in numerous plant groups, in particular in angiosperms, but there are a few exceptions (e.g. Corriveau and Coleman, 1988, Birky, 1995). Polymorphism of the plastome has been extensively used in phylogeographic studies, in contrast with the mitogenome (e.g. Ruhlman and Jansen, 2014). The general lack of interest in plant mitogenomes for phylogenetic analyses mainly stems from their supposed low sequence variation (Wolfe et al., 1987, Smith and Keeling, 2015). Consequently, the assembly of plant mitochondrial sequences from shotgun data was reported only in a few studies (e.g. Ma et al., 2012, Bock et al., 2014, Malé et al., 2014). In addition, reconstructing full mitogenome sequences is challenging due to the presence of both long duplications and plastid-derived regions (Mower et al., 2012b, Smith and Keeling, 2015). These characteristics are lineage-specific and make the assembly of plant mitogenome from short sequence reads very complex, with no standard procedure (as proposed for reconstructing plastomes; Straub et al., 2012, Bakker et al., 2016). Yet, mitogenomic variation in plants may be valuable in investigating not only phylogenetic relationships among species, but also in studying patterns of substitution (e.g. Zhu et al., 2014) or the genetic determinism of maternally inherited traits such as the cytoplasmic male sterility (e.g. Liu et al., 2011, Mower et al., 2012a, Hiroshi et al., 2014, Touzet and Meyer, 2014).

Here, we describe the assembly of the first mitogenome of Oleaceae (Lamiales). We applied our approach to degraded DNA extracted from an old herbarium specimen collected 141 years ago on Guadalupe Island, Mexico. This specimen represents the only known remain of the genus Hesperelaea, which is now considered extinct (Watson, 1876, Zedane et al., 2016). Our study demonstrates that shotgun sequencing data, even from poorly preserved DNA, can be used to generate mitochondrial DNA datasets. Molecular analysis of available mitogenomes of Lamiales reveals deep structural reorganizations and heterogeneous evolutionary rates between mitochondrial genes. Finally, we discuss the implication of our results for further studies on the Oleaceae functional mitogenomics and phylogenetics.

Section snippets

Study model and DNA sequencing

Hesperelaea palmeri A. Gray belongs to the tribe Oleeae (Wallander and Albert, 2000). A recent phylogenetic study indicates that it was a paleoendemic lineage of an American group that includes Forestiera and Priogymnanthus (Zedane et al., 2016). The complete plastome of this species has been sequenced previously using a genome skimming approach (Zedane et al., 2016). Here, we reused these shotgun data produced with the HiSeq technology (Illumina Inc., 2010), consisting of 10,694,511 paired-end

Mitogenome assembly, content and annotation

Our approach allowed us to reconstruct a master sequence of the H. palmeri mitogenome (EMBL accession no: KX545367; mitogenome length: 658,522 bp; GC content: 44.50%). About 5% of this genome is composed of duplicated regions, with one large repeat of 7975 bp and 151 small repeats ranging from 40 to 496 bp (Tables 1 and S1). We identified 21 plastid-derived regions with a length > 100 bp and with a minimum identity of 80% with the H. palmeri plastome [covering in total 57,617 bp (8.75%) of the

Conclusions and perspectives

Our approach shows that the assembly of mitogenomes or long mitochondrial sequences is feasible with shotgun data, even with low-quality DNA extracted from herbarium samples. The assembly of mitochondrial genomes from Illumina short sequence reads remains, however, a challenge due to the length of the mitogenome, plastid-derived regions and repetitive sequences. Indeed, the reconstruction of a master circle is not fully automatized and thus time-consuming (e.g. Cahill et al., 2010, Malé et al.,

Acknowledgments

This work was funded by the Regional Council Midi-Pyrenees (AAP 13053637, 2014-EDB-UT3-DOCT) and the LABEX entitled TULIP managed by Agence Nationale de la Recherche (ANR-10-LABX-0041). We also thank Jan Hackel for helpful comments, and the Missouri Botanical Garden (MO) for permission to extract DNA from a leaf of Hesperelaea.

References (55)

  • G. Besnard et al.

    From museums to genomics: old herbarium specimens shed light on a C3 to C4 transition

    J. Exp. Bot.

    (2014)
  • Biomatters Ltd.

    Geneious v. 9.0.5

  • C.W. Birky

    Uniparental inheritance of mitochondrial and chloroplast genes: mechanisms and evolution

    Proc. Natl. Acad. Sci. U. S. A.

    (1995)
  • D.G. Bock et al.

    Genome skimming reveals the origin of the Jerusalem Artichoke tuber crop species: neither from Jerusalem nor an artichoke

    New Phytol.

    (2014)
  • M.J. Cahill et al.

    Read length and repeat resolution: exploring prokaryote genomes using next-generation sequencing technologies

    PLoS ONE

    (2010)
  • J.L. Corriveau et al.

    Rapid screening method to detect potential biparental inheritance of plastid DNA and results for over 200 angiosperms

    Am. J. Bot.

    (1988)
  • R. Cronn et al.

    Multiplex sequencing of plant chloroplast genomes using Solexa sequencing-by-synthesis technology

    Nucleic Acids Res.

    (2008)
  • R.C. Edgar

    Muscle: a multiple sequence alignment method with reduced time and space complexity

    BMC Bioinformatics

    (2004)
  • R. Govindarajulu et al.

    Comparison of nuclear, plastid, and mitochondrial phylogenies and the origin of wild octoploid strawberry species

    Am. J. Bot.

    (2015)
  • S. Greiner et al.

    Why are most organelle genomes transmitted maternally?

    BioEssays

    (2015)
  • W. Hao et al.

    Fine-scale mergers of chloroplast and mitochondrial genes create functional, transcompartmentally chimeric mitochondrial genes

    Proc. Natl. Acad. Sci. U. S. A.

    (2009)
  • N. Harrison et al.

    Next-generation sequencing and systematics: what can a billion base pairs of DNA sequence data do for you?

    Taxon

    (2011)
  • Y. Hiroshi et al.

    Complete mitochondrial genome sequence of black mustard (Brassica nigra; BB) and comparison with Brassica oleracea (CC) and Brassica carinata (BBCC)

    Genome

    (2014)
  • C. Hong-Wa et al.

    Species limits and diversification in the Madagascar olive (Noronhia, Oleaceae)

    Bot. J. Linn. Soc.

    (2014)
  • Illumina Inc.

    HiSeq™ 2000 sequencing system

  • M. Krzywinski et al.

    Circos: an information aesthetic for comparative genomics

    Genome Res.

    (2009)
  • S. Kurtz

    The Vmatch large scale sequence analysis software

  • Cited by (24)

    View all citing articles on Scopus
    1

    Present address: INRA-CNRGV, INRA Auzeville, 31326 Castanet-Tolosan, France.

    View full text