Abstract
American elm, Ulmus americana L., was widely cultivated in the USA and Canada as a landscape tree. Despite its importance in landscaping and horticulture, its genome is poorly characterized. We assembled the chloroplast genomes of two American elm genotypes (RV16 and Am. 57845); to our knowledge, this is the first description of sequencing and assembly of this species. The complete chloroplast genome of U. americana ranged from 158,935 to 158,993 bp and it contains 127 genes, namely 85 protein-coding genes, 34 tRNA genes, and 8 rRNA genes. Between the two American elm chloroplasts we sequenced, we identified 240 high-quality sequence variants (SNPs and indels). To evaluate the phylogeny of American elm, we compared the chloroplast genomes of the two American elms with seven Asian elm species and twelve other chloroplast genomes available through the NCBI database. As expected, Ulmus was closely related to Morus and Cannabis, as all three genera are assigned to the Urticales. We clarified the timing of the divergence of American elm from the available Asian elms, the divergence within these Asian elms, and all the species’ relative ages. Comparison of the chloroplasts of American elm with the available Asian elms revealed that trnH was absent from American elm but not most Asian elms; conversely, petB, petD, psbL, trnK, and rps16 are present in the American elm but absent from all Asian elms analyzed. ycf15 was present in both American and Asian elms but absent from members of closely related genera. The complete chloroplast genome of U. americana will provide useful genetic resources for characterizing the genetic diversity of U. americana and potentially help to conserve natural populations of American elm.
Similar content being viewed by others
References
Alexander LW, Woeste KE (2014) Pyrosequencing of the northern red oak (Quercus rubra L.) chloroplast genome reveals high quality polymorphisms for population management. Tree Genet Genomes 10:803–812. https://doi.org/10.1007/s11295-013-0681-1
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215:403–410. https://doi.org/10.1016/S0022-2836(05)80360-2
Amiryousefi A, Hyvönen J, Poczai P (2018) IRscope: an online program to visualize the junction sites of chloroplast genomes. Bioinformatics 34:3030–3031
Beck N, Lang B (2010) MFannot, organelle genome annotation webserver. University of Montreal. http://megasun.bch.umontreal.ca/cgi-bin/dev_mfa/mfannotInterface.pl. 2019
Bey CF (1990) Ulmus americana L. American elm vol 2. Hardwoods, Silvics of North America. United States Forest Service, United States Department of Agriculture, Washington, D.C
Birchler JA, Veitia RA (2012) Gene balance hypothesis: connecting issues of dosage sensitivity across biological disciplines. Proc Natl Acad Sci 109:14746–14753
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. https://doi.org/10.1093/bioinformatics/btu170
Bouzat JL (2010) Conservation genetics of population bottlenecks: the role of chance, selection, and history. Conserv Genet 11:463–478. https://doi.org/10.1007/s10592-010-0049-0
Brasier C (1983) The future of Dutch elm disease in Europe vol 60. https://www.forestresearch.gov.uk/
Brasier CM, Buck KW (2001) Rapid evolutionary changes in a globally invading fungal pathogen (Dutch Elm Disease). Biol Invasions 3:223–233. https://doi.org/10.1023/a:1015248819864
Brudno M et al (2003) LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic. DNA. Genome Res 13:721–731
Brunet J, Guries RP (2016) Elm genetic diversity and hybridization in the presence of Dutch elm disease. Paper presented at the The American elm restoration workshop. Lewis Center, OH
Bushnell B (2014) BBMap: a fast, accurate, splice-aware aligner. Lawrence Berkeley National Lab, Berkeley
Cai J, Ma PF, Li HT, Li DZ (2015) Complete plastid genome sequencing of four Tilia species (Malvaceae): a comparative analysis and phylogenetic implications. PLoS One 10:e0142705. https://doi.org/10.1371/journal.pone.0142705
Cingolani P et al. (2012) A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3 Fly 6:80–92
Flower CE, Hayes-Plazolles N, Slavicek JM, Rosa C (2017) First report of ‘Candidatus Phytoplasma trifolii’-related strain of 16SrVI-A phytoplasma subgroup, associated with elm yellows disease in American elm (Ulmus americana L.) in Ohio, USA. Plant Dis 102(2):438. https://doi.org/10.1094/PDIS-08-17-1154-PDN
Frazer KA, Pachter L, Poliakov A, Rubin EM, Dubchak I (2004) VISTA: computational tools for comparative genomics. Nucleic Acids Res 32:W273–W279
Greiner S, Lehwark P, Bock R (2019) OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res 47:W59–W64. https://doi.org/10.1093/nar/gkz238
Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O (2010) New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol 59:307–321. https://doi.org/10.1093/sysbio/syq010
Hoang DT, Chernomor O, von Haeseler A, Minh BQ, Vinh LS (2018) UFBoot2: improving the ultrafast bootstrap approximation. Mol Biol Evol 35:518–522. https://doi.org/10.1093/molbev/msx281
Huang Y, Wang J, Yang Y, Fan C, Chen J (2017) Phylogenomic analysis and dynamic evolution of chloroplast genomes in Salicaceae. Front Plant Sci 8(1050). https://doi.org/10.3389/fpls.2017.01050
Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS (2017) ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods 14:587–589. https://doi.org/10.1038/nmeth.4285
Kane N, Sveinsson S, Dempewolf H, Yang JY, Zhang D, Engels JM, Cronk Q (2012) Ultra-barcoding in cacao (Theobroma spp.; Malvaceae) using whole chloroplast genomes and nuclear ribosomal. DNA Am J Bot 99:320–329. https://doi.org/10.3732/ajb.1100570
Karnosky DF (1979) Dutch elm disease - review of the history, environmental implications, control, and research needs. Environ Conserv 6:311–322. https://doi.org/10.1017/S037689290000357x
Katoh K, Misawa K, Ki K, Miyata T (2002) MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res 30:3059–3066
Katoh K, Standley DM (2013) MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30:772–780. https://doi.org/10.1093/molbev/mst010
Kent WJ (2002) BLAT—the BLAST-like alignment tool. Genome Res 12:656–664
Lang BF, Laforest M-J, Burger G (2007) Mitochondrial introns: a critical view. Trends Genet 23:119–125
Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25. https://doi.org/10.1186/gb-2009-10-3-r25
Laslett D, Canback B (2004) ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. https://doi.org/10.1093/nar/gkh152
Lin C-S et al (2015) The location and translocation of ndh genes of chloroplast origin in the Orchidaceae family. Sci Rep 5:9040
Madeira F et al (2019) The EMBL-EBI search and sequence analysis tools APIs in 2019. Nucleic Acids Res 47:W636–W641. https://doi.org/10.1093/nar/gkz268
Marcone C (2016) Elm yellows: a phytoplasma disease of concern in forest and landscape ecosystems. For Pathol 47:e12324. https://doi.org/10.1111/efp.12324
McKenna A et al (2010) The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20:1297–1303. https://doi.org/10.1101/gr.107524.110
NCBI (2016) Sequin. https://www.ncbi.nlm.nih.gov/projects/Sequin/. Accessed 2019
Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ (2015) IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol 32:268–274. https://doi.org/10.1093/molbev/msu300
NRCS (2017) The Plants Database National Plant Data Team. http://plants.usda.gov. 2019
Pinchot CC, Knight KS, Haugen LM, Flower CE, Slavicek JM (2017) Proceedings of the American elm restoration workshop 2016 Gen Tech Rep NRS-P-174 Newtown Square, PA: US Department of Agriculture, Forest Service, Northern Research Station 148 p 174:1–148
Rochaix JD, Kuchka M, Mayfield S, Schirmerrahire M, Girardbascou J, Bennoun P (1989) Nuclear and chloroplast mutations affect the synthesis or stability of the chloroplast Psbc gene-product in Chlamydomonas-Reinhardtii. EMBO J 8:1013–1021. https://doi.org/10.1002/j.1460-2075.1989.tb03468.x
Ruhfel BR, Gitzendanner MA, Soltis PS, Soltis DE, Burleigh JG (2014) From algae to angiosperms-inferring the phylogeny of green plants (Viridiplantae) from 360 plastid genomes. Bmc Evol Biol 14 Artn 23 https://doi.org/10.1186/1471-2148-14-23
Sabir J et al (2014) Evolutionary and biotechnology implications of plastid genome variation in the inverted-repeat-lacking clade of legumes. Plant Biotechnol J 12:743–754. https://doi.org/10.1111/pbi.12179
Salinas-Giegé T, Giegé R, Giegé P (2015) tRNA biology in mitochondria. Int J Mol Sci 16:4518–4559
Sanchez R et al (2011) Phylemon 2.0: a suite of web-tools for molecular evolution, phylogenetics, phylogenomics and hypotheses testing. Nucleic Acids Res 39:W470–W474. https://doi.org/10.1093/nar/gkr408
Schwarz MB (1922) Das Zweigensterben der Olmen, Trauenveiden und Pfirschbaurne vol 5. Mededelingen wit het Phytopathologisch laboratorium ‘Willie Commelin Scholten’
Setohigashi Y, Hamaji T, Hayama M, Matsuzaki R, Nozaki H (2011) Uniparental inheritance of chloroplast DNA is strict in the isogamous Volvocalean Gonium. Plos One 6 ARTN e19545 https://doi.org/10.1371/journal.pone.0019545
Shimodaira H, Hasegawa M (1999) Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol Biol Evol 16:1114–1114
Slater GSC, Birney E (2005) Automated generation of heuristics for biological sequence comparison. BMC Bioinforma 6:31
Soltis D, Soltis P, Doyle JJ (1998) Molecular systematics of plants II: DNA sequencing vol 2. Springer Science & Business Media. https://doi.org/10.1007/978-1-4615-5419-6_1
Soubrier J, Steel M, Lee MSY, Der Sarkissian C, Guindon S, Ho SYW, Cooper A (2012) The influence of rate heterogeneity among sites on the time dependence of molecular rates. Mol Biol Evol 29:3345–3358. https://doi.org/10.1093/molbev/mss140
Stoppel R, Meurer J (2013) Complex RNA metabolism in the chloroplast: an update on the psbB operon. Planta 237:441–449. https://doi.org/10.1007/s00425-012-1782-z
Stothard P (2000) The sequence manipulation suite: JavaScript programs for analyzing and formatting protein and DNA sequences. Biotechniques 28:1102, 1104. https://doi.org/10.2144/00286ir01
Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, Greiner S (2017) GeSeq - versatile and accurate annotation of organelle genomes. Nucleic Acids Res 45:W6–W11. https://doi.org/10.1093/nar/gkx391
Whittemore AT, Olsen RT (2011) Ulmus americana (Ulmaceae) is a polyploid complex. Am J Bot 98:754–760. https://doi.org/10.3732/ajb.1000372
Whittemore AT, Xia Z-L (2017) Genome size variation in elms (Ulmus spp.) and related genera. HortScience 52:547–553
Wiegrefe SJ, Sytsma KJ, Guries RP (1994) Phylogeny of elms (Ulmus, Ulmaceae) - molecular evidence for a sectional classification. Syst Bot 19:590–612. https://doi.org/10.2307/2419779
Wysoker A, Tibbetts K, Fennell T (2019) Picard Tools vol 2.18.2. Broad Institute
Yang Z (1995) A space-time process model for the evolution of DNA sequences. Genetics 139:993–1005
Zhang Q, Zhang H, Li Q, Bai R, Ning E, Cai X (2019) Characterization of the complete chloroplast genome sequence of an endangered elm species, Ulmus gaussenii (Ulmaceae). Conserv Genet Resour 11:71–74
Zhang Y, Zhuang X, Li J, Li P, Wang Z (2019) Characterization of the complete chloroplast genome sequence of Ulmus chenmoui (Ulmaceae), an endangered plant endemic to China. Mitochondrial DNA Part B 4:482–484
Zhao P, Woeste KE (2011) DNA markers identify hybrids between butternut (Juglans cinerea L.) and Japanese walnut (Juglans ailantifolia Carr.). Tree Genet Genomes 7:511–533. https://doi.org/10.1007/s11295-010-0352-4
Zuo LH, Shang AQ, Zhang S, Yu XY, Ren YC, Yang MS, Wang JM (2017) The first complete chloroplast genome sequences of Ulmus species by de novo sequencing: genome comparative and taxonomic position analysis. PLoS One 12:e0171264. https://doi.org/10.1371/journal.pone.0171264
Acknowledgments
DNA sequencing was performed at the Purdue University Agricultural Genomics Center, Philip San Miguel, Director.
Author contribution statement
AE: conceived and designed the project, assembled the genomes, analyzed the data, and wrote the original manuscript; JDA: designed portions of the methods, analyzed the data, and revised and edited the final manuscript; CCP, JMS, and CEF: provided leaf samples and edited the draft; KEW: supervised the project and edited and revised the manuscript. All authors contributed to the editing of the final manuscript.
Data archiving statement
The whole chloroplast genome data are deposited as MH324448 and MN043961 in the NCBI database.
Funding
Funding was provided in part by a grant from the Manton Family Trust, the Hardwood Tree Improvement and Regeneration Center (United States Department of Agriculture Forest Service Northern Research Station), and the Department of Forestry and Natural Resources at Purdue University.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Contribution to the field statement
Genetic diversity within the chloroplast is considered a fundamental tool for understanding population genetic structure and species evolution. We report the assembly and public deposit of the full chloroplast DNA sequences of two genotypes of American elm, and their comparison with Asian elms. These sequences enabled us to identify highly polymorphic regions within the American elm chloroplast that can be used for future studies of genetic diversity, genetic structure, gene flow, hybridization, and genome evolution of American elm.
Disclaimer
Mention of a trademark, proprietary product, or vendor does not constitute a guarantee or warranty of the product by the US Dept. of Agriculture and does not imply its approval to the exclusion of other products or vendors that also may be suitable.
Additional information
Communicated by A.M. Dandekar
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary Figure 1
Map of the chloroplast genome of Ulmus americana (Am. 57845). The direction of transcription is indicated by arrows. Genes inside the circle are transcribed clockwise, and those outside are transcribed counterclockwise. Gene function is color-coded as shown in the legend. The darker gray in the inner circle shows the GC content, while the lighter gray shows the AT content. LSC (Large Single Copy region), SSC (Small Single Copy region), IRA, IRB (Inverted Repeat A and B, respectively). The bold black arrow indicates the start position of the chloroplast assembly; the numbering proceeds counter-clockwise. (PNG 5872 kb)
Supplementary Figure 2
Pairwise global nucleotide sequence alignment of Ulmus americana genotypes RV16 and Am. 57845. (PDF 2021 kb)
Supplementary Table 1
The vcf file containing all high-quality variants (SNPs and indels) discovered in the U. americana chloroplast, including functional annotations. (TXT 112 kb)
Supplementary Table 2
The genes in the U. americana chloroplast affected by variants (SNPs and indels). (TXT 3 kb)
Rights and permissions
About this article
Cite this article
Ebrahimi, A., Antonides, J.D., Pinchot, C.C. et al. The complete chloroplast genome sequence of American elm (Ulmus americana) and comparative genomics of related species. Tree Genetics & Genomes 17, 5 (2021). https://doi.org/10.1007/s11295-020-01487-3
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11295-020-01487-3