Abstract
Lomas formations or “fog oases” are islands of vegetation in the desert belt of the west coast of South America, with a unique vegetation composition among the world’s deserts. However, plant diversity and conservation studies have long been neglected, and there exists a severe gap in plant DNA sequence information. To address the lack of DNA information, we conducted field collections and laboratory DNA sequencing to establish a DNA barcode reference library of Lomas plants from Peru. This database provides 1,207 plant specimens and 3,129 DNA barcodes data corresponding with collections from 16 Lomas locations in Peru, during 2017 and 2018. This database will facilitate both rapid species identification and basic studies on plant diversity, thereby enhancing our understanding of Lomas flora’s composition and temporal variation, and providing valuable resources for conserving plant diversity and maintaining the stability of the fragile Lomas ecosystems.
Similar content being viewed by others
Background & Summary
Along the Pacific coast from 5°S (northern Peru) to 30°S (northern Chile), a narrow arid belt, of nearly 3,000 km long, is formed at the foot of the Andes1. This belt is typified by a tropical desert climate, with annual precipitation of less than 50 mm (arid) or 5 mm (hyper-arid), making it one of the driest areas in the world. Despite the drought, “fog oasis” like islands with lush vegetation and flowers appear in the desert and are known as “blooming deserts”2,3. There is a tenuous balance between Pacific Ocean currents, onshore winds, the height of the Andes mountains, and cyclic El Niño events that shape such particular geographic and unique climatic conditions4,5,6. From July to November, ocean-generated fog is intercepted by isolated mountains or steep coastal slopes, starting at sea level and extending to around 1,000 meters asl. This is the main water source sustaining plant communities from arid and hyper-arid zones, locally known as Lomas formations (Peru) or “fog oases” (Chile)4,6,7.
Lomas ecosystems, due to their seasonal and ephemeral nature, are one of the most climate-responsive ecosystems on Earth. Human activities, such as urban sprawl, mining, off-road vehicle, dumping, overgrazing, and the introduction of invasive plants and animals, have had a significant impact on them4,5,8. More than 58% of the Peruvian population lives in the narrow arid belt of the Pacific coast and relies on the Lomas for their livelihood5. However, only 3.3% of Peruvian Lomas are currently included in formally protected areas5.
The importance of Lomas plant diversity has been overlooked in comparison to the adjacent Andean-Amazonian World Biodiversity Center9. This diversity is both unique and fragile, with more than 40% of the species, within individual Lomas communities, being endemic, such as species in genera Calceolaria, Mathewsia, Nolana, Palaua, Tiquilia, Weberbauerella, etc.1. The Lomas are a valuable genetic resource, containing wild relatives of many crops (e.g., Andean potato, tomato, and papaya), as well as medicinal plants (e.g., Nasa urens, Alternanthera halimifolia), and fuel plants (e.g., Caesalpinia spinosa, Vachellia macracantha), among others8.
The composition of each Lomas plant community is highly diverse, resulting from a combination of climate, geography, and the eco-physiology1. Investigating the formation of plant diversity in Lomas is a challenge, because of annual inconsistencies in desert precipitation, and often very high inter-annual climate variability10. The number of species in the Lomas flora fluctuates dynamically11, with rare species potentially appearing only during or after El Niño events, and then not being found for several years4. Several regional floristic checklists in Peru have been published, for example, for La Libertad12, Ancash13, Lima14,15,16, Ica17,18 and Arequipa19.
However, many taxa remain unnamed at species or genus levels. This transient and heterogeneous appearance of Lomas vegetation has limited the opportunities for plant collections or occurrence records1,20, making species taxonomic revision in the Lomas flora difficult. The number of specimens registered for Peruvian Lomas is limited. The Field Museum in Chicago has the most comprehensive collection of Lomas plants from Peru and Chile, with ca. 2,800 specimens from Peruvian Lomas retrievable from its online botanical collections database (https://collections-botany.fieldmuseum.org/project/6657), covering 723 species from 369 genera in 92 families. Additionally, about 3,800 Lomas plant specimens of 438 species from 253 genera in 79 families can be retrieved from the Global Biodiversity Information Facility (GBIF, https://www.gbif.org/) database.
DNA barcoding techniques and specific reference libraries offer new opportunities for rapid, accurate, and automated species identification. They compensate for the disadvantages of morphology-based taxonomic methods which are time-consuming, specialized, and the identification results are subject to the integrity and life stage of the specimen. Consequently, this decreases the efficiency of field species survey and identification processes21,22,23,24,25,26. To accelerate the species investigation and conservation process in the Lomas flora, we (1) made a catalogue of the flora of Peruvian Lomas based on historical plant species lists from the published literature, and online specimen databases; (2) conducted a species survey and collection in Peruvian Lomas formations and established a plant DNA barcode reference library. Voucher specimen have been deposited in scientific collections. Images and DNA barcode sequences has been uploaded to the web-based information workbench, called the Barcode of Life Data System (BOLD)27. A publicly available DNA reference library of Lomas plants will provide a valuable resources to all scholars interested in Lomas vegetation. Although it is a small-scale focus, the diversity of unique taxa will be expected to have broad utility in plant biogeography, biodiversity conservation, environmental DNA detection, biological invasion detection, and the food industry.
Methods
Generation of the flora list for Lomas
The floristic list of Lomas was obtained from previously published studies (see Supplementary File 1: Reference) and specimen records from the Field Museum’s online Botanical Collections Database (https://collections-botany.fieldmuseum.org/project/6657). A total of 4,653 records were retrieved for the Peruvian Lomas, each of which was retained in Supplementary File 1 (Literature_record: 1,816; Specimen_record: 2,837), with the specific source in the ‘REFERENCE’ column (as of March 2023). Species names and taxonomic authorities were verified using the ‘status’ function of the R package ‘plantlist’ version 0.8.028, and the World Checklist of Vascular Plants (WCVP)29. The original species names and the updated species names are presented in Supplementary File 1, with different columns for ‘ORIGINAL_TAXON_NAME’ and ‘ACCEPTED_SPECIES_NAME’, respectively. The family classification of angiosperms follows APG IV30 and its updated website APW (http://www.mobot.org/MOBOT/Research/APweb), gymnosperms follows Yang et al. in 202231, and ferns and lycopods follows Christenhusz & Chase32.
Specimen collection and identification
In 2017 and 2018, two field trips were organized to collect specimens in the arid belt of the west coast of Peru, ranging from 9.37°S to 18.00°S latitude, 70.29°W to 78.17°W longitude, and 20 to 1,270 meters elevation. The collection campaign extended from September to November during the wet season in Peru, from north to south, with 1,207 specimens collected from 16 main Lomas locations (Table 1; Fig. 1). The environments varied from place to place (see Table 1 for details). Due to the widespread climatic variations of El Niño–Southern Oscillation (ENSO), Lomas vegetation is characterized by its transient and disparate nature of occurrence, which limits the opportunities for collection and plant occurrence records4,20,33. For each sampling locality, we: (1) collected all the plant species found; (2) recorded GPS spatial location, elevation, and habitat characteristics; (3) preserved plant leaf tissue in silica gel; (4) recorded the key characteristics of each specimen and made a preliminary identification of them. All collections were conducted in compliance with national and local regulations under permits N° 310-2017-SERFOR/DGGSPFFS and N° 429-2018-MINAGRI-SERFOR-DGGSPFFS granted by the Peruvian National Forestry and Fauna Service. All the specimens were deposited in the herbarium of South China Botanical Garden (IBSC) and Herbario San Marcos (USM) of the Natural History Museum (UNMSM) in Lima. Following the inclusion in the scientific collections, the specimens were further examined and identified by professional taxonomists from the Herbario San Marcos (USM) to confirm the initial identification made in the field.
DNA barcode generation
Fresh leaves were collected and stored in silica gel during fieldwork. Total genomic DNA was extracted using the CTAB method34. We used two plastid protein-coding DNA regions and a ribosomal DNA region, corresponding to the most widely used plant DNA barcodes35,36: the ribulose-bisphosphate/carboxylase Large-subunit gene (rbcL), the maturase-K gene (matK), and the internal transcribed spacer 2 (ITS2). Sequencing was performed using universal DNA barcode primers for rbcL37 (rbcLa_For: ATGTCACCACAAACAGAGACTAAAGC; rbcLa_Rev: GTAAAATCAAGTCCACCRCG), matK (K. J. Kim, unpublished, Kim-3F: CGTACAGTACTTTTGTGTTTACGAG; Kim-1R: ACCCAGTCCATCTGGAAATCTTGGTTC), ITS238 (ITS2_S2F: ATGCGATACTTGGTGTGAAT; ITS2_S3R: GACGCTTCTCCAGACTACAAT). The PCR reaction mixture was made up of 25 μl (2.5 μl of 10 × PCR buffer (Tris-HCl, 100 mM; KCl, 500 mM; MgCl2, 15 mM), primer pairs 0.5 μl (10 μM) each, 2.0 μl of dNTPs (2.5 µM), 0.5 μl of DNA template (about 20~30 ng), 0.2 μl of rTaq polymerase (5 U µl−1), and 18.8 μl of ddH2O. The PCR amplification was performed under the following conditions: 3 min at 94 °C; 35 cycles of 30 secs at 94 °C, 45 secs at 50 °C for rbcL, 48 °C for matK, and 55 °C for ITS2, 1 min at 72 °C; and 10 min at 72 °C. All PCR products were visualized on a 1.0% agarose gel and sequenced on an ABI3730 DNA analyzer. ITS2 and matK DNA fragments were sequenced in bi-direction to ensure the accuracy of sequence data, and rbcL was sequenced in the forward direction. Raw sequences were trimmed and forward and reverse reads were assembled in Geneious 11.0.239. The newly obtained sequences were initially verified by BLASTn method: All sequences were confirmed as the correct target fragments using BLASTn search of the GenBank database (https://blast.ncbi.nlm.nih.gov/Blast.c-gi). Considering the absence of some species in the GenBank database, the target fragments with the highest bit-scores for sequence accessions from the same genus or family were regarded as reliable40,41. Ultimately, 1,157 samples had at least one sequence matching them, and 858 samples had sequence data for all three DNA barcode sequences (rbcL, matK and ITS2).
Data verification
The accuracy of DNA barcodes is tested by (1) The barcoding gap method: Comparing the K2P (Kimura 2 Parameter) intra- and inter-specific distances for each barcode, where distances were calculated by MEGA-X42, and species were identified to have barcode gaps when the maximum intra-specific distance is lower than the minimum inter-specific distance43; (2) The Best Close Match (BCM) method: Identification efficiency of each DNA barcode was calculated using taxonDNA44; (3) The tree-based method: The sequence obtained was considered correctly identified if all sequences from the same species, genus or family form a monophyletic clade in a phylogenetic tree. Briefly, the obtained sequences were first aligned using MAFFT v7.47145 and manually adjusted in Genious 11.0.239. All obtained rbcL and matK sequences were aligned simultaneously, while ITS2 sequences were aligned by order level. The rbcL, matK, and ITS2 alignments were then combined to create a super-matrix alignment (RMI2). Maximum-likelihood (ML) trees were generated using RAxML-HPC2 (8.2.12)46 with the GTRGAMMA model, and the support for individual nodes in the phylogenetic tree was assessed with 1,000 bootstrap replicates. The sequence obtained was considered correctly identified if all sequences from the same species, genus, or family form a monophyletic clade with bootstrap support (BS) of not less than 50%.
Data Records
To date, 4,653 Peruvian Lomas plant records have been accumulated and summarized in Supplementary File 1. Excluding 276 records not assigned to the species level, there are 1,092 vascular plant species belonging to 468 genera and 96 families in the Peruvian Lomas flora. The dominant families are Solanaceae (17 genera/116 species), Asteraceae (52/103), Poaceae (43/96), Fabaceae (37/79), and Malvaceae (24/67) (Table 2). Each record in Supplementary File 1 represents a species in the literature and specimen databases, with associated information, including (1) Literature_record: ORIGINAL_TAXON_NAME, ACCEPTED_SPECIES_NAME, ACCEPTED_AUTHOR, TAXONOMIC_STATUS, GENUS, FAMILY (APG IV), ORDER, GROUP, REFERENCE, NATIVE_VS_CULTIVATED; (2) Specimen_record: ORIGINAL_TAXON_NAME, ACCEPTED_SPECIES_NAME, ACCEPTED_AUTHOR, TAXONOMIC_STATUS, GENUS, FAMILY (APG IV), ORDER, GROUP, REFERENCE, Collector, IRN, occurrence ID, Catalog Number.
The distribution of main Lomas5,47 and the Lomas visited for this work are shown in Fig. 1. After two years of surveys, 1,207 specimens were collected, of which, 870 were identified to species level (289 species from 192 genera in 67 families). The remaining 337 specimens were identified to genus level (336 individuals in 134 genera) and one to family level (Supplementary File 2), due to missing critical features based on morphological identification, the lack of reference barcode data, barcode sequencing failures or insufficient barcode resolution. This yielded a total of 238 genera from 78 families (Table 3). The dominant families are Asteraceae (28/34, genera/species), Solanaceae (10/32), Malvaceae (9/23), Poaceae (15/21), Boraginaceae (7/15), Fabaceae (11/13), Lamiaceae (6/11), and Amaranthaceae (4/10) (Table 2). The 38 taxa (33 genera and 22 families) are new records (Supplementary File 2) compared with the catalogue of Peruvian Lomas flora in this study.
We generated 3,129 plant DNA barcodes for 1,157 collected specimens, including 1,127 rbcL, 1,038 matK, and 964 ITS2 sequences (Table 3), with sequencing success rates of 93%, 86%, and 80%, respectively. Fifty specimens in this study failed in the end to obtain DNA barcodes. Complete coverage for three core barcodes with a 71% sequencing success rate was achieved for 858 individuals. All specimen details, including species identifiers, voucher information, GPS coordinates, elevation, collection information, accession number, corresponding images, and DNA barcode sequences, were uploaded to the BOLD systems to establish a DNA barcode reference library of Lomas plants in Peru (Fig. 2). It is publicly available as “ DNA barcode for Lomas plants in Peru (DS-PLOMAS, https://doi.org/10.5883/DS-PLOMAS)”. In the BOLD systems, each record in a project represents a voucher specimen with its images, voucher collection numbers, associated sequences, and collection data associated with (1) Lab Sheet: Project Code, Process ID, Sample ID, Field ID, Catalog Num, ITS2 Seq. Length, ITS2 Trace Count, rbcL Seq. Length, rbcL Trace Count, matK Seq. Length, matK Trace Count, Image Count, Barcode Compliant, Stop Codon, Collection Date, Identification, Institution; (2) Voucher Info: Sample ID, Field ID, Museum ID, Institution Storing; (3) Taxonomy: Phylum, Class, Order, Family, Subfamily, Genus, Species, Identifier, Taxonomy Notes; (4) Collection Data: Collectors, Collection Date, Country/Ocean, State/Province, Region, Sector, Exact Site, GPS Coordinates, Elevation, and Collection Notes. All DNA barcodes are available in GenBank, with accession numbers OQ391231-OQ392357 for rbcL, OQ390168-OQ391205 for matK, and OQ411630-OQ412593 for ITS2 (Supplementary File 2). All Supplementary Files for this study are stored at Figshare48.
Technical Validation
For this project, we conducted a review of all voucher specimens and repeated experiments for sequences that were identified as abnormal in BLASTn and Tree-based methods (Fig. 2). The results revealed 19 specimens that were inconsistent with morphological identification, and of the 12 specimens not identified to genus, eleven could be inferred as possible genera (Supplementary File 2). To ensure accuracy, these samples were re-sampled and sequenced, and the results were sent back to collaborating taxonomists for rechecking the specimens. The validated results have been noted in the corresponding specimens of the BOLD system, and the original records are retained for verification.
We evaluated the species identification ability of DNA barcodes using 125 species with multiple individuals (a total of 515 samples covering 125 species, 88 genera, and 34 families), and the results are shown in Table 4. The three-barcode combination (RMI2) had the highest correct identification capacity, ranging from 80% (BCM method) to 72% (Barcoding gap method), which is similar to results in previous DNA barcode studies (i.e., Fazekas et al.49; Hollingsworth et al.50; Naciri & Linder51; Wirta et al.52; Yan et al.53). Additionally, although 39% of the nodes in the 858-sample RMI2-ML tree (The RMI2-ML tree is available in Figshare48) had weak bootstrap support values (BS < 50%), there are still 15% of nodes with moderate support (BS from 50% to 70%), and 45% with high support (BS > 70%), which were sufficient to discriminate most genera and families. Of the 134 genera with multiple individuals, 17 (13%) were detected as not monophyletic (Supplementary File 2). These non-monophyletic relationships are partially due to unclear inter-generic boundaries (e.g., Nassella vs. Jarava and Poa vs. Rostraria in Poaceae, Cyclanthera vs. Sicyos in Cucurbitaceae, etc.), and insufficient barcode discrimination (e.g., Convolvulus in Convolvulaceae, Villanova in Asteraceae, Croton in Euphorbiaceae, etc.), and some were mixed with single individuals. At the family level, two of the 47 families with multiple individuals (4%) were not clustered into monophyletic groups (Asparagaceae and Amaryllidaceae).
Usage Notes
We provide freely accessible and downloadable historical plant lists and a DNA barcode reference library of Peruvian Lomas plants. Data as of the publication date are available for indexed search and free download from BOLD and GenBank. This library is anticipated to be of benefit to a wide range of applications, from species taxonomic revisions to specimen identification, including biogeographic and evolutionary studies in the region, to biodiversity surveys and monitoring, and environmental assessments.
Code availability
The code used to check species names and taxonomic authorities is available in the R package ‘plantlist’ version 0.8.0 (Zhang28).
References
Dillon, M. O., Nakazawa, M. & González, S. L. The Lomas formations of coastal Peru: Composition and biogeographic history. Vol. 43 1–9 (Botany New Series, 2003).
Chávez, R. O. et al. GIMMS NDVI time series reveal the extent, duration, and intensity of “blooming desert” events in the hyper-arid Atacama Desert, Northern Chile. Int J Appl Earth Obs Geoinf 76, 193–203 (2019).
Martínez-Harms, J. et al. Mechanisms of flower coloring and eco-evolutionary implications of massive blooming events in the Atacama Desert. Front. Ecol. Evol. 10, 957318 (2022).
Dillon, M. O., González, S. L., Cruz, M. Z., Asencio, P. L. & Silvestre, V. Q. Floristic checklist of the Peruvian Lomas Formations. Arnaldoa 18, 7–32 (2011).
Moat, J. et al. Seeing through the clouds–Mapping desert fog oasis ecosystems using 20 years of MODIS imagery over Peru and Chile. Int J Appl Earth Obs Geoinf 103, 102468 (2021).
Rundel, P. W., Villagra, P. E., Dillon, M. O., Roig, S. & Debandi, G. in The Physical Geography of South America (eds Veblin T., Young, K. & Orme, A.) Ch. Arid and Semi-Arid Ecosystems, 158–183 (Oxford University Press, 2007).
Eichler, T. P. & Londoño, A. C. South American climatology and impacts of El Niño in NCEP’s CFSR data. Adv. Meteorol. 2013, 1–15 (2013).
Beresford-Jones, D. et al. Re-evaluating the resource potential of lomas fog oasis environments for Preceramic hunter–gatherers under past ENSO modes on the south coast of Peru. Quat Sci Rev 129, 196–215 (2015).
Perez-Escobar, O. A. et al. The Andes through time: evolution and distribution of Andean floras. Trends Plant Sci. 27, 364–378 (2022).
Tchakerian, V. P. in Encyclopedia of Atmospheric Sciences (Second Edition) (eds North, G. R., Pyle, J. & Zhang, F.) 185–192 (Academic Press, 2015).
Péfaur, J. E. Dynamics of plant communities in the Lomas of southern Peru. Vegetatio 49, 163–171 (1982).
González, S. L., Cruz, M. Z., Bazán, G. G. & Chávez, L. C. Floristic diversity of the Loma Cerro Campana, Province, Department La Libertad Trujillo-Peru. Arnaldoa 21, 187–220 (2014).
González, S. L. et al. Diversidad florística de la Loma Mongón, Provincia Casma, Departamento Ancash, Perú. Arnaldoa 15, 45–62 (2008).
Cuba-Melly, N. & Odar, J. Diversity of vascular flora of lomas granados and possible threats to its conservation, province of huaral, lima–peru. The Biologist 16, 237–250 (2018).
Tovar, C. Sanchez Infantas, E. & Teixeira Roth, V. Plant community dynamics of lomas fog oasis of Central Peru after the extreme precipitation caused by the 1997–98 El Nino event. PLoS One 13, e0190572 (2018).
Trinidad, H., Huamán-Melo, E., Delgado, A. & Cano, A. Vascular flora from Villa María and Amancaes Lomas, Lima, Peru. Rev. peru biol. 19, 149–158 (2012).
Montenegro-Hoyos, A., Vega, N. & Linares-Palomino, R. Plant diversity and structure in desert communities of the Andean piedmont in Ica, Peru. Vegetation Classification and Survey 3, 53–66 (2022).
Whaley, O. Q., Orellana-Garcia, A. & Pecho-Quispe, J. O. An Annotated Checklist to Vascular Flora of the Ica Region, Peru–with notes on endemic species, habitat, climate and agrobiodiversity. Phytotaxa 389, 1–125 (2019).
Quipuscoa Silvestre, V. et al. Diversidad de plantas vasculares de las lomas de Yuta, provincia Islay, Arequipa, Perú, 2016. Arnaldoa 23, 517–546 (2016).
Whaley, O. Q. et al. An ecosystem approach to restoration and sustainable management of dry forest in southern Peru. Kew Bulletin 65, 613Y641 (2010).
Alex Smith, M. et al. DNA barcoding and the taxonomy of Microgastrinae wasps (Hymenoptera, Braconidae): impacts after 8 years and nearly 20 000 sequences. Mol Ecol Resour 13, 168–176 (2013).
Dasmahapatra, K. K. & Mallet, J. Taxonomy: DNA barcodes: recent successes and future prospects. Heredity 97, 254–255 (2006).
DeSalle, R. & Goldstein, P. Review and Interpretation of Trends in DNA Barcoding. Front. Ecol. Evol. 7, 302 (2019).
Hubert, N. & Hanner, R. DNA Barcoding, species delineation and taxonomy: a historical perspective. DNA Barcodes 3, 44–58 (2015).
Kress, W. J. Plant DNA barcodes: Applications today and in the future. J Syst Evol 55, 291–307 (2017).
Telfer, A. C. et al. Biodiversity inventories in high gear: DNA barcoding facilitates a rapid biotic survey of a temperate nature reserve. Biodivers Data J 3, e6313 (2015).
Ratnasingham, S. & Hebert, P. D. N. bold: The Barcode of Life Data System (http://www.barcodinglife.org). Mol Ecol Notes 7, 355–364 (2007).
Zhang, J. Plantlist: Looking Up the Status of Plant Scientific Names based on The Plant List Database. R package version (0.8.0), https://github.com/helixcn/plantlist (2022).
Govaerts, R., Nic Lughadha, E., Black, N., Turner, R. & Paton, A. The World Checklist of Vascular Plants, a continuously updated resource for exploring global plant diversity. Sci. Data 8, 215 (2021).
The Angiosperm Phylogeny Group. An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG IV. Bot. J. Linn. Soc. 181, 1–20 (2016).
Yang, Y. et al. Recent advances on phylogenomics of gymnosperms and a new classification. Plant Divers 44, 340–350 (2022).
Christenhusz, M. J. & Chase, M. W. Trends and concepts in fern classification. Ann. Bot. 113, 571–594 (2014).
Dillon, M. Origins and diversity of the lomas formations in the Atacama and Peruvian Deserts of western South America. Abstract Am. J. Bot. 76, 212 (1989).
Doyle, J. J. & Doyle, J. L. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemical Bulletin 19, 11–15 (1987).
CBOL Plant Working Group. A DNA barcode for land plants. Proc Natl Acad Sci USA 106, 12794–12797 (2009).
China Plant BOL Group. et al. Comparative analysis of a large dataset indicates that internal transcribed spacer (ITS) should be incorporated into the core barcode for seed plants. Proc Natl Acad Sci USA 108, 19641–19646 (2011).
Kress, W. J. et al. Plant DNA barcodes and a community phylogeny of a tropical forest dynamics plot in Panama. Proc Natl Acad Sci USA 106, 18621–18626 (2009).
Chen, S. et al. Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species. PLoS One 5, e8613 (2010).
Ripma, L. A., Simpson, M. G. & Kristen, H. L. Geneious! Simplified genome skimming methods for phylogenetic systematic studies: A case study in Oreocarya (Boraginaceae). Appl Plant Sci 2, 1400062 (2014).
Kelly, L. J. et al. DNA barcoding of lichenized fungi demonstrates high identification success in a floristic context. New Phytol 191, 288–300 (2011).
Liu, J. et al. The use of DNA barcoding as a tool for the conservation biogeography of subtropical forests in China. Diversity Distrib. 21, 188–199 (2015).
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol Biol Evol 35, 1547–1549 (2018).
Hebert, P. D. N., Cywinska, A., Ball, S. L. & deWaard, J. R. Biological identifications through DNA barcodes. Proc R Soc Lond B Biol Sci. 270, 313–321 (2003).
Meier, R., Shiyang, K., Vaidya, G. & Ng, P. K. DNA barcoding and taxonomy in Diptera: a tale of high intraspecific variability and low identification success. Syst Biol 55, 715–728 (2006).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30, 772–780 (2013).
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Ruhm, J. et al. Two Sides of the Same Desert: Floristic Connectivity and Isolation Along the Hyperarid Coast and Precordillera in Peru and Chile. Front. Ecol. Evol. 10, 862846 (2022).
Song, F. et al. Flora diversity survey and establishment of a plant DNA barcode database of Lomas ecosystems in Peru. figshare https://doi.org/10.6084/m9.figshare.22032629.v7 (2023).
Fazekas, A. J. et al. Are plant species inherently harder to discriminate than animal species using DNA barcoding markers? Mol Ecol Resour 9, 130–139 (2009).
Hollingsworth, P. M., Li, D. Z., van der Bank, M. & Twyford, A. D. Telling plant species apart with DNA: from barcodes to genomes. Philos Trans R Soc Lond B Biol Sci B 371, 20150338 (2016).
Naciri, Y. & Linder, H. P. Species delimitation and relationships: The dance of the seven veils. Taxon 64, 3–16 (2015).
Wirta, H. et al. Establishing a community-wide DNA barcode library as a new tool for arctic research. Mol Ecol Resour 16, 809–822 (2016).
Yan, L. J. et al. DNA barcoding of Rhododendron (Ericaceae), the largest Chinese plant genus in biodiversity hotspots of the Himalaya-Hengduan Mountains. Mol Ecol Resour 15, 932–944 (2015).
Acknowledgements
We are grateful to the professional taxonomists who helped with species identification, particularly María Isabel La Torre and Hamilton Beltrán. We also thank Yu-Ying Zhou for help with laboratory work and database establishment. This study was financially supported by the International Partnership Program of the Chinese Academy of Sciences (Grant No. 151853KYSB20190027) and the CAS President’s International Fellowship Initiative (Grant No. 2022VBA0003).
Author information
Authors and Affiliations
Contributions
X.-J.G. conceived and designed the study; X.-J.G., Y.-F.D., H.-F.Y., Z.-L.L., A.D., H.T., P.G.A., S.R., A.C.-E. collected and identified plant materials; E.R., M.A., Y.P.A., S.R., processed specimens and identified plants; F.S. gathered the historical species list data, performed the laboratory works, analyzed data, and wrote the manuscript. X.-J.G., revised and finalized the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Song, F., Deng, YF., Yan, HF. et al. Flora diversity survey and establishment of a plant DNA barcode database of Lomas ecosystems in Peru. Sci Data 10, 294 (2023). https://doi.org/10.1038/s41597-023-02206-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02206-y