Trends in Ecology & Evolution
ReviewSequencing our way towards understanding global eukaryotic biodiversity
Section snippets
Microscopic eukaryotes: global dominance, scant knowledge
Microscopic eukaryotic taxa are abundant and diverse, playing a globally important role in the functioning of ecosystems 1, 2 and host-associated habitats [3]. Here, we consider taxa generally represented by individuals <1 mm in size; the term ‘microscopic eukaryotes’ thus encompasses meiofaunal metazoans (e.g. Nematoda, Platyhelminthes, Gastrotricha and Kinorhyncha; see Glossary), microbial representatives of fungi and deep protist lineages (Alveolata, Rhizaria, Amoebozoa, algal taxa in the
Emerging insight from environmental data
Following earlier 16S rRNA (reference GenBank accession X80721.1 for Escherichia coli) investigations of archaeal and bacterial communities 14, 22, high-throughput marker gene approaches were developed for different groups of microscopic eukaryote using the 18S nuclear small subunit rRNA gene (nSSU; reference GenBank accession X03680.1 for Caenorhabditis elegans), focusing on protists 11, 12, 23, 24, 25, 26 and meiofauna 9, 10, 27. Similar to 16S investigations, these early 18S studies
Analyzing high-throughput data
Over the past few years, high-throughput sequencing techniques have been informed by rapid progress in sequencing technology, bioinformatics tools and analytical pipelines. Here, we present an overview of the analytical considerations for high-throughput studies. Following sample collection, extraction of environmental DNA, PCR and sequencing (Figure 1), large data sets can be processed using many existing tools (Table 1, Figure 2).
The need for robust guide trees and reference databases
Limited eukaryote reference databases and inconsistent taxonomic levels currently hinder the development of robust computational pipelines for marker gene data (e.g. reference-based OTU picking and confident taxonomy assignments [60]), and limit the use of tree-based methods and deeper sequencing technologies with shorter sequence reads (such as those derived from the Illumina platforms). Microscopic eukaryotic taxa have been historically underrepresented in public repositories, with some
Future outlook and challenges
Although substantial progress is being made with high-throughput eukaryotic studies, many challenges lie ahead. A strong emphasis on morphological and environmental data collection, guide trees and reference sequence databases, and open-access repositories for high-throughput data sets is urgently needed. Large-scale sequencing methods offer substantial promise for basic and applied biodiversity research, yet the wider adoption of these approaches will probably hinge on the ease-of-use and
Concluding remarks
The promise and accessibility of high-throughput sequencing is now poised to attract increasing numbers of non-computationally trained researchers. With ongoing declines in the price of sequencing, deep sequencing will inevitably represent the most cost-effective approach for elucidating ecological and functional roles of complex communities. However, exploiting the data will require the continued refinement of bioinformatics pipelines and database resources, which will in turn require an
Acknowledgments
The authors would like to thank the anonymous reviewers for their insightful comments that significantly helped to improve an earlier version of the manuscript. Development of this manuscript was made possible by a Catalysis Meeting award (HB and WKT) from the National Evolutionary Synthesis Center. HB and WKT supported through NSF (DEB-1058458 and NIH (NIH-1P20RR030360-01)). SC supported by a Natural Environment Research Council (NERC) New Investigator Grant (NE/E001505/1), a Post Genomic and
Glossary
- 454
- common term for the Roche GS platforms that use bead emulsion methods and typically return approximately 1.2 million sequences per full plate run (reads currently averaging 350–450 bp).
- Illumina
- company producing the newest Hi-Seq and MiSeq platforms, which uses bridge amplification to produce 1.6 billion sequences per eight-lane Hi-Seq flow cell (current max length for paired-end reads is 300 bp).
- Marker gene surveys
- high-throughput environmental sequencing utilizing homologous genetic loci (e.g.
References (100)
- et al.
Molecular screening of free-living microbial eukaryotes: diversity and distribution using a meta-analysis
Curr. Opin. Microbiol.
(2005) Mapping of picoeukaryotes in marine ecosystems with a quantitative PCR of the 18S rRNA gene
FEMS Microbiol. Ecol.
(2005)Systems biology for ecology: from molecules to ecosystems
Adv. Ecol. Res.
(2010)- et al.
Applications of PCR-based tools for detection and identification of animal trypanosomes: a review and perspectives
Vet. Parasitol.
(2002) Reducing chimera formation during PCR amplification to ensure accurate genotyping
Gene
(2010)Molecular phylogeny of Banza (Orthoptera: Tettigoniidae), the endemic katydids of the Hawaiian Archipelago
Mol. Phylogenet. Evol.
(2006)Exponential decline of deep-sea ecosystem functioning linked to benthic biodiversity loss
Curr. Biol.
(2008)The influence of biotic interactions on soil biodiversity
Ecol. Lett.
(2006)Microbial eukaryotes in the human microbiome: ecology, evolution, and future directions
Front. Microbiol.
(2011)Microeukaryote community patterns along an O2/H2S Gradient in a supersulfidic Anoxic Fjord (Framvaren, Norway)
Appl. Environ. Microbiol.
(2006)
Genetic diversity and habitats of two enigmatic marine alveolate lineages
Aquat. Microb. Ecol.
Archaeorhizomycetes: unearthing an ancient class of ubiqutous soil fungi
Science
Discovery of novel intermediate forms redefines the fungal tree of life
Nature
Ultrasequencing of the meiofaunal biosphere: practice, pitfalls, and promises
Mol. Ecol.
Evaulating high-throughput sequencing as a method for metagenomic analysis of nematode diversity
Mol. Ecol. Resour.
Multiple marker parallel tag enviornmental DNA sequencing reveals a highly complex eukaryotic community in marine anoxic water
Mol. Ecol.
Massively parallel tag sequencing reveals the complexity of anaerobic marine protistan communities
BMC Biol.
A molecular view of microbial diversity and the biosphere
Science
Microbial diversity in the deep sea and the unexplored ‘rare biosphere’
Proc. Natl. Acad. Sci. U.S.A.
A global census of marine microbes
The influence of sex, handedness, and washing on the diversity of hand surface bacteria
Proc. Natl. Acad. Sci. U.S.A.
A core gut microbiome in obese and lean twins
Nature
The correlation between rDNA copy number and genome size in eukaryotes
Genome
Diversity of 16S rRNA genes within individual prokaryotic genomes
Appl. Environ. Microbiol.
Lateral gene transfer in eukaryotes
Cell. Mol. Life Sci.
Microbial population structures in the deep marine biosphere
Science
A method for studying protistan diversity using massively parallel sequencing of V9 hypervariable regions of small-subunit ribosomal RNA genes
PLoS ONE
Diversity in a hidden world: potential an limitation of next-generation sequencing for surveys of molecular diversity of eukaryotic microorganisms
Mol. Ecol.
Protistan microbial observatory in the Cariaco Basin, Caribbean. II. Habitat specialization
ISME J.
Protistan microbial observatory in the Cariaco Basin, Caribbean. I. Pyrosequencing vs Sanger insights into species richness
ISME J.
Metagenetic community analysis of microbial eukaryotes illuminates biogeographic patterns in deep-sea and shallow water sediments
Mol. Ecol.
Contrasting seasonal niche separation between rare and abundant taxa conceals the extent of protist diversity
Mol. Ecol.
Rapidly denoising pyrosequencing amplicon reads by exploiting rank-abundance distributions
Nat. Methods
Second-generation environmental sequencing unmasks marine metazoan biodiversity
Nat. Commun.
Depicting more accurate pictures of protistan community complexity using pyrosequencing of hypervariable SSU rRNA gene regions
Environ. Microbiol.
Ultra-deep sequencing of foraminiferal microbarcodes unveils hidden richness of early monothalamous lineages in deep-sea sediments
Proc. Natl. Acad. Sci. U.S.A.
Ecological assessment of estuarine sediments by pyrosequencing eukaryotic ribosomal DNA
Front. Ecol. Environ.
Environmental barcoding: a next-generation sequencing approach for biomonitoring applications using river benthos
PLoS ONE
Assessing macroinvertebrate biodiversity in freshwater ecosystems: advances and challenges in DNA-based approaches
Q. Rev. Biol.
Phylogeny, phylogeography, phylobetadiversity and the molecular analysis of biological communities
Philos. Trans. R. Soc. B: Biol. Sci.
Removing noise from pyrosequenced amplicons
BMC Bioinform.
pplacer: linear time maximum-likelihood Bayesian phyogenetic placement of sequences onto a fixed reference tree
BMC Bioinform.
Aligning short reads to reference alignments and trees
Bioinformatics
Search and clustering orders of magnitude faster than BLAST
Bioinformatics
Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities
Appl. Environ. Microbiol.
Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences
Bioinformatics
ESPRIT-Tree: hierarchial clustring analysis of millions of 16S rRNA pyrosequences in quasilinear computational time
Nucleic Acids Res.
The Ribosomal Database Project: improved alignments and new tools for rRNA analysis
Nucleic Acids Res.
Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB
Appl. Environ. Microbiol.
QIIME allows analysis of high-throughput community sequencing data
Nat. Methods
Cited by (341)
DNA metabarcoding reveals spatial and temporal variation of fish eye fluke communities in lake ecosystems
2024, International Journal for ParasitologyTen years of green manuring and biofumigation alters soil characteristics and microbiota
2023, Applied Soil EcologyResponse mechanism of meiofaunal communities to multi-type of artificial reef habitats from the perspective of high-throughput sequencing technology
2023, Science of the Total Environment