A method for simultaneous measurement of soil bacterial abundances and community composition via 16S rRNA gene sequencing
Introduction
High-throughput sequencing has revolutionized the field of soil microbial ecology. In particular, 16S rRNA gene sequencing has provided unprecedented insight into the diversity of soil microbial communities, and it is now commonly used to characterize bacterial and archaeal communities in soil or related environments. However, one important limitation of this approach is that it only provides estimates of the proportional abundances of taxa – it provides no information on how the total amounts of microbial DNA may vary across samples. This can cause problems in the interpretation of results. For example, consider two samples that both have 20% of 16S rRNA sequences assigned to the phylum Acidobacteria. Such information can be used to assess how the abundance of Acidobacteria compares to other taxa, but this does not tell us anything about the total numbers of bacteria in these two samples and one sample could have far more Acidobacteria than another if it had a higher total number of bacterial cells. Likewise, two samples could appear to have very different proportional abundances of 16S rRNA reads assigned to a bacterial taxon of interest (e.g., a bacterial pathogen), but these two samples could actually have the same number of cells belonging to that taxon. Changes in bacterial community composition can also be difficult to interpret when relying on proportional abundances since it is often difficult to differentiate between one taxon increasing in abundance and another decreasing in abundance. Thus, the proportional nature of marker gene datasets makes it impossible to compare how the total amounts of microbes (or their marker genes) vary across environmental samples that could have very different total cell numbers.
While there are a myriad of non-nucleic acid techniques for estimating total microbial biomass in soil or other environments (including phospholipid fatty acid analysis, substrate induced respiration, chloroform fumigation, and direct counting) – these methods are difficult to relate directly to the relative abundances as determined by 16S rRNA gene sequencing. This is due to the fact that these methods do not provide direct information on the amounts of 16S rRNA gene copies in a given sample, but instead represent other metrics of cell abundances (e.g., amounts of phospholipid fatty acids, amounts of chloroform-extractable microbial biomass carbon, or numbers of visible cells). Perhaps more importantly, these approaches for estimating soil microbial biomass can require significant added effort, they may require analyzing fresh (unfrozen) samples, and they often do not discriminate between prokaryotes and other organisms that can be abundant in soil (like fungi, protists, plants, or soil fauna). Likewise, total DNA yield is not typically considered a useful estimate of bacterial biomass as much of the DNA in soil comes from soil organisms that are not bacteria (Leckie et al., 2004) and DNA extraction efficiencies can vary dramatically across soil types (Frostegård et al., 1999, Cruaud et al., 2014). While quantitative PCR is frequently used to determine the number of 16S rRNA genes in a given sample (e.g. Fierer et al., 2005), this requires an additional step in the analyses, and the quantitative PCR analyses are also subject to biases due to differences in DNA extraction efficiencies and varying DNA amplification efficiencies across samples (Martin-Laurent et al., 2001, Fierer et al., 2005).
Clearly it is useful to have a method in place to directly relate 16S rRNA gene sequence data to estimates of the total amount of 16S rRNA genes found in a given sample to improve our understanding of the changes in taxon abundances underlying spatial or temporal differences in community composition. Here, we describe a method that couples sequencing-based analyses of the 16S rRNA gene for the assessment of microbial diversity with the determination of the variability in 16S rRNA gene abundances across samples, thus allowing the comparison of the actual abundances of different microbial taxa across samples, not just their proportional abundances. The approach involves adding DNA from a bacterium unlikely to be found in soil to the soil sample at the DNA extraction step. We then use the proportional representation of 16S rRNA reads from this organism to calculate how the total abundances of 16S rRNA genes vary across samples (Fig. 1). By adding this simple step to pipelines for 16S rRNA gene analyses, one can simultaneously compare how the diversity and abundances of soil bacteria vary across samples.
Section snippets
General description of the approach
Our general approach is outlined in Fig. 1. For convenience, we define cell abundances in this manuscript as 16S rRNA gene abundances, although we recognize that the conversion from 16S rRNA gene numbers to bacterial cell abundances should be done with caution as the number of 16S rRNA gene copies per cell can vary from one to fifteen (Lee et al., 2009). Prior to DNA extraction, we add an internal standard to permit estimation of 16S rRNA gene abundances across soil samples. This internal
Selection of internal standards to add to soil
We confirmed that our sources of internal standard DNA, from A. fischeri and T. thermophilus, do not commonly occur in soil by looking for these taxa (or closely related taxa) in previously published datasets that span a wide range of soil types. No sequences >97% similar to 16S rRNA gene sequences from A. fischeri and T. thermophilus were found in any of these collections of soil samples. In addition, when we screened the 116 soil samples used to test this method (Table S1), 16S rRNA sequences
Conclusions
Whereas until now, 16S rRNA gene sequencing only gave information on the relative abundances of taxa within a sample, with the internal standard it is now possible to estimate how the total amounts of 16S rRNA genes may vary across samples. The described method is likely to be broadly useful when conducting analyses of microbial studies in soil, and potentially other environments, where the ability to compare the total amounts of bacteria and archaea between samples is critical. Further testing
Acknowledgments
We thank Jessica Henley for her assistance with sample processing and Tess Brewer for her assistance with lab work and graphic design.
This work was supported by grants to N.F., R.L. M., and M.A.B. from the U.S. National Science Foundation (DEB-1021222, DEB-1021098, DEB-0953331, and DEB-1021112) with funding to W.S. and S.L. provided from the University of Antwerp.
References (29)
- et al.
Relationships between soil microbial biomass determined by chloroform fumigation–extraction, substrate-induced respiration, and phospholipid fatty acid analysis
Soil Biology & Biochemistry
(2002) - et al.
Decreased mass specific respiration under experimental warming is robust to the microbial biomass method employed
Ecology Letters
(2009) - et al.
Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms
ISME Journal
(2012) - et al.
Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample
- et al.
Predicting the responsiveness of soil biodiversity to deforestation: a cross-biome study
Global Change Biology
(2014) - et al.
Influence of DNA extraction method, 16S rRNA targeted hypervariable regions, and sample origin on microbial diversity detected by 454 pyrosequencing in marine chemosynthetic ecosystems
Applied and Environmental Microbiology
(2014) - et al.
Nuclear DNA content and genome size of trout and human
Cytometry Part A
(2003) UPARSE: highly accurate OTU sequences from microbial amplicon reads
Nature Methods
(2013)- et al.
Assessment of bias associated with incomplete extraction of microbial DNA from soil
Applied & Environmental Microbiology
(2009) - et al.
Assessment of soil microbial community structure by use of taxon-specific quantitative PCR assays
Applied and Environmental Microbiology
(2005)
Variations in microbial community composition through two soil depth profiles
Soil Biology & Biochemistry
Quantification of bias related to the extraction of DNA directly from soils
Applied and Environmental Microbiology
Use and misuse of PLFA measurements in soils
Soil Biology & Biochemistry
Conducting a microbiome study
Cell
Cited by (150)
Evaluation of lipopeptides biosurfactant from Raoultella planticola for bioremediation in n-hexadecane-contaminated soil
2024, Journal of Environmental Chemical EngineeringEffect of forest soil viruses on bacterial community succession and the implication for soil carbon sequestration
2023, Science of the Total EnvironmentMolecular and associated approaches for studying soil biota and their functioning
2023, Soil Microbiology, Ecology and Biochemistry, Fifth Edition