Improving assessment accuracy for lake biological condition by classifying lakes with diatom typology, varying metrics and modeling multimetric indices

doi:10.1016/j.scitotenv.2017.07.152

Science of The Total Environment

Volume 609, 31 December 2017, Pages 263-271

https://doi.org/10.1016/j.scitotenv.2017.07.152 Get rights and content

Highlights

•
Hierarchical modeling improved multimetric indices (MMI) performance.
•
Modeled MMI performances were different when evaluated at different spatial scales.
•
Varying metrics among site groups did not improve MMI performance.

Abstract

Site grouping by regions or typologies, site-specific modeling and varying metrics among site groups are four approaches that account for natural variation, which can be a major source of error in ecological assessments. Using a data set from the 2007 National Lakes Assessment project of the USEPA, we compared performances of multimetric indices (MMI) of biological condition that were developed: (1) with different lake grouping methods, ecoregions or diatom typologies; (2) by varying or not varying metrics among site groups; and (3) with different statistical techniques for modeling diatom metric values expected for minimally disturbed condition for each lake. Hierarchical modeling of MMIs, i.e. grouping sites by ecoregions or typologies and then modeling natural variability in metrics among lakes within groups, substantially improved MMI performance compared to using either ecoregions or site-specific modeling alone. Compared with MMIs based on ecoregion site groups, MMI precision and sensitivity to human disturbance were better when sites were grouped by diatom typologies and assessing performance nationwide. However, when MMI performance was evaluated at site group levels, as some government agencies often do, there was little difference in MMI performance between the two site grouping methods. Low numbers of reference and highly impacted sites in some typology groups likely limited MMI performance at the group level of analysis. Varying metrics among site groups did not improve MMI performance. Random forest models for site-specific expected metric values performed better than classification and regression tree and multiple linear regression, except when numbers of reference sites were small in site groups. Then classification and regression tree models were most precise. Based on our results, we recommend hierarchical modeling in future large scale lake assessments where lakes are grouped by ecoregions or diatom typologies and site-specific metric models are used to establish expected metric values.

Graphical abstract

Introduction

Assessments of biological condition are important for managing freshwater resources (European Union, 2000, USEPA (U.S. Environmental Protection Agency), 2007a, USEPA (U.S. Environmental Protection Agency), 2007b). In lakes, diatoms have a long history of use in paleoecological studies that document lake responses to a wide variety of human disturbances, because diatoms are sensitive to many environmental changes and current as well as past assemblages are preserved in lake sediments (Smol and Stoermer, 2010). Diatoms are also important primary producers, elements of food webs, and sources of biodiversity in lakes (Mann and Droop, 1996); thus diatoms are important elements of biological condition in lakes (sensu Davies and Jackson, 2006). Diatom assemblages may play a unique role for understanding biological integrity, because they likely respond to different types of disturbances compared to lake invertebrates and fish, as they do in streams (O'Connor et al., 2000, Hering et al., 2006, Carlisle et al., 2008, Beck and Hatch, 2009). As a result, diatoms should be particularly valuable in the assessment of current lake conditions as well as paleoecological studies.

Relationships among natural environment factors, human disturbance and metrics are complicated. Relationships between human disturbance and metrics can be influenced by the effects of natural environment on both metrics and disturbance (Stoddard et al., 2008, Hawkins et al., 2010, Schoolmaster et al., 2013). Thus, one of the challenges with assessing ecological condition across large spatial scales is distinguishing effects of human disturbance from natural variation (Stevenson et al., 2013). Natural variability in diatom assemblage composition is great at continental spatial scales and may be related to species biogeographies and the high sensitivity of diatoms species composition to naturally varying environmental factors. Stevenson et al. (2009) showed that a diatom metric for trophic status was affected as much by natural variability among streams as human disturbance. A priori classification of sites by regions or typologies, site-specific modeling of expected reference condition, and varying metrics in site groups are four approaches that have been used to control natural variation in ecological assessments (Whittier et al., 2007, Hawkins et al., 2010).

Landscape regionalizations and aquatic biota assemblage composition have been used to group sites into classes to account for natural variation among sites (Hawkins et al., 2010). Regionalization scheme, such as Omernik's ecoregions (Omernik, 1987), has been extensively used in freshwater assessment, particularly in the US (USEPA, 2010). Ecoregions and EDUs are assumed to capture a significant amount of the natural variation in metrics or multimetric indices (MMIs) caused by differences in climate, geology, hydrology, soils, and surrounding vegetation. Regionalization schemes, however, cannot account for biotic response to natural variation within an ecoregion (Hawkins et al., 2010). Biological typologies assign sites to groups (i.e. typologies) by similarity in species composition of assemblages at reference sites. Biological typologies are not spatially constrained, so they can account for natural variation within and across regions. Biological typologies are used to account for natural variation in species composition among habitats in RIVPACS (Wright et al., 2000), a widely used approach for stream bioassessment in Europe and Australia.

Site-specific modeling of expected reference condition enables adjusting individual metrics for natural variation among sites. The adjusted metric values are the difference between the unadjusted metric values and the modeled expected reference value of each metric for that site. Models for expected reference metric value for a site are calculated using reference site data including unadjusted metric values and a suite of environmental variables that are affected relatively little by humans. Up to now, a variety of statistical techniques have been used to model relationships between individual metrics or MMIs and natural gradients, such as multiple linear regression (MLR) (Stevenson et al., 2013), classification and regression trees (CART, Cao et al., 2007), and random forest (RF, Hollister et al., 2016). Linear regression and CART have advantages over other techniques, because they are easier to understand by stakeholders. But more advanced modeling techniques that involve machine learning may perform better. For example, both RF and CART can model nonlinear relationships with interactions better than MLR. Moreover, RF is less susceptible to overfitting than CART and would therefore provide more accurate predictions when used with new data than CART (Breiman, 2001, Cutler et al., 2007). The choice of technique might depend on sample size and non-linear interaction of multiple variables (Smith et al., 2013), because machine learning statistical techniques usually require larger sample sizes for precise models.

Performance of MMIs could be increased if different metrics are used in different ecoregions, because human activities and the stressors they produce vary greatly among ecoregions (Ellis and Ramankutty, 2008) and sensitivity of metrics differs among stressors (Whittier et al., 2007). Both the types and intensity of human activities vary among ecoregions, with extensive agriculture in some ecoregions and more patchy urban and agricultural activities in others (USEPA, 2013). Responses of stream diatom metrics to a nutrient dominated agricultural gradient likely differ compared to a multistressor gradient with both urban and agricultural activities (Tang et al., 2016). Whittier et al. (2007) found that using different metrics in different ecoregions provided the best MMI performance, which indicates that some metrics did not respond to human disturbance as much in some ecoregions as others. Performance of MMIs could also be increased if different metrics were used in different groups of sites defined by biological typology. For example, fish and invertebrate species richness differed in cold and warm water habitats (Mebane et al., 2003, Hughes et al., 2004). However, a trade-off exists between consistency and sensitivity when deciding whether to use different biotic metrics for MMIs in different ecoregions. MMIs might become more sensitive to human disturbance if different metrics are used among ecoregions (or site groups defined by biological typology), but changing metrics also changes what we are assessing and therefore reduces consistency in assessments across groups.

In the present study, we evaluated different methods for improving the performance of a nationwide diatom MMI for lakes with the US Environmental Protection Agency's dataset from the 2007 National Lakes Assessment (NLA). We evaluated three hypotheses: (1) performance of MMIs will be greater when grouping sites by diatom typology than by ecoregions; (2) MMIs generated by selecting metrics for each site group (typologies or ecoregions) will perform better than by using the same set of metrics in all site groups; and (3) different statistical techniques (e.g. MLR, CART, RF) for adjusting metrics for natural variability will perform best in different situations. To do this, we grouped sites by ecoregions and diatom typology and calculated site-specific models of expected reference condition for each group of sites by ecoregion or typology. We then compared metric and MMI performance using a standard set of statistics that have been used in other evaluations of ecological assessment methods.

Section snippets

Data sets

The NLA was conducted by the United States Environment Protection Agency (USEPA). The NLA provides a nationwide dataset and analysis in which the same standardized field and laboratory protocols and the same data analyses were used for individual biological assemblages (http://water.epa.gov/type/lakes/lakessurvey_index.cfm). 1031 lakes were sampled for the NLA. 909 were selected randomly with a probabilistic sampling design from the pool of all USA lakes. 122 lakes were hand-picked to serve as

Site grouping by diatom typology

We used 5 centers to group the 144 reference sites with K-means clustering into 5 diatom typologies of reference lakes. The among group variation explained with 5 centers was 60.1% of all variation. A higher number of clusters would have provided much less improvement in variation explained per cluster than the previous five clusters, but a higher number of clusters would not have provided a sufficient number of reference sites in each cluster for metric modeling. The numbers of reference sites

Effect of site grouping methods and metric modeling on MMI performance

Hierarchical metric modeling, with site grouping and site specific metric modeling, improved lake diatom MMI performance compared to accounting for natural variation in MMIs by either ecoregions or site specific models. Stevenson et al. (2013) argued that performance of an adjusted MMI (NLA-MLR) was better than an unadjusted MMI for the USEPA's NLA data, where adjustments for natural variation were made with site specific models for MMIs (not individual metrics as in this study). In that study,

Conclusions

Hierarchical modeling improved diatom MMI performance for lakes with a combination of site grouping and modeling expected reference values of metrics within site groups. Modeled MMIs within diatom typologies had the highest overall performance and sensitivity to HDG when evaluated with all 1031sites at a national scale. However, when MMI performance was evaluated for each site group to assess consistency and to follow common USEPA methods, there was little performance difference between

Acknowledgements

We thank K. A. Blocksom and J. van Sickle for providing an original version of R code to calculate diatom multimetric indices. RJS was partially supported by a cooperative agreement with the USEPA (Grant R835203).

References (45)

I. Dodkins et al.
Developing an optimal river typology for biological elements within the Water Framework Directive
Water Res.
(2005)
J.R. Leathwick et al.
Complementarity-based conservation prioritization using a community classification, and its application to riverine ecosystems
Biol. Conserv.
(2010)
D.R. Schoolmaster et al.
An algorithmic and information-theoretic approach to multimetric index construction
Ecol. Indic.
(2013)
P.F. Smith et al.
A comparison of random forest regression and multiple linear regression for prediction in neuroscience
J. Neurosci. Methods
(2013)
T. Tang et al.
Accounting for regional variation in both natural environment and human disturbance to improve performance of multimetric indices of lotic benthic diatoms
Sci. Total Environ.
(2016)
M.W. Beck et al.
A review of research on the development of lake indices of biotic integrity
Environ. Rev.
(2009)
K.A. Blocksom
A performance comparison of metric scoring methods for a multimetric index for Mid-Atlantic Highlands streams
Environ. Manag.
(2003)
L. Breiman
Random forests
Mach. Learn.
(2001)
Y. Cao et al.
Modeling natural environmental gradients improves the accuracy and precision of diatom-based indicators
J. N. Am. Benthol. Soc.
(2007)
D.M. Carlisle et al.
Biological assessments of Appalachian streams based on predictive models for fish, macroinvertebrate, and diatom assemblages
J. N. Am. Benthol. Soc.
(2008)

D.F. Charles et al.

Paleoecological analysis of lake acidification trends in North America and Europe using diatoms and chrysophytes

D.R. Cutler et al.

Random forests for classification in ecology

Ecology

(2007)

S.P. Davies et al.

The biological condition gradient: a descriptive model for interpreting change in aquatic ecosystems

Ecol. Appl.

(2006)

J. Davy-Bowker et al.

A comparison of the European Water Framework Directive physical typology and RIVPACS-type models as alternative methods of establishing reference conditions for benthic macroinvertebrates

Hydrobiologia

(2006)

S.S. Dixit et al.

Assessing water quality changes in the lakes of the northeastern United States using sediment diatoms

Can. J. Fish. Aquat. Sci.

(1999)

J. Elith et al.

A working guide to boosted regression trees

J. Anim. Ecol.

(2008)

E.C. Ellis et al.

Putting people in the map: anthropogenic biomes of the world

Front. Ecol. Environ.

(2008)

European Union

Directive 2000/60/EC of the European Parliament and of the Council of 23 October 2000 establishing a framework for Community action in the field of water policy. The European Parliament and the Council of the European Union

Off. J. Eur. Communities

(2000)

C.P. Hawkins et al.

The reference condition: predicting benchmarks for ecological and water-quality assessments

J. N. Am. Benthol. Soc.

(2010)

D. Hering et al.

Assessment of European streams with diatoms, macrophytes, macroinvertebrates and fish: a comparative metric-based analysis of organism response to stress

Freshw. Biol.

(2006)

J.W. Hollister et al.

Modelling lake trophic state: a random forest approach

Ecosphere

(2016)

R.M. Hughes et al.

A biointegrity index (IBI) for coldwater streams of western Oregon and Washington

T. Am. Fish. Soc.

(2004)

Cited by (13)

Improving biological condition assessment accuracy by multimetric index approach with microalgae in streams and lakes
2021, Science of the Total Environment
Citation Excerpt :
Therefore, site-grouping by diatom typologies are assumed to be better than ecoregions on accounting for natural variation and generating MMI with good performance. Surprisingly, up to now, the evidence that support a better performance of site grouping by typologies was not strong either in lakes (Table 2, Liu and Stevenson, 2017) or in streams and rivers (Tang et al., 2016). The most possible reason could not be the relatively small amounts of natural variation among sites explained by typology (Stevenson et al., 2018) but due to the insufficient representative nature of typologies based on only diatom metrics (Liu et al., 2020a).
Multimetric index (MMI) approach is a broadly used in ecological assessment because it can integrate information of various kinds of ecologically related metrics of freshwater ecosystems and provide an easily understandable score for purpose of further evaluation and managements. Accounting for natural variation and disentangling covariation between natural environmental factors and human disturbance factors are imperative for an accurate assessment. Lots of progress has been made recently on the aforementioned two aspects. Three approaches, a priori classification of sites by regions or typologies, site-specific modeling of expected reference condition and varying metrics in site groups, have been tested in lakes and streams to improve assessment accuracy. All existed studies support that site-specific modeling can efficiently account for natural variation and generate a MMI with good performance. However, until now, no strong evidence has shown that diatom/blue-algae typologies are better than regionalization frameworks on accounting for natural variation either in lakes or in streams. To separate the natural variation explained by site specific modeling from that of varying metrics is necessary for a thorough and accurate evaluation on the valuableness of site-grouping by typologies. Different performance of varying metrics among site groups of streams and lakes was most probably caused by the lack of representativeness of diatom metrics on biological condition rather than the complex multi-stressor gradients in streams and rivers. A recent study showed that blue-green algae enhanced performance of diatom-based MMI on defining lake condition under high level of human disturbance. On the other hand, with more and more extensive and intensive use of statistics techniques in developing MMI, we also discussed some statistical challenges faced by scientists in field of ecological assessment, especially on setting significance level of a statistical test and multiple comparison issue in MMI performance comparison.
Benthic algae assessments in the EU and the US: Striving for consistency in the face of great ecological diversity
2021, Ecological Indicators
Freshwaters face multiple environmental problems including eutrophication, acidification, salinization, and climate-change, all of which can lead to impairment of ecosystem structure and function. Furthermore, these stressors often act in combination. Benthic algal-based assessments to quantify impairment are used in both the EU and US. In this review, we use case studies, experience, and the literature to compare concepts, approaches, and methods between the EU and US to offer an updated picture of benthic algal-based assessments. Both the US and EU are composed of numerous constituent states having considerable flexibility to adopt individual methods. The goal of this work is to synthesize the various approaches that are used across the EU and US. Specifically, we compare and contrast benthic algal assessment performed in response to core legislation – the Water Framework Directive in the EU and the Clean Water Act in the US, with a particular focus on the steps taken to ensure consistency at different stages of the process. This includes consideration of approaches to sampling design and field methods, taxonomic resolution and laboratory harmonization, metric selection and choice of algal groups, assessment of stressors and stressor/response relationships. A number of commonalities emerged during this process, particularly the focus on diatoms over other algal groups. However, there are also a number of key differences, including more widespread use of multimetric indices in the US compared with the EU. Finally, we consider emerging opportunities, including the potential for using metagenomic approaches for bioassessment in the future.
Blue-green algae enhanced performance of diatom-based multimetric index on defining lake condition under high level of human disturbance
2020, Science of the Total Environment
Citation Excerpt :
However, we did not evaluate the effect of incorporating soft-bodied planktonic algae metrics on MMI performance in lakes because the evaluation was beyond the scope of our previous paper. For lake/stream biological condition assessment, algal assessment is mainly based on structural and functional attributes of either soft-bodied benthic/planktonic algae or diatoms (Phillips et al., 2012; Carvalho et al., 2013; Thackeray et al., 2013; Fetscher et al., 2014; Poikane et al., 2016; Liu and Stevenson, 2017). Soft-bodied algae metrics are not commonly used to develop MMI in the US possibly for three reasons: first, the laboratory procedures of soft-bodied algae taxonomic analysis of the USEPA resulted in a dominant of total algal biovolume of live diatoms for most samples, which makes it harder find qualified soft-bodied algae metrics (Stancheva and Sheath, 2016).
Degradation of lake conditions could result from many stressors generated by human disturbance. Accurately defining lake ecological condition by multimetric index (MMI) method is of great importance for tracking source of stressors and lake management. For algal assessment, seldom have structural and functional attributes of soft-bodied planktonic algae metrics, one important dimension of biological condition, been used to develop MMI in conjunction with diatom metrics. Another thing is that some researchers found MMI method does not perform well in mid- and high-disturbed lakes. To test the aforementioned questions, we used data sets of the 2007 National Lake Assessment project of the USEPA to develop MMIs with and without using soft-bodied planktonic algae metrics for plains and lowlands area (PLNLOW, high disturbed region of the US) and across the conterminous US. Compared to site groups modeled by single diatom assemblages, we found integrating soft-bodied planktonic algae metric (especially blue-green algae metric) into developing MMIs can significantly improve performance of MMI in PLNLOW region. The separation powers of MMIs of five level III ecoregions, developed by incorporating blue-green algae metric, are consistently higher than those developed by single diatom assemblages (p-value = 0.029). However, when blue-green algae metric was applied to develop MMI along with diatom metrics in the national scale assessment, performances of MMIs are similar to that developed by diatom metrics (0.14 < p-value < 0.86). Different performance of MMIs developed by integrating blue-green algae metric at different spatial scales indicated the usefulness of blue-green algae metric in ecological assessment in mid- and high- disturbed lakes and a tiered approach for using diatom and blue-green algae metric in ecological assessment. We suggest using blue-green algae metric in combined with diatom metric to develop MMI when lakes are mid- and high-disturbed, while a routine diatom assessment would be enough for minimally disturbed sites.
Annual changes in periphyton communities and their diatom indicator species, in the littoral zone of a subtropical urban lake restored by submerged plants
2020, Ecological Engineering
Biological indicators of assessing surface waters depend on regionally based indices and sensitivity to environmental conditions. Periphytic algae often used as a potential biological assessment method for its environmental sensitivity characteristics. Here we studied the characteristics of the periphytic algae community and diatom species in the subtropical urban landscape lake known as West Lake, Hangzhou, China, which is in the process of ecological restoration. The changes in species composition, biomass and the community diversity of periphytic algae were determined. 71 taxa of 54 genera of periphytic algae were observed. The Bacillariophyta, Chlorophyta and Cyanophyta dominated in the community. The Bacillariophytes exhibited the highest species richness, whereas the Cyanophytes showed the highest cell density. Principal component analysis environmental variables showed that the TN was main nutrient factor to periphytic algae in the West Lake. In terms of the relatively high TN contents characteristics in subtropical urban landscape lake West Lake, the ecological optimum values and tolerance values of TN about diatom were calculated by weighted regression method. Synedra capitata, Achnanthes minutissima, Gomphonema acuminatum, Navicula graciloides were extracted as indicator species for environmental gradients of TN in the West Lake. Those findings provide a theoretical basis for the use of diatoms biological index to improve the effectiveness of monitoring and evaluating the nutritional level of TN in urban landscape lakes system. It would be significant to identify the nutritional status of the subtropical shallow water bodies and advise related management.
Comment: Averaging statistics of multimetric index leading to an inaccurate evaluation on methods of defining biological condition of streams/rivers in ecological assessment
2019, Science of the Total Environment
Advancing evaluation of bioassessment methods: A reply to Liu and Cao
2018, Science of the Total Environment
A series of three papers was written about the development of multimetric indices (MMIs) using diatoms in rivers, streams and lakes for transcontinental surveys conducted by the United States Environmental Protection Agency. Stevenson et al. (2013) used the surface sediment diatom data from the 2007 National Lake Assessment to develop national scale site specific models for MMIs to account for natural variation in condition among sites. Liu and Stevenson (2017) also used the 2007 lakes data to evaluate performance of MMIs by grouping sites by ecoregions or typologies (naturally similar types of lakes defined by similarity in diatom species composition) with site specific metric models (SSMMs) that adjust metrics for natural variability among sites. Tang et al. (2016) used benthic diatom data from the 2008–2009 National River and Stream Assessment to develop SSMMs and MMIs by ecoregion and typology. All three studies showed that SSMMs improved performance of diatom MMIs by accounting for natural variation among sites. None of the studies provided consistent evidence that grouping sites by typologies produced better MMI performance than grouping sites by ecoregions.
Liu and Cao (2018) criticized the Tang et al. (2016) paper for using means and standard errors to evaluate relative performance of MMI calculation methods at the site group scale, however, their criticism is incorrect. Actually, Tang et al. (2016) only used means to summarize and report relative performance of MMI calculation methods in the body of the paper. Tang et al. (2016) appropriately used non-parametric rank sum approaches to evaluate the probability that the multiple MMI calculations for separate site groups were the same for ecoregion (n = 9) and typology (n = 7) site groups. Liu and Stevenson (2017) used this same non-parametric approach for tests of lake diatom MMIs. Liu and Cao's (2018) concerns can be addressed by distinguishing between the goals and methods used for testing and evaluation of MMI calculation methods at the national and site-group scales. Tang et al. (2016) did not aggregate data across site groups to test MMI performance at the national scale because they were following standard EPA methods that develop separate MMIs for each site group. In conclusion, Liu and Cao (2018) misunderstood the MMI evaluation in Tang et al. (2016) and added no new information to this body of work, because all the concerns they raised were discussed in Liu and Stevenson (2017).

View all citing articles on Scopus

View full text

Improving assessment accuracy for lake biological condition by classifying lakes with diatom typology, varying metrics and modeling multimetric indices

Highlights

Abstract

Graphical abstract

Introduction

Section snippets

Data sets

Site grouping by diatom typology

Effect of site grouping methods and metric modeling on MMI performance

Conclusions

Acknowledgements

Water Res.

Biol. Conserv.

Ecol. Indic.

J. Neurosci. Methods

Sci. Total Environ.

A review of research on the development of lake indices of biotic integrity

Environ. Rev.

A performance comparison of metric scoring methods for a multimetric index for Mid-Atlantic Highlands streams

Environ. Manag.

Random forests

Mach. Learn.

Modeling natural environmental gradients improves the accuracy and precision of diatom-based indicators

J. N. Am. Benthol. Soc.

Biological assessments of Appalachian streams based on predictive models for fish, macroinvertebrate, and diatom assemblages

J. N. Am. Benthol. Soc.

Paleoecological analysis of lake acidification trends in North America and Europe using diatoms and chrysophytes

Random forests for classification in ecology

Ecology

The biological condition gradient: a descriptive model for interpreting change in aquatic ecosystems

Ecol. Appl.

A comparison of the European Water Framework Directive physical typology and RIVPACS-type models as alternative methods of establishing reference conditions for benthic macroinvertebrates

Hydrobiologia

Assessing water quality changes in the lakes of the northeastern United States using sediment diatoms

Can. J. Fish. Aquat. Sci.

A working guide to boosted regression trees

J. Anim. Ecol.

Putting people in the map: anthropogenic biomes of the world

Front. Ecol. Environ.

Directive 2000/60/EC of the European Parliament and of the Council of 23 October 2000 establishing a framework for Community action in the field of water policy. The European Parliament and the Council of the European Union

Off. J. Eur. Communities

The reference condition: predicting benchmarks for ecological and water-quality assessments

J. N. Am. Benthol. Soc.

Assessment of European streams with diatoms, macrophytes, macroinvertebrates and fish: a comparative metric-based analysis of organism response to stress

Freshw. Biol.

Modelling lake trophic state: a random forest approach

Ecosphere

A biointegrity index (IBI) for coldwater streams of western Oregon and Washington

T. Am. Fish. Soc.