Gaussian mixture discriminant analysis and sub-pixel land cover characterization in remote sensing

doi:10.1016/S0034-4257(02)00172-4

Remote Sensing of Environment

Volume 84, Issue 4, 10 April 2003, Pages 550-560

https://doi.org/10.1016/S0034-4257(02)00172-4 Get rights and content

Abstract

Mixture analysis is a necessary component for capturing sub-pixel heterogeneity in the characterization of land cover from remotely sensed images. Mixture analysis approaches in remote sensing vary from conventional linear mixture models to nonlinear neural network mixture models. Linear mixture models are fairly simple and generally result in poor mixture analysis accuracy. Neural network models can achieve much higher accuracy, but typically lack interpretability. In this paper we present a mixture discriminant analysis (MDA) model for inferring land cover fractions within forest stands from Landsat Thematic Mapper images. Specifically, individual class distributions are modeled as mixtures of subclasses of Gaussian distributions, and land cover fractions are estimated using the corresponding posterior probabilities. Compared to a benchmark study on accuracy of mixture models with Plumas National Forest data, this MDA model easily outperforms traditional linear mixture models and parallels the performance of the ARTMAP neural network mixture model. In other words, the MDA model is observed to successfully combine the performance characteristics of more complex neural network models (due to the nonlinear nature of its classification rules), with the ease of interpretation associated with linear mixture models (due to its relatively simple structure). MDA models therefore offer an attractive alternative for addressing the mixture modeling problem in remote sensing.

Introduction

The extraction of land cover information from remote sensing images traditionally is viewed as a classification problem which labels each pixel in the image as one of only a few possible classes. However, in reality, all degrees of mixing of pure land cover classes within pixels can be found due to the continuum of variation found in the landscape (Foody, 1996b) and the intrinsic mixed nature of most land covers (Schowengerdt, 1996). Hence, discretization of land cover into a limited number of categories contributes to a loss of information. Alternatively, mixture modeling in remote sensing predicts the respective fractions of land cover classes within pixels and characterizes land cover more accurately by decomposing a pixel into a small number of “pure” land cover classes. The resulting mixture map represents the fractions of pure land covers within pixels. For example, a pixel may be denoted in such maps by the following mixture fractions—80% conifer, 10% hardwood, and 10% brush. A traditional thematic map would label it as conifer by majority rule. This mixture information is very important for forestry, wildlife conservation (Woodcock, Gopal, & Albert, 1996), and global climate modeling (DeFries, Townshend, & Hansen, 1999). Classification and mixture analysis are not mutually exclusive. One nontrivial use of the mixture information is that discrete classification maps of any type can be produced out of the continuous land cover information if desired. For example, Adams et al. (1995) classified remote sensing images according to the dominant ground cover inferred from mixture analysis. In both classification and mixture analysis, the intra-class variability caused by factors such as age, health condition, and species uniformity often adds complexity to the task.

Several attempts have been made to characterize land cover at sub-pixel level using remote sensing data, including linear mixture models Adams et al., 1995, Roberts et al., 1998, Roberts et al., 1993, Smith et al., 1990, neural networks Atkinson et al., 1997, Carpenter et al., 1999, Foody, 1998, Foody et al., 1997, fuzzy classifiers (Foody, 1996a), maximum likelihood classifiers Foody et al., 1992, Häme et al., 2001, Schowengerdt, 1996, regression trees (DeFries et al., 1997), and decision trees (McIver & Friedl, 2002). Prior research has shown that linear mixture models, which yield simple linear decision rules, often generate poor to moderate results. In addition, as demonstrated by researchers Borel & Gerstl, 1994, Ray & Murray, 1996, linear mixture models may not be suitable in cases when multiple scattering results in nonlinear mixing. In this context, a nonlinear decision rule can produce better results. Atkinson et al. (1997) applied a mixture model based on a Multilayer Perceptron neural network to decompose AVHRR imagery. The mixture information from their model was more accurate compared to that generated through linear mixture models and fuzzy c-means classifiers. Carpenter et al. (1999) presented an algorithm for mixture estimation based on an ARTMAP neural network, and applied it for identifying life form components of the vegetation mixture from Landsat Thematic Mapper (TM) imagery. The ARTMAP-based mixture model was able to capture nonlinear boundaries between classes and thus performed better in terms of accuracy compared with the conventional linear mixture models.

For purposes of comparison, we will take methods based on linear mixture models and ARTMAP models as representative extremes from this literature. The commonly used linear mixture model¹ is effectively a constrained linear model, wherein the spectra of a mixed pixel is modeled as a linear combination of spectra of “pure” land covers called “endmembers” (with weights constrained to be positive and sum to one) Adams et al., 1995, Smith et al., 1990. The assumptions are independent sampling and a single common, multivariate Gaussian error model. In addition, least squares is used to estimate class fractions from observed data. The end result captures the effect of land cover mixing at the level of the mean, but is often unable to truly match the multi-modal nature of the underlying data. Correspondingly, these methods tend to exhibit a relatively poor level of accuracy in most situations. For example, in the study by Carpenter et al. (1999), the spectra of endmembers of each of the four land cover classes varied to a great degree, resulting in poor accuracy. That prompted a selection of two sets of endmembers, “exterior” endmembers and “interior” endmembers. The “exterior” endmembers were the pixels whose spectral values were at the exterior of the scatter plots of TM Bands 3 and 4 and TM Bands 4 and 5 formed by endmembers of all classes. The “interior” endmembers were at the interior of the scatter plots. These “exterior” and “interior” endmembers were averaged respectively to get two sets of mean spectra. While the two sets of endmembers could give fairly good results for some classes, neither could produce good overall accuracy. Neural network models, however, like those based on the ARTMAP architecture of Carpenter et al., 1999 have been reported to estimate fractional coverage with much higher accuracy, due to their ability to represent highly complex nonlinear functions. However, this feature has also been accompanied by criticism that neural networks can be difficult to use and yield little in the way of explanation or interpretation, despite the fact that researchers in the neural network community have attempted to interpret its “black box” nature to some degree (e.g. Liu, Gopal, & Woodcock, 2001, Chap. 12).

In this paper, we develop a mixture discriminant analysis (MDA) model for estimating fractions of four land cover classes, conifer, hardwood, barren, and brush within forest stands from Landsat TM images. While MDA models have been around informally for some years now in fields like statistics and pattern recognition, they seem to have been explored formally only recently by Hastie and Tibshirani (1996) within statistics and, to the best of our knowledge, have found no application to land cover characterization in remote sensing to date. Employing this MDA framework, we model each land cover class distribution as a mixture of subclasses of multivariate Gaussian distributions. We discuss the training of this model and propose an estimator based on the posterior distribution of classes, given data. In the spirit of Carpenter et al. (1999), and using the same data, we conduct a numerical study in which we compare our MDA-based method with the methods based on the linear mixture and the ARTMAP neural network mixture models described by Carpenter et al., 1999. We find that with little loss in simplicity and interpretability over the linear mixing approach, our MDA approach is able to nearly match the performance of the neural network approach. In addition, the results of the MDA method are more interpretable and statistically based compared with the neural network approach.

The outline of this paper is as follows. Section 2 describes the MDA model for the mixture problem in remote sensing. Section 3 describes the Plumas National Forest data used in this study, and illustrates how MDA is trained and applied for mixture analysis. Section 4 compares the performance of these mixture methods in terms of mixture analysis accuracy. Finally this paper ends with some discussions and conclusions in 5 Discussion, 6 Conclusions.

Section snippets

Model description: mixture discriminant analysis (MDA)

In this section, we briefly describe the mixture discriminant analysis (MDA) modeling framework, as outlined in Hastie and Tibshirani (1996). These authors consider MDA in some generality, focusing in particular on a number of variations on the basic modeling and fitting strategy, and consider its application to tasks such as the recognition of handwritten digits. Our focus is on the adaptation of this framework to sub-pixel land cover characterization in remote sensing. MDA can be viewed as an

Field measurement and satellite sensor data

The study area, Plumas National Forest of California, is characterized by temperate conifer forests mixed with chaparral brush fields and hardwood forests. For the purpose of forest management, the quantification of conifer, hardwood, and brush within stands is useful (Carpenter et al., 1999). As a result, four land cover classes are identified, i.e., conifer, hardwood, brush, and barren.

The data used in this study consist of two components: field measurements of land cover fractions and the

Results

We compare the results of MDA with those of the two linear mixture models and the ARTMAP mixture model previously published by Carpenter et al. (1999). In that study, a linear mixture model was tested using two different sets of endmembers—exterior endmembers and interior endmembers—which were chosen to address the variability among the “pure” endmembers. Exterior endmembers were the means of the exterior sites in spectral measurement space of all the pure sites while interior endmembers are

Discussion

Classification of land cover is one of the primary objectives of the use of remote sensing data. Increasingly, global climate models and terrestrial ecosystem models require specification of mixtures of land covers DeFries et al., 1999, Woodcock et al., 1996. Fraction estimation is a difficult task given the spectral overlap of the land cover classes and the spectral variability within classes, as is manifested in the scatter plot (Fig. 1) of the TM Bands 3 and 4, and the histograms of Bands 4

Conclusions

The MDA approach captures intra-class variability by modeling each class distribution as a mixture of Gaussian subclass distributions. The posterior probabilities from MDA are assumed to represent the sub-pixel land cover fractions. MDA outperforms linear mixture models and is similar in performance to the ARTMAP neural network mixture model due to the nonlinear nature of its decision boundaries, but without losing the ease of interpretation due to its relatively simple structure. MDA models

Acknowledgements

This research was supported by NSF Grant BCS 0079077 and ONR Award N00014-99-1-0219. We would like to thank Curtis Woodcock, Gail Carpenter, and the staff at the Region 5 Remote Sensing Laboratory of the U.S. Forest Service for providing the data. We also thank the two anonymous reviewers whose valuable comments and suggestion greatly improved this manuscript.

References (30)

J.B Adams et al.
Classification of multispectral images based on fraction endmembers, application to land cover change in the Brazilian Amazon
Remote Sensing of Environment
(1995)
C Borel et al.
Nonlinear spectral mixing models for vegetative and soil surfaces
Remote Sensing of Environment
(1994)
R DeFries et al.
Subpixel forest cover in Central Africa from multisensor, multitemporal data
Remote Sensing of Environment
(1997)
T Häme et al.
AVHRR-based forest proportion map of the Pan-European area
Remote Sensing of Environment
(2001)
D.K McIver et al.
Using prior probabilities in decision-tree classification of remotely sensed data
Remote Sensing of Environment
(2002)
T.W Ray et al.
Nonlinear spectral mixing in desert vegetation
Remote Sensing of Environment
(1996)
D.A Roberts et al.
Mapping chaparral in the Santa Monica mountains using multiple endmember spectral mixture models
Remote Sensing of Environment
(1998)
D.A Roberts et al.
Green vegetation, nonphotosynthetic vegetation, and soils in AVIRIS data
Remote Sensing of Environment
(1993)
R.A Schowengerdt
On the estimation of spatial–spectral mixing with classifier likelihood functions
Pattern Recognition Letters
(1996)
M.O Smith et al.
Vegetation in deserts: I. A regional measure of abundance from multispectral images
Remote Sensing of Environment
(1990)

A.H Strahler

The use of prior probabilities in maximum likelihood classification of remotely sensed data

Remote Sensing of Environment

(1980)

P.M Atkinson et al.

Mapping sub-pixel proportional land cover with AVHRR imagery

International Journal of Remote Sensing

(1997)

C.A Bateson et al.

Endmember bundles: a new approach to incorporating endmember variability into spectral mixture analysis

IEEE Transactions on Geoscience and Remote Sensing

(2000)

G.A Carpenter et al.

A neural network method for mixture estimation for vegetation mapping

Remote Sensing of Environment

(1999)

R.S DeFries et al.

Continuous fields of vegetation characteristics at the global scale at 1-km resolution

Journal of Geophysical Research

(1999)

Cited by (114)

Improved Gaussian mixture model to map the flooded crops of VV and VH polarization data
2023, Remote Sensing of Environment
Accurate and timely monitoring of flooded crop areas is crucial for disaster rescue and loss assessment. However, most flooded crop monitoring methods based on synthetic aperture radar (SAR) imagery were developed for rice, which is probably inappropriate for crops with complex canopy structures that strongly attenuate SAR signals. Additionally, these methods often rely on empirical thresholds and region-specific reference samples, limiting their reliability and applicability on a larger spatial scale. To address these issues, we developed a novel flooded crop mapping approach at a regional scale using Sentinel-1 time-series data and an unsupervised Gaussian Mixture Model (GMM). Our approach leverages a Flood Separability Index (FSI) derived from the fitted probability density function of flooded and non-flooded crop areas in a GMM. This allows us to overcome the limitations of manual input selection in previous studies. The multi-temporal GMM was constructed using the time-series images with optimal polarization to estimate the flooded crop extents on a regional scale. We also investigated the scattering mechanisms of three typical crop disaster structures within an agricultural landscape area. Our results indicate that the proposed multi-temporal GMM is robust in crop planting areas with complex canopy structures. The performance of both single-temporal and multi-temporal GMMs surpasses that of baseline methods such as Otsu and K-means. Compared with VV polarization, VH polarization exhibits greater potential for accurately mapping flooded crops in complex agricultural regions. Our approach does not require labeled samples or many predefined parameters, making it fast and feasible for mapping flooded crops with complex canopy structures in large spatial areas.
Mapping multi-layered mangroves from multispectral, hyperspectral, and LiDAR data
2021, Remote Sensing of Environment
Citation Excerpt :
While the swampy environment and inaccessibility to mangrove forests often hinder field investigation, remote sensing technology has been applied in mangrove studies for the past three decades (Blasco et al., 1992; Chun et al., 2015; Green et al., 1998; Wang et al., 2004b, 2015), which is still a timely and efficient tool for mangrove mapping and monitoring. The advent of high spatial resolution sensors such as SPOT (HVR, HRVIR, or HRG), Quickbird, IKONOS, and GeoEye that provide meter-level resolution images improved vegetation classification at the species level (Giri et al., 2014; Ju et al., 2003; Wang et al., 2004b, 2004a; Zhou et al., 2009). New generation sensors provide images with both high spatial resolution and novel bands at some spectral regions such as the red edge band which was found strongly related to chlorophyll concentration in leaves (Clevers, 1999; Mutanga and Skidmore, 2007).
Understanding species distribution and canopy structure of mangrove forests is imperative for flora and fauna conservation in mangrove habitats. However, most mangrove studies focused on the top canopy layer without exploring the vertical structure of mangroves. This paper presents multi-layered mangrove mapping which considered both overstory and understory detection and species classification using multispectral WorldView-3 (WV-3) data, airborne hyperspectral images (HSI), and LiDAR point cloud. First, LiDAR returns were stratified into the overstory and understory by analyzing the profile of return height, which helped understand the vertical structure of the mangrove stands. Second, three classification algorithms Random Forest (RF), Support Vector Machine (SVM), and Convolutional Neural Network (CNN) were compared by applying WV-3, HSI, LiDAR data, and their combinations to map seven vegetative species. Feature selection was conducted to identify important features and the optimal feature size prior to classification tasks. The measured and estimated understory canopy heights reached a high correlation coefficient of 0.71, which demonstrated the effectiveness of using LiDAR data and the proposed procedure to stratify multi-layered canopies. The combined HSI and LiDAR data produced satisfactory results by the three classifiers with overall accuracy (OA) varying from 0.86 to 0.88. And the species was also accurately mapped by integrating WV-3 and LiDAR data using both RF and SVM algorithms with OA attaining between 0.84 and 0.86. The results of this study highlight that (1) LiDAR data provided superior information to map the vertical structure of multi-layered mangroves, which provided valuable information to classify single-layered and dual-layered Kandelia obovata with understory beneath; (2) the combination of spectral and LiDAR features improved mangrove species classification; (3) and species mapping results derived from combined datasets appeared to be more influential by LiDAR features when using RF and SVM, but spectral features played a more important role in CNN.
Assessing, mapping, and optimizing the locations of sediment control check dams construction
2020, Science of the Total Environment
Check dams are considered to be one of the most effective measures for conservation of the soil and water resources. However, identifying the most suitable sites for the installation of check dams remain quite demanding. This research investigates and compares five machine learning algorithms (MLAs) – boosted regression trees (BRT), multivariate adaptive regression spline (MARS), mixture discriminant analysis (MDA), random forest (RF), and support vector machine (SVM) – for generating check-dam site-suitability maps (CDSSMs) and assessing them in Firuzkuh County, Iran. First, the locations of 475 existing check dams were monitored, registered, and divided into calibration (70%) and testing datasets (30%) for training and validation of the models. Fourteen check-dam conditioning factors (CDCFs) were selected and checked for multicollinearity. The relative importance of the CDCFs assessed using the elastic net (ENET) algorithm. Results demonstrated that distance from river (DFR) and drainage density (DD) to be the most significant factors for mapping the suitable sites for the erection of check dams. This research revealed that all of five MLAs had excellent accuracy for predicting the check-dam site-suitability with high AUC values: RF (0.966), SVM (0.878), MARS (0.878), MDA (0.844), and BRT (0.843). The most accurate model (RF) showed that 16.95%, 35.55%, 31.08%, and 16.42% of study area comes under low, moderate, high, and very high suitability classes. The outcome achieved by this research will be helpful to sustainability planners and managers in constructing check dams at suitable sites for better conservation of soil and water resources.
Disentangling fractional vegetation cover: Regression-based unmixing of simulated spaceborne imaging spectroscopy data
2020, Remote Sensing of Environment
The next generation of spaceborne imaging spectrometers will enable hyperspectral analysis of vegetation cover across large spatial extents. Spectral unmixing provides a means to assess subpixel vegetation composition in such imagery. Here we implement a regression-based unmixing approach to generate fractional vegetation cover on a regional scale from a simulated Environmental Mapping and Analysis Program (EnMAP) satellite scene derived from Airborne Visible InfraRed Imaging Spectrometer (AVIRIS) imagery acquired over the San Francisco Bay Area, California, USA, an area with a mixture of temperate and Mediterranean climate forests, woodlands and shrublands. A hierarchical classification scheme was implemented that considered fractional cover of vegetation as a whole (vegetation vs non-vegetation), vegetation life forms (woody vs non-woody vegetation; tree vs shrub vs grass), and tree leaf type (needleleaf vs broadleaf). A Gaussian Process Regression (GPR) model was trained using synthetically-mixed training data generated from an endmember library, and mapping accuracy was assessed using an independent validation dataset across four ecoregions. Our approach was able to effectively model landscape patterns at all levels of the class hierarchy. Site-wide map accuracy was highest when mapping generic vegetation fractions (MAE = 3.8%) and expectedly decreased at more complex hierarchy levels, with highest errors observed when separating tree and shrub fractions. Still, fraction estimates of needleleaf trees (MAE = 10.6%), broadleaf trees (MAE = 13.1%) and shrubs (MAE = 15.3%) were mapped with low overall error. Using Landsat imagery led to an average decrease in map accuracy of 1.9% when compared to hyperspectral image analysis and a maximum decrease of 3.5% when separating broadleaf and needleleaf trees across all sites. Further, a single regional model was shown to yield comparable results to multiple local ecoregion-based models, facilitating the analysis of large regions without creating a separate model for each region. Our results highlight the utility of regression-based approaches for quantitative vegetation mapping, which is of particular interest for future spaceborne imaging spectroscopy missions operating across large areas at moderate spatial resolution.
Groundwater spring potential assessment using new ensemble data mining techniques
2020, Measurement: Journal of the International Measurement Confederation
The growing demand and exigency for groundwater resources warrant the demarcation of groundwater spring potential zones (GSPZ) for effective sustainable strategy in groundwater identification, conservation, and management. Here we utilized a novel data mining (DM) ensemble to generate groundwater spring potential maps (GSPMs) by combining RF-BRT (random forest-boosted regression tree), MARS-SVM (multivariate adaptive regression spline-support vector machine) and, FDA-GLM-MDA (functional data analysis-generalized linear model-mixture discriminant analysis). Initially, an aggregate of 1726 groundwater spring locations was collected from the regional water company of Tehran Province and field investigation, in which 1208 springs (70%) were taken for training purposes and the remaining 518 (30%) springs were applied for the validation process. Twelve conditioning factors including DEM (digital elevation model)/elevation, fault density, aspect, rainfall, distance from rivers, distance from faults, slope, MRVBF (multiresolution index of valley bottom flatness), TWI (topographic wetness index), lithology, land use/land cover, and permeability were utilized for mapping process and their importance in predicting the groundwater spring potential. The variable importance (VI) analysis using SVM (support vector machine) reveals that the most significant conditioning factors in the prediction process are rainfall, TWI, DEM-elevation, distance from rivers, slope, distance from faults, and MRVBF. The GSPMs generated from novel data-mining (DM) ensembles were validated using the cut-off reliant (recall, fallout, F-measure, accuracy, precision, specificity, TSS: true skill statistic, Cohen’s kappa, fourfold plot, CCI: corrected classified instances) and cut-off independent (ROC-AUC: receiver operating characteristic-area under the curve) measures. The outcome of the validation measures shows that RF-BRT has the superior values of recall, F-measure, overall accuracy, precision, specificity, TSS, Cohen’s kappa, fourfold plot, CCI followed by MARS-SVM, and FDA-GLM-MDA whereas the AUC value of RF-BRT (0.955), MARS-SVM (0.934), and FDA-GLM-MDA (0.914) also display similar result. The GSPMs generated using the novel DM ensemble models in our study can be utilized by policymakers in implementing the strategies for effective land use planning and sustainable groundwater management.
Diagnosis of degraded pastures using an improved NDVI-based remote sensing approach: An application to the Environmental Protection Area of Uberaba River Basin (Minas Gerais, Brazil)
2019, Remote Sensing Applications: Society and Environment
Pasture degradation represents a global environmental problem that urges mitigation. A fundamental step towards restoration of degraded pastures is the identification and accurate mapping of these areas. In Brazil, the area of degraded pastures is immense and therefore remote sensing is a cost-effective way to map it. In this study, an improved method based on NDVI values extracted from satellite images is presented, and tested in the Environmental Protection Area of Uberaba River Basin (EPAURB) located in the state of Minas Gerais, Brazil. The EPAURB covers an area of approximately 528.1 km², 50.9% of which is pasture. The innovative features of this method comprise: 1) the mapping is preceded by the definition of NDVI fingerprints for healthy, smoothly degraded, moderately degraded and degraded pasture (called physiognomies), based on non linear relationships between NDVI values and time; 2) the mapping of physiognomies accounts for the influence of geology and weather seasonality on the NDVI values. In the EPAURB the physiognomic categories were set by visual inspection and evaluation of soil characteristics (e.g., organic matter, nutrients, resistance to penetration) in the so-called characterization ground truth sites also termed buffers. Resistance to penetration and several other soil parameters showed statistically different (p ≤ 0.05) values among physiognomies. The definition of fingerprints was based on a 4-year record (2013–2016) of NDVI 16-day composite (MOD13Q1) 250 m time-series data. The map of degraded pastures was delineated on the basis of comparisons between the NDVI values of 23 satellite images covering the year of 2016 (termed NDVI_pixel) and corresponding characteristic NDVI values of degraded pasture physiognomy extracted from the corresponding fingerprint (termed NDVI_buffer). Whenever NDVI_buffer,min ≤ NDVI_pixel ≤ NDVI_buffer,max a repetition counter (n) increased one unit. For n ≥ 3 the pixel was classified as degraded pasture. The results exposed 160.1 km² of degraded pasture for 3 ≤ n ≤ 18, which represents 60% of all pasture land. The areas mapped as degraded pasture were subject to a field check in 38 so-called validation ground truth sites, using resistance to penetration as validation parameter, with 84.1% success. Given the serious environmental damage posed by pasture degradation, several mitigation measures were discussed including the protection of degraded soil through the “polluter pays principle”.

View all citing articles on Scopus

View full text

Gaussian mixture discriminant analysis and sub-pixel land cover characterization in remote sensing

Abstract

Introduction

Section snippets

Model description: mixture discriminant analysis (MDA)

Field measurement and satellite sensor data

Results

Discussion

Conclusions

Acknowledgements

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Pattern Recognition Letters

Remote Sensing of Environment

Remote Sensing of Environment

Mapping sub-pixel proportional land cover with AVHRR imagery

International Journal of Remote Sensing

Endmember bundles: a new approach to incorporating endmember variability into spectral mixture analysis

IEEE Transactions on Geoscience and Remote Sensing

A neural network method for mixture estimation for vegetation mapping

Remote Sensing of Environment

Continuous fields of vegetation characteristics at the global scale at 1-km resolution

Journal of Geophysical Research