Super-resolution land cover mapping with indicator geostatistics

doi:10.1016/j.rse.2006.04.020

Remote Sensing of Environment

Volume 104, Issue 3, 15 October 2006, Pages 264-282

https://doi.org/10.1016/j.rse.2006.04.020 Get rights and content

Abstract

Many satellite images have a coarser spatial resolution than the extent of land cover patterns on the ground, leading to mixed pixels whose composite spectral response consists of responses from multiple land cover classes. Spectral unmixing procedures only determine the fractions of such classes within a coarse pixel without locating them in space. Super-resolution or sub-pixel mapping aims at providing a fine resolution map of class labels, one that displays realistic spatial structure (without artifact discontinuities) and reproduces the coarse resolution fractions. In this paper, existing approaches for super-resolution mapping are placed within an inverse problem framework, and a geostatistical method is proposed for generating alternative synthetic land cover maps at the fine (target) spatial resolution; these super-resolution realizations are consistent with all the information available.

More precisely, indicator coKriging is used to approximate the probability that a pixel at the fine spatial resolution belongs to a particular class, given the coarse resolution fractions and (if available) a sparse set of class labels at some informed fine pixels. Such Kriging-derived probabilities are used in sequential indicator simulation to generate synthetic maps of class labels at the fine resolution pixels. This non-iterative and fast simulation procedure yields alternative super-resolution land cover maps that reproduce: (i) the observed coarse fractions, (ii) the fine resolution class labels that might be available, and (iii) the prior structural information encapsulated in a set of indicator variogram models at the fine resolution. A case study is provided to illustrate the proposed methodology using Landsat TM data from SE China.

Introduction

Sensors with spatial resolution larger than the extent of classes on the ground yield mixed pixels, i.e., pixels whose spectral signature is a composite of signatures of different classes. Spectral unmixing is the procedure of determining the fractions of such classes that occupy any coarse pixel; see, for example, Richards and Jia (1999) or Tso and Mather (2001) for a survey of unmixing methods. Spectral unmixing, however, provides only class fractions without locating the constituent classes, or for that matter the end members used for unmixing, within any coarse pixel. That additional task lies in the realm of super-resolution mapping, also termed sub-pixel mapping, or downscaling; see, for example, Atkinson (2001).

In this paper, we view super-resolution mapping from an inverse problem perspective (Bertero and Boccacci, 1998, Menke, 1989, Tarantola, 2005): that of reconstructing a fine resolution map of class labels from a set of coarse class fractions. The forward problem of computing coarse fractions from a fine resolution map of class labels is trivial. The inverse problem, however, is under-determined, in that it has multiple plausible solutions: many fine resolution class maps can lead to an equally good reproduction of the available coarse fractions. In order to solve such an under-determined inverse problem, one needs to invoke prior information that will resolve the inherent ambiguity. This prior information should pertain to the fine (target) spatial resolution, so that it constrains the “space” of possible spatial patterns of classes that can occur at that resolution. In what follows, we will occasionally refer to this fine resolution prior information as a model of spatial structure, structural model, or structural information, which, in this work, can also be loosely regarded as pertaining to texture (Tso & Mather, 2001).

The most primitive form of spatial structure is the rather unrealistic assumption of classes randomly distributed at the fine spatial resolution. Another particular form of prior information is the assumption of maximum class auto-correlation at the target resolution, which underpins several approaches for super-resolution mapping (Atkinson, 2001, Atkinson et al., 1997, Mertens et al., 2003, Tatem et al., 2001, Verhoeye and De Wulf, 2001). This prior structural model might be appropriate when the extent of spatial patterns on the ground is much larger than that of the coarse pixel, a scenario termed H-resolution by Jupp et al. (1988). Such a model, however, is too rigid, in that it cannot be adapted to generic scenes.

Moving away from the rigid notion of maximum spatial auto-correlation, prior information has been specified implicitly or explicitly in the form of parametric indicator variogram models. Atkinson (2001) and Makido and Shortridge (2005) used an iterative class swapping procedure to generate super-resolution maps, whereby a parametric function of distance (e.g., exponential decay) was used as a proxy for a formal indicator variogram model to determine the benefit of changing a simulated class label at any particular pixel in each iteration. Tatem et al., 2002, Tatem et al., 2003 explicitly used indicator variogram models as objective function components within an iterative optimization framework. Prior information has also been specified in the form of interactions between predefined groups of pixels (cliques) linked to the parametric energy function of a Markov Random Field model (Kasetkasem et al., 2005, Tso and Mather, 2001). Alternatively, that prior information could be extracted from analog images, by computing directly without any parametrization the probability of occurrence of different spatial patterns, i.e., sets of class labels over groups of pixels (Strebelle, 2002). This latter approach to prior information specification can account for complex spatial patterns (e.g., meandering objects) that cannot be adequately characterized by two-point statistics, such as indicator variograms.

In this work, we assume knowledge of the coarse resolution fractions; we do not address how such fractions were derived from the original satellite reflectance values. Moreover, we assume that these fractions are exact measurements with no error or inherent uncertainty; the possibility of relaxing this assumption is not pursued here due to space limitations. In addition, and contrary to many existing approaches to super-resolution mapping, we also consider the case whereby the analyst has access to a typically small set of class labels at the fine resolution, possibly obtained via ground surveys. Such sparse, with respect to the abundant coarse fractions, fine resolution data provide information on the actual location of classes at the target resolution, and hence should be reproduced exactly in the final maps.

After a prior model of spatial structure has been specified, a probabilistic formulation of inverse problems seeks to determine the conditional probability distribution of the unknowns given the available data; that probability distribution encapsulates our uncertainty about the unknown attribute values given the current level of information (Tarantola, 2005). In the context of super-resolution mapping, this amounts to determining the multivariate or multi-pixel conditional probability of obtaining any particular spatial combination of class labels at the target resolution, given the abundant coarse fractions and possibly some sparsely sampled class labels at that target resolution. That conditional distribution is typically complex with multiple modes and may not be analytically tractable. Instead of determining a single summary measure from the conditional distribution, such as its mean or mode, that distribution is “explored” by generating multiple samples or realizations from it (Kaipio and Somersalo, 2004, Mosegaard and Tarantola, 1995, Sambridge and Mosegaard, 2002, Tarantola, 2005). In our setting, this amounts to generating alternative simulated super-resolution maps of class labels that are consistent with all the information available; that is, the prior structural model, the coarse fractions, and the fine class labels if available. These simulated super-resolution maps can then be used to determine the likelihood of occurrence of patterns of classes over various groups or templates of pixels, by calculating the frequency of occurrence of such class patterns over the realizations.

More importantly, by using these synthetic super-resolution maps as inputs to a process simulation model, e.g., in a wildfire propagation simulator, one can build a probability distribution for the model outputs in a Monte Carlo framework (Bachmann & Allgöwer, 2002). By fixing other input variables to a nominal value or set of values, one could also assess via Monte Carlo simulation the sensitivity of that model to an uncertain land cover map; see Crosetto et al. (2001) for a discussion of uncertainty and sensitivity analysis techniques in a remote sensing context. In our case, that sensitivity would pertain to the lack of class labels at the appropriate model resolution.

One of the earliest approaches to super-resolution land cover mapping is that of Verhoeye and De Wulf (2001), who proposed a deterministic solution based on linear programming. This approach, however, does not acknowledge the existence of multiple super-resolution maps due to multiple optima in a linear programming formulation. In general, most existing algorithms for super-resolution land cover mapping are iterative in nature, and (rightfully so) yield different such maps depending on the initial map used in the iteration procedure. Mertens et al. (2003) adopted the same objective function as Verhoeye and De Wulf (2001), but used a genetic algorithm to search for plausible super-resolution maps. Tatem et al. (2001) trained a Hopfield neural network to optimize an initial super-resolution map (used for further iterations) with the simultaneous objectives of coarse fraction reproduction and spatial auto-correlation maximization; that method was successfully tested on an actual case study (Tatem et al., 2003). Mertens et al. (2004) used wavelets to account for the resolution difference between fine class labels and coarse fractions. At the fine scale, a neural network was trained to estimate the wavelet coefficients, from which a super-resolution map was reconstructed. Tatem et al. (2002) extended their neural network approach to account for indicator variogram models. Atkinson (2001) and Makido and Shortridge (2005) adopted a swapping algorithm, as used in spatial simulated annealing, to construct plausible super-resolution maps. In these latter works, the coarse fractions were matched exactly by construction. This was achieved by applying the swapping algorithm to an initial purely random super-resolution map comprised of the correct class fractions within each coarse pixel.

The common concern with the above iterative approaches is their rate of convergence and computational burden, associated with the repetitive evaluation of mismatch between simulated and observed coarse fractions, and most importantly between simulated and expected spatial structure. Other more complex sampling methods, such as Markov Chain Monte Carlo methods and simulated annealing are also iterative, and can become computationally prohibitive due to slow convergence; see, for example, Kaipio and Somersalo (2004). In addition, none of the existing approaches for super-resolution mapping accounts for fine resolution data in the form of a sparse set of class labels at informed fine pixels. The recently developed probability perturbation method of Caers and Hoffman (2006), although still iterative, appears to be less computational expensive, and warrants further attention in the context of super-resolution mapping.

In this paper, we propose a novel approach for super-resolution land cover mapping based on the geostatistical methods of indicator Kriging (Journel, 1983) and indicator stochastic simulation (Journel & Alabert, 1989), accounting explicitly for the resolution difference between the available coarse fractions and the sought-after class labels. The proposed approach: (i) is non-iterative and computationally inexpensive, (ii) offers exact, within round-off errors, reproduction of coarse resolution fractions, (iii) ensures exact reproduction of observed class labels at informed fine resolution pixels that might be available, and (iv) closely reproduces a set of indicator variogram models linked to transition probabilities of class labels from one fine pixel to another. Since we explicitly acknowledge that there are multiple solutions to super-resolution land cover mapping, the end product of our method is a set of alternative realizations or maps of class labels having the properties listed above.

Section 2.1 describes the links between the spatial statistics of the fine resolution class labels and the coarse resolution class fractions. These links are exploited in Section 2.2 to derive, via indicator coKriging, the conditional probability of class occurrence at any fine resolution pixel, given the neighboring coarse fractions and possibly some fine resolution sample class labels. In Section 2.3, the coKriging formulation is used within a modified sequential indicator simulation framework to generate alternative realizations of fine resolution class labels with the properties listed above. A case study is provided in Section 3, illustrating the applicability of our proposed methodology for super-resolution land cover mapping using data from a Landsat TM scene over SE China. Lastly, we offer some discussion and recommendations for future work in Section 4.

Section snippets

Methodology

Let c(v) denote the, usually unknown, class at a generic fine resolution pixel v = v(u), with u being the coordinate vector of its centroid; the area of that pixel v is denoted as |v|. It is assumed that at this fine resolution c(v) can take one of K mutually exclusive and collectively exhaustive labels, i.e., c(v) = k, with k = 1,…,K. The set of all true class labels constitutes the unavailable super-resolution image, and can be arranged in a (M × 1) vector c = [c(v_m), m = 1,…,M]^T, where superscript T

Case study

To demonstrate our proposed super-resolution mapping approach, we consider a reference land cover classification derived from a Landsat TM scene of an area in the Pearl River Delta, South East China; see Seto et al. (2002) for more details. The reference land cover class map, shown in Fig. 1, has a dimension 15 km × 15 km, and includes 500 × 500 fine resolution pixels. Each pixel has a size 30 m × 30 m, and is considered as belonging to one of K = 3 broadly defined land cover classes: vegetation (white

Discussion and conclusions

In this paper, super-resolution land cover mapping is viewed as an under-determined inverse problem, that of constructing fine resolution land cover maps from coarse class fraction data. We explicate the necessity of a prior model of spatial structure for land cover at the fine (target) resolution to resolve the inherent ambiguity of such an ill-posed inverse problem and make it solvable. In addition, we demonstrate that existing super-resolution land cover mapping solutions invoke such a prior

Acknowledgments

The first author acknowledges partial funding from a PhD scholarship from the program: Fonds Québécois de la recherche sur la nature et les technologies from the government of Quebec, Canada. The second author acknowledges partial funding provided by the National Science Foundation under Award BCS #0422599. Both authors would like to thank Prof. André Journel at Stanford University for his constructive comments on early versions of this paper. The constructive comments of three anonymous

References (37)

M. Crosetto et al.
Uncertainty propagation in models driven by remotely sensed data
Remote Sensing of Environment
(2001)
T. Kasetkasem et al.
Super-resolution land cover mapping using a Markov random field approach
Remote Sensing of Environment
(2005)
K.C. Mertens et al.
Sub-pixel mapping and sub-pixel sharpening using neural network predicted wavelet coefficients
Remote Sensing of Environment
(2004)
A.J. Tatem et al.
Super-resolution land cover pattern prediction using a Hopfield neural network
Remote Sensing of Environment
(2002)
P.M. Atkinson
Super-resolution target mapping from soft-classified remotely sensed imagery
P.M. Atkinson et al.
Defining an optimal size of support for remote-sensing investigations
IEEE Transactions on Geoscience and Remote Sensing
(1995)
P.M. Atkinson et al.
Mapping sub-pixel proportional land cover with AVHRR imagery
International Journal of Remote Sensing
(1997)
A. Bachmann et al.
Uncertainty propagation in wildland fire behaviour modeling
International Journal of Geographical Information Science
(2002)
J.A. Benediktsson et al.
Consensus theoretic classification methods
IEEE Transactions on Systems, Man, and Cybernetics
(1992)
M. Bertero et al.
Introduction to inverse problems in imaging
(1998)

R. Bordley

A multiplicative formula for aggregating probability assessments

Management Science

(1982)

J. Caers et al.

The probability perturbation method: A new look at Bayesian inverse modeling

Mathematical Geology

(2006)

S.F. Carle et al.

Transition probability-based indicator geostatistics

Mathematical Geology

(1996)

R.C. Gonzalez et al.

Digital image processing

(2002)

P. Goovaerts

Geostatistics for natural resources evaluation

(1997)

A.G. Journel

Non-parametric estimation of spatial distributions

Mathematical Geology

(1983)

A.G. Journel

Combining knowledge from diverse sources: An alternative to traditional data independence hypotheses

Mathematical Geology

(2002)

A.G. Journel et al.

Non-Gaussian data expansion in the earth sciences

Terra Nova

(1989)

Cited by (137)

Very fine spatial resolution urban land cover mapping using an explicable sub-pixel mapping network based on learnable spatial correlation
2023, Remote Sensing of Environment
Sub-pixel mapping is the prevailing approach for dealing with the mixed pixel effect in urban land use/land cover classification, by reconstructing the sub-pixel-scale distribution inside each mixed-pixel based on spatial autocorrelation. However, 1) traditional spatial autocorrelation is limited to a local window, which cannot model the teleconnection between two locations or objects that are far apart and 2) autocorrelation is based on the idea of “the more proximate, the more similar”, which relies on a distance-weight decay parameter and cannot characterize the rich variety of mutual information in spatially heterogenous areas in urban. In this research, we develop and demonstrate a learnable correlation-based sub-pixel mapping (LECOS) method. 1) We use the “mutual retrieval” mechanism of the self-attention operation to model teleconnections that enable more distant locations or objects to be mutually correlated and 2) we design a parameter-free “self-attention in self-attention” operation to learn adaptively the diverse global correlation patterns between pixel and sub-pixel. The learned spatial correlations are then used for reasoning the sub-pixel-scale distribution of each class. We validated our method on the most challenging public datasets of urban scenes, which exhibit considerable spatial heterogeneity with complex structures and broken objects. The learned building-tree, building-road and road-tree correlation patterns contributed most to the sub-pixel reconstruction result of the urban scenes, consistent with in-situ reference data. We further explored the model's explicability in a large-area of several metropolises in China, by mapping land cover in these cities at a 2 m very fine spatial resolution using 10 m Sentinel-2 input images, and found that the derived result not only revealed rich urban spatial heterogeneity, but also that the learned correlation was indicative of urban pattern dynamics, suggesting the potential for greater understanding of issues such as urban fairness, accessibility, human exposure and sustainability.
Generating continuous fine-scale land cover mapping by edge-guided maximum a posteriori based spatiotemporal sub-pixel mapping
2022, Science of Remote Sensing
Global land cover mapping activities are of great important for retrieving the environment we are living in. However, the trade-off between spatial and temporal resolution makes it difficult to obtain the continuous fine-scale land cover product for detail and frequent land surface analysis. To overcome the difficulty, this study proposed a modified maximum a posteriori (MAP) based spatiotemporal sub-pixel mapping (SSM), for a long-term and fine-scale land cover mapping. The proposed approach consists of three parts, i.e., data fidelity term, spatial regularization term, and enhanced temporal regularization term, which can make use of historical fine-scale image for a better super-resolution of the current coarse image. One regional experiment was implemented for the algorithm validation, and one real experiment was investigated, by generating continuous land cover mapping products (2010–2015) at Landsat-like spatial resolution from time series of MODIS images. Compared with traditional methods, the proposed approach reconstructed more pleasant spatial details and had greater quantitative accuracy, outperforming the state-of-the-art spatiotemporal sub-pixel mapping method by averagely 2.58% for all the inspected dates in OA metric. This study not only provides a generalized research framework for generating continuous fine-scale land cover product from daily available MODIS images, but also validates the usability of the time series products for detail land cover monitoring, and reveals a finding that the study area of Wuhan is a well practitioner of the sustainable development policy during its urbanization process.
A review of geostatistical simulation models applied to satellite remote sensing: Methods and applications
2021, Remote Sensing of Environment
Despite an ever-increasing number of spaceborne, airborne, and ground-based data acquisition platforms, remote sensing data are still often spatially incomplete or temporally irregular. While deterministic interpolation techniques are often used, they tend to create unrealistic spatial patterns and generally do not provide uncertainty quantification. Geostatistical simulation models are effective in generating an ensemble of realistic and equally probable realizations of an unmeasured phenomenon, allowing data uncertainty to be propagated. These models are commonly used in several fields of earth science, and in recent years, they have been applied widely to remotely sensed data. This study provides the first review of the applications of geostatistical simulation to remote sensing data. We review recent geostatistical simulation models relevant to satellite remote sensing data and discuss the characteristics and advantages of each approach. Finally, the applications of each geostatistical simulation model are categorized in different domains of natural sciences, including soil, vegetation, topography, and atmospheric science.
A new downscaling-integration framework for high-resolution monthly precipitation estimates: Combining rain gauge observations, satellite-derived precipitation data and geographical ancillary data
2018, Remote Sensing of Environment
Citation Excerpt :
The performance of various methods (i.e. DTRMM_GDA, DTRMM_KED, DTRMM_GWR4 and DTRMM_GWRK4) may be influenced by the accuracy of downscaled TRMM estimates and spatial characteristics of precipitation datasets derived from rain gauge observations. Previous works have pointed out that downscaling should be regarded as an under-determined inversion work (Boucher and Kyriakidis, 2006; Park, 2013). To further explore the error sources of different approaches, we investigate the relationship between the performance of the merging algorithms and the accuracy of the downscaled TRMM precipitation data, and the relationship between the performance of various rainfall estimates and the global Moran's I values.
Deriving high quality precipitation estimates at high spatial resolution is of prime importance for many hydrological, meteorological, and environmental investigations. Rain gauge observations and satellite-derived precipitation data are two main sources of precipitation estimates. Gauge observations are accurate and reliable, but are heavily point-based and sparse in areas of rugged or complex terrains. Satellite-derived precipitation products can cover large areas, but they are generally characterized by inherent bias. To optimize the use of both datasets, we propose in this paper, a downscaling-integration framework to generate high quality monthly precipitation datasets at 1 km spatial resolution by merging rain gauge observations and TRMM 3B43 products. Firstly, an area-to-point kriging (ATPK) approach is used to downscale the original TRMM product to 1 km, so as to ensure a fair comparison with rain gauge data. Then, the downscaled TRMM precipitation datasets are integrated with the gauge observations using geographically weighted regression kriging (GWRK). The geographical factors (i.e. longitude, latitude and elevation) are also used as auxiliary variables in the GWRK model. Applying this approach to an experiment conducted at the middle and lower reaches of the Yangtze River in China from 2001 to 2014 shows that: (1) the downscaled monthly TRMM precipitation data by ATPK are more accurate than the original TRMM estimates; (2) the GWRK model employing the downscaled TRMM precipitation data and geographical factors provides better monthly precipitation estimates than the conventional ordinary kriging (OK) interpolation and the commonly used merging methods (i.e. geographical difference analysis, GDA and kriging with external drift, KED); (3) the GWRK method reduces the influence of the inaccuracy (bias) of satellite-derived precipitation data on the precipitation estimates compared to GDA. The approach presented in this study has provided an efficient alternative for solving the scale mismatch problem between point-based gauge data and low resolution satellite data, and producing improved precipitation data at high spatial resolution.
Super-resolution mapping using cellular automata model
2023, Research Square
Super-Resolution Mapping with a Fraction Error Eliminating CNN Model
2023, IEEE Transactions on Geoscience and Remote Sensing

View all citing articles on Scopus

View full text

Super-resolution land cover mapping with indicator geostatistics

Abstract

Introduction

Section snippets

Methodology

Case study

Discussion and conclusions

Acknowledgments

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Remote Sensing of Environment

Super-resolution target mapping from soft-classified remotely sensed imagery

Defining an optimal size of support for remote-sensing investigations

IEEE Transactions on Geoscience and Remote Sensing

Mapping sub-pixel proportional land cover with AVHRR imagery

International Journal of Remote Sensing

Uncertainty propagation in wildland fire behaviour modeling

International Journal of Geographical Information Science

Consensus theoretic classification methods

IEEE Transactions on Systems, Man, and Cybernetics

Introduction to inverse problems in imaging

A multiplicative formula for aggregating probability assessments

Management Science

The probability perturbation method: A new look at Bayesian inverse modeling

Mathematical Geology

Transition probability-based indicator geostatistics

Mathematical Geology

Digital image processing

Geostatistics for natural resources evaluation

Non-parametric estimation of spatial distributions

Mathematical Geology

Combining knowledge from diverse sources: An alternative to traditional data independence hypotheses

Mathematical Geology

Non-Gaussian data expansion in the earth sciences

Terra Nova