Next Article in Journal
Data Fusion for Cross-Domain Real-Time Object Detection on the Edge
Previous Article in Journal
Multi-Focus Image Fusion via Distance-Weighted Regional Energy and Structure Tensor in NSCT Domain
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Scale Effect of Land Cover Classification from Multi-Resolution Satellite Remote Sensing Data

1
School of Geographical Sciences, Qinghai Normal University, Xining 810008, China
2
Qinghai Province Key Laboratory of Physical Geography and Environmental Process, Xining 810008, China
3
Ministry of Education Key Laboratory of Tibetan Plateau Land Surface Processes and Ecological Conservation, Xining 810008, China
4
Academy of Plateau Science and Sustainability, Xining 810008, China
*
Author to whom correspondence should be addressed.
Sensors 2023, 23(13), 6136; https://doi.org/10.3390/s23136136
Submission received: 23 May 2023 / Revised: 23 June 2023 / Accepted: 30 June 2023 / Published: 4 July 2023
(This article belongs to the Section Remote Sensors)

Abstract

:
Land cover data are important basic data for earth system science and other fields. Multi-source remote sensing images have become the main data source for land cover classification. There are still many uncertainties in the scale effect of image spatial resolution on land cover classification. Since it is difficult to obtain multiple spatial resolution remote sensing images of the same area at the same time, the main current method to study the scale effect of land cover classification is to use the same image resampled to different resolutions, however errors in the resampling process lead to uncertainty in the accuracy of land cover classification. To study the land cover classification scale effect of different spatial resolutions of multi-source remote sensing data, we selected 1 m and 4 m of GF-2, 6 m of SPOT-6, 10 m of Sentinel-2, and 30 m of Landsat-8 multi-sensor data, and explored the scale effect of image spatial resolution on land cover classification from two aspects of mixed image element decomposition and spatial heterogeneity. For the study area, we compared the classification obtained from GF-2, SPOT-6, Sentinel-2, and Landsat-8 images at different spatial resolutions based on GBDT and RF. The results show that (1) GF-2 and SPOT-6 had the best classification results, and the optimal scale based on this classification accuracy was 4–6 m; (2) the optimal scale based on linear decomposition depended on the study area; (3) the optimal scale of land cover was related to spatial heterogeneity, i.e., the more fragmented and complex was the space, the smaller the scale needed; and (4) the resampled images were not sensitive to scale and increased the uncertainty of the classification. These findings have implications for land cover classification and optimal scale selection, scale effects, and landscape ecology uncertainty studies.

1. Introduction

Remote sensing provides data on a large scale and around the clock for use in various industries. Land cover classification data obtained by remote sensing are essential data for studying surface processes and for climate model simulations [1,2,3,4]. In recent decades, with the development of space science and multi-platform remote sensing, multi-sensor and multi-angle technologies, the spatial resolution, spectral resolution, and temporal resolution of remote sensing images have been improving. Spatial resolution is one of the basic characteristics of remote sensing images, and the scale effect in remote sensing is a key research problem. Woodcock [5] considered that spatial resolution should be similar to the scale of observation. Multi-source satellite remote sensing images have become the basic data source for regional, national, and global mapping. Studies have shown that land cover mapping is influenced by the spatial resolution of remote sensing images, which has an obvious spatial scale effect [6,7,8]. The scale effect of the information acquired by remote sensing is the key to obtaining optimal scale land cover mapping based on the optimal resolution for a particular study area. Han et al. [9] used the method of information entropy to solve the average entropy of category differentiability of image data at each scale and calculate the optimal scale. They showed that the optimal scale has a relationship with the spatial distribution characteristics of the features. Treitz [10] used the variance function to calculate the optimal spatial resolution based on the theory of spatial autocorrelation analysis of spatial statistics and concluded that the optimal scale was related to the ground scene and sensor parameters. Dongping Ming et al. [11] proposed an improved local variance method based on the variable window and variable resolution to determine the optimal resolution using local variance as a measure. Their results are not applicable to a large range of complex environments. Feng et al. [12] used a Triangular Prism Method and Double Blanket Method to determine the resolution of images with three fractal dimensions, and the results had uncertainties. Ming et al. [13] studied the optimal spatial resolution of different features in remote sensing images by using the improved method of average local variance and concluded that variance increases with increasingly complex feature information. The above studies are based on the optimal spatial resolution calculated by geostatistical and classical statistical methods, and there is a large uncertainty in the research conclusions. Research on the scale effect of remote sensing images still lacks clear conclusions.
Generally speaking, features have inherent scales, and the expression at the inherent scale is the most realistic representation of the features [1,14]. However, it is difficult to obtain images with different spatial resolutions from multiple satellites covering the same area due to weather conditions, cloud cover, sensor performance, satellite transit times, and other factors, so most current studies of scale effects often use the same image resampled to obtain data with different spatial resolutions [6,15,16,17,18,19,20,21,22]. However, due to the existence of spatial heterogeneity, resampling can cause distortion of features and loss of spectral information, and the resampled images are different from the real satellite images at a specific spatial scale. Therefore, the results of these studies are still somewhat questionable [23,24].
Markham and Townshend [25] argued that remote sensing classification accuracy is mainly influenced by two factors. The first factor is the image elements at the edge between categories in the classification results, i.e., the hybrid image elements. When the spatial resolution of the image increases, the number of hybrid image elements at the edge between different ground feature categories decreases, and the classification accuracy increases. The second factor is spatial heterogeneity. When the spatial resolution increases, the variability of spectral features within the same feature category increases, which causes the inter-category separability to decrease, thus leading to a decrease in classification accuracy. On the surface, the accuracy effects of spatial resolution variation in relation to mixed image elements and spatial heterogeneity are contradictory. However, the variation in classification accuracy ultimately still depends on the relative relationship between the spatial resolution of the image and the size of the target within the scene. For larger homogeneous targets, the reduction in spatial resolution only increases the number of hybrid pixels at the edges but does not cause a change of spectral variability between pixels within the target, so the classification accuracy is reduced. In contrast, for targets with large spectral-spatial heterogeneity, the reduction in spatial resolution increases the number of hybrid pixels at the edges, but the smoothing effect of the reduced spatial resolution may improve the accuracy of the final classification results, which may reduce the intra-class spectral variation and increase the distinguishability between classes. Woodcock and Strahler [5] argued that the net effect of these two conflicting factors is a function of the environment of the image scene. Therefore, it is necessary to analyze the effect of spatial resolution variation on land cover classification accuracy based on multi-source remote sensing data in terms of both mixed image element decomposition and spatial heterogeneity and to study the optimal scale effect of land cover classification at different spatial resolutions from multi-source remote sensing data.
In this study, we selected 1 m and 4 m data from GF-2 satellite, 6 m data from SPOT-6, 10 m data from Sentinel-2, and 30 m data from Landsat-8 OLI to quantitatively investigate the relationship between land cover classification results and different spatial resolution from multiple satellite remote sensing data, and explore how the classification accuracy varies with spatial resolution, and to investigate whether the resampled remote sensing data have any influence on the scale analysis comparing with real remote image data, and which scale can most accurately represent the ground truth distribution characteristics of land cover. The results can provide a reference for selecting the optimal scale for land cover classification and a basis for scale conversion.

2. Study Area and Data Pre-Processing

2.1. Overview of the Study Area

The Huangshui River is an important first-class tributary of the upper reaches of the Yellow River, and the Huangshui basin is located in the northeast of Qinghai province, between 36°02′–37°28′ N, 100°42′–103°04′ E (Figure 1). The basin area covers 16,120 km2. The main cities in the basin include Xining City and Haidong City, the main population-gathering areas in Qinghai province. The land cover is greatly affected by human activities, with diverse feature types and fragmented feature patches. Xining is the capital city, as well as the political, economic, transportation, and cultural center of Qinghai province. Its administrative area includes four districts and three counties (Huangyuan County, Datong County, and Huanzhong County). Haidong City includes two municipal districts (Ledu District and Pingan District), as well as Minhe Hui and Tu Mutual Autonomous County. The topography of the whole watershed is undulating and diverse, dominated by hills and medium-high mountains. Two typical areas in two important cities in the Huangshui basin were selected for our study. One area is located in Duoba New Area of Xining, which is a key development and construction area of Xining with a complex topography and typical land cover type; and another area is located in the Ping’an District of Haidong, which is an important transportation hub of the Qinghai-Tibet Plateau and the main foreign port of Qinghai Province. Xining Caojiabao International Airport is located in this area. The topography is relatively flat, and the land cover type is typical.

2.2. Data Sources

The satellite images used in this study are from Chinese GF-2, French SPOT-6, ESA Sentinel-2, and U.S. Landsat-8. Among them, GF-2 is the first batch of satellites launched by the major project of China’s high-resolution earth observation system. It is the civil remote sensing satellite with the highest spatial resolution and the largest observation width developed by China. It is equipped with two high-resolution 1 m panchromatic and 4 m multi-spectral cameras. SPOT-6 was successfully launched by the French Space Center on 22 September 2012. It has an orbital altitude of 695 km and a spatial resolution of 6 m. It records images in multi-spectral blue, green, red, and near-infrared bands and 1.5 m panchromatic bands with a standard image coverage of 60 km × 60 km. Sentinel-2A is the second satellite of the European Space Agency of the European Union’s Copernicus Earth Observation Program. It was launched on 23 June 2015 for the Global Monitoring for Environment and Security program. Sentinel-2A carries a multi-spectral imager with 13 spectral bands, a strip width of 290 km, and a revisit period of 10 days. Landsat-8 is a U.S. Landsat program that was successfully launched on 11 February 2013. Landsat-8 carries the Land Imager (OLI) and the Thermal Infrared Sensor (TIRS). The OLI Land Imager includes nine bands with a spatial resolution of 30 m. The satellite sensors and their parameters are listed in Table 1 below.

3. Research Methodology

The flow chart in Figure 2 shows the mapping and analysis methods applied in this study. Land use/cover classification and analysis of scale effects are the main steps involved. The following sections describe the analysis scheme and several relevant steps in this study in detail.

3.1. Ensemble Classification Methods

Ensemble learning (EL) classification methods based on multiple classifiers have been shown to be some of the most effective methods for remote sensing image classification [26,27,28]. EL trains various base classifiers separately and then combines them with related combination methods (e.g., bagging, augmentation, or voting) to produce the final classification results. Bagging ensemble methods use the same training algorithm to train several subsets, and each classifier randomly selects the training data, which means that different subsets of the same sample can be selected [29]. Then, the output of each classifier is used for voting decisions. The random forest (RF) algorithm is based on the bagging ensemble method, with a small adjustment so that the correlation between individual trees is reduced [30]. The Boosting ensemble method is an improvement on RF. The classification principle is to iteratively train a series of weak classifiers. Higher weighted attention is used to correctly classify in the next learning round, and the final result is determined by the maximum number of votes classified by the weak classifier [31]. The gradient boosting decision tree (GBDT), an algorithm among boosting ensemble methods, has been proved to be one of the most effective algorithms. It is known for its excellent performance, and recent results in many research areas have shown that it outperforms various other classifiers [32,33,34]. In this paper, we study the application of the EL method to investigate the spatially optimal scale of land cover.

3.1.1. RF

RF belongs to the classification and prediction method that integrates a set of classification and regression tree (CART) decision trees. RF is the most representative bagging ensemble learning algorithm that combines bagging ensemble learning and random subspace methods to reduce overfitting [35]. In the classification process, data sets with different subsamples are randomly selected. Several decision trees are trained using different feature subsamples, and the results of the subsample decision trees are voted on to output the final classification results. RF input data does not require magnitude processing and can automatically handle missing values. It is one of the most commonly used machine learning algorithms.

3.1.2. GBDT

GBDT is a Boosting ensemble machine learning method that combines multiple decision trees. GBDT is a residual model in the direction of gradient descent. It is based on the process of upgrading weak classifiers to strong classifiers. Each iteration reduces the residuals of the last iteration and constantly adjusts the weights of misclassified samples to improve the accuracy of classification. GBDT can fit the true distribution of data and has a strong generalization ability. GBDT has good overall performance due to the complementary strengths of the weak classifiers [36,37].

3.2. Linear Decomposition Method

The lower the spatial resolution of an image, the higher the probability that an image element contains two or more features. In ensemble learning classification, the mixed image elements are assigned to the category with the highest probability. If each hybrid image element can be decomposed and the percentage of overlay-type components to the image element can be solved, the uncertainty of classification results can be quantified, resulting in multiple hybrid image element decomposition models. The same idea can be used to calculate the percentage of various features in an image element for the classified results of low-resolution images, and the classification uncertainty of low-resolution images can be evaluated.
The principle of the hybrid image decomposition model is to decompose each hybrid image element and solve for the percentage of the overlay-type components in the decomposed image elements. The hybrid image decomposition model allows the uncertainty of classification results to be quantified. For the classified results, the same method can be used to evaluate the classification uncertainty of low-resolution images by calculating the percentage of each feature type in each image element of the low-resolution image classification results.
In this study, the area corresponding to the image element size of 1 m × 1 m after the fusion of 1 m panchromatic and 4 m multi-spectral images of GF-2 image is used as a sliding window. The land cover types contained in each window and the percentage of each type are counted on the classification results of 4 m of GF-2, 6 m of SPOT-6, 10 m of Sentinel-2, and 30 m of Landsat-8, respectively. The classification results of GF-2–4 m, SPOT-6, Sentinel-2, and Landsat-8 are only one category of cultivated land, forest land, grassland, water, built-up land, and bare land, while the GF-2 classification results of 1 m after fusion are a linear combination of each category expressed as:
a 1 × c c u l t i v a t e l a n d + a 2 × c f o r e s t l a n d + a 3 × c g l a s s l a n d + a 4 × c w a t e r l a n d + a 5 × c b u i l t u p l a n d + a 6 × c b a r e l a n d = 1 where   a 1 + a 2 + a 3 + a 4 + a 5 + a 6 = 1   and   a 1 ,   a 2 ,   a 3 ,   a 4 ,   a 5 ,   a 6 [ 0 , 1 ]
where a i is the classification results of different scale images (4 m of GF-2, 6 m of SPOT-6, 10 m of Sentinel-2, and 30 m of Landsat-8) correspond to the percentage of each land cover type in GF-2 classification results of 1 m, c c u l t i v a t e l a n d , c f o r e s t l a n d , c g l a s s l a n d , c w a t e r l a n d , c b u i l t u p l a n d , c b a r e l a n d represent the land cover types: cultivated land, forest land, grassland, water, built-up land, and bareland, respectively. The value of c is 1 or 0. If there is one of the six land cover types, c is 1; if there is no one of the six land cover types, c is 0.

3.3. Spatial Heterogeneity Method

Spatial heterogeneity refers to the heterogeneity and complexity of the spatial distribution of ecological processes and patterns [38]. Spatial heterogeneity can generally be understood as the sum of spatial patchiness and gradient. Spatial pattern, heterogeneity, and patchiness are characteristics dependent on scale. We can define landscape indices to quantitatively describe landscape characteristics, landscape pattern information, and spatial heterogeneity, reflecting the structural composition characteristics and spatial configuration relationships of the landscape [39]. To analyze the spatial heterogeneity of different study areas, five landscape indices, namely area-weighted mean patch area (AREA_AM), largest patch index (LPI), aggregation index (AI), splitting index (SPLIT), landscape shape index (LSI), were selected at the landscape level in terms of area, shape, aggregation, and distribution.
AREA_AM is the area-weighted average patch area, calculated by the following formula:
A R E A _ A M = i = 1 n w i j × a i j i = 1 n w i j
where a i j represents the area of a patch of a landscape element, w i j is the weight size.
LPI is the proportion of the largest patch in a patch type occupying the entire landscape area, in the range 0 < LPI ≤ 100, calculated by the following formula:
L P I = max a i j A × 100 %
where a i j represents the area of a patch of a landscape element, and A is the total area of all landscapes.
AI is the aggregation index, which reflects the spatial configuration characteristics of landscape elements. The smaller AI, the higher the dispersion of the landscape. Conversely, the lower the dispersion of the landscape, the calculation formula is as follows:
A I = g i i max g i i × 100 %
where g i i is the number of adjacencies between different plaque types.
SPLIT is the landscape separation degree, which refers to the separation degree of individual distribution of different numbers of patches in a certain landscape type, calculated as follows:
S P L I T = D i j A i j × 100 %
where D i j is the distance index of landscape type, A i j is the area index of landscape type.
LSI is the landscape shape index, reflecting the complexity of the overall landscape shape. The closer LSI is to 1, the simpler the overall landscape shape. The larger LSI is, the more complex. The calculation formula is as follows:
L S I = 0.25 E A
where E is the total length of all patch boundaries in the landscape, A is the total area of the landscape.

4. Results and Analysis

4.1. Classification Results and Accuracy Analysis

In our study, the classification input features are 1 m and 4 m for GF-2, 6 m for SPOT-6, 10 m for Sentinel-2, and 30 m for Landsat-8 in the red, green, blue, and near-infrared bands, respectively. The classification samples are actual land cover type samples acquired by handheld GPS in the field, of which 80% are training samples, and 20% are validation samples. The classification parameters are set the same for different resolutions in the same classification method. In order to reduce the influence of other factors, the scale effects of the classification results are explored with the same inputs, the same samples, and the same parameter settings.
The classification results based on RF and GBDT, two ensemble methods to classify land cover in two respective study areas, are shown in Figure 3 and Figure 4. The classification results were evaluated using the producer’s accuracy, user’s accuracy, f1-score, overall accuracy, and Kappa coefficient, as shown in Figure 5, Figure 6 and Figure 7 and Table 2.
The comprehensive analysis of the classification results of the five different spatial resolution images shows that higher spatial resolution of the images allows for better extraction features with smaller areas or sizes. For instance, small features such as reservoir pits are difficult to be extracted on images of 10 m and 30 m resolution but can be extracted on high-resolution images such as 1m and 4 m images of GF-2. Cultivated land is extracted more completely and clearly outlined on mesoscale 6 m and 10 m images, while the area of cultivated land extracted on 30 m upper images is large. The classification results are better for the small-scale resolution of feature types with larger areas and single features; for instance, grassland is better classified on 10 m and 30 m images with less pepper and better consistency.
It can be seen from Table 2 that the spatial resolution of the images largely affects the classification results of the images. For the five resolution images included in this study, the overall accuracy exceeds 84.54%. As a whole, the classification accuracy is consistent with related studies [40,41]. The effect of different classification methods on the same spatial resolution is not very significant. The classification accuracy varies by about 6% with a change of spatial resolution from 1–30 m, while the accuracy of different classification methods of the same resolution varies by about 1%, indicating that land cover classification accuracy is closely related to the spatial resolution of remote sensing images. The classification accuracy of GBDT is higher than that of RF, which indicates that the effect of boosting ensemble classification is better than that of bagging ensemble classification. The classification accuracy of the two ensemble classification methods was the highest at the spatial resolution of 4 m and 6 m and decreased when the image resolution exceeded or was smaller than 4 m. In terms of classification accuracy analyses, the optimal spatial scale of land cover for both study areas is 4–6 m, which is consistent with related studies [11,42].
The overall producer’s accuracy of GBDT is higher than that of RF. Among them, the average producer’s accuracy of GF-2 (4 m) in Region A was the highest; the average producer’s accuracy of SPOT-6 and Sentinel-2 was higher in Region B (Figure 5). The producer’s accuracy of water area is generally higher, and the producer’s accuracy of bare land is generally lower. The extreme differences between the maximum and minimum values of the producer’s accuracies of different land cover types on Sentinel-2 are larger, specifically: the extreme differences are larger in Region A than in Region B. This indicates that the distribution of the producer’s accuracies of different land cover types in Region B is more concentrated than that in Region A, and the uncertainties of different feature types in Region A are higher.
The user’s accuracy of GBDT is generally higher than that of RF. Among them, the average user’s accuracy of GF-2 (4 m) in Region A is the highest; the average user’s accuracy of SPOT-6 and Sentinel-2 is higher in Region B (Figure 6). The user’s accuracy of forest land in Region A is generally higher; the user’s accuracy of unutilized land in Region B is lower. The larger extreme difference between Region A and Region B indicates that the distribution of user accuracy for different land cover types in Region B is more concentrated than that in Region A. The uncertainty of different feature types in Region A is higher.
The f1-score is the average of producer accuracy and user accuracy. The average f1-score accuracy of GF-2 (4 m) in Region A is the highest; the average f1-score accuracy of SPOT-6 and Sentinel-2 is higher in Region B; the f1-score of Landsat-8 is lower in both areas (Figure 7). Based on the classification accuracy, it can be concluded that the spatial resolution dependence of the watershed on the image is low, while the dependence on woodland and cropland is high. The accuracy of GF-2 (4 m) is better in Region A, and the accuracy of SPOT-6 and Sentinel-2 is better in Region B. Therefore, the uncertainty of Region A is higher than that of Region B, and the optimal scale accuracy of Region B corresponds to a lower spatial resolution than the optimal scale accuracy of Region A.

4.2. Optimal Scale Analysis Based on Linear Decomposition

Based on the linear decomposition method, the 1 m classification results of GF-2 fusion were used as a reference for Landsat-8, Sentinel-2, and SPOT-6 classification results. The fitted equations and related parameters were established using the curve fitting method and are summarized in Table 3. The coefficients of determination are greater than 0.85 (p < 0.01) in both study areas. The fitted curves are shown in Figure 8.
The scale effect of the fitted curves based on the linear decomposition method in Region A and Region B shows that the mean value of the decomposition tends to increase and then decrease as the scale increases (Figure 8). The optimal classification scale of Region A is 5 m, and the optimal scale of Region B is 13 m. The optimal scale of decomposition is different for different study areas, mainly because of the different distribution patterns of topographic and geomorphological complexity and land cover types in the two areas, and therefore the spatial heterogeneity is different. Region A has a more complex topography and more fragmented patches. Region B is located in Xining Caojiabao Airport, with relatively flat topography and regular feature patches, boundaries, and shapes. Therefore, the optimal land cover scale of Region B is larger than that of Region A. That is, the spatial resolution of remote sensing images used for the classification of Region B is larger than that of Region A.
To compare the scale effect of image resampling on land cover, resampling was performed based on the same scene image. To compare data sources and maintain maximum spectral information, the sampling method chosen used the nearest neighbor sampling method with the least information loss. This has less impact on the amount of image information and is especially suitable for resampling before classification [43]. Therefore, the GF-1 fused 1 m image was resampled into 4, 6, 10, and 30 m by the nearest neighbor sampling method. The results of the linear decomposition based on resampling are shown in Figure 9.
It can be seen that the linear decomposition result based on the classification after resampling is not sensitive to the scale (Figure 9). The linear decomposition gradually decreases as the scale rises. The linear decomposition value and scale of the classification after resampling do not show a trend of first increasing and then decreasing, probably because resampling introduces errors.

4.3. Spatial Heterogeneity Analysis

Land cover classification of remote sensing images is mainly influenced by two factors: mixed image elements and spatial heterogeneity. Mixed image element decomposition shows that the choice of the optimal scale for land cover classification is related to the study area. The more complex the study area, the higher the resolution or the larger the scale of the image for classification. To analyze the spatial heterogeneity of different study areas, five landscape indices, namely AREA_AM, LPI, AI, SPLIT, and LSI, were selected from the aspects of the area, shape, aggregation, and distribution at the landscape level. The landscape indices values were calculated based on the landscape pattern analysis software Fragstats4.2. The calculation results in two areas based on two classification methods are shown in Table 4.
From the above tables, the average patch area of Region B is larger than that of Region A. The maximum patch area of Region B is twice as large as that of Region A. The proportion of the largest patches to the landscape area in Region B is larger than that in Region A, the aggregation is better, and the shape is more regular. Region A is more spatially heterogeneous than Region B. Therefore, the classification of land cover types in Region B corresponds to a larger scale, consistent with the results of linear decomposition.
According to our results, the optimal classification result is not necessarily the optimal land cover scale. The classification results are the expression of macroscopic feature patterns, and factors such as shape, patchiness, and aggregation separability associated with feature characteristics determine the optimal scale of land cover. The more fragmented and complex the features are, the smaller the optimal land cover scale needs to be to finely represent the feature distribution.

5. Discussion

5.1. Discussion of Classification Methods

RF and GBDT are the most popular bagging and boosting ensemble methods based on decision trees. In our study, the GBDT model outperforms the RF model due to the different ensemble of trees in the ensemble methods. RF uses bagging (bootstrap resampling) method to construct different training sets and determines the final classification result using the maximum number of votes for the results of different datasets. GBDT uses gradient boosting to create a tree based on the residuals of the previously created tree. Both RF and GBDT have advantages, such as dealing with nonlinearities and limiting overfitting. Dietterich [44] compared bagging and boosting ensembles and found that the noise in the boosting dataset is less than that in the bagging dataset, which means that boosting is sensitive to the noise in the data. The signal-to-noise ratios of data at different scales are different, but in this study, RF and GBDT are better in GF-2, SPOT-6, Sentinel-2, and Landsat-8 classifications. The ensemble classifier has complementary advantages over a single classifier and can improve classification accuracy, in accordance with Feng et al. [45].
Yang et al. [46] used three methods, ISODATA, MLC, and SVM, to classify land cover in the Poyang Lake region, a region with strong spatial heterogeneity during the dry period. Their results indicate that different land cover classification methods may lead to different classification results for the same remote sensing data. In the same geographical area, the classification results may be different when using the same classification methods for land cover classification of remote sensing data with different resolutions. Even if the same classification method is used to classify land cover data with different resolutions in the same geographical area, the classification results may be different. By contrast, in our study, the classification results from RF and GBDT show more consistent results on different remote sensing data, and both SPOT-6 and GF-2 (4 m) have higher classification accuracy in two different areas. Therefore, the advantages of the ensemble method are better for multi-scale data usability.

5.2. Uncertainty Analysis of Classification

Determining the optimal classification system is the premise and foundation of land cover classification. A suitable classification system should consider both the actual situation of the study area and the spectral and spatial resolution of image data. At present, the national land use/cover classification system for remote sensing monitoring gives six major categories: cultivated land, forest land, grassland, water area, urban and rural industrial and mining residential construction land, and bare land. Our study areas are located in a high altitude, hilly, middle, and high mountainous terrain area and consider only six major categories of land cover classification. A more detailed classification may cause more complicated linear decomposition and spatial heterogeneity. Therefore, the conclusions drawn in this paper are applicable to the six major categories of land cover classification, and more detailed land classification may require further study.
From the analysis of classification accuracy after resampling, the overall classification accuracy of real images with different resolutions and the f1-score of each category after resampling in study Region A and Region B are shown in Figure 10. From the figure, it can be seen that the overall classification accuracy and consistency between each category are not very good. The consistency in study Region B is better than that in study Region A. The greater the regional heterogeneity, the greater the effect on the images after resampling. Combining the analysis of Region A and Region B, the consistency was higher on urban and rural industrial and mining residential construction land and forest land, and the uncertainty was larger in other categories, which is basically similar to the findings of Xu et al. [47]. It further indicates that resampling increases the uncertainty of the classification.

5.3. Uncertainty Analysis Based on Linear Model

The statistics of feature categories in Region A and Region B for the two ensemble learning classification methods are shown in Figure 11, Figure 12, Figure 13 and Figure 14. The table refers to the mean and standard deviation of the image elements of each category on the SPOT-6, Sentinel-2, and Landsat-8 classification images. This is similar to the misclassification cost (MCC) introduced by Defries et al. [48] for land cover classification accuracy evaluation, where the degree of misclassification is different between categories.
According to the statistical results of regions A and B based on the linear decomposition method, it can be seen that with the decrease in spatial resolution, the mean results of different ground object types decrease, indicating that the decrease in spatial resolution increases the number of mixed pixels, the spatial heterogeneity, and the spatial uncertainty.
In the two study areas, the proportions of cropland, grassland, urban and rural industrial and mining residential construction land, and bare land are larger, and their consistency of classification results on SPOT-6, Sentinel-2, and Landsat-8 images is higher. There is less uncertainty in the attribution of image categories for these four features. The mean value of water is lower because the areas under water are smaller and distributed in strips, corresponding to a smaller scale. The mean value of GF-2 (4 m) is the largest in both study areas, 0.5957 and 0.6945, indicating 40% uncertainty in Region A and 30% uncertainty in Region B. The mean standard deviation is around 0.3, indicating fewer fluctuations and more reliable mean decomposition.

6. Conclusions

In this paper, we explored the scale effect of land cover classification from mixed image element decomposition and spatial heterogeneity using multi-source and multi-scale satellite remote sensing data and came to the following conclusions:
(1)
GF-2, SPOT-6, Sentinel-2, and Landsat-8 images with different spatial resolutions based on GBDT and RF were used for classification studies, and GF-2 and SPOT-6 had the best classification results. Therefore, the optimal scale based on classification accuracy is 4–6 m.
(2)
The optimal scale based on linear decomposition is related to the study area, and the optimal scale is different for different study areas.
(3)
The optimal scale of land cover classification is related to spatial heterogeneity. The more fragmented and complex the space, the smaller the scale needed.
(4)
Images based on resampling do not reflect the characteristics of the actual scale images well, are insensitive to scale effects and increase the uncertainty of the classification.
(5)
The best classification result is not necessarily the optimal land cover scale. The classification result is only a representation of the macroscopic feature pattern. Factors such as shape, patchiness, and aggregation separability associated with feature characteristics determine the optimal land cover scale.

Author Contributions

Conceptualization, R.L. and X.G.; methodology, R.L.; software, R.L. and H.Z.; validation, F.S.; formal analysis, X.G; investigation, R.L. and F.S.; resources, X.G.; data curation, F.S.; writing—original draft preparation, R.L.; writing—review and editing, R.L.; supervision, X.G.; project administration, X.G; funding acquisition X.G. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the Natural Science Foundation of the Qinghai Science and Technology Department (Grant No. 2021-ZJ-913).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available upon request from the first author.

Acknowledgments

We sincerely thank the Natural Resources Remote Sensing Center of Qinghai Province for providing the GF-2 satellite data.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Defries, R. Terrestrial vegetation in the coupled human-earth system: Contributions of remote sensing. Ann. Rev. Environ. Resour. 2008, 33, 369–390. [Google Scholar] [CrossRef]
  2. Turner, W.; Spector, S.; Gardiner, N.; Fladeland, M.; Sterling, E.; Steininger, M. Remote sensing for biodiversity science and conservation. Trends Ecol. Evol. 2003, 18, 306–314. [Google Scholar] [CrossRef]
  3. Cihlar, J. Land cover mapping of large areas from satellites: Status and research priorities. Int. J. Remote Sens. 2000, 21, 1093–1114. [Google Scholar] [CrossRef]
  4. Gong, P.J.; Wang, J.; Yu, L.; Zhao, Y.; Zhao, Y.; Liang, L.; Niu, Z.; Huang, X.; Fu, H.; Liu, S.; et al. Finer resolution observation and monitoring of global land cover: First mapping results with Landsat TM and ETM+ data. Int. J. Remote Sens. 2013, 34, 2607–2654. [Google Scholar] [CrossRef] [Green Version]
  5. Woodcock, C.E.; Strahler, A.H. The factor of scale in remote sensing. Remote Sens. Environ. 1987, 21, 311–332. [Google Scholar] [CrossRef]
  6. Chen, D.; Stow, D.A.; Gong, P. Examining the effect of spatial resolution and texture window size on classification accuracy: An urban environment case. Int. J. Remote Sens. 2004, 25, 2177–2192. [Google Scholar] [CrossRef]
  7. Lu, D.; Weng, Q. A survey of image classification methods and techniques for improving classification performance. Int. J. Remote Sens. 2007, 28, 823–870. [Google Scholar] [CrossRef]
  8. Lv, Z.Y.; He, H.Q.; Benediktsson, J.A.; Huang, H. A generalized image scene decomposition-based system for supervised classification of very high resolution remote sensing imagery. Remote Sens. 2016, 8, 814. [Google Scholar] [CrossRef] [Green Version]
  9. Han, P.; Gong, J.Y.; Li, Z.L.; Bo, Y.C.; Cheng, L. Selection of optimal scale in remotely sensed image classification. J. Remote Sens. 2010, 14, 507–518. [Google Scholar]
  10. Treitz, P. Variogram analysis of high spatial resolution remote sensing data: An examination of boreal forest ecosystems. Int. J. Remote Sens. 2001, 22, 3895–3900. [Google Scholar] [CrossRef]
  11. Ming, D.P.; Wang, Q.; Yang, J.Y. Spatial scale of remote sensing image and selection of optimal spatial resolution. J. Remote Sens. 2008, 12, 529–537. [Google Scholar]
  12. Feng, G.X.; Ming, D.P. Fractal based method on selecting the optimal spatial resolution for remote sensing images. J. Geogr. Inf. Sci. 2015, 17, 478–485. [Google Scholar]
  13. Ming, D.; Yang, J.; Li, L.; Song, S. Modified ALV for selecting the optimal spatial resolution and its scale effect on image classification accuracy. Math. Comput. Model. 2011, 54, 1061–1068. [Google Scholar] [CrossRef]
  14. Marceau, D.J.; Hay, G.J. Remote sensing contributions to the scale issue. Canadian J. Remote Sens. 1999, 25, 357–366. [Google Scholar] [CrossRef]
  15. Li, X.; Cheng, G.; Liu, S.; Xiao, Q.; Ma, M.; Jin, R.; Che, T.; Liu, Q.; Wang, W.Z.; Qi, Y.; et al. Heihe Watershed Allied Telemetry Experimental Research (HiWATER): Scientific objectives and experimental design. Bull. Am. Meteorol. Soc. 2013, 94, 1145–1160. [Google Scholar] [CrossRef]
  16. Pu, R.L.; Susan, B. Mapping seagrass coverage and spatial patterns with high spatial resolution IKONOS imagery. Int. J. Appl. Earth Obs. Geoinf. 2017, 54, 145–158. [Google Scholar] [CrossRef]
  17. Ling, Y.; Ehlers, M.; Usery, E.L.; Madden, M. Effects of spatial resolution ratio in image fusion. Int. J. Remote Sens. 2008, 29, 2157–2167. [Google Scholar] [CrossRef]
  18. Hsieh, P.F.; Lee, L.C.; Chen, N.Y. Effect of spatial resolution on classification errors of pure and mixed pixels in remote sensing. IEEE Trans. Geosci. Remote Sens. 2001, 39, 2657–2663. [Google Scholar] [CrossRef]
  19. Bo, Y.; Wang, J.; Li, X. Exploring the scale effect in land cover mapping from remotely sensed data: The statistical separability-based method. In Proceedings of the 2005 IEEE International Geoscience and Remote Sensing Symposium (IGARSS’05), Seoul, Republic of Korea, 25–29 July 2005; pp. 3884–3887. [Google Scholar] [CrossRef]
  20. Goward, S.N.; Davis, P.E.; Fleming, D.; Miller, L.; Townshend, J.R. Empirical comparison of Landsat 7 and Ikonos multispectral measurements for selected earth observation system (EOS) validation sites. Remote Sens. Environ. 2003, 88, 80–99. [Google Scholar] [CrossRef]
  21. Meddens, A.J.H.; Hicke, J.A.; Vierling, L.A. Evaluating the potential of multispectral imagery to map multiple stages of tree mortality. Remote Sens. Environ. 2011, 115, 1632–1642. [Google Scholar] [CrossRef]
  22. Schaaf, A.N.; Dennison, P.E.; Fryer, G.K.; Roth, K.L.; Roberts, D.A. Mapping plant functional types at multiple spatial resolutions using imaging spectrometer data. GIScience Remote Sens. 2011, 48, 324–344. [Google Scholar] [CrossRef]
  23. Moody, A.; Woodcock, C.E. Scale-dependent errors in the estimation of land-cover proportions: Implications for global land-cover datasets. Photogramm. Eng. Remote Sens. 1994, 60, 585–594. [Google Scholar]
  24. Wang, G.; Gertner, G.; Anderson, A.B. Up-scaling methods based on variability-weighting and simulation for inferring spatial information across scales. Int. J. Remote Sens. 2004, 25, 4961–4979. [Google Scholar] [CrossRef]
  25. Markham, B.L.; Townshend, J. Land cover classification accuracy as a function of sensor spatial resolution. In Proceedings of the International Symposium on Remote Sensing of Environment, Ann Arbor, MI, USA, 11–15 May 1981. [Google Scholar]
  26. Saini, R.; Ghosh, S.K. Ensemble classifiers in remote sensing: A review. In Proceedings of the International Conference on Computing, Communication and Automation (ICCCA), Greater Noida, India, 5–6 May 2017. [Google Scholar]
  27. Ghimire, B.; Rogan, J.; Galiano, V.; Panday, P.; Neeti, N. An evaluation of bagging, boosting, and random forests for land-cover classification in Cape Cod, Massachusetts, USA. GISci. Remote Sens. 2012, 49, 623–643. [Google Scholar] [CrossRef]
  28. Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning. Data Mining, Inference, and Prediction, 2nd ed.; Springer Series in Statistics: New York, NY, USA, 2009. [Google Scholar]
  29. Breiman, L. Bagging predictors. Mach. Learn. 1996, 24, 123–140. [Google Scholar] [CrossRef] [Green Version]
  30. Ho, T.K. A data complexity analysis of comparative advantages of decision forest constructors. Pattern Anal. Appl. 2002, 5, 102–112. [Google Scholar] [CrossRef]
  31. Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
  32. Sun, R.; Wang, G.Y.; Zhang, W.Y.; Hsu, L.T.; Ochieng, W.Y. A gradient boosting decision tree based GPS signal reception classification algorithm. Appl. Soft Comput. 2020, 86, 105942. [Google Scholar] [CrossRef]
  33. Lombardo, L.; Cama, M.; Conoscenti, C.; Märker, M.; Rotigliano, E. Binary logistic regression versus stochastic gradient boosted decision trees in assessing landslide susceptibility for multiple-occurring landslide events: Application to the 2009 storm event in Messina (Sicily, southern Italy). Nat. Hazards 2015, 79, 1621–1648. [Google Scholar] [CrossRef]
  34. Tama, B.A.; Rhee, K.H. An in-depth experimental study of anomaly detection using gradient boosted machine. Neural Comput. Appl. 2019, 31, 955–965. [Google Scholar] [CrossRef]
  35. Chan, J.C.W.; Paelinckx, D. Evaluation of random forest and Adaboost tree-based ensemble classification and spectral band selection for ecotope mapping using airborne hyperspectral imagery. Remote Sens. Environ. 2008, 112, 2999–3011. [Google Scholar] [CrossRef]
  36. Freund, Y.; Schapire, R.E. Experiments with a new boosting algorithm. Int. Conf. Mach. Learn. 1996, 96, 148–156. [Google Scholar]
  37. Demarchi, L.; Bund, W.; Pistocchi, A. Object-based ensemble learning for pan-European riverscape units mapping based on Copernicus VHR and EU-DEM data fusion. Remote Sens. 2020, 12, 1222. [Google Scholar] [CrossRef] [Green Version]
  38. Wu, J.G. Landscape Ecology Pattern, Process, Scale and Hierarchy; Higher Education Press: Beijing, China, 2007. [Google Scholar]
  39. Fu, B.J. Principles and Applications of Landscape Ecology, 2nd ed.; Science Press: Beijing, China, 2011. [Google Scholar]
  40. Ota, T.; Mizoue, N.; Yoshida, S. Influence of using texture information in remote sensed data on the accuracy of forest type classification at different levels of spatial resolution. J. For. Res. 2011, 16, 432–437. [Google Scholar] [CrossRef]
  41. Atkinson, P.M.; Aplin, P. Spatial variation in land cover and choice of spatial resolution for remote sensing. Int. J. Remote Sens. 2004, 25, 3687–3702. [Google Scholar] [CrossRef]
  42. Chen, J.; Deng, M.; Xiao, P.F.; Yang, M.H.; Mei, X.M.; Liu, H.M. Optimal spatial scale choosing for high resolution imagery based on texture features frequency analysis. J. Remote Sens. 2011, 15, 492–511. [Google Scholar]
  43. Dalponte, M.; Bruzzone, L.; Gianelle, D. Tree species classification in the Southern Alps based on the fusion of very high geometrical resolution multispectral/hyperspectral images and LiDAR data. Remote Sens. Environ. 2012, 123, 258–270. [Google Scholar] [CrossRef]
  44. Dietterich, T.G. An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization. Mach. Learn. 2000, 40, 139–157. [Google Scholar] [CrossRef]
  45. Feng, L.; Zhang, Z.; Ma, Y.; Du, Q.; Williams, P.; Drewry, J.; Luck, B. Alfalfa yield prediction using UAV-based hyperspectral imagery and ensemble learning. Remote Sens. 2020, 12, 2028. [Google Scholar] [CrossRef]
  46. Yang, Y.; Liu, Y.B.; Ruan, R.Z.; Ye, C.; Lu, P.P. Scale-induced uncertainty in MODIS-based land cover classification. J. Remote Sens. 2012, 16, 868–880. [Google Scholar]
  47. Xu, K.J.; Tian, Q.J.; Yang, Y.J.; Yue, J.; Tang, S.F. How up-scaling of remote-sensing images affects land-cover classification by comparison with multiscale satellite images. Int. J. Remote Sens. 2019, 40, 2784–2810. [Google Scholar] [CrossRef]
  48. DeFries, R.S.; Chan, J.C.W. Multiple criteria for evaluating machine learning algorithms for land cover classification from satellite data. Remote Sens. Environ. 2000, 74, 503–515. [Google Scholar] [CrossRef]
Figure 1. Location map of the study area.
Figure 1. Location map of the study area.
Sensors 23 06136 g001
Figure 2. Flow chart map of land cover mapping and analysis methods.
Figure 2. Flow chart map of land cover mapping and analysis methods.
Sensors 23 06136 g002
Figure 3. RF and GBDT land cover classification maps for Region A.
Figure 3. RF and GBDT land cover classification maps for Region A.
Sensors 23 06136 g003aSensors 23 06136 g003b
Figure 4. RF and GBDT land cover classification maps for Region B.
Figure 4. RF and GBDT land cover classification maps for Region B.
Sensors 23 06136 g004aSensors 23 06136 g004b
Figure 5. Producer’s accuracy Chart.
Figure 5. Producer’s accuracy Chart.
Sensors 23 06136 g005
Figure 6. User’s accuracy Chart.
Figure 6. User’s accuracy Chart.
Sensors 23 06136 g006aSensors 23 06136 g006b
Figure 7. f1-score accuracy Chart.
Figure 7. f1-score accuracy Chart.
Sensors 23 06136 g007
Figure 8. Upper scale fit of Region A and Region B.
Figure 8. Upper scale fit of Region A and Region B.
Sensors 23 06136 g008
Figure 9. Linear decomposition after resampling on Region A and Region B.
Figure 9. Linear decomposition after resampling on Region A and Region B.
Sensors 23 06136 g009
Figure 10. Scale fit after resampling in regions A and B.
Figure 10. Scale fit after resampling in regions A and B.
Sensors 23 06136 g010aSensors 23 06136 g010b
Figure 11. Class mean statistics of classification results at different scales for Region A.
Figure 11. Class mean statistics of classification results at different scales for Region A.
Sensors 23 06136 g011
Figure 12. Category standard deviation statistics of classification results at different scales for Region A.
Figure 12. Category standard deviation statistics of classification results at different scales for Region A.
Sensors 23 06136 g012
Figure 13. Class mean statistics of classification results at different scales for Region B.
Figure 13. Class mean statistics of classification results at different scales for Region B.
Sensors 23 06136 g013
Figure 14. Category standard deviation statistics of classification results at different scales for Region B.
Figure 14. Category standard deviation statistics of classification results at different scales for Region B.
Sensors 23 06136 g014
Table 1. Satellite sensors and their parameters.
Table 1. Satellite sensors and their parameters.
Satellite SensorCountrySpatical Resolution (m)Wave BandWavelength Coverage (μm)Acquisition Data
GF-2China1, 4Blue, Green,
Red, Near-infrared
(R, G, B, NIR)
0.45~0.52/0.52~0.59
0.63~0.69/0.77~0.89
21 July 2017
SPOT-6France60.455~0.525/0.530~0.590
0.625~0.695/0.760~0.890
19 July 2017
Sentinel-2AESA100.440~0.538/0.537~0.582
0.646~0.684/0.760~0.890
27 July 2017
Landsat-8America300.450~0.515/0.525~0.600
0.630~0.680/0.845~0.885
11 September 2017
Table 2. Classification of overall accuracy and Kappa coefficient of the two classification methods.
Table 2. Classification of overall accuracy and Kappa coefficient of the two classification methods.
RegionSatellite SensorRFGBDT
OA (%)KappaOA (%)Kappa
AGF-2 (1 m)88.030.8588.430.86
GF-2 (4 m)89.500.8790.290.88
SPOT-691.560.8992.190.90
Sentinel-2A89.730.8791.850.89
Landsat-884.540.8087.010.84
BGF-2 (1 m)86.530.8388.830.86
GF-2 (4 m)91.010.8992.810.91
SPOT-690.570.9092.050.90
Sentinel-2A91.170.8990.390.88
Landsat-885.570.8286.960.83
Table 3. Fitting model and related parameters.
Table 3. Fitting model and related parameters.
RegionFitting EquationR2p
A y = 0.4397 + 0.5906 × 10 3 × x 0.6393 × 10 4 × x 2 0.9110.002
B y = 0.5088 + 7.5814 × 10 3 × x 0.2888 × 10 3 × x 2 0.8770.005
Table 4. Landscape index values based on RF and GBDT in Region A and Region B.
Table 4. Landscape index values based on RF and GBDT in Region A and Region B.
RegionSatellite SensorRFGBDT
AREA_AMLPIAISPLITLSIAREA_AMLPIAISPLITLSI
AGF-2 (1 m)361.9312.3988.0323.47552.93331.0211.6481.8225.66839.41
GF-2 (4 m)605.5512.3081.1330.32321.02599.2112.0677.1130.63388.99
SPOT-6693.9610.9384.4726.50176.83761.2012.3679.3824.15234.33
Sentinel-2A1488.9416.3683.5812.33112.551012.5612.6176.6918.13159.15
Landsat-8951.7417.3964.8619.2880.50694.409.2762.5826.4285.63
BGF-2 (1 m)533.9215.8889.6013.86534.97531.1915.9283.4313.93713.88
GF-2 (4 m)1352.4622.2989.789.41145.591412.4123.3486.789.01187.78
SPOT-61421.0524.6687.888.96115.281626.0823.4883.929.34152.53
Sentinel-2A1660.9227.4085.127.6785.211586.0026.4680.118.03113.43
Landsat-81401.2223.9275.649.0947.021314.0623.4872.479.5052.94
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, R.; Gao, X.; Shi, F.; Zhang, H. Scale Effect of Land Cover Classification from Multi-Resolution Satellite Remote Sensing Data. Sensors 2023, 23, 6136. https://doi.org/10.3390/s23136136

AMA Style

Li R, Gao X, Shi F, Zhang H. Scale Effect of Land Cover Classification from Multi-Resolution Satellite Remote Sensing Data. Sensors. 2023; 23(13):6136. https://doi.org/10.3390/s23136136

Chicago/Turabian Style

Li, Runxiang, Xiaohong Gao, Feifei Shi, and Hao Zhang. 2023. "Scale Effect of Land Cover Classification from Multi-Resolution Satellite Remote Sensing Data" Sensors 23, no. 13: 6136. https://doi.org/10.3390/s23136136

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop