Next Article in Journal
Analysis of the E-Region Low-Altitude Quasi-Periodic Event in Low-Latitudes of China
Previous Article in Journal
Seasonal Cooling Effect of Vegetation and Albedo Applied to the LCZ Classification of Three Chinese Megacities
Previous Article in Special Issue
Mapping Coastal Wetlands and Their Dynamics in the Yellow River Delta over Last Three Decades: Based on a Spectral Endmember Space
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimating the Surface Fuel Load of the Plant Physiognomy of the Cerrado Grassland Using Landsat 8 OLI Products

by
Micael Moreira Santos
1,*,
Antonio Carlos Batista
1,
Eduardo Henrique Rezende
2,
Allan Deyvid Pereira Da Silva
2,
Jader Nunes Cachoeira
2,
Gil Rodrigues Dos Santos
3,
Daniela Biondi
1 and
Marcos Giongo
2
1
Department of Forest Sciences, Federal University of Paraná, Curitiba 80210-170, Brazil
2
Environmental Monitoring and Fire Management Center—CeMAF, Federal University of Tocantins, Gurupi 77410-530, Brazil
3
Department of Phytopathology, Federal University of Tocantins, Gurupi 77410-530, Brazil
*
Author to whom correspondence should be addressed.
Remote Sens. 2023, 15(23), 5481; https://doi.org/10.3390/rs15235481
Submission received: 22 September 2023 / Revised: 9 November 2023 / Accepted: 14 November 2023 / Published: 23 November 2023

Abstract

:
Techniques and tools meant to aid fire management activities in the Cerrado, such as accurately determining the fuel load and composition spatially and temporally, are pretty scarce. The need to obtain fuel information for more efficient management in a considerably heterogeneous, biodiverse, and fire-dependent environment requires a constant search for improved remote sensing techniques for determining fuel characteristics. This study presents the following objectives: (1) to assess the use of data from Landsat 8 OLI images to estimate the fine surface fuel load of the Cerrado during the dry season by adjusting multiple linear regression equations, (2) to estimate the fuel load through random forest and k-nearest neighbor (k-NN) algorithms in comparison to regression analyses, and (3) to evaluate the importance of predictor variables from satellite images. Therefore, 64 sampling units were collected, and the pixel values associated with the field plots were extracted in a 3 × 3-pixel window surrounding the reference pixel. For multiple linear regression analyses, the R2 values ranged from 0.63 to 0.78, while the R2 values of the models fitted using the random forest algorithm ranged from 0.52 to 0.83 and the R2 values of those fitted using the k-NN algorithm ranged from 0.30 to 0.68. The estimates made through multiple linear regression analyses showed better results for the equations adjusted for the beginning of the dry season (May and June). Adopting the random forest algorithm resulted in improvements in the statistical metrics of evaluation of the fuel load estimates for the Cerrado grassland relative to multiple linear regression analyses. The variable fraction-soil (FS) exerted the most significant effect on surface fuel load estimates, followed by the vegetation indices NDII, GVMI, DER56, NBR, and MSI, all of which use near-infrared and short-wave infrared channels in their calculations.

1. Introduction

Fire is common in the Cerrado biome, mainly in open plant physiognomies predominated by herbaceous and grassland vegetation. According to Miranda et al. [1], fires in the Cerrado, like in other savannas, can be characterized as surface fires that consume the fine fuels of the grass fuel layer. Fine surface fuels represent the layer of particles less than 0.6 cm in thickness, consisting of litter, grasses, herbs, and downed woody material [2]. For Rothermel [3], fuel represents the organic matter available for ignition and combustion, and this is characterized as the only fire-related factor that can be controlled by human action. Knowing the fuel characteristics is essential for determining fire behavior and making decisions in integrated fire management and wildfire suppression activities. However, determining the characteristics of fuel is temporally and spatially complex [4,5]. Li et al. [6] reported that determining fuel characteristics demands high costs and a considerable time for sampling. Roberts et al. [7] indicated the following essential attributes for understanding fire behavior: fuel type, fuel biomass, fuel moisture, and fuel condition (live or dead). The fuel load is a crucial variable commonly used in various fire management strategies, such as fire risk assessments and fire behavior prediction [3,8]. The fuel load has a significant influence on the modeling of biomass and carbon estimates, and its knowledge for land managers is fundamental [9]. Nevertheless, there are few studies on the Cerrado that involve determining the characteristics of its flammable material.
Remote sensing techniques are essential for estimating several fuel characteristics as more studies are conducted and improvements are made. According to Roberts et al. [7], the products obtained using various remote sensing techniques can help assess wildfire hazards, which include the following: (i) indirect measurements of live fuel moisture, (ii) measurements of live herbaceous biomass, (iii) measurements of the fuel condition, and (iv) detailed classifications of the fuel type. Van Wagtendonk and Root [10] noted that information relative to fuel is often presented as fuel model maps; fuel models are used to determine the fuel load, size, depth, and moisture of extinction. A wide range of studies have estimated fuel variables through remote sensing products. However, there is a significant lack of studies concerning the surface fuel of grassland environments of the Cerrado biome. Several studies have characterized and estimated forest biomass using Landsat imagery [11,12,13,14,15], MODIS products [16,17], and Lidar sensors [18,19]. Various studies that have used remote sensing products mostly relate to the characterization of forest variables, but few have worked with estimates of surface fuel variables, which are more related to the occurrence of surface fires, quite recurrent in grassland and savanna areas.
The use of vegetation indices has been widely addressed. They are used in biomass studies to determine fuel moisture [20,21]. Other studies using spectral mixture techniques have also been conducted [22,23]. However, few are related to determining surface fuel’s physical characteristics, except for Franke et al. [24], who applied spectral mixture techniques to map fine fuel accumulation. The significant difference in this study is the indirect estimation of fine surface fuel classified by type into live and dead fuel and of its behavior during the dry season, differing between the beginning and end of the dry period using remote sensing products. Most studies that estimate the fuel load perform the modeling without considering different fuel types. They do not consider the seasonality between other times of the year or more critical periods and do not offer vital information for risk estimates of forest fires. Few studies consider the use of spectral mixture analysis products associated with machine learning algorithms in estimating the fuel load of grasses, mainly in environments with high spatio-temporal variability, such as the Cerrado biome. In this sense, the main objectives of this study are as follows:
(1)
To evaluate the performance of multiple linear regression equations adjusted for load estimation in classes of live and dead fine fuels, considering the beginning and end of the dry season, based on the reflectance of Landsat 8 OLI images, vegetation indices, and fraction values (F-values) of the spectral mixture analysis (SMA);
(2)
To assess the use of random forest and k-nearest neighbor algorithms to estimate the fine fuel load in different classes in comparison to traditional multiple linear regression analyses;
(3)
To analyze the importance of each predictor variable from remote sensing products in random forest models.

2. Materials and Methods

2.1. Study Area

Collections in the field were performed in the central southern portion of the Serra Geral do Tocantins Ecological Station (EESGT), a fully protected conservation unit with an area of 716,306 ha, located in the Cerrado biome in the Jalapão region in Brazil. According to the Köppen climate classification, the region’s climate is classified as Aw (tropical savanna climate), and its annual precipitation ranges from 1400 to 1500 mm [25], which is higher than the annual potential evapotranspiration. In the region, summer occurs between October and April and is rainy and winter occurs between May and September and is dry [26]. The predominant plant physiognomy is grassland: pure grassland (campo limpo), pure wet grassland (campo limpo úmido), grassland with scattered shrubs and trees (campo sujo), wet grassland with scattered shrubs and trees (campo sujo úmido), and rocky grassland (campo rupestre) [27,28]. The predominant soil type is quartz sand or quartzipsamments, and it has a sandy or loamy texture at least 2 m deep and may have up to 15% clay. The location’s relief varies from relatively flat to gently undulating, with average elevations between 300 and 500 m [29]. Figure 1 presents the study area and the spatial distribution of data sampling units.

2.2. Field Survey

A field survey of the variables of the surface fuel load was conducted by installing data sampling units that were collected during the periods from 11 May 2017 to 31 May 2017 (May), from 6 June 2017 to 29 June 2017 (June), from 5 August 2017 to 25 August 2017 (August), and from 8 September 2017 to 24 September 2017 (September). Average air temperatures at the time of samplings ranged from 24.4 to 31.0 °C from the beginning (May and June) to the end (August and September) of the dry season, respectively. The relative humidity of the air presented averages at the time of samplings, which varied from 41.0 to 27.3% from the beginning to the end of the dry season. Each sampling unit had two 30 m transects spaced 14 m apart. Markings were made on the transects to install, by prior draw, eight sub-samples of surface fuel with an area of 0.25 m2 (0.5 × 0.5 m), and fuel was collected from them using the destructive method. The size of the sub-samples was chosen for a broader sampling that could provide a more significant variability of fuel samples in a shorter time. Accordingly, 64 data sampling units separated by a distance of over 100 m from each other were collected for analyses using satellite images. It is noteworthy that data sampling was carried out in the area of viable access, given the existence of an area of difficult access further north (Figure 1).
The destructive sampling involved separating the material according to its physiological state (live or dead) into different diameter classes (timelag), adapting the methodology proposed by Schroeder and Buck [30] and Brown et al. [2]. Thus, the surface fuels were classified as follows: (i) dead grass fuel, (ii) 1 h downed wood debris (<0.64 cm diameter), (iii) live grass fuel, and (iv) live shrub fuel (<0.64 cm diameter). As the study aimed to estimate fine fuels under 0.64 cm in diameter, which exert a more significant influence on the fire spread in regions with open vegetation in the Cerrado biome, the 10 h (0.64–2.54 cm) and 100 h (2.54–7.62 cm) classes were not considered. In addition to the abovementioned classes, the analyses and estimates took into account the total fine live (grass + shrub < 0.64 cm diameter) and fine dead (grass + 1 h downed wood debris) fuels and the total fine fuel load (live and dead; <0.64 cm diameter). After collecting the sub-samples from the field, they were all dried until they reached a constant mass. Next, the dry mass was determined, and its respective fuel moisture content was calculated using the ratio between wet mass and dry mass on a dry-weight basis and then converted to a percentage. Figure 2 presents a diagram exemplifying the collection locations, surface fuel data sampling, and characterization process.

2.3. Obtaining and Processing Satellite Imagery

Images from the Landsat 8 OLI satellite were used in this study. They were obtained for the approximate dates of four collections of field samples during the dry season in May and June (beginning of the dry season) and in August and September (end of the dry season), corresponding to the following dates (dd month yyyy): 5 May 2017, 6 June 2017, 22 June 2017, 24 July 2017, 9 August 2017, and 10 September 2017. To cover the entire study area, images from 3 Landsat 8 OLI scenes were used, namely: 220/067, 221/067, and 221/068. The reflectance values of the bands were extracted, and the vegetation indices were calculated through the cloud-based geospatial data processing platform Google Earth Engine (GEE). The GEE platform provides easy access to high-performance computing resources for large-scale processing of geospatial data sets [31]. Landsat 8 OLI Surface Reflectance data were generated using the Land Surface Reflectance Code (LaSRC) algorithm [32]. The reflectance values were obtained from the Landsat 8 OLI sensor bands in the following channels: (i) blue (0.45–0.51 µm), (ii) green (0.53–0.59 µm), (iii) red (0.64–0.67 µm), (iv) near-infrared (0.85–0.88 µm), (v) short-wave infrared (1.57–1.65 µm), and (vi) short-wave infrared (2.11–2.29 µm).

2.3.1. Vegetation Indices (VIs)

Based on the combination of reflectance in different spectral bands, vegetation indices (VIs) were calculated. VIs are an efficient way to attain empirical information from multispectral sensors and are frequently used in various scientific studies that set out to better explain or determine the behavior of certain vegetation variables [10,11,33,34]. Therefore, a total of 20 vegetation indices were calculated and incorporated into the analyses as independent variables: (i) Normalized Difference Vegetation Index (NDVI) [35], (ii) Visible Atmospherically Resistant Index (VARI) [36], (iii) Visible Green Index (VIgreen) [36], (iv) Simple Ratio (SR) [37], (v) Structure Insensitive Pigment Index (SIPI) [38], (vi) Soil-Adjusted Vegetation Index (SAVI) [39], (vii) Normalized Difference Water Index (NDWI) [40], (viii) Normalized Difference Infrared Index (NDII6) [41], (ix) Normalized Burn Ratio (NBR) [41], (x) Modified Vegetation Index (MVI) [42], (xi) Modified Simple Ratio (MSR) [43], (xii) Moisture Stress Index (MSI) [44], (xiii) Modified Normalized Difference Water Index (MNDWI) [45], (xiv) Integral (INT) [46], (xv) Global Vegetation Moisture Index (GVMI) [33], (xvi) Enhanced Vegetation Index (EVI) [47], (xvii) Derivative-band 5 (ivp), band 6 (siwir1) (DER56) [48], (xviii) Derivative-band 4 (vm), band 5 (ivp) (DER45) [48], (xix) Derivative-band 3 (vd), band 4 (vm) (DER34) [48], and (xx) Derivative-band 2 (az), band 3 (vd) (DER23) [48]. The vegetation indices analyzed in this study show relationships with the fuel load and moisture, as well as with the physiological state of the fuel. These vegetation indices were chosen with the aim to analyze their viability for estimating the fuel load from Landsat 8 OLI images, considering seasonality during the dry season in a Cerrado environment.

2.3.2. Spectral Mixture Analysis (SMA)

The spatial variability of vegetation within a pixel is a strong limiting factor in ascertaining the fuel condition and load information. Spectral mixture analysis (SMA) has considerable potential to estimate the fuel condition and fuel moisture content at the sub-pixel level [34]. SMA is a widely applied technique in remote sensing for obtaining ecologically relevant and significant components of an image pixel [49]. In SMA, two or more reference spectra (endmembers), such as green vegetation, dry vegetation, soil, and shade, are modeled as linear combinations to estimate the sub-pixel fractions of each component [50,51].
After retrieval, the Landsat 8 OLI images underwent the atmospheric correction process to eliminate interferences from the atmosphere to extract data and determine reflectance values [32,52]. Next, color composites were obtained and used in subsequent study steps. The color composite of the Landsat 8 OLI image used in the initial collection for the spectral mixture analyses was 7 (short-wave infrared-SWIR 2), 5 (near-infrared), and 4 (red). After processing the satellite images, the creation of a spectral library began. First, the surface targets for SMA were divided into (i) non-photosynthetic dry vegetation (NPV), (ii) green vegetation (GV), and (iii) exposed soil. This classification was chosen because green vegetation, dry vegetation, and soil dominate Cerrado grassland environments. Several candidates for “pure pixels,” known as endmembers, were selected from each target. Thus, based on the specific spectral behavior of each target, regions of interest (ROI) whose reflectance presented spectral behaviors that were as faithful as possible to the characteristic spectral curve of the respective target were delimited.
To fulfill the objectives proposed herein, endmembers were chosen in four representative images of the four months of the dry season (May, June, August, and September) according to the dates of the Landsat 8 images previously mentioned. After the selection of all the pure pixel candidates of the features, as mentioned earlier, based on the spectral curves, the process of selecting endmembers based on the lowest statistical errors began. The multiple endmember spectral mixture analysis (MESMA) methodology was applied to perform this procedure [53] using MESMA QGIS Plugin (version 1.0.8) [54]. MESMA’s results produce grayscale images whose values (called fraction values (F-values)) represent the estimated relative degree in which each pixel corresponds to its respective reference spectrum. Therefore, the images generated with F-values were for the GV, NPV, and soil components.

2.3.3. Data Extraction

The pixel values associated with the field plots were extracted in a 3 × 3-pixel window around the reference pixel (pixel associated with the field plot). This way, the median values of the pixels in each 3 × 3-pixel polygon related to the field plots were extracted [11,55,56,57,58,59]. The data were exported and tabulated to compare the information collected from the plots in the field with the information from the spectral mixture analysis, reflectance, and calculated vegetation indices.

2.4. Statistical Analysis of the Data

2.4.1. Multiple Linear Regression Analysis

After processing the data from the Landsat 8 OLI satellite images, all the information associated with the collection points in the field were extracted in a 3 × 3-pixel window. Next, all the data were tabulated and correlated with the fuel load values in the different classes. In multiple linear regression analyses, the fuel load was a dependent variable, especially for the fine dead fuels (dead grass fuel, 1 h downed wood debris, and total fine dead fuel) and total fine fuel (live and dead, <0.64 cm diameter) classes. The predictor (independent) variables were divided into three main groups: (i) fraction values (F-values) from the spectral mixture analysis (SMA), (ii) reflectance of the Landsat 8 OLI images in different spectral bands, and (iii) the calculated vegetation indices. The stepwise method was adopted to select the most relevant variables for the regression equations. Also, the adjustments of multiple linear regression equations considered the following: (i) adjusting equations considering 32 plots collected in the initial months of the dry season (May and June), (ii) adjusting equations considering 32 plots collected in the final months of the dry season (August and September), and (iii) adjusting equations for the entire period of the dry season (all 64 plots). Figure 3 presents a flowchart of the methodology applied in this study.

2.4.2. Random Forest Machine Learning Algorithm

The random forest (RF) machine learning algorithm is a non-parametric method developed by Breiman [60] that has been widely adopted in various studies [12,16,58,61]. The RF algorithm combines several predictor trees generated by an independently sampled random vector with a similar distribution for all trees in the forest [62]. Compared to traditional linear regression methods, random forest models exhibit several advantages in above-ground biomass estimation, such as better accuracy and fewer non-linear problems [63].
Although the training and test sets were divided randomly in R, a specific code command (set seed) was used to guarantee the model’s reproducibility—in other words, to run the model with the same previously defined data sets (training and test). To implement a random forest, two parameters must be defined: the number of trees (ntree) and the number of features in each split (mtry). Regarding ntree, in this study, it was observed that the prediction accuracy with the random forest algorithm did not increase significantly when ntree presented values greater than 1000. The ideal ntree value defined was ntree = 1000 to guarantee the reliability of the prediction results without affecting computational efficiency [64,65]. For mtry, the values were defined as one-third of the total number of predictor variables [60,66,67,68]. These varied between mtry = 4 and 5 for models adjusted with predictor variables selected using the stepwise method and mtry = 10 for models adjusted with all predictor variables. The “randomForest” package in the R program was used to apply the algorithm. The importance of the predictor variables used in the random forest models was evaluated with the increase in the mean-square error (% IncMSE), which measures the effect of a variable’s predictive potential when subjected to a random permutation. In short, this metric shows how much the model’s accuracy decreases if a particular predictor variable is left out of the model.

2.4.3. K-Nearest Neighbor Machine Learning Algorithm

Another machine learning algorithm used to verify the feasibility of estimating the fuel load from satellite imagery products was the k-nearest neighbor (k-NN) algorithm. The k-NN algorithm is used for classification and regression. It predicts the values of specific variables based on the similarity in a covariate space between the point and other points representing the values of the variables [69]. The use of the k-NN algorithm associated with remote sensing data has played a key role in forest studies, and there have been several applications in forest inventories in the field that aimed to improve the accuracy of estimates [70,71]. The k-NN algorithm was applied using the “Caret” package in R. There is one essential parameter in this algorithm, K, or the number of neighbors that need to be optimized. This parameter specifies the voting system in the k-NN algorithm wherein the K nearest neighbors are selected, and then it sets the output model [72]. The K values were determined based on the lowest RMSE values, and the optimal values were chosen for each estimated fuel variable.

2.4.4. Cross-Validation

Cross-validation evaluates a model’s performance and provides a nearly unbiased prediction error estimator [73]. The multiple linear regression equations, random forest models, and k-NN algorithm models were subjected to cross-validation with the k-fold cross-validation technique using 10 folds. The k-fold cross-validation technique is a robust method for estimating a model’s accuracy, and it generally provides more accurate estimates of the test error rate [11,74,75]. The following statistical metrics were used to validate the equations: (i) the values of the squared coefficient of determination (R2), (ii) the adjusted squared coefficient of determination (R2adj), (iii) the root-mean-square error (RMSE), and (iv) the mean absolute error (MAE). The validations considered the adjustments of the multiple linear regression models and the random forest and k-nearest neighbor algorithm models, with an emphasis on the models with the best fit considering all the independent variables of the study and on the models with the best fit considering the independent variables selected using the stepwise method.

3. Results

3.1. Multiple Linear Regression Analysis

Table 1 presents the metrics of the multiple linear regression equations, which were validated by considering the study’s independent variables. The multiple linear regression equations performed best with adjustments using the stepwise method. Table 1 presents only the metrics of the equations adjusted using this method. The equations validated using the k-fold cross-validation method, which were notable for the coefficient of determination (R2), were for the estimation of the total fine dead fuel (grass + 1 h downed wood debris), with R2 = 0.92, and dead grass fuel, with R2 = 0.89. These equations mentioned earlier are for the beginning of the dry season (p < 0.05). However, the lowest RMSE and MAE values were observed for estimates of dead grass fuel at the beginning of the dry season, with values below 0.40. The statistical metrics verified for the end of the dry season were lower than those observed for the equations adjusted for the beginning of the dry season in the study area.
For the entire period of the dry season, the R2 values ranged from 0.57 to 0.78. The equation for estimating dead grass fuel performed the best, with R2 = 0.78, followed by the stepwise equation for calculating the total fine dead fuel (grass + 1 h downed wood debris), with R2 = 0.73 (p < 0.05). The R2adj values of the equations subjected to cross-validation may be negative without a lower limit and present a maximum value of 1, as noted by Peterson et al. [76], which was the case for the equations at the end of the dry season. The lowest RMSE and MAE values were found to estimate dead grass fuel.

3.2. Estimates Using the Random Forest Algorithm

3.2.1. Model Performance Metrics

To use the random forest algorithm, the entire dry season was considered for the adjustment of the models. The models were not divided for the dry season’s beginning and end, as was performed in multiple linear regression analyses. Like the regression equations, the random forest algorithm was adjusted for each estimated variable by considering all the study’s independent variables and the variables selected using the stepwise method. Unlike the multiple linear regression equations, the random forest models presented a slight advantage in the adjustments with all the independent variables compared to the adjustments with the variables selected using the stepwise method, as shown in Table 2. The R2 values with the random forest algorithm ranged from 0.52 to 0.83; the highest values were found to estimate dead grass fuel and the total fine dead fuel.
To estimate the total fine fuel (live and dead), the best performance was related to the model using all the independent variables, with R2 = 0.62. For the RMSE and MAE values of the estimates using the random forest algorithm, the lowest values were recorded for the estimation of dead grass fuel, as 0.33 and 0.23, respectively. In contrast, the highest RMSE and MAE values corresponded to the estimation of the total fine fuel load (live and dead). Figure 4 and Figure 5 show the fine fuel load mappings in the Serra Geral do Tocantins Ecological Station, with the best fits for each fuel load category analyzed.

3.2.2. Importance of the Variables in the Random Forest Models

In analyzing the results of the importance of the variables in the RF models presented in Figure 6, the considerable importance of the fraction-soil (FS; F-values) variable from the spectral mixture analysis (SMA) was noteworthy in all the models addressed. In all situations, the FS variable increased the mean-square error (% IncMSE) above 20% in all the models presented. This means that the removal of the predictor FS variable resulted in an increase in the mean-square error above 20% for the estimated fuel variables. Among the fuel variables in which FS exerted a greater effect, the total fine dead fuel (grass + 1 h downed wood debris) and dead grass fuel were notable, presenting values of 33.5% and 29.1%, respectively.
The vegetation indices NDII, NBR, GVMI, and DER56 emerged as the second-most essential variables considering the estimates of dead grass fuel, 1 h downed wood debris, the total fine dead fuel (grass + 1 h downed wood debris), and the total fine fuel (live and dead), respectively. The GVMI appeared most frequently among the vegetation indices and was most important in the fitted random forest models. It was not a variable of importance only for estimating the total fine fuel. Out of the bands’ reflectance, the short-wave infrared channel (SWIR2-2.11–2.29 µm) occurred most frequently and had the most significant importance. Figure 6 graphically illustrates the importance of the random forest algorithm models with the best fit.

3.3. Estimates using the K-Nearest Neighbor Algorithm

Similarly, the data from the same database used for the application of the RF algorithm were executed with the k-nearest neighbor (k-NN) algorithm so that two models were built for each fuel load variable: one model considering all the independent variables of the study and another model considering only the variables selected using the stepwise method. As a parameter for using the k-NN algorithm, the K values were chosen based on the lowest RMSE values, ranging from 5 to 13. Table 3 demonstrates the K values chosen for each k-nearest neighbor algorithm model.
Like the random forest models, the k-nearest neighbor models presented slightly superior performances for the adjustments made with all the predictor variables of the study in comparison with those selected using the stepwise method. The R2 values ranged from 0.30 to 0.68, and the k-NN models with the best fit were those for estimating dead grass fuel, followed by the model for estimating the total fine dead fuel, with R2 = 0.54. Table 4 provides the statistical metrics of the k-NN algorithm models.
Regarding the RMSE and MAE values, the estimates using the k-NN algorithm presented the lowest values for the estimation of dead grass fuel, as 0.37 and 0.30, respectively. The highest RMSE and MAE values with the k-NN models were found to estimate the total fine fuel load (live and dead) (Table 4).

4. Discussion

Given the low correlations of the live fuel classes (live grass and live shrub), statistical analyses for their estimation could not be conducted. However, according to the findings of Santos et al. [77], the low moisture level of Cerrado grassland species is the main factor behind the increase in the fuel’s flammability. Moreover, the proportion of fine dead fuel to the proportion of live fuel during the dry season in the biome can reach 70%, or over 80%, considering only the grass fuel of the area, based on the fuel characterization by Santos et al. [78]. In addition, the fine thickness of 1 h downed wood debris and grass fuels in a physiologically inactive state (e.g., grass and 1 h downed wood debris) makes the moisture content of these fuels more sensitive to changes in atmospheric conditions, thus affecting their ignition capacity and leaving areas with a predominance of these types of fuels more prone to the occurrence of wildfires, especially in the dry season [79]. As reported by Slijepcevic et al. [80] and Soares et al. [81], low fuel moisture is one of the main factors contributing to the occurrence of large fires.
Out of the adjustments of the multiple linear regression equations for the beginning and end of the dry season in the Cerrado biome, the equations adjusted for the beginning of the dry season had the best statistical performance. This may be explained by analyzing the results obtained by Santos et al. [78], who characterized the fuel of Cerrado grassland vegetation. In this study, fuel characteristics, such as the number of individuals, the number of species, the grass height, and the fuel load in different classes (timelag), exhibited different behaviors between the first and last months of the dry season in the physiognomy under analysis. The fuel condition mapping showing fractions of the NPV, GV, and soil for the beginning of the dry season (May) in contrast to the fuel condition at the end of the dry season (September) is presented in Figure 7A and Figure 7B, respectively. For example, given the different ages of the fuels present in the area, the load values of dead grass fuel in the first months of the dry season (May and June) showed a statistically significant difference in dead grass fuel between the ages of one and four years. By the end of the dry season (August and September), no statistically significant difference was found among the fuel loads of dead grass fuels between the ages of two and four years (p < 0.05).
Considering all the adjustments of the multiple linear regression equations in the study (beginning, end, and the entire dry period), the R2 values ranged from 0.45 to 0.92. For the whole of the dry period, the highest value was 0.78 for the adjusted equation to estimate dead grass fuel (Table 1). Similar results were reported by Tucker et al. [82], who presented more expressive values, considering only the coefficient of determination (R2) values and the 0.385 µm range of the electromagnetic spectrum as the predictor variable to estimate the total dry biomass (R2 = 0.80). Using Landsat 8 OLI images for fuel load estimation in the Hulunbuir grassland, Bao et al. [14] obtained an R2 value of 0.64 using models based on multiple linear regression. Using Landsat 5 TM imagery to estimate the above-ground biomass in interior Alaska, Ji et al. [11] obtained an R2 value of 0.73 in their regression model. In one of the few studies conducted in the Cerrado, Franke et al. [24] found relationships between variables obtained from spectral mixture analyses and the biomass of fine surface fuels in their regression model, with R2 values of 0.81 (fraction of non-photosynthetic dry vegetation (FNPV)) and 0.65 (fraction-soil (FS)). The results of Franke et al. [24] differ from those of this study: between the variables given by the spectral mixture analysis and fuel load, the most substantial relationships were found for the fraction-soil (FS) values. However, the linear regression models fitted by Franke et al. [24] did not consider a cross-validation analysis. It is possible to note that there are few studies on estimating surface fuel from satellite imagery, especially for the Cerrado biome.
Like the adjustments of the multiple linear regression equations, the models resulting from applying the random forest algorithm with better performance based on the evaluation metrics were for estimating dead grass fuel classes and the total fine fuel class (grass + 1 h downed wood debris). They presented higher values than the other classes, reaching R2 = 0.83. The statistical metrics obtained for estimating the 1 h downed wood debris fuel load demonstrated poor performance in relation to the other variables (Table 2). This may be explained by less spatial continuity compared to the dead grass fuel present in the study area. Dube and Mutanga [83] used Landsat 8 OLI and 7 ETM+ images and the reflectance of the bands and vegetation indices as predictor variables. They tested the random forest algorithm to estimate the above-ground biomass in southern Africa and found coefficient of determination (R2) values ranging from 0.43 to 0.65. Pierce et al. [84] used the RF algorithm and information from Landsat 5 TM, field data, and topographic factors for modeling and mapping canopy fuels in California (USA) and attained pseudo-R2 values ranging from 0.55 to 0.68. Frazier et al. [12] characterized the above-ground biomass in a boreal forest using Landsat temporal segmentation metrics and found R2 values of 0.62 estimated using random forest models. Gao et al. [16] found higher R2 values (0.75) using MODIS sensor data to estimate the above-ground biomass in an Asian region.
As for the importance of the variables of the RF models with the best fit, for the dead fuel classes (dead grass, 1 h downed wood debris, and total fine dead fuels) and the total fine fuel (live and dead), the fraction-soil (FS) variable from the spectral mixture analysis (SMA) was the independent variable that exerted the most significant effect. These results corroborate the higher correlations between the FS variable and the variables obtained from the spectral mixture analysis. The physiognomy of the Cerrado grassland is predominantly open (pure grassland and grassland with scattered shrubs and trees) and has a sparse population of tree species. Given the age of the fuel and its level of cover, greater or less soil exposure becomes quite noticeable in the response to the spectral mixture analysis, which demonstrated an inverse relationship with the surface fuel load. Despite the importance of the FS variable in the RF models, there is no knowledge of its performance in areas with a greater presence of tree species. At least five vegetation indices, namely the NDII, GVMI, DER56, NBR, and MSI, had a degree of importance concerning %IncMSE above 10%. All these vegetation indices revealed the presence of the near-infrared (NIR: 0.85–0.88 µm) and short-wave infrared (SWIR1: 1.57–1.65 µm; SWIR2: 2.11–2.29 µm) channels in their respective calculations. This indicates the importance of the near-infrared and short-wave infrared channels for load estimates using both RF models and multiple linear regression analyses. The near-infrared and short-wave infrared channels have been used to estimate vegetation characteristics in several settings [55,76].
Overall, the statistical metrics used to assess the models were superior for the RF models in relation to the multiple linear regression equations and k-NN models. Thus, the models estimating the surface fuel of the Cerrado grassland using the random forest algorithm with data retrieved from satellite images provided better estimates of the surface fuel. Only the total fine fuel variable (live and dead) estimated using multiple linear regression analysis resulted in a slightly higher coefficient of determination (R2), considering the entire dry period in the study area. However, the RMSE and MAE values were higher than those of the random forest algorithm. Aligned with the results of this study, D’Este et al. [85] evaluated the performance of machine learning models in estimating the fine fuel load in a region of Italy. They observed greater predictive power for the random forest algorithm, with R2 = 0.50, compared to multiple linear regression models and support vector machines, which presented coefficients of determination of 0.40 and 0.39, respectively. Compared with this study, despite different fuel characteristics, the R2 values for estimating the fine dead fuel load (<0.64 cm diameter) using the RF algorithm had a better performance (R2 = 0.83).
The application of the k-NN algorithm to estimate the surface fuel of the plant physiognomy of the Cerrado grassland was unsatisfactory, presenting lower statistical metrics than multiple linear regression analyses and the random forest algorithm. This may be because the k-NN algorithm has a better performance and more comprehensive application in estimating forest variables [69,71,86,87] and due to little knowledge of the assessment of surface fuels.

5. Conclusions

Multiple linear regression analyses showed better statistical results for equations adjusted for the beginning of the dry season (May and June) than those adjusted for the end of the dry season (August and September). This behavior arises from the various changes in fuel characteristics in the Cerrado grassland’s physiognomy during the dry season’s beginning and end. Therefore, modeling to obtain estimates for load, moisture, and other surface fuel characteristics should be performed separately and consider the seasonality throughout the year.
The use of the random forest algorithm contributed to improvements in the evaluation metrics for estimating the Cerrado grassland surface fuel load compared to multiple linear regression analyses and the k-nearest neighbor algorithm. Out of the predictor variables originating from the products of Landsat 8 OLI images, the fraction-soil variable from the spectral mixture analysis exerted the most significant effect on load estimation in the different classes analyzed herein. Accordingly, applying the RF algorithm and the fraction-soil variable is recommended for estimating the Cerrado’s fuel load of open or savanna physiognomies. Vegetation indices also played a considerable role in applying the RF algorithm, especially the NDII, GVMI, DER56, NBR, and MSI, which presented higher effect values than the others in the estimates of the analyzed surface fuel variables.
However, their performance in more closed physiognomies with a more significant presence of tree species, for example, is unknown. Further studies must be conducted to verify the feasibility of using the products of different satellite images in different environments, especially in areas more prone to fires in the biome.

Author Contributions

Conceptualization, M.M.S. and A.C.B.; methodology, M.M.S., A.C.B. and M.G.; software, E.H.R., A.D.P.D.S. and J.N.C.; validation, M.M.S. and A.D.P.D.S.; formal analysis, D.B., G.R.D.S., A.C.B. and M.G.; investigation, M.M.S., A.D.P.D.S. and J.N.C.; resources, M.G. and A.C.B.; data curation, M.M.S. and A.D.P.D.S.; writing—original draft preparation, M.M.S.; writing—review and editing, M.M.S. and A.C.B.; visualization, E.H.R., G.R.D.S. and D.B.; supervision, A.C.B. and M.G.; project administration, M.M.S. and M.G.; funding acquisition, M.G. and A.C.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Council for Scientific and Technological Development (CNPQ)–PELD Jalapão (grant number 443100/2020-9).

Data Availability Statement

Data are available on request from the authors.

Acknowledgments

The authors acknowledge the National Council for Scientific and Technological Development (CNPQ) for financial support granted to carry out this research; the Federal University of Tocantins and the Graduate Program in Forestry and Environmental Sciences (PGCFA) for financial support granted for translating this article through public notice N° 012/2021; and the Serra Geral do Tocantins Ecological Station managers, Ana Carolina Sena Barradas, Marco Assis Borges, and Máximo Menezes Costa, for their support in the planning and logistics of field collections.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Miranda, H.S.; Sato, M.N.; Neto, W.N.; Aires, F.S. Fires in the Cerrado, the Brazilian Savanna. Trop. Fire Ecol. 2009, 427–450. [Google Scholar] [CrossRef]
  2. Brown, J.K.; Oberheu, R.D.; Johnston, C.M. Handbook for Inventorying Surface Fuels and Biomass in the Interior West; Gen. Tech. Rep. INT-129; U.S. Department of Agriculture: Washington, DC, USA; Forest Service: Washington, DC, USA; Intermountain Forest and Range Experimental Station: Washington, DC, USA, 1982; Volume 129, 48p. [CrossRef]
  3. Rothermel, R.C. A Mathematical Model for Predicting Fire Spread in Wildland Fuels; Intermountain Forest & Range Experiment Station: Washington, DC, USA; Forest Service: Washington, DC, USA; US Department of Agriculture, USDA: Washington, DC, USA, 1972; pp. 1–48.
  4. Keane, R.E.; Herynk, J.M.; Toney, C.; Urbanski, S.P.; Lutes, D.C.; Ottmar, R.D. Evaluating the Performance and Mapping of Three Fuel Classification Systems Using Forest Inventory and Analysis Surface Fuel Measurements. For. Ecol. Manag. 2013, 305, 248–263. [Google Scholar] [CrossRef]
  5. Kanmegne Tamga, D.; Latifi, H.; Ullmann, T.; Baumhauer, R.; Bayala, J.; Thiel, M. Estimation of Aboveground Biomass in Agroforestry Systems over Three Climatic Regions in West Africa Using Sentinel-1, Sentinel-2, ALOS, and GEDI Data. Sensors 2023, 23, 349. [Google Scholar] [CrossRef]
  6. Li, Z.; Angerer, J.P.; Jaime, X.; Yang, C.; Wu, X. Ben Estimating Rangeland Fine Fuel Biomass in Western Texas Using High-Resolution Aerial Imagery and Machine Learning. Remote Sens. 2022, 14, 4360. [Google Scholar] [CrossRef]
  7. Roberts, D.A.; Dennison, P.E.; Gardner, M.E.; Hetzel, Y.; Ustin, S.L.; Lee, C.T. Evaluation of the Potential of Hyperion for Fire Danger Assessment by Comparison to the Airborne Visible/Infrared Imaging Spectrometer. IEEE Trans. Geosci. Remote Sens. 2003, 41, 1297–1310. [Google Scholar] [CrossRef]
  8. Kukavskaya, E.A.; Shvetsov, E.G.; Buryak, L.V.; Tretyakov, P.D.; Groisman, P.Y. Increasing Fuel Loads, Fire Hazard, and Carbon Emissions from Fires in Central Siberia. Fire 2023, 6, 63. [Google Scholar] [CrossRef]
  9. Szpakowski, D.M.; Jensen, J.L.R. A Review of the Applications of Remote Sensing in Fire Ecology. Remote Sens. 2019, 11, 2638. [Google Scholar] [CrossRef]
  10. van Wagtendonk, J.W.; Root, R.R. The Use of Multi-Temporal Landsat Normalized Difference Vegetation Index (NDVI) Data for Mapping Fuel Models in Yosemite National Park, USA. Int. J. Remote Sens. 2010, 24, 1639–1651. [Google Scholar] [CrossRef]
  11. Ji, L.; Wylie, B.K.; Nossov, D.R.; Peterson, B.; Waldrop, M.P.; McFarland, J.W.; Rover, J.; Hollingsworth, T.N. Estimating Aboveground Biomass in Interior Alaska with Landsat Data and Field Measurements. Int. J. Appl. Earth Obs. Geoinf. 2012, 18, 451–461. [Google Scholar] [CrossRef]
  12. Frazier, R.J.; Coops, N.C.; Wulder, M.A.; Kennedy, R. Characterization of Aboveground Biomass in an Unmanaged Boreal Forest Using Landsat Temporal Segmentation Metrics. ISPRS J. Photogramm. Remote Sens. 2014, 92, 137–146. [Google Scholar] [CrossRef]
  13. Zhang, G.; Ganguly, S.; Nemani, R.R.; White, M.A.; Milesi, C.; Hashimoto, H.; Wang, W.; Saatchi, S.; Yu, Y.; Myneni, R.B. Estimation of Forest Aboveground Biomass in California Using Canopy Height and Leaf Area Index Estimated from Satellite Data. Remote Sens. Environ. 2014, 151, 44–56. [Google Scholar] [CrossRef]
  14. Bao, W.; Yu, F.; Chang, Y.; Guo, M.; Zhou, F.; Zhong, C. Estimation of Fuel Load Using Remote Sensing Data in Hulunbuir Grassland. Nat. Hazards Res. 2022, 2, 375–383. [Google Scholar] [CrossRef]
  15. Li, Y.; Quan, X.; Liao, Z.; He, B. Forest Fuel Loads Estimation from Landsat ETM+ and ALOS PALSAR Data. Remote Sens. 2021, 13, 1189. [Google Scholar] [CrossRef]
  16. Gao, X.; Dong, S.; Li, S.; Xu, Y.; Liu, S.; Zhao, H.; Yeomans, J.; Li, Y.; Shen, H.; Wu, S.; et al. Using the Random Forest Model and Validated MODIS with the Field Spectrometer Measurement Promote the Accuracy of Estimating Aboveground Biomass and Coverage of Alpine Grasslands on the Qinghai-Tibetan Plateau. Ecol. Indic. 2020, 112, 106114. [Google Scholar] [CrossRef]
  17. Cui, L.; Jiao, Z.; Dong, Y.; Sun, M.; Zhang, X.; Yin, S.; Ding, A.; Chang, Y.; Guo, J.; Xie, R. Estimating Forest Canopy Height Using MODIS BRDF Data Emphasizing Typical-Angle Reflectances. Remote Sens. 2019, 11, 2239. [Google Scholar] [CrossRef]
  18. Goldbergs, G.; Levick, S.R.; Lawes, M.; Edwards, A. Hierarchical Integration of Individual Tree and Area-Based Approaches for Savanna Biomass Uncertainty Estimation from Airborne LiDAR. Remote Sens. Environ. 2018, 205, 141–150. [Google Scholar] [CrossRef]
  19. Xu, Q.; Man, A.; Fredrickson, M.; Hou, Z.; Pitkänen, J.; Wing, B.; Ramirez, C.; Li, B.; Greenberg, J.A. Quantification of Uncertainty in Aboveground Biomass Estimates Derived from Small-Footprint Airborne LiDAR. Remote Sens. Environ. 2018, 216, 514–528. [Google Scholar] [CrossRef]
  20. Zarco-Tejada, P.J.; Rueda, C.A.; Ustin, S.L. Water Content Estimation in Vegetation with MODIS Reflectance Data and Model Inversion Methods. Remote Sens. Environ. 2003, 85, 109–124. [Google Scholar] [CrossRef]
  21. Dennison, P.E.; Roberts, D.A.; Peterson, S.H.; Rechel, J. Use of Normalized Difference Water Index for Monitoring Live Fuel Moisture. Int. J. Remote Sens. 2006, 26, 1035–1042. [Google Scholar] [CrossRef]
  22. Sousa, D.; Small, C. Global Cross-Calibration of Landsat Spectral Mixture Models. Remote Sens. Environ. 2017, 192, 139–149. [Google Scholar] [CrossRef]
  23. Routh, D.; Seegmiller, L.; Bettigole, C.; Kuhn, C.; Oliver, C.D.; Glick, H.B. Improving the Reliability of Mixture Tuned Matched Filtering Remote Sensing Classification Results Using Supervised Learning Algorithms and Cross-Validation. Remote Sens. 2018, 10, 1675. [Google Scholar] [CrossRef]
  24. Franke, J.; Barradas, A.C.S.; Borges, M.A.; Menezes Costa, M.; Dias, P.A.; Hoffmann, A.A.; Orozco Filho, J.C.; Melchiori, A.E.; Siegert, F. Fuel Load Mapping in the Brazilian Cerrado in Support of Integrated Fire Management. Remote Sens. Environ. 2018, 217, 221–232. [Google Scholar] [CrossRef]
  25. Secretaria do Planejamento e da Modernização da Gestão Pública (Seplan). Atlas of Tocantins: Subsidies for Land Management Planning [Atlas Do Tocantins: Subsídios Ao Planejamento Da Gestão Territorial], 6th ed.; Governo do Estado do Tocantins: Palmas, Brazil, 2012.
  26. Instituto Chico Mendes De Conservação Da Biodiversidade (ICMBio). Management Plan for Serra Geral Do Tocantins Ecological Station [Plano de Manejo Para Estação Ecológica Serra Geral Do Tocantins (EESGT)], 1st ed.; Instituto Chico Mendes De Conservação Da Biodiversidade (ICMBio): Brasília, Brazil, 2014; Volume 1.
  27. Arruda, M.B.; von Behr, M. Jalapão: Scientific and Conservation Expedition [Jalapão: Expedição Científica e Conservacionista], 1st ed.; Instituto Brasileiro do Meio Ambiente e dos Recursos Naturais Renováveis (Ibama): Brasília, Brazil, 2002; Volume 1.
  28. Ribeiro, J.F.; Walter, B.M.T. The Main Phytophysiognomies of the Cerrado Biome [As Principais Fitofisionomias Do Bioma Cerrado]; Sano, S.M., Almeida, S.P., Ribeiro, J.F., Eds.; Embrapa: Brasília, Brazil, 2008; Volume 1.
  29. Santos, R.P.; Crema, A.; Szmuchrowski, M.A.; Possapp, J.J.; Nogueira, C.C.; Asano, K.; Kawaguchi, M.; Dino, K. Atlas of the Ecological Corridor of the Jalapão Region [Atlas Do Corredor Ecológico Da Região Do Jalapão], 2nd ed.; Instituto Chico Mendes de Conservação da Biodiversidade: Brasília, Brazil, 2013; Volume 1.
  30. Schroeder, M.J.; Buck, C.C. Fire Weather: A Guide for Application of Meteorological Information to Forest Fire Control Operations; USDA: Washington, DC, USA, 1970; Volume 1.
  31. Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-Scale Geospatial Analysis for Everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
  32. Vermote, E.; Roger, J.C.; Franch, B.; Skakun, S. LASRC (Land Surface Reflectance Code): Overview, Application and Validation Using MODIS, VIIRS, LANDSAT and Sentinel 2 Data’s. In Proceedings of the IGARSS 2018-2018 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Valencia, Spain, 22–27 July 2018; pp. 8173–8176. [Google Scholar] [CrossRef]
  33. Davidson, A.; Wang, S.; Wilmshurst, J. Remote Sensing of Grassland–Shrubland Vegetation Water Content in the Shortwave Domain. Int. J. Appl. Earth Obs. Geoinf. 2006, 8, 225–236. [Google Scholar] [CrossRef]
  34. Yebra, M.; Dennison, P.E.; Chuvieco, E.; Riaño, D.; Zylstra, P.; Hunt, E.R.; Danson, F.M.; Qi, Y.; Jurdao, S. A Global Review of Remote Sensing of Live Fuel Moisture Content for Fire Danger Assessment: Moving towards Operational Products. Remote Sens. Environ. 2013, 136, 455–468. [Google Scholar] [CrossRef]
  35. Rouse, J.W.J.; Haas, R.H.; Schell, J.A.; Deering, D.W. Monitoring Vegetation Systems in the Great Plains with ERTS; NASA Technical Reports Server (NTRS): Hampton, VA, USA, 1974.
  36. Gitelson, A.A.; Kaufman, Y.J.; Stark, R.; Rundquist, D. Novel Algorithms for Remote Estimation of Vegetation Fraction. Remote Sens. Environ. 2002, 80, 76–87. [Google Scholar] [CrossRef]
  37. Birth, G.S.; McVey, G.R. Measuring the Color of Growing Turf with a Reflectance Spectrophotometer1. Agron. J. 1968, 60, 640–643. [Google Scholar] [CrossRef]
  38. Peñuelas, J.; Filella, I.; Gamon, J.A. Assessment of Photosynthetic Radiation-Use Efficiency with Spectral Reflectance. New Phytol. 1995, 131, 291–296. [Google Scholar] [CrossRef]
  39. Huete, A.R. A Soil-Adjusted Vegetation Index (SAVI). Remote Sens. Environ. 1988, 25, 295–309. [Google Scholar] [CrossRef]
  40. Gao, B.C. NDWI—A Normalized Difference Water Index for Remote Sensing of Vegetation Liquid Water from Space. Remote Sens. Environ. 1996, 58, 257–266. [Google Scholar] [CrossRef]
  41. Hunt, E.R.; Rock, B.N. Detection of Changes in Leaf Water Content Using Near- and Middle-Infrared Reflectances. Remote Sens. Environ. 1989, 30, 43–54. [Google Scholar] [CrossRef]
  42. Paltridge, G.W.; Barber, J. Monitoring Grassland Dryness and Fire Potential in Australia with NOAA/AVHRR Data. Remote Sens. Environ. 1988, 25, 381–394. [Google Scholar] [CrossRef]
  43. Chen, J.M. Evaluation of Vegetation Indices and a Modified Simple Ratio for Boreal Applications. Can. J. Remote Sens. 2014, 22, 229–242. [Google Scholar] [CrossRef]
  44. Rock, B.N.; Vogelmann, J.E.; Williams, D.L.; Vogelmann, A.F.; Hoshizaki, T. Remote Detection of Forest DamagePlant Responses to Stress May Have Spectral “Signatures” That Could Be Used to Map, Monitor, and Measure Forest Damage. Bioscience 1986, 36, 439–445. [Google Scholar] [CrossRef]
  45. Xu, H. Modification of Normalised Difference Water Index (NDWI) to Enhance Open Water Features in Remotely Sensed Imagery. Int. J. Remote Sens. 2007, 27, 3025–3033. [Google Scholar] [CrossRef]
  46. Chuvieco, E.; Martín, M.P.; Palacios, A. Assessment of Different Spectral Indices in the Red-near-Infrared Spectral Domain for Burned Land Discrimination. Int. J. Remote Sens. 2010, 23, 5103–5110. [Google Scholar] [CrossRef]
  47. Huete, A.R.; Liu, H.Q.; Batchily, K.; van Leeuwen, W. A Comparison of Vegetation Indices over a Global Set of TM Images for EOS-MODIS. Remote Sens. Environ. 1997, 59, 440–451. [Google Scholar] [CrossRef]
  48. Danson, F.M.; Steven, M.D.; Malthus, T.J.; Clark, J.A. High-Spectral Resolution Data for Determining Leaf Water Content. Int. J. Remote Sens. 1992, 13, 461–470. [Google Scholar] [CrossRef]
  49. Ozkan, S.; Bozdagi Akar, G. Improved Deep Spectral Convolution Network for Hyperspectral Unmixing With Multinomial Mixture Kernel and Endmember Uncertainty. arXiv 2018, arXiv:1808.01104. [Google Scholar]
  50. Cao, S.; Yu, Q.; Sanchez-Azofeifa, A.; Feng, J.; Rivard, B.; Gu, Z. Mapping Tropical Dry Forest Succession Using Multiple Criteria Spectral Mixture Analysis. ISPRS J. Photogramm. Remote Sens. 2015, 109, 17–29. [Google Scholar] [CrossRef]
  51. Mayes, M.T.; Mustard, J.F.; Melillo, J.M. Forest Cover Change in Miombo Woodlands: Modeling Land Cover of African Dry Tropical Forests with Linear Spectral Mixture Analysis. Remote Sens. Environ. 2015, 165, 203–215. [Google Scholar] [CrossRef]
  52. Richter, R. A Spatially Adaptive Fast Atmospheric Correction Algorithm. Int. J. Remote Sens. 1996, 17, 1201–1214. [Google Scholar] [CrossRef]
  53. Tane, Z.; Roberts, D.; Veraverbeke, S.; Casas, Á.; Ramirez, C.; Ustin, S. Evaluating Endmember and Band Selection Techniques for Multiple Endmember Spectral Mixture Analysis Using Post-Fire Imaging Spectroscopy. Remote Sens. 2018, 10, 389. [Google Scholar] [CrossRef]
  54. Crabbé, A.H.; Somers, B.; Roberts, D.A.; Halligan, K.; Dennison, P.; Dudley, K. MESMA QGIS Plugin (Version 1.0.8). 2020. Available online: https://mesma.readthedocs.io/ (accessed on 21 September 2023).
  55. Chuvieco, E.; Riaño, D.; Aguado, I.; Cocero, D. Estimation of Fuel Moisture Content from Multitemporal Analysis of Landsat Thematic Mapper Reflectance Data: Applications in Fire Danger Assessment. Int. J. Remote Sens. 2002, 23, 2145–2162. [Google Scholar] [CrossRef]
  56. Helmer, E.H.; Ruzycki, T.S.; Wunderle, J.M.; Vogesser, S.; Ruefenacht, B.; Kwit, C.; Brandeis, T.J.; Ewert, D.N. Mapping Tropical Dry Forest Height, Foliage Height Profiles and Disturbance Type and Age with a Time Series of Cloud-Cleared Landsat and ALI Image Mosaics to Characterize Avian Habitat. Remote Sens. Environ. 2010, 114, 2457–2473. [Google Scholar] [CrossRef]
  57. Main-Knorn, M.; Cohen, W.B.; Kennedy, R.E.; Grodzki, W.; Pflugmacher, D.; Griffiths, P.; Hostert, P. Monitoring Coniferous Forest Biomass Change Using a Landsat Trajectory-Based Approach. Remote Sens. Environ. 2013, 139, 277–290. [Google Scholar] [CrossRef]
  58. Chrysafis, I.; Mallinis, G.; Gitas, I.; Tsakiri-Strati, M. Estimating Mediterranean Forest Parameters Using Multi Seasonal Landsat 8 OLI Imagery and an Ensemble Learning Method. Remote Sens. Environ. 2017, 199, 154–166. [Google Scholar] [CrossRef]
  59. Marselis, S.M.; Tang, H.; Armston, J.D.; Calders, K.; Labrière, N.; Dubayah, R. Distinguishing Vegetation Types with Airborne Waveform Lidar Data in a Tropical Forest-Savanna Mosaic: A Case Study in Lopé National Park, Gabon. Remote Sens. Environ. 2018, 216, 626–634. [Google Scholar] [CrossRef]
  60. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  61. Zeng, N.; Ren, X.; He, H.; Zhang, L.; Zhao, D.; Ge, R.; Li, P.; Niu, Z. Estimating Grassland Aboveground Biomass on the Tibetan Plateau Using a Random Forest Algorithm. Ecol. Indic. 2019, 102, 479–487. [Google Scholar] [CrossRef]
  62. de Carvalho Junior, W.; Calderano Filho, B.; da Silva Chagas, C.; Bhering, S.B.; Pereira, N.R.; Pinheiro, H.S.K. Multiple Linear Regression and Random Forest Model to Estimate Soil Bulk Density in Mountainous Regions [Regressão Linear Múltipla e Modelo Random Forest Para Estimar a Densidade Do Solo Em Áreas Montanhosas]. Pesqui Agropecu Bras 2016, 51, 1428–1437. [Google Scholar] [CrossRef]
  63. Li, Z.; Xin, X.; Huan, T.; Fan, Y.; Chen, B.; Zhang, B. Estimating Grassland LAI Using the Random Forests Approach and Landsat Imagery in the Meadow Steppe of Hulunber, China. J. Integr. Agric. 2017, 16, 286–297. [Google Scholar] [CrossRef]
  64. Ou, Q.; Lei, X.; Shen, C. Individual Tree Diameter Growth Models of Larch–Spruce–Fir Mixed Forests Based on Machine Learning Algorithms. Forests 2019, 10, 187. [Google Scholar] [CrossRef]
  65. Thanh Noi, P.; Kappas, M. Comparison of Random Forest, k-Nearest Neighbor, and Support Vector Machine Classifiers for Land Cover Classification Using Sentinel-2 Imagery. Sensors 2018, 18, 18. [Google Scholar] [CrossRef] [PubMed]
  66. Wang, L.; Zhou, X.; Zhu, X.; Dong, Z.; Guo, W. Estimation of Biomass in Wheat Using Random Forest Regression Algorithm and Remote Sensing Data. Crop J. 2016, 4, 212–219. [Google Scholar] [CrossRef]
  67. Sun, H.; Gui, D.; Yan, B.; Liu, Y.; Liao, W.; Zhu, Y.; Lu, C.; Zhao, N. Assessing the Potential of Random Forest Method for Estimating Solar Radiation Using Air Pollution Index. Energy Convers. Manag. 2016, 119, 121–129. [Google Scholar] [CrossRef]
  68. Liaw, A.; Wiener, M. Classification and Regression by RandomForest. R News 2002, 2, 18–22. [Google Scholar]
  69. Gao, Y.; Lu, D.; Li, G.; Wang, G.; Chen, Q.; Liu, L.; Li, D. Comparative Analysis of Modeling Algorithms for Forest Aboveground Biomass Estimation in a Subtropical Region. Remote Sens. 2018, 10, 627. [Google Scholar] [CrossRef]
  70. Ohmann, J.L.; Gregory, M.J.; Roberts, H.M. Scale Considerations for Integrating Forest Inventory Plot Data and Satellite Image Data for Regional Forest Mapping. Remote Sens. Environ. 2014, 151, 3–15. [Google Scholar] [CrossRef]
  71. Zhang, W.; Zhao, L.; Li, Y.; Shi, J.; Yan, M.; Ji, Y. Forest Above-Ground Biomass Inversion Using Optical and SAR Images Based on a Multi-Step Feature Optimized Inversion Model. Remote Sens. 2022, 14, 1608. [Google Scholar] [CrossRef]
  72. Avand, M.; Janizadeh, S.; Naghibi, S.A.; Pourghasemi, H.R.; Bozchaloei, S.K.; Blaschke, T. A Comparative Assessment of Random Forest and K-Nearest Neighbor Classifiers for Gully Erosion Susceptibility Mapping. Water 2019, 11, 2076. [Google Scholar] [CrossRef]
  73. Pflugmacher, D.; Cohen, W.B.; Kennedy, R.E. Using Landsat-Derived Disturbance History (1972–2010) to Predict Current Forest Structure. Remote Sens. Environ. 2012, 122, 146–165. [Google Scholar] [CrossRef]
  74. Sun, X.; Li, G.; Wang, M.; Fan, Z. Analyzing the Uncertainty of Estimating Forest Aboveground Biomass Using Optical Imagery and Spaceborne LiDAR. Remote Sens. 2019, 11, 722. [Google Scholar] [CrossRef]
  75. Wu, C.; Shen, H.; Shen, A.; Deng, J.; Gan, M.; Zhu, J.; Xu, H.; Wang, K. Comparison of Machine-Learning Methods for above-Ground Biomass Estimation Based on Landsat Imagery. J. Appl. Remote Sens. 2016, 10, 035010. [Google Scholar] [CrossRef]
  76. Peterson, S.H.; Roberts, D.A.; Dennison, P.E. Mapping Live Fuel Moisture with MODIS Data: A Multiple Regression Approach. Remote Sens. Environ. 2008, 112, 4272–4284. [Google Scholar] [CrossRef]
  77. Santos, M.M.; Batista, A.C.; de Carvalho, E.V.; de Cássia Da Silva, F.; Pedro, C.M.; Giongo, M. Relationships between Moisture Content and Flammabilityof Campestral Cerrado Species in Jalapão. Rev. Bras. Ciências Agrárias 2018, 13, 1–9. [Google Scholar] [CrossRef]
  78. Santos, M.M.; Batista, A.C.; da Silva, A.D.P.; Neto, E.G.; Barradas, A.C.S.; Giongo, M. Characterization and Dynamics of Surface Fuel of Cerrado Grassland in Jalapão Region—Tocantins, Brazil. Floresta 2020, 51, 127–136. [Google Scholar] [CrossRef]
  79. Cawson, J.G.; Nyman, P.; Schunk, C.; Sheridan, G.J.; Duff, T.J.; Gibos, K.; Bovill, W.D.; Conedera, M.; Pezzatti, G.B.; Menzel, A. Estimation of Surface Dead Fine Fuel Moisture Using Automated Fuel Moisture Sticks across a Range of Forests Worldwide. Int. J. Wildland Fire 2020, 29, 548–559. [Google Scholar] [CrossRef]
  80. Slijepcevic, A.; Anderson, W.R.; Matthews, S.; Anderson, D.H. Evaluating Models to Predict Daily Fine Fuel Moisture Content in Eucalypt Forest. For. Ecol. Manag. 2015, 335, 261–269. [Google Scholar] [CrossRef]
  81. Soares, R.V.; Batista, A.C.; Tetto, A.F. Forest Fires: Control, Effects and Use of Fire [Incêndios Florestais: Controle, Efeitos e Uso Do Fogo], 2nd ed.; UFPR: Curitiba, Brazil, 2017; Volume 1.
  82. Tucker, C.J. Spectral Estimation of Grass Canopy Variables. Remote Sens. Environ. 1977, 6, 11–26. [Google Scholar] [CrossRef]
  83. Dube, T.; Mutanga, O. Evaluating the Utility of the Medium-Spatial Resolution Landsat 8 Multispectral Sensor in Quantifying Aboveground Biomass in UMgeni Catchment, South Africa. ISPRS J. Photogramm. Remote Sens. 2015, 101, 36–46. [Google Scholar] [CrossRef]
  84. Pierce, A.D.; Farris, C.A.; Taylor, A.H. Use of Random Forests for Modeling and Mapping Forest Canopy Fuels for Fire Behavior Analysis in Lassen Volcanic National Park, California, USA. For. Ecol. Manag. 2012, 279, 77–89. [Google Scholar] [CrossRef]
  85. D’este, M.; Elia, M.; Giannico, V.; Spano, G.; Lafortezza, R.; Sanesi, G. Machine Learning Techniques for Fine Dead Fuel Load Estimation Using Multi-Source Remote Sensing Data. Remote Sens. 2021, 13, 1658. [Google Scholar] [CrossRef]
  86. Chirici, G.; McRoberts, R.E.; Fattorini, L.; Mura, M.; Marchetti, M. Comparing Echo-Based and Canopy Height Model-Based Metrics for Enhancing Estimation of Forest Aboveground Biomass in a Model-Assisted Framework. Remote Sens. Environ. 2016, 174, 1–9. [Google Scholar] [CrossRef]
  87. Mura, M.; McRoberts, R.E.; Chirici, G.; Marchetti, M. Statistical Inference for Forest Structural Diversity Indices Using Airborne Laser Scanning Data and the K-Nearest Neighbors Technique. Remote Sens. Environ. 2016, 186, 678–686. [Google Scholar] [CrossRef]
Figure 1. Study area and spatial distribution of data sampling units. (AD) Fuel sampling zones with random plots distributed in the Cerrado grassland areas in the protected area: Datum SIRGAS 2000, UTM, spindle 23 South. Landsat 8 OLI images (RGB: 754), 221/67 and 221/68 (Path/Row) of date 5 May 2017.
Figure 1. Study area and spatial distribution of data sampling units. (AD) Fuel sampling zones with random plots distributed in the Cerrado grassland areas in the protected area: Datum SIRGAS 2000, UTM, spindle 23 South. Landsat 8 OLI images (RGB: 754), 221/67 and 221/68 (Path/Row) of date 5 May 2017.
Remotesensing 15 05481 g001
Figure 2. Overview of fuel sampling. (A) Physiognomy of some sampling areas, (B) fuel data sampling unit, (C) sub-samples of surface fuel characterization, and (D) number of samples taken at the beginning (May and June) and end of the dry season (August and September).
Figure 2. Overview of fuel sampling. (A) Physiognomy of some sampling areas, (B) fuel data sampling unit, (C) sub-samples of surface fuel characterization, and (D) number of samples taken at the beginning (May and June) and end of the dry season (August and September).
Remotesensing 15 05481 g002
Figure 3. Flowchart of the methodology applied in this study. R2 = coefficient of determination, R2adj = adjusted coefficient of determination, RMSE = root-mean-square error, MAE = mean absolute error.
Figure 3. Flowchart of the methodology applied in this study. R2 = coefficient of determination, R2adj = adjusted coefficient of determination, RMSE = root-mean-square error, MAE = mean absolute error.
Remotesensing 15 05481 g003
Figure 4. Fine fuel load mappings performed using the best RF models: (A,B) estimates for dead grass fuel and (C,D) estimates for 1 h downed wood debris.
Figure 4. Fine fuel load mappings performed using the best RF models: (A,B) estimates for dead grass fuel and (C,D) estimates for 1 h downed wood debris.
Remotesensing 15 05481 g004
Figure 5. Fine fuel load mappings were performed using the best RF models: (A,B) estimates for the total fine dead fuel and (C,D) estimates for the total fine fuel.
Figure 5. Fine fuel load mappings were performed using the best RF models: (A,B) estimates for the total fine dead fuel and (C,D) estimates for the total fine fuel.
Remotesensing 15 05481 g005
Figure 6. Importance of the variables of the random forest (RF) algorithm models with the best fit for fuel load estimation. (A) RF model for dead grass fuel, (B) RF models for 1 h downed wood debris, (C) RF models for the total fine dead fuel, and (D) RF models for the total fine fuel. %IncMSE indicates the increase in the mean-square error as a percentage.
Figure 6. Importance of the variables of the random forest (RF) algorithm models with the best fit for fuel load estimation. (A) RF model for dead grass fuel, (B) RF models for 1 h downed wood debris, (C) RF models for the total fine dead fuel, and (D) RF models for the total fine fuel. %IncMSE indicates the increase in the mean-square error as a percentage.
Remotesensing 15 05481 g006
Figure 7. Fuel condition map consisting of the three sub-pixel fraction images (R: NPV; G: GV; B: soil) in the Serra Geral do Tocantins Ecological Station for (A) the beginning (May 2017) and (B) the end of the dry season (September 2017).
Figure 7. Fuel condition map consisting of the three sub-pixel fraction images (R: NPV; G: GV; B: soil) in the Serra Geral do Tocantins Ecological Station for (A) the beginning (May 2017) and (B) the end of the dry season (September 2017).
Remotesensing 15 05481 g007
Table 1. Statistical metrics of the cross-validation of multiple linear regression models with the best fit for fuel load estimation.
Table 1. Statistical metrics of the cross-validation of multiple linear regression models with the best fit for fuel load estimation.
PerEstimated Fuel (y)Predictor Variables (x)R2R2adjRMSEMAE
1Dead grassNPV; soil; B5: nir; B6: swir 1; B7: swir 2; VARI; SR; SIPI; NDII; MSR; MSI; GVMI; DER56; DER340.890.810.370.32
1 h Downed wood debrisNPV; soil; GV; B4: red; B5: nir; B6: swir 1; B7: swir 2; NDVI; VARI; VIgreen; SR; SIPI; SAVI; NDWI; NRR; MSI; MNDWI; DER34; DER230.810.550.450.37
Total fine dead (grass + 1 h downed wood debris)NPV; soil; GV; B4: red; B5: nir; B6: swir 1; B7: swir 2; VARI; VIgreen; SR; SIPI; SAVI; NDWI; NDII6; NBR; DER450.920.840.490.40
Total fine
(live and dead)
NPV; soil; B2: blue; B4: red; B5: nir; B6: swir 1; NDVI; VARI; VIgreen; SR; SIPI; SAVI; NDWI; NDII6; NBR; MSR; MSI; MNDWI; INTEGRAL; DER45; DER34; DER23; GVMI0.810.371.261.06
2Dead grassNPV; Soil; B2: blue; B3: green; B5: nir; B6: swir 1; B7: swir 2; NDVI; VIgreen; SR; NDWI; NDII6; NBR; MSR; MSI; MNDWI; GVMI; EVI; DER56; DER34; DER230.810.470.440.38
1 h Downed wood debrisNPV; soil; GV; B2: blue; B3: green; B4: red; B5: nir; B7: swir 2; NDVI; VARI; VIgreen; SR; SIPI; SAVI; NDWI; NDII6; NRR; MSR; MSI; INTEGRAL; GVMI; EVI; DER56; DER45; DER34; DER230.45−1.591.901.53
Total fine dead (grass + 1 h downed wood debris)Soil; GV; B2: blue; B3: green; B4: red; B5: nir; B6: swir 1; B7: swir 2; NDVI; VARI; VIgreen; SR; SIPI; SAVI; NDWI; NDII6; NBR; MSR; MSI; INTEGRAL; GVMI; EVI; DER56; DER45; DER34; DER230.64−0.722.241.98
Total fine
(live and dead)
Soil; GV; B2: blue; B3: green; B4: red; B5: nir; B6: swir 1; B7: swir 2; NDVI; VARI; VIgreen; SR; SAVI; NDWI; NDII6; MSR; MSI; INTEGRAL; EVI; DER56; DER45; DER34; DER230.56−0.442.272.01
3Dead grassSoil; B3: green; B4: red; B5: nir; B6: swir 1; VARI; VIgreen; SR; SAVI; NDWI; NDII6; MSR; INTEGRAL; GVMI; EVI; DER560.780.720.360.31
1 h Downed wood debrisNPV; soil; B2: blue; B4: red; B7: swir 2; VARI; SR; SIPI; NDWI; NRR; MSI; INTEGRAL; GVMI; EVI; DER450.570.450.620.51
Total fine dead (grass + 1 h downed wood debris)NPV; soil; B7: swir 2; VARI; VIgreen; SIPI; NBR; MSR; MSI; INTEGRAL; GVMI; EVI; DER450.730.660.750.62
Total fine
(live and dead)
NPV; soil; GV; B2: blue; B4: red; B5: nir; B6: swir 1; B7: swir 2; VARI; SIPI; NDWI; NBR; DER56; DER340.630.541.060.91
Per = dry season period: (1) early dry season (May and June), (2) late dry season (August and September), and (3) all dry season (May to September).
Table 2. Statistical metrics used to assess the models generated using the random forest algorithm.
Table 2. Statistical metrics used to assess the models generated using the random forest algorithm.
Estimated Fuel (y)Predictor Variables (x)R2R2adjRMSEMAE
Dead grassAll predictor variables0.830.710.330.24
* Soil; B3: green; B4: red; B5: nir; B6: swir 1; VARI; VIgreen; SR; SAVI; NDWI; NDII6; MSR; INTEGRAL; GVMI; EVI; DER560.830.780.330.23
1 h Downed wood debrisAll predictor variables0.590.300.580.44
* NPV; soil; B2: blue; B4: red; B7: swir 2; VARI; SR; SIPI; NDWI; NRR; MSI; INTEGRAL; GVMI; EVI; DER450.520.380.610.46
Total fine dead
(grass + 1 h downed wood debris)
All predictor variables0.830.710.590.44
* NPV; soil; B7: swir 2; VARI; VIgreen; SIPI; NBR; MSR; MSI; INTEGRAL; GVMI; EVI; DER450.790.740.630.49
Total fine
(live and dead)
All predictor variables0.620.350.890.75
* NPV; soil; GV; B2: blue; B4: red; B5: nir; B6: swir 1; B7: swir 2; VARI; SIPI; NDWI; NBR; DER56; DER340.550.430.960.81
* Variables selected using the stepwise method in multiple linear regression analyses.
Table 3. K values chosen for each k-nearest neighbor algorithm model.
Table 3. K values chosen for each k-nearest neighbor algorithm model.
Estimated Fuel (y)Predictor Variables (x)RMSE ValueChosen K Value
Dead grassAll predictor variables0.40725
Stepwise *0.44537
1 h Downed wood debrisAll predictor variables0.81837
Stepwise *0.82539
Total fine dead (grass + 1 h downed wood debris)All predictor variables1.10515
Stepwise *1.12339
Total fine
(live and dead)
All predictor variables1.60607
Stepwise *1.613613
* Variables selected using the stepwise method in multiple linear regression analyses.
Table 4. Statistical metrics used to assess the models generated using the k-NN algorithm.
Table 4. Statistical metrics used to assess the models generated using the k-NN algorithm.
Estimated Fuel (y)Predictor Variables (x) 1R2R2adjRMSEMAE
Dead grassAll predictor variables0.680.450.370.30
Stepwise *0.610.490.410.34
1 h Downed wood debrisAll predictor variables0.34−0.130.740.61
Stepwise *0.310.110.760.63
Total fine dead (grass + 1 h downed wood debris)All predictor variables0.540.210.920.78
Stepwise *0.490.370.980.84
Total fine
(live and dead)
All predictor variables0.38−0.071.431.23
Stepwise *0.300.121.531.31
* Variables selected using the stepwise method in multiple linear regression analyses. 1 The same predictor variables are shown in Table 2 for RF models.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Santos, M.M.; Batista, A.C.; Rezende, E.H.; Da Silva, A.D.P.; Cachoeira, J.N.; Dos Santos, G.R.; Biondi, D.; Giongo, M. Estimating the Surface Fuel Load of the Plant Physiognomy of the Cerrado Grassland Using Landsat 8 OLI Products. Remote Sens. 2023, 15, 5481. https://doi.org/10.3390/rs15235481

AMA Style

Santos MM, Batista AC, Rezende EH, Da Silva ADP, Cachoeira JN, Dos Santos GR, Biondi D, Giongo M. Estimating the Surface Fuel Load of the Plant Physiognomy of the Cerrado Grassland Using Landsat 8 OLI Products. Remote Sensing. 2023; 15(23):5481. https://doi.org/10.3390/rs15235481

Chicago/Turabian Style

Santos, Micael Moreira, Antonio Carlos Batista, Eduardo Henrique Rezende, Allan Deyvid Pereira Da Silva, Jader Nunes Cachoeira, Gil Rodrigues Dos Santos, Daniela Biondi, and Marcos Giongo. 2023. "Estimating the Surface Fuel Load of the Plant Physiognomy of the Cerrado Grassland Using Landsat 8 OLI Products" Remote Sensing 15, no. 23: 5481. https://doi.org/10.3390/rs15235481

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop