Country-wide high-resolution vegetation height mapping with Sentinel-2

doi:10.1016/j.rse.2019.111347

Remote Sensing of Environment

Volume 233, November 2019, 111347

https://doi.org/10.1016/j.rse.2019.111347 Get rights and content

Highlights

•
Vegetation height at 10 m ground sampling distance is regressed from Sentinel-2
•
Country-wide maps are computed for Switzerland and Gabon
•
Mean absolute error (MAE) of 1.7 m in Switzerland and 4.3 m in Gabon
•
The deep convolutional neural network correctly predicts vegetation heights up to 50 m

Abstract

Sentinel-2 multi-spectral images collected over periods of several months were used to estimate vegetation height for Gabon and Switzerland. A deep convolutional neural network (CNN) was trained to extract suitable spectral and textural features from reflectance images and to regress per-pixel vegetation height. In Gabon, reference heights for training and validation were derived from airborne LiDAR measurements. In Switzerland, reference heights were taken from an existing canopy height model derived via photogrammetric surface reconstruction. The resulting maps have a mean absolute error (MAE) of 1.7 m in Switzerland and 4.3 m in Gabon (a root mean square error (RMSE) of 3.4 m and 5.6 m, respectively), and correctly estimate vegetation heights up to >50 m. They also show good qualitative agreement with existing vegetation height maps. Our work demonstrates that, given a moderate amount of reference data (i.e., 2000 km² in Gabon and ≈5800 km² in Switzerland), high-resolution vegetation height maps with 10 m ground sampling distance (GSD) can be derived at country scale from Sentinel-2 imagery.

Graphical Abstract

Introduction

Vegetation height is a basic variable to characterise a forest's structure, and is known to correlate with important biophysical parameters like primary productivity (Thomas et al., 2008), above-ground biomass (Anderson et al., 2006) and bio-diversity (Goetz et al., 2007). However, direct measurement of tree height does not scale to large areas and/or high spatial resolution: in-situ observations are in practice only feasible for a limited number of sample plots and logging sites. Airborne light detection and ranging (LiDAR) can map canopy height over ground densely and accurately, but the financial cost and the limited area covered per day only allow for small regional projects (some countries of moderate size have complete coverage, but with low revisit times of several years between subsequent acquisitions). Finally, space-borne LiDAR provides world-wide coverage, but the measurements are sparse in both space and time: distances between adjacent profiles are in the tens of kilometers, and nearby observations have been acquired up to 6 years apart. After 7 years of data collection, the point density in Gabon, for example, is only 1.26 shots per km² (Baghdadi et al., 2013). Moreover, each measurement is averaged over a ground footprint of 70 m radius.

Hence, dense wide-area maps of canopy height are typically obtained by regression from multi-spectral satellite images, using in-situ or LiDAR heights as reference data to fit the regression model (Lefsky, 2010, Hudak et al., 2002). This approach has made it possible to produce tree height maps with ground resolutions down to 30 m, by exploiting the Landsat archive (Hansen et al., 2016).

Here, we demonstrate country-wide mapping of canopy height with a ground resolution of 10 m, by regression from Sentinel-2 multi-spectral data. At such high resolutions, the spectral signature of an individual pixel is no longer sufficient to predict tree height. Rather, the physical phenomena underlying the monocular prediction of tree height, like shadowing, roughness, and species distribution give rise to reflectance patterns across neighbourhoods of multiple pixels. It is, however, not obvious how to encode the resulting image textures into predictive feature descriptors that support the regression. To sidestep this problem, we resort to deep learning. Recent progress in computer vision and image analysis has impressively demonstrated that very deep¹ convolutional neural networks (CNNs) are able to learn a tailored multi-level feature encoding for a given prediction task from raw images, given a sufficient (large) amount of training data. Our experiments reveal that texture patterns are particularly important in areas of high (tropical) forest, extending the sensitivity of the regressor to heights up to ≈55 m. End-to-end learning of rich contextual feature hierarchies underlies several successes of image and raster data analysis, including visual recognition of objects (Krizhevsky et al., 2012), understanding human speech from spectrograms (Abdel-Hamid et al., 2014) and assessment of positions in board games like go or chess (Silver et al., 2018).

We employ a deep convolutional neural network to regress country-wide canopy height for Gabon and Switzerland from 13-channel Sentinel-2 Level 2A images (corrected to bottom-of-atmosphere reflectance), using reference values obtained from airborne LiDAR scans and photogrammetric stereo matching as training data. The two countries were selected because in both we have access to reference data for training and quantitative evaluation: in Switzerland from the national forest inventory program; in Gabon via NASA's LVIS project. At the same time, the two countries are very different in terms of their geography and biomes, which supports our belief that the proposed approach can be scaled up to global coverage. Importantly, we also find that no long time series or multi-temporal signatures are required. A few observations per pixel (4 to 12) already achieve low prediction errors – in fact, even predicting from a single image yields fairly decent results. This means that, at the 5-day revisit cycle of Sentinel-2, we are able to obtain almost complete coverage using only the 10 clearest images within the leaf-on season (May–September) for Switzerland or within a period of 12 months in tropical forest regions with frequent cloud cover.

Our work is, to our knowledge, the first to demonstrate large-scale vegetation height mapping from optical satellites at 10 m GSD. The model is able to retrieve tree heights up to ≈55 m, well beyond the saturation level of existing high-resolution canopy height maps (e.g., Hansen et al., 2016). At the technical level, we are not aware of any other work that employs deep CNNs for canopy height estimation from optical satellite data.

Based on the present work, the next goal is to generate a global, wall-to-wall map of canopy height.

Section snippets

Remote sensing of vegetation height

The most straightforward approach to measure canopy height over large areas is airborne or spaceborne LiDAR. By directly measuring range from the sensor to both points near the tree tops and points on the ground (as well as further ones in between), LiDAR delivers a direct and very accurate observation of the canopy height over ground, and also makes it possible to derive further information about vegetation structure. That approach was developed as soon as airborne LiDAR systems were available

Sentinel-2

Sentinel-2 is a satellite mission within the European Space Agency's (ESA) Copernicus program, consisting of two identical satellites launched in 2015 and 2017, respectively, with an expected lifetime of 7.25 years. The satellites each carry a multi-spectral instrument, and together reach a revisit time of 5 days². The sensor captures 13 spectral bands with varying spatial resolution (10 m, 20 m, 60 m). Four bands provide 10 m ground

Preprocessing

ESA's sen2cor toolbox provides standard algorithms to correct atmospheric effects (Mueller-Wilm, 2018). As a best practice, we use this toolbox for radiometric correction and create the Level 2A product, i.e., bottom-of-atmosphere reflectance. By decreasing variability due to atmospheric effects, the distribution of the image values is homogenised across different sensing dates and geographic regions, which simplifies the regression problem and may lead to improved generalisation. Moreover, the

Results and discussion

We quantitatively evaluate our approach on 7 regions in total, 5 in Gabon (GA) and 2 in Switzerland (CH). See Fig. 1. Each region is split into spatially disjoint training, validation, and test sets. Depending on the region, four to twelve Sentinel-2 images are available that have overall cloud coverage <70% (Table 1). The CNN is trained on images from multiple acquisition dates, assuming that the vegetation height did not change significantly within the investigated time interval. This makes

Conclusion

Our proposed data-driven approach allows one to map vegetation height at 10 m resolution. We show that the regression from few Sentinel-2 images achieves low error in the tropics as well as in central Europe, and that our method is suitable for country-scale canopy height mapping in terms of generalisation and computation time. Our CNN-based learning engine, which is able to exploit spatial context and texture features, can predict a high-resolution vegetation height map from a single

Acknowledgement

We thank Christian Ginzler from WSL for sharing the reference data for Switzerland. We greatly appreciate the open data policies of the LVIS project and the ESA Copernicus program. The project received funding from Barry Callebaut Sourcing AG, as a part of a Research Project Agreement.

References (58)

J. Anderson et al.
The use of waveform lidar to measure northern temperate mixed conifer and deciduous forest structure in New Hampshire
Remote Sens. Environ.
(2006)
H. Astola et al.
Comparison of Sentinel-2 and Landsat 8 imagery for forest variable prediction in boreal region
Remote Sens. Environ.
(2019)
V. Avitabile et al.
Capabilities and limitations of Landsat and land cover data for aboveground woody biomass estimation of Uganda
Remote Sens. Environ.
(2012)
M.L. Clark et al.
Small-footprint lidar estimation of sub-canopy elevation and tree height in a tropical rain forest landscape
Remote Sens. Environ.
(2004)
M. Drusch et al.
Sentinel-2: Esa's optical high-resolution mission for GMES operational services
Remote Sens. Environ.
(2012)
S.J. Goetz et al.
Laser remote sensing of canopy habitat heterogeneity as a predictor of bird species in an eastern temperate forest, USA
Remote Sens. Environ.
(2007)
M. Hansen et al.
Towards an operational MODIS continuous field of percent tree cover algorithm: examples using AVHRR and MODIS data
Remote Sens. Environ.
(2002)
M.C. Hansen et al.
Mapping tree height distributions in sub-Saharan Africa using landsat 7 and 8 data
Remote Sens. Environ.
(2016)
A.T. Hudak et al.
Integration of lidar and Landsat ETM+ data for estimating and mapping forest canopy height
Remote Sens. Environ.
(2002)
L. Korhonen et al.
Comparison of Sentinel-2 and Landsat 8 in the estimation of boreal forest canopy cover and leaf area index
Remote Sens. Environ.
(2017)

D. Marmanis et al.

Classification with an edge: improving semantic image segmentation with boundary detection

ISPRS J. Photogramm. Remote Sens.

(2018)

E. Naesset

Determination of mean tree height of forest stands using airborne laser scanner data

ISPRS J. Photogramm. Remote Sens.

(1997)

O. Abdel-Hamid et al.

Convolutional neural networks for speech recognition

IEEE/ACM IEEE Trans. Audio Speech Lang. Process.

(2014)

J.B. Abshire et al.

Geoscience laser altimeter system (GLAS) on the ICESat mission: on-orbit measurement performance

Geophys. Res. Lett.

(2005)

G. Asner et al.

High-resolution mapping of forest carbon stocks in the Colombian Amazon

Biogeosciences

(2012)

A. Baccini et al.

A first map of tropical Africa's above-ground biomass derived from satellite imagery

Environ. Res. Lett.

(2008)

N.N. Baghdadi et al.

Viability statistics of GLAS/ICESat data acquired over tropical forests

IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens.

(2013)

E. Bendersky

Depthwise separable convolutions for machine learning

J.B. Blair et al.

AfriSAR LVIS L2 geolocated surface elevation product, version 1. Boulder, Colorado USA. NASA National Snow and Ice Data Center Distributed Active Archive Center.

(2018)

Y. Chen et al.

Deep learning-based classification of hyperspectral data

IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens.

(2014)

F. Chollet

Xception: deep learning with depthwise separable convolutions

I. Chrysafis et al.

Assessing the relationships between growing stock volume and Sentinel-2 imagery in a Mediterranean forest ecosystem

Remote Sens. Lett.

(2017)

S. Clerc et al.

S2 MPC - Data Quality Report. ESA, reference S2-PDGS-MPC-DQR,issue 36.

D. Eigen et al.

Depth map prediction from a single image using a multi-scale deep network

G. Foody et al.

Classification of tropical forest classes from Landsat TM data

Int. J. Remote Sens.

(1996)

GEDI Team

GEDI ecosystem LiDAR NASA/University of Maryland

C. Ginzler et al.

Countrywide stereo-image matching for updating digital surface models in the framework of the swiss national forest inventory

Remote Sens.

(2015)

K. He et al.

Deep residual learning for image recognition

M. Immitzer et al.

First experience with Sentinel-2 data for crop and tree species classifications in central Europe

Remote Sens.

(2016)

Cited by (122)

A deep learning framework for 3D vegetation extraction in complex urban environments
2024, International Journal of Applied Earth Observation and Geoinformation
Accurate extraction of three-dimensional (3D) vegetation is essential for monitoring urban ecological environments and carbon sinks. Two-dimensional vegetation data in cities has been widely researched. However, large-scale urban vegetation height inventories are lacking. This study proposes a novel framework for 3D extraction of urban vegetation, which can be widely applied based on remote sensing approaches. A multi-task convolutional neural network is established to extract the urban vegetation cover and estimate the vegetation height at the pixel level. The results indicate that this method can derive the complete urban vegetation cover and height from stereo satellite data. Compared with the traditional stereo-photogrammetry method, this method enables rapid inference of vegetation height in urban areas with a root mean square error (RMSE) of 3.16 m. This model is capable of accurately separating vegetation in complex urban environments and performs well despite shadow effects. Furthermore, in this study, the first vegetation height map with 1-m spatial resolution has been produced, covering six urban districts in Beijing (approximately 1,378 km²). It only takes 2–3 min to process the imagery of the whole study area. The high-resolution map can display more urban vegetation details over the existing 10 m/30 m resolution vegetation height maps. Furthermore, the established framework and benchmark for urban vegetation 3D information offer unique insights and provide a basis for further research.
Sub-meter tree height mapping of California using aerial images and LiDAR-informed U-Net model
2024, Remote Sensing of Environment
Tree canopy height is one of the most important indicators of forest biomass, productivity, and ecosystem structure, but it is challenging to measure accurately from the ground and from space. Here, we used a U-Net model adapted for regression to map the canopy height of all trees in the state of California with very high-resolution aerial imagery 0.6 m from the USDA-NAIP program. The U-Net model was trained using canopy height models computed from aerial LiDAR data as a reference, along with corresponding RGB-NIR NAIP images collected in 2020. We evaluated the performance of the deep-learning model using 42 independent 1 km $^{2}$ areas across various forest types and landscape variations in California. Our predictions of tree heights exhibited a mean error of 2.9 m and showed relatively low systematic bias across the entire range of tree heights present in California. In 2020, trees taller than 5 m covered $\sim$ 19.3% of California. Our model successfully estimated canopy heights up to 50 m without saturation, outperforming existing canopy height products from global models. The approach we used allowed for the reconstruction of the three-dimensional structure of individual trees as observed from nadir-looking optical airborne imagery, suggesting a relatively robust estimation and mapping capability, even in the presence of image distortion. These findings demonstrate the potential of large-scale mapping and monitoring of tree height, as well as potential biomass estimation, using NAIP imagery.
Multi-temporal forest monitoring in the Swiss Alps with knowledge-guided deep learning
2024, Remote Sensing of Environment
Monitoring forests, in particular their response to climate and land use change, requires studying long time scales. While efficient deep learning methods have been developed to process short time series of satellite imagery, leveraging long time series of aerial imagery remains a challenge, due to changes in imaging technologies, sensors, and acquisition conditions, as well as irregular time gaps between acquisitions. In this work, we tackle this challenge through the task of multi-temporal forest mapping at the treeline ecotone in the Swiss Alps. We work with time series of aerial imagery spanning the years 1946–2020, without forest segmentation labels except for the year 2020. We propose a multi-temporal deep learning method which takes irregular time gaps into consideration, and learns to overcome the large domain shift between acquisitions through a custom loss function encoding prior knowledge about forest cover dynamics. Using this method, we significantly improve the forest segmentation performance on historical images compared to a mono-temporal counterpart processing each acquisition independently, or multi-temporal counterparts trained with more generic temporal consistency loss functions. We show that our method is a promising approach for monitoring subtle temporal trends from heterogeneous remote sensing time series. Overall, our work suggests that by designing a deep learning architecture and a training procedure based on problem-specific prior knowledge, a variety of Earth processes can be monitored from long time series of remote sensing data with incomplete training labels.
High-resolution canopy height map in the Landes forest (France) based on GEDI, Sentinel-1, and Sentinel-2 data with a deep learning approach
2024, International Journal of Applied Earth Observation and Geoinformation
In intensively managed forests in Europe, where forests are divided into stands of small size and may show heterogeneity within stands, a high spatial resolution (10–––20 m) is needed to capture the differences in canopy height. In this work, we developed a deep learning model based on multi-sensor remote sensing measurements to create a high-resolution canopy height map over the “Landes de Gascogne” forest in France, a large maritime pine plantation of 13,000 km² with flat terrain and intensive management. This area is characterized by even-aged and mono-specific stands, of a typical length of a few hundred meters, harvested every 35 to 50 years. Our deep learning U-Net model uses multi-band images from Sentinel-1 and Sentinel-2 with composite time averages as input to predict tree height derived from GEDI waveforms. The evaluation is performed with external validation data from forest inventory plots and a stereo 3D reconstruction model based on Skysat imagery available at specific locations. We trained seven different U-Net models based on combinations of Sentinel-1 and Sentinel-2 bands to evaluate the importance of each sensor in the dominant height retrieval. The model outputs allow us to generate a 10 m resolution canopy height map of the whole “Landes de Gascogne” forest area for 2020 with a mean absolute error of 2.02 m on the test dataset. The best predictions were obtained using all available bands from Sentinel-1 and Sentinel-2 but using only one satellite source also provided good predictions. For all validation datasets in coniferous forests, our model showed better metrics than previous canopy height models available in the same region.
Hy-TeC: a hybrid vision transformer model for high-resolution and large-scale mapping of canopy height
2024, Remote Sensing of Environment
Accurate and timely monitoring of forest canopy height is critical for assessing forest dynamics, biodiversity, carbon sequestration as well as forest degradation and deforestation. Recent advances in deep learning techniques, coupled with the vast amount of spaceborne remote sensing data offer an unprecedented opportunity to map canopy height at high spatial and temporal resolutions. Current techniques for wall-to-wall canopy height mapping correlate remotely sensed information from optical and radar sensors in the 2D space to the vertical structure of trees using lidar's 3D measurement abilities serving as height proxies. While studies making use of deep learning algorithms have shown promising performances for the accurate mapping of canopy height, they have limitations due to the type of architectures and loss functions employed. Moreover, mapping canopy height over tropical forests remains poorly studied, and the accurate height estimation of tall canopies is a challenge due to signal saturation from optical and radar sensors, persistent cloud cover, and sometimes limited penetration capabilities of lidar instruments. In this study, we map heights at 10 m resolution across the diverse landscape of Ghana with a new vision transformer (ViT) model, dubbed Hy-TeC, optimized concurrently with a classification (discrete) and a regression (continuous) loss function. This model achieves significantly higher accuracy than previously employed convolutional-based approaches (ConvNets) optimized with only a continuous loss function. Hy-TeC results show that our proposed discrete/continuous loss formulation significantly increases the sensitivity for very tall trees (i.e., > 35 m). Overall, Hy-TeC has significantly reduced bias (0.8 m) and higher accuracy (RMSE = 6.6 m) over tropical forests for which other approaches show poorer performance and oftentimes a saturation effect. The height maps generated by Hy-TeC also have better ground sampling distance and better sensitivity to sparse vegetation. Over these areas, Hy-TeC showed an RMSE of 3.1 m in comparison to a reference dataset while the baseline ConvNet model had an RMSE of 4.3 m. Hy-TeC, which was used to generate a height map of Ghana using free and open access remotely sensed data with Sentinel-2 and Sentinel-1 images as predictors and GEDI height measurements as calibration data, has the potential to be used globally.
Measuring the 3-30-300 rule to help cities meet nature access thresholds
2024, Science of the Total Environment
The 3-30-300 rule offers benchmarks for cities to promote equitable nature access. It dictates that individuals should see three trees from their dwelling, have 30 % tree canopy in their neighborhood, and live within 300 m of a high-quality green space. Implementing this demands thorough measurement, monitoring, and evaluation methods, yet little guidance is currently available to pursue these actions. To overcome this gap, we employed an expert-based consensus approach to review the available ways to measure 3-30-300 as well as each measure's strengths and weaknesses. We described seven relevant data and processes: vegetation indices, street level analyses, tree inventories, questionnaires, window view analyses, land cover maps, and green space maps. Based on the reviewed strengths and weaknesses of each measure, we presented a suitability matrix to link recommended measures with each component of the rule. These recommendations included surveys and window-view analyses for the ‘3 component’, high-resolution land cover maps for the ‘30 component’, and green space maps with network analyses for the ‘300 component’. These methods, responsive to local situations and resources, not only implement the 3-30-300 rule but foster broader dialogue on local desires and requirements. Consequently, these techniques can guide strategic investments in urban greening for health, equity, biodiversity, and climate adaptation.

View all citing articles on Scopus

View full text

Country-wide high-resolution vegetation height mapping with Sentinel-2

Highlights

Abstract

Graphical Abstract

Introduction

Section snippets

Remote sensing of vegetation height

Sentinel-2

Preprocessing

Results and discussion

Conclusion

Acknowledgement

Remote Sens. Environ.

Remote Sens. Environ.

Remote Sens. Environ.

Remote Sens. Environ.

Remote Sens. Environ.

Remote Sens. Environ.

Remote Sens. Environ.

Remote Sens. Environ.

Remote Sens. Environ.

Remote Sens. Environ.

ISPRS J. Photogramm. Remote Sens.

ISPRS J. Photogramm. Remote Sens.

Convolutional neural networks for speech recognition

IEEE/ACM IEEE Trans. Audio Speech Lang. Process.

Geoscience laser altimeter system (GLAS) on the ICESat mission: on-orbit measurement performance

Geophys. Res. Lett.

High-resolution mapping of forest carbon stocks in the Colombian Amazon

Biogeosciences

A first map of tropical Africa's above-ground biomass derived from satellite imagery

Environ. Res. Lett.

Viability statistics of GLAS/ICESat data acquired over tropical forests

IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens.

Depthwise separable convolutions for machine learning

AfriSAR LVIS L2 geolocated surface elevation product, version 1. Boulder, Colorado USA. NASA National Snow and Ice Data Center Distributed Active Archive Center.

Deep learning-based classification of hyperspectral data

IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens.

Xception: deep learning with depthwise separable convolutions

Assessing the relationships between growing stock volume and Sentinel-2 imagery in a Mediterranean forest ecosystem

Remote Sens. Lett.

S2 MPC - Data Quality Report. ESA, reference S2-PDGS-MPC-DQR,issue 36.

Depth map prediction from a single image using a multi-scale deep network

Classification of tropical forest classes from Landsat TM data

Int. J. Remote Sens.

GEDI ecosystem LiDAR NASA/University of Maryland

Countrywide stereo-image matching for updating digital surface models in the framework of the swiss national forest inventory

Remote Sens.

Deep residual learning for image recognition

First experience with Sentinel-2 data for crop and tree species classifications in central Europe

Remote Sens.