Importance of sample size, data type and prediction method for remote sensing-based estimations of aboveground forest biomass

Fassnacht, FE; Hartig F.; Latifi, H; Berger C.; Hernández J.; Corvalan, P.; Koch, B

Keywords: anova, lidar, hyperspectral, sample size, biomass estimation, Prediction method, EO1-Hyperion, HyMap

Abstract

Estimates of forest biomass are needed for various technical and scientific applications, ranging from carbon and bioenergy policies to sustainable forest management. As local measurements are costly, there is a great interest in obtaining reliable estimates over large areas from remote sensing data. Currently, such estimates are obtained with a variety of data sources, statistical methods and prediction standards, and there is no agreement on what are best practices for this task. To improve our understanding of how these different methods affect prediction quality, we first conducted a systematic review of the available literature to identify the most common sensor types and prediction methods. Based on the review, we identified sample size of the reference points on the ground, prediction method (step-wise linear regression, support vector machines, random forest, Gaussian processes and k-nearest neighbor), and sensor type as the main differences that could potentially affect predictive quality. We then compared those factors in two case study areas in Germany and Chile, for which airborne discrete return Light Detection And Ranging (LiDAR) and airborne hyperspectral as well as airborne discrete return LiDAR and spaceborne hyperspectral data were available. For each factor combination, we calculated Pearson's coefficient of correlation between observations and predictions (r(2)) and root mean squared error (RMSE) for bootstrapped estimates using k-fold cross-validation with a varying number of folds. Finally, Analysis of Variance (ANOVA) was used to quantify the influence of the factors on the predictive error of the biomass models. Our results confirm previous findings that predictor data (sensor) type is the most important factor for the accuracy of biomass estimates, with LiDAR being preferable to hyperspectral data. In contrast to some previous studies, complementing LiDAR with hyperspectral data did not improve predictive accuracy. Also the prediction method had a substantial effect on accuracy and was generally more important than the sample size. In most cases, random forest performed best and stepwise linear models worst, judging from r(2) and RMSE under cross-validation. Additional results suggested that r(2) may deliver unrealistically large values when the hold-out sample during the cross-validation is too small. In conclusion, our literature review revealed that different methods for biomass estimation are currently used, with no general agreement on best practices. In our case studies, we found substantial accuracy differences between those methods, with LiDAR data, in combination with a random forest algorithm and a large number of reference sample units on the ground yielding the lowest error for biomass predictions. The comparatively high importance of the statistical prediction method seems particularly relevant, as they suggest that choosing the appropriate statistical method may be more effective than obtaining additional field data for obtaining good biomass estimates. Considering the costs of improving accuracy of global and regional biomass estimates by ground measurements, it seems sensible to invest in further comparative studies, preferably with a wider range of sites and including also RADAR sensors, to establish robust best-practice recommendations for obtaining regional and global biomass estimates from remote-sensing data. (C) 2014 Elsevier Inc. All rights reserved.

Más información

Título según WOS: Importance of sample size, data type and prediction method for remote sensing-based estimations of aboveground forest biomass
Título según SCOPUS: Importance of sample size, data type and prediction method for remote sensing-based estimations of aboveground forest biomass
Título de la Revista: REMOTE SENSING OF ENVIRONMENT
Volumen: 154
Número: 1
Editorial: Elsevier Science Inc.
Fecha de publicación: 2014
Página de inicio: 102
Página final: 114
Idioma: English
DOI:

10.1016/j.rse.2014.07.028

Notas: ISI, SCOPUS