Next Article in Journal
Rainfall-Induced Shallow Landslide Recognition and Transferability Using Object-Based Image Analysis in Brazil
Previous Article in Journal
Comparison and Assessment of Different Land Cover Datasets on the Cropland in Northeast China
Previous Article in Special Issue
The Deep Convolutional Neural Network Role in the Autonomous Navigation of Mobile Robots (SROBO)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Technical Note

Forest Aboveground Biomass Estimation Using Machine Learning Ensembles: Active Learning Strategies for Model Transfer and Field Sampling Reduction

1
Italian Aerospace Research Centre, Via Maiorise snc, 81043 Capua, Italy
2
Department of Electrical Engineering and Information Technology, University of Naples Federico II, Via Claudio 21, 80125 Naples, Italy
3
Latitudo 40 srl, Via Emanuele Gianturco 31C, 80142 Naples, Italy
4
Department of Civil, Chemical and Environmental Engineering, University of Genova, Via Montallegro 1, 16145 Genova, Italy
*
Author to whom correspondence should be addressed.
Remote Sens. 2023, 15(21), 5138; https://doi.org/10.3390/rs15215138
Submission received: 19 July 2023 / Revised: 19 October 2023 / Accepted: 23 October 2023 / Published: 27 October 2023
(This article belongs to the Special Issue Remote Sensing of Anthropic Impact on the Environment)

Abstract

:
Biomass is a crucial indicator of the carbon sequestration capacity of a vegetation ecosystem. Its dynamic is of interest because it impacts on the carbon cycle, which plays an important role in the global climate and its changes. This work presents a novel technique, able to transfer a calibrated regression model between different areas by exploiting an active learning methodology and using Shannon’s entropy as a discriminator for sample selection. Model calibration is performed based on a reference area for which an extended ground truth is available and implemented via regression bootstrap. Then, re-calibration samples for model transfer are selected through active learning, allowing for choosing a limited number of points to be investigated for training data collection. Different sampling strategies and regression techniques have been tested to demonstrate that a significant reduction in the number of calibration samples does not affect the estimation performance. The proposed workflow has been tested on a dataset concerning Finnish forests. Experimental results show that the joint exploitation of regression ensembles and active learning dramatically reduces the amount of field sampling, providing aboveground biomass estimates comparable to those obtained using literature techniques, which need extended training sets to build reliable predictions.

Graphical Abstract

1. Introduction

Forests cover about 40% of the Earth’s ice-free land surface, although, in the recent past, wide forested areas have been converted to agricultural or urban land uses. They constitute the vital core of the whole planet, because they are the sentinels of the carbon cycle and the custodians of its biodiversity [1]. The expected increase in the global population is a further threat, representing a potential trigger for increasing deforestation and overexploitation of resources. The challenge to be faced by both politicians and scientists is the definition of protocols and strategies allowing for the sustainable management of such critical areas [2]. In this context, it is important to support decision-makers with timely information about several biophysical parameters useful to implement adequate planning activities [3]. Among them, the aboveground biomass (AGB) is particularly interesting due to its impact on the carbon cycle and, consequently, on the global climate and its changes [4].
Traditionally, this parameter is estimated through field sampling [5] or the exploitation of remote sensing data [6]. The results of the first methodology are unfeasible for applications at forest scale. Conversely, remote sensing technologies are able to deliver synoptic pictures of large areas with high revisit time. Therefore, as widely demonstrated by the literature, they represent a unique data source to feed methodologies, allowing for AGB estimation at large scale [7,8,9].
To this end, several techniques have been proposed in the literature. Regression-based models are among the most exploited [10,11,12]. They have many advantages, like the possibility to combine heterogeneous data, the low computational complexity and the suitability to deal with sparse sampling, which is the typical case in forest studies [5]. Machine learning methodologies have been exploited to better deal with nonlinearities between measurements and remote sensing data. Previous studies demonstrated the effectiveness of solutions like random forests [13], gradient boost algorithms [14] and support vector machines [15]. However, some concerns have been raised against their parametrization, as tuning can significantly affect the performance of these models [16]. Moreover, most of the studies rely on sparse sampling over large areas for model calibration and validation, thus introducing issues related to generalization [17].
More recently, deep learning solutions are emerging to address AGB estimation problems, as proposed, as an example, in [18,19,20]. In this case, major constraints are related to the sparse sampling typically implemented in forestry applications, as this methodology is usually unsuitable for training neural networks [21]. In the past, SAR scattering models have demonstrated their reliability for this purpose [22,23,24]. However, these solutions require careful parametrization, including the specification of various stand properties to be retrieved through field campaigns [25].
Working at forest scale, the main issue related to the exploitation of algorithms requiring the acquisition of calibration points is probably generalization, as sampling is usually implemented in specific areas [17]. Therefore, a concept of interest is the introduction of methodologies allowing for the transfer of a model from one area to another, sharing similar characteristics [26]. This problem is particularly of interest, as an example, in precision agriculture, since the variations of the AGB are typically higher due to crop cycles and their sensitivity to environmental conditions. In this context, Reference [26] exploited active sampling to select significant samples to transfer a calibrated model for ryegrass AGB and nitrogen content estimation across different seasons. Wan et al. exploited the Kennard–Stone algorithm to select re-calibration samples acquired in different years, to extend the forecasting capabilities of a rice yield model [27]. Reference [28] proposed an active sampling framework to implement multi-year incremental monitoring of grassland biomass and nitrogen content.
Dealing with forests, the problem of the update of an existing model to make it work when applied to other areas, or to the same area in different times, has been scarcely addressed, as the dominant approach is the validation of new techniques fed by variously collected data [4,7,10,13,29]. The most relevant literature on the topic concerns the processing of airborne laser scanning (ALS) data. Reference [30] exploited repeated ALS acquisitions and ground measures to update models predicting aboveground carbon, basal area, stand density index and total stem volume within a study area in northern Idaho (USA). Tompalski et al. demonstrated the feasibility of transferring between different areas of forest attributes derived using ALS data [31]. Reference [32] proposed a methodology to generate new maps of forest structural attributes, integrating ALS samples within pre-fitted models, pointing out that the direct transfer of an available model elsewhere may involve extrapolations or its application to conditions not included in the training dataset.
In this regard, the purpose of this paper is to introduce a strategy to tackle the typical generalization issues found in the past AGB estimation literature. In particular, active learning principles [33] are exploited to build a novel workflow suitable for transferring a calibrated regression model to new areas, through the selection of relevant re-calibration samples. To this end, a novel technique for the selecting of the most informative samples, based on their entropy rank, is introduced. Different from commonly adopted solutions, in which training data are distributed over large areas and used to train a single model, the principal novelty here is the exploitation of several regression models, generated from a reference one, retrained with few samples collected on specific areas. The aim is to extend the forecasting capability of an existing training dataset through the addition of a limited number of samples, made available by new acquisitions, through the implementation of active learning [26,34]. Finally, particular attention is paid to validation. This is one of the weaknesses found in the literature, which tends to assess the performance of the estimation over very large areas using a few tens of plots. In this work, the obtained results will be evaluated against an extended ground truth, accounting for more than 850,000 validation points, derived from the LiDAR processing of data acquired over Finnish forests.
The work is organized as follows. Materials and methods are introduced in Section 2. Experimental results are presented in Section 3 and discussed in Section 4. Conclusions are drawn at the end of the work.

2. Materials and Methods

2.1. Data

In Finland, ALS data are systematically collected for digital terrain mapping and forest parameters estimation [35], including AGB. These data, opportunely integrated with satellite images and historical field sampling, have been exploited by the Finnish Forest Centre to produce the reference AGB data used in this study [36], which analyzes an area of about 85 km2, partitioned in 13 clips of 6.55 km2 extension each. The clips, whose global statistics are reported in Table 1, are quite dishomogeneous in terms of the average AGB amount and its distribution (see detailed data in Table S1, provided as supplementary material). The average AGB ranges between 37.1 t/ha and 81.3 t/ha. Its standard deviation ranges between 22.2 t/ha and 63.2 t/ha. The overall AGB mean and standard deviation are in the order of 69.5 t/ha and 43.1 t/ha. The area is characterized by almost flat topography, with elevation values ranging approximately 100–200 m above sea level, with a lack of significant urban areas and agricultural lands. It belongs predominantly to the middle boreal vegetation zone, with a dominant presence of coniferous [37].
Satellite data, used to predict the AGB, belong to a publicly available dataset (see the data availability statement at the end of the work) composed of several clips. It is presented as a benchmarking tool for AGB studies. Therefore, data are provided without any geographic and/or temporal attributes. For each clip, the dataset includes, beyond AGB data, one year of Sentinel-1 SAR and Sentinel-2 MS observations, made approximately on a monthly basis (see Figure 1). In particular, SAR data have been acquired regularly once per month, due to their insensitivity to atmospheric conditions. MS acquisitions are mostly concentrated during spring and summer in order to cope with massive cloud coverage typically registered in the other periods of the year.

2.2. Methodology

As stated in the Introduction, the purpose of the paper is to introduce a methodology allowing for the transfer of a calibrated partial least squares regression (PLSR) [38] model to different areas sharing similar characteristics. In other words, it is assumed that a model, calibrated using an extended ground truth, is available. The question to be addressed is about how to select relevant samples for fitting this model with new scenes.
To this end, the workflow depicted in Figure 2 is adopted. The inputs are a calibrated regression model and two time series of data, one MS and one SAR, which are exploited to calculate a set of predictors, a total of 159, to be exploited for regression. MS predictors are mostly constituted by vegetation indices. SAR ones are constituted by both intensity-based and texture variables derived from the Haralick co-occurrence matrix [39].
The predictors are calculated for each available acquisition and then averaged on a yearly basis. Following this operation, the processing forks in two branches. The first one is dedicated to the retrieval of the model to be transferred, which is obtained via bootstrap, i.e., a regression calibrated with a random sample selection. Bootstrapping is implemented as follows. It is assumed that the AGB is known in a specific area (Clip 0 in Table S1). Then, among 1000 PLSR experiments calibrated with random sample selection, the one providing the lowest RMSE is selected as reference for transferring.
Model transferring requires that the same predictors are used for each regression. However, the literature highlighted that the large amount of data usually ingested into PLSR can contain irrelevant information that can cause the reduction in the model performance [40]. Therefore, a set of potentially informative predictors is selected through the calculation of the variable importance in prediction (VIP) score [41].
As the VIP score is an output of PLSR, a tentative regression is implemented with this purpose. In other words, this will not be used for AGB estimation. In this phase, the number of latent variables, i.e., the set of orthogonal factors having the best predictive power, is set to 10.
The VIP score is defined, for each variable j, as the sum, over latent variables f, of its PLS-weight value w j weighted by the percentage of explained variance of the specific latent variable S S Y f . For the j-th variable, it holds [41]:
V I P j = J f = 1 F w j f 2 S S Y f F f = 1   F S S Y f ,
where F is the total number of latent variables and J the total number of variables. The relation for the calculation of S S Y f is given by [41]:
S S Y f = b f 2 t f T t f ,
where b f and t f are the PLS inner relation vector of coefficients and the score vector relevant with the f-th latent variable, respectively. It is common practice in the literature to assume that informative variables are associated with a VIP score larger than one. This means that those variables have an above-average influence on the building of the model explaining the observations [42].
According to Figure 2, following the definition of the model to be transferred, active sampling is implemented to select re-calibration data. To this end, the stack composed of all the predictors for the new area is processed with a principal component analysis (PCA) for dimensionality reduction. Then, an ISODATA clustering is implemented to obtain labelled regions. In this case, the maximum number of allowed classes is set to 10.
The classified image provided by ISODATA clustering is treated with connected component labelling [43] to retrieve elementary regions, i.e., the image segments. They are used to calculate the spatial average of the regression variables, which constitute the input for active learning.
As described in [34], active learning techniques can be categorized based on uncertainty [44] or diversity criteria [45]. In the first case, samples are ranked according to their uncertainty. The higher the uncertainty, the better their rank. Among these techniques, those based on variance-based pool of regressors (PAL) are probably the most interesting [46]. Active learning techniques based on diversity criteria, instead, select samples based on the dissimilarities they introduce in the training datasets [34]. Different metrics can be used to assess such dissimilarity like the Euclidean distance [47], the cosine angle distance [48] or the entropy [49], which is the one used in this work according to the definition provided by Shannon [50]:
H = i = 1 N P n log 2 P n ,
where P n is the normalized probability of the n-th histogram quantization level and N is the total number of bins. The entropy gives information about the shape of the histogram and measures the quantity of information carried by a signal. The higher the entropy, i.e., the flatter the histogram, the higher its informative content.
According to Reference [28], the histogram is calculated on the vector constituted by the region-wise average of the predictors. In other words, given the N predictors for the generic orange region provided by the ISODATA clustering depicted in Figure 3, the reference vector for histogram calculation is composed of the N average values of the predictors. The entropy H 1 is calculated according to Equation (3) and stored as metrics for active sampling selection.
The second step of the active sampling process, the vector representative of the green region of Figure 3, is appended to the one previously considered. The entropy H2 of this new vector, constituted by 2N elements is calculated. If the condition H 2 > H 1 is verified, it is retained. Otherwise, the vector corresponding to the green region is discarded. In any case, the entropy value used for the acceptance test is the last one computed. The process continues until all the regions have been tested. Those determining an increase in the entropy against the last available one are appended to the selection and the corresponding regions marked as informative.
In summary, the adopted active sampling procedure can be described as follows:
a.
Consider each retrieved cluster and make the spatial average for each spectral band;
b.
Starting from a randomly selected cluster, calculate its histogram considering all the elements of the average spectral response and its associated entropy H 1 according to Equation (3);
c.
Add another cluster to the dataset by appending its (average) spectral response to the vector constituted by the one previously considered. Calculate the new histogram and its entropy H 2 ;
d.
If H 2 > H 1 mark the cluster as informative and continue the process by adding new clusters to the dataset. Do not delete those marked as not informative. Clusters marked as informative are those to be sampled to retrieve model calibration data.
The average number of samples, i.e., regions, selected by the adopted active sampling methodology is around 301 within areas of about 6.5 km2. In an operational environment, this number can be considered high. Therefore, a simple strategy for reducing the number of samples to be considered for model transfer has been implemented.
The output of the above-discussed procedure is the list of the regions marked as informative. Each of them is associated with an entropy value, calculated by adding the vector representative of the i-th region to the one collecting those already accepted. Moreover, a class ID coming from the ISODATA clustering is available.
Informative regions are then separated, based on the ISODATA class ID, and sorted in descending entropy order. The first k samples of the chart are retained for model calibration. In such a way, the maximum number of calibration samples is fixed to n m a x = k t , where t is the maximum number of allowed classes in ISODATA clustering. In the following experiments, t has been set to 10. Two different settings for k have been tested, i.e., k = 10, yielding n m a x = 100 , and k = 5, yielding n m a x = 50 . In the following, these samples will be referred to as very informative samples (VIS). The adoption of this strategy allowed for the reduction in the areas to be sampled to 73 per clip, in the case of k = 10, and to 36 per clip, in the case of k = 5.
Regressions for test areas are implemented using different strategies. The first one is a PLSR. It is a statistical method, aiming at finding a linear regression model between the observations and the independent variables, by projecting the predicted variables and the observed ones to a new space [38]. This technique is particularly suited when the matrix of predictors has more variables than observations and there is multicollinearity among predictors, as in the case study at hand.
The second regression technique used for AGB predictions Is gradient boosting (GB). It consists of a finite set of weak learners (typically decision trees) that minimize an arbitrary differentiable loss function. A meta-learner assigns weights to each learner and combines their predictive results through voting methods to provide a better predictive performance for regression problems. In particular, the least-squares boosting algorithm has been used [51]. This means that the loss function to be minimized is the mean squared error of the learners. The ensemble of GB and PLSR represents the third regression strategy. In this case, the final result is given by the average of the two predictions.

3. Results

The aggregated results of the proposed AGB estimation methodology for the 12 considered clips are reported in Table 2, together with benchmarking results. The breakdown for each single clip is reported in Table S2 as supplementary material.
Using all the samples selected by the adopted active sampling technique (see Table S2), the obtained RMSE, using PLSR, ranges between 20.8 t/ha and 47.2 t/ha with an average of 28.8 t/ha. GB provided a minimum RMSE of 24.6 t/ha and a maximum one of 48.5 t/ha with an average of 33.7 t/ha. The ensemble resulted in RMSE values ranging between 21.4 t/ha and 44.7 t/ha with an average of 28.8 t/ha.
Reducing the samples to VIS does not significantly affect the estimate (see also Table S2). In the case of k = 10, the RMSE ranges between 21.5 t/ha and 51.6 t/ha with an average of 30.7 t/ha, using PLSR. GB estimates provided RMSE values ranging between 25.1 t/ha and 52.1 t/ha, with an average of 34.2 t/ha. The estimates obtained using the ensemble range between 21.4 t/ha and 46.2 t/ha, with an average of 30.0 t/ha.
When the setting is k = 5 (see Table S2), the obtained RMSE ranges between 21.7 t/ha and 50.3 t/ha, with an average of 31.6 t/ha, using PLSR. GB exploitation provided RMSE values ranging between 24.8 t/ha and 51.4 t/ha with an average of 34.0 t/ha. Using the ensemble, the RMSE ranges between 21.4 t/ha and 49.3 t/ha with an average 30.7 t/ha.
The column named “Inc” reports the results obtained using an incremental model. In this configuration, all the points used to calibrate clips previously analyzed are included within the calibration set for the current regression. In other words, as an example, the calibration set for the third regression includes the data used to train the models used for the first and the second regression. Indeed, the exploitation of more calibration points, referring to different areas, seems to not be beneficial for the estimate, as the average RMSE is higher than the one retrieved using the standard setting in most of the experiments (see Table S2).
Some scatterplots relevant to selected experiments (see Table S2 for clip indexing) are depicted in Figure 4. In particular, the first row of the graphics reports the results of AGB estimation for Clip 0 (Figure 4a) and Clip 4, relative to ensemble regression with the setting k = 5 (Figure 4b) and the neural network output (Figure 4c). The regressions shown in the second row of the graphics concern Clip 7. The one reported in Figure 4d has been obtained using ensemble regression calibrated with all the samples selected by active learning. The plot shown in Figure 4e concerns ensemble regression with k = 5. All the other plots are omitted for brevity, but aggregated results in the plane (R2, RMSE) are shown in Figure 4f, relative to ensemble regression.
The variables used for regression, determined through VIP score thresholding, are all derived from the MS dataset. In particular, the 17 indices reported in Table 3 were informative for model training. Remarkably, none of them are derived from the SAR dataset.
The adopted benchmarking techniques are randomly calibrated PLSR and NN-based estimation. In the first case, a number of samples equal to the one determined via active sampling has been adopted for a fair comparison. Bootstrapping has been implemented to retrieve the minimum RMSE and the average one, for a total of 1000 experiments.
Bootstrapping returned a minimum RMSE (indicated as RMSE* in Table 2) ranging between 21.0 t/ha and 39.4 t/ha (see Table S2). The average of the minimum RMSE is 26.2 t/ha. The column reporting the mean RMSE values ( RMSE ¯ in Table 2), obtained via bootstrap, is useful to aid in pointing out that this method can fail, as not all the experiments can be considered successful whatever the combination of samples is used for model calibration (see Table S2).
NN-based estimation has been implemented using ReUse [17]. This is a UNet network trained to perform a pixel-wise regression task, mapping Sentinel-1 and Sentinel-2 images, concatenated along the spectral axis into a single tensor, into AGB raster. The architecture consists of two pathways. The first is a contracting pathway (the encoder) that captures the image context. The second pathway is a symmetric expanding pathway (the decoder) designed to produce pixel-wise predictions using transposed convolutions. Several skip connections connect the two pathways.
It is worthwhile to remark that the deep learning benchmark was carried out without the usage of training samples belonging to the clips under investigation for AGB estimation. The training phase has been implemented using the same number of clips exploited for validation. As for the bootstrap benchmark, which is typically based on sparse sampling, the training on points belonging to the clips under testing allows the model to learn the characteristics of the specific clip effectively. This approach is unsuitable for deep learning, which is typically trained using patches, here retrieved from clips not considered for AGB estimates.
The network generates a pixel-wise regression map and can extract spatial and spectral features from satellite images using an end-to-end paradigm. In particular, a patch-wise approach is adopted [57]. Each input image and ground truth AGB map is divided into nonoverlapping 32 × 32 pixel patches. The training has been implemented using a ground truth with comparable dimensions of the area to be estimated. It stops when the monitored validation loss stops to improve after 35 epochs. The maximum number of epochs is set to 200. The optimizer used is Adam [58] with default parameters. The learning rate is reduced by a factor of 0.2 if no improvement in validation loss is observed for 25 epochs. The adopted loss function is the mean absolute error. The RMSE values obtained using this technique range between 20.6 t/ha and 43.9 t/ha (see Table S2), with an average of 30.4 t/ha.

4. Discussion

Biomass is a crucial indicator of the carbon sequestration capacity of a vegetation ecosystem [61]. Its dynamic is of interest as it impacts the carbon cycle, which plays an important role in the global climate and its changes. This work introduces a novel framework for AGB estimation based on the transferring of a calibrated regression model between different scenes through active selection re-calibration samples. In this regard, a simple, fully unsupervised technique, exploiting Shannon’s entropy as a discriminator, has been adopted, to select a number of calibration samples compatible with operational environments.
The obtained results testified the reliability of the workflow, which restituted an average RMSE of 28.8 t/ha, in case all the samples selected by the adopted active sampling methodology, on average 46/km2, are exploited. The adoption of the ISODATA-based selection allowed for the reduction in the samples to be collected to about 10/km2 (when the setting was k = 10) and 6/km2 (when the setting was k = 5) with negligible impact on the AGB estimation, whose RMSE increased to 30.0 t/ha when the setting was k = 10 and to 30.7 t/ha when the setting was k = 5. The registered values of the coefficient of determination, R2, as expected, follow the trend of the RMSE, i.e., they tend to increase as the RMSE decreases, as shown in Figure 4f. As a general comment, it is remarkable that the incremental framework, in which all the samples collected for previous estimates are involved in the current regression, did not add value to the estimates.
The aforementioned RMSE values refer to the most performing regression configuration which, interestingly, is provided by the ensemble of PLSR and GB. This configuration outperforms the single regression technique in 18 experiments out of 32. This trend increases as the number of calibration points is reduced. The analysis of the errors reveals that they are strongly correlated with the AGB standard deviation of the clip. The higher the standard deviation, the higher the RMSE. This result is in line with the literature, as reported, as an example, in [62].
As shown in the previous section, the variables selected for regression through VIP scoring belong to the MS dataset. Despite the well-known relation between SAR backscattering and the AGB [8], spectral information in the near-infrared and red edge frequencies, which appear in almost all the indices listed in Table 3, resulted dominant. This is due to the scarce sensitivity of C-band frequencies to AGB within the resolution cell [63]. However, the contribution of SAR features is expected to be more significant using L-band data, as this wavelength typically shows higher correlation with the AGB [8]. This result can be found, as an example, in [62], where the authors highlighted how AGB estimates can benefit from the presence of L-band SAR features among regression variables.
The RMSE values obtained adopting the proposed methodology are comparable with those declared in the literature for the specific application. As an example, Reference [7] claimed an RMSE of 18.9 t/ha on an area having an average AGB of 49.4 t/ha, with a standard deviation of 33.8 t/ha using extreme gradient boost algorithm. Reference [14] reported an RMSE of about 30.0 t/ha in the study of an area with an average AGB of 55.5 t/ha and a standard deviation of 18.2 t/ha, using extreme gradient boosting as well. Vafaei et al. [62] reported an RMSE of 38.7 t/ha, using support vector regression in an area with an average AGB of 206.5 t/ha and a standard deviation of 78.0 t/ha.
All these studies were characterized by the same approach, consisting of sparse sampling of a few plots (149 in Reference [62] and 367 in Reference [7]) distributed over large areas. These data are used in part to train the model and in part to validate it. In other words, the authors used a few samples to calibrate a single model to fit a large area. The validation was performed on a small subset of the ground truth, as the training phase tends to absorb the majority of the collected samples.
The approach proposed here is different. First, the AGB estimation problem is tackled via active learning, which allows for the selection of highly informative calibration samples, thus introducing a further guide for field operations which can be accordingly significantly reduced. It requires extended reference data about a small area, which can be retrieved with a single LiDAR flight, useful to calibrate a machine learning model via bootstrapping, which, as demonstrated in the previous section, is the most effective solution in the presence of extended ground data. This model can be effectively transferred elsewhere through the collection of a few new calibration samples, according to active learning principles. This means that the objective was to demonstrate that a reliable selection of re-calibration samples feeding simple regression models can provide results comparable with methodologies exploiting more sophisticated solutions tuned on the specific case study and/or requiring extended ground data.
In this regard, the proposed active learning technique dramatically reduces the necessity of field sampling, which is the principal limitation in large-scale AGB mapping [5], up to about 6 samples/km2. The concept of active sampling for model transfer also opens the possibility of the exploitation of historical field data in combination with newly acquired ones with the purpose of updating and/or expanding past AGB maps [37]. Differently from the literature, using, as aforementioned, a few plots to retrieve estimation quality metrics, the validation has been performed on an extended ground truth for a total of more than 850,000 validation points.
The reliability of the proposed methodology has been tested through comparison against some of the most popular techniques for AGB estimation, like bootstrap and NN-based estimation. The benchmark revealed that random sampling is not a viable option if an extended ground truth is not available. Using this methodology, in most cases, the average RMSEs are much higher than the typical ones indicated in the literature. On the other hand, in the presence of extended ground data, it can be considered as an option, as the provided minimum RMSEs constitute, in five cases out of seven, the best overall estimation performance (see Table S2).
NNs are popular tools for AGB estimation allowing for reliable estimates, provided that an extended ground truth is available for model training [18,64]. Dealing with AGB estimates, this represents a significant limitation, since reference data are usually retrieved via sparse plot sampling. The obtained RMSE values, using ReUse, testify that NNs are suitable for model transfer, i.e., it is possible to train the net using reference data acquired on a test area and then use the weights for making inference elsewhere. However, the amount of reference data should be comparable with the extension of the area to be predicted.
The performed experiments revealed that the average RMSE values obtained using ReUse are comparable with those provided by the proposed framework and by the best bootstrap performance. This means that the choice of the best solution for the specific AGB estimation problem to be solved can be a function of the available data and computational power. In other words, the usage of machine learning ensembles can be a viable solution to cope with the sparsity of reference data. On the other hand, when extended ground truths can be collected, NNs and bootstrapping, or even a combination of the two, could represent an option, as they can slightly reduce the estimation error.

5. Conclusions

Forests cover about 40% of the Earth’s ice-free land surface and constitute the vital core of the whole planet since they represent the sentinels of the carbon cycle. Monitoring their biomass is key, as this parameter is the principal indicator of the carbon sequestration capacity of a vegetation ecosystem.
Remote sensing literature about forest aboveground biomass estimation is principally oriented towards machine learning, due to its ability to deal with sparse samples acquired during field campaigns. This also allows researchers to cope with large-scale problems, as samples are tendentially distributed all over the area to be investigated.
In this work, the problem of aboveground estimation is tackled using active learning principles, allowing for the transfer of a calibrated regression model through a data-driven selection of the most informative re-calibration samples. In this regard, a technique exploiting Shannon’s entropy as a diversity criterion has been discussed, with a particular focus on how it allows for the dramatic reduction in the number of samples to be collected, to a few samples per square kilometer, which is a quantity suitable for operational environments. The proposed framework requires one reference area, for which an extended ground truth should be available, to calibrate the model to be transferred to the rest of the forest, assuming that its structure does not exhibit significant variations. Reference AGB data can be obtained, as an example, with a LiDAR acquisition.
The estimates provided by the proposed methodology have been compared against techniques set out in the literature that exploit massive reference data for model training, including bootstrapping and neural networks. This investigation revealed that the exploitation of machine learning ensembles, in combination with active learning, provides results, measured in terms of the root mean square errors, fully comparable with the literature. This suggests that the proposed methodology is a viable solution to cope with the sparsity of reference data. On the other hand, in case an extended ground truth is available, both neural networks and bootstrapping represent feasible solutions as they allow for a slight reduction in the estimation error.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/rs15215138/s1, Table S1: AGB statistics for the clips considered in this study; Table S2: AGB estimation results. Bold black values indicate the best performance for each clip. The letter F stands for failed estimate, i.e., having RMSE higher than three times the AGB standard deviation of the clip. The column “Samples” refers to the different active sampling settings tested. The column “Inc” refers to the incremental sampling setting.

Author Contributions

Conceptualization, D.A.; methodology, D.A.; software, D.A., A.E.P., G.G., S.M. and M.R.; validation, D.A. and A.E.P.; formal analysis, D.A. and A.E.P.; investigation, D.A. and A.E.P.; resources, D.A., A.E.P., M.R. and G.G; data curation, D.A., A.E.P., M.R. and G.G; writing—original draft preparation, D.A. and A.E.P.; writing—review and editing, D.A., A.E.P., G.G., M.R., C.S. and S.M.; supervision, D.A. and C.S. All authors have read and agreed to the published version of the manuscript.

Funding

The APC was funded by the Italian Aerospace Research Centre.

Data Availability Statement

The data used in this work are a subset of the full dataset available at https://huggingface.co/datasets/nascetti-a/BioMassters (accessed on 18 July 2023). Further information about data can be found at https://neurips.cc/virtual/2023/poster/73499 (accessed on 18 July 2023).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mitchard, E.T.A. The tropical forest carbon cycle and climate change. Nature 2018, 559, 527–534. [Google Scholar] [CrossRef]
  2. Franklin, S.E. Remote Sensing for Sustainable Forest Management; CRC Press: Boca Raton, FL, USA, 2001. [Google Scholar]
  3. Iverson, L.R.; Graham, R.L.; Cook, E.A. Applications of satellite remote sensing to forested ecosystems. Landsc. Ecol. 1989, 3, 131–143. [Google Scholar] [CrossRef]
  4. Santoro, M.; Cartus, O.; Fransson, J.E.S.; Wegmüller, U. Complementarity of X-, C-, and L-band SAR Backscatter Observations to Retrieve Forest Stem Volume in Boreal Forest. Remote Sens. 2019, 11, 1563. [Google Scholar] [CrossRef]
  5. West, P.W. Tree and Forest Measurement; Springer International Publishing: Cham, Switzerland, 2015. [Google Scholar] [CrossRef]
  6. Lu, D. The potential and challenge of remote sensing-based biomass estimation. Int. J. Remote Sens. 2006, 27, 1297–1328. [Google Scholar] [CrossRef]
  7. Li, Y.; Li, M.; Li, C.; Liu, Z. Forest aboveground biomass estimation using Landsat 8 and Sentinel-1A data with machine learning algorithms. Sci. Rep. 2020, 10, 9952. [Google Scholar] [CrossRef]
  8. Amitrano, D.; Di Martino, G.; Guida, R.; Iervolino, P.; Iodice, A.; Papa, M.N.; Riccio, D.; Ruello, G. Earth environmental monitoring using multi-temporal synthetic aperture radar: A critical review of selected applications. Remote Sens. 2021, 13, 604. [Google Scholar] [CrossRef]
  9. Mitchard, E.T.A.; Saatchi, S.S.; Woodhouse, I.H.; Nangendo, G.; Ribeiro, N.S.; Williams, M.; Ryan, C.M.; Lewis, S.L.; Feldpausch, T.R.; Meir, P. Using satellite radar backscatter to predict above-ground woody biomass: A consistent relationship across four different African landscapes. Geophys. Res. Lett. 2009, 36, L23401. [Google Scholar] [CrossRef]
  10. Lu, D.; Chen, Q.; Wang, G.; Liu, L.; Li, G.; Moran, E. A survey of remote sensing-based aboveground biomass estimation methods in forest ecosystems. Int. J. Digit. Earth 2016, 9, 63–105. [Google Scholar] [CrossRef]
  11. Skowronski, N.S.; Clark, K.L.; Gallagher, M.; Birdsey, R.A.; Hom, J.L. Airborne laser scanner-assisted estimation of aboveground biomass change in a temperate oak–pine forest. Remote Sens. Environ. 2014, 151, 166–174. [Google Scholar] [CrossRef]
  12. Næsset, E.; Gobakken, T.; Bollandsås, O.M.; Gregoire, T.G.; Nelson, R.; Ståhl, G. Comparison of precision of biomass estimates in regional field sample surveys and airborne LiDAR-assisted surveys in Hedmark County, Norway. Remote. Sens. Environ. 2013, 130, 108–120. [Google Scholar] [CrossRef]
  13. Li, Z.; Bi, S.; Hao, S.; Cui, Y. Aboveground biomass estimation in forests with random forest and Monte Carlo-based uncertainty analysis. Ecol. Indic. 2022, 142, 109246. [Google Scholar] [CrossRef]
  14. Li, Y.; Li, M.; Wang, Y. Forest Aboveground Biomass Estimation and Response to Climate Change Based on Remote Sensing Data. Sustainability 2022, 14, 14222. [Google Scholar] [CrossRef]
  15. Monnet, J.-M.; Chanussot, J.; Berger, F. Support Vector Regression for the Estimation of Forest Stand Parameters Using Airborne Laser Scanning. IEEE Geosci. Remote Sens. Lett. 2011, 8, 580–584. [Google Scholar] [CrossRef]
  16. Probst, P.; Boulesteix, A.-L. To Tune or Not to Tune the Number of Trees in Random Forest. J. Mach. Learn. Res. 2018, 18, 1–18. [Google Scholar]
  17. Morais, T.G.; Teixeira, R.F.; Figueiredo, M.; Domingos, T. The use of machine learning methods to estimate aboveground biomass of grasslands: A review. Ecol. Indic. 2021, 130, 108081. [Google Scholar] [CrossRef]
  18. Pascarella, A.E.; Giacco, G.; Rigiroli, M.; Marrone, S.; Sansone, C. ReUse: REgressive Unet for Carbon Storage and Above-Ground Biomass Estimation. J. Imaging 2023, 9, 61. [Google Scholar] [CrossRef]
  19. Narine, L.L.; Popescu, S.C.; Malambo, L. Synergy of ICESat-2 and Landsat for Mapping Forest Aboveground Biomass with Deep Learning. Remote Sens. 2019, 11, 1503. [Google Scholar] [CrossRef]
  20. Zhang, L.; Shao, Z.; Liu, J.; Cheng, Q. Deep Learning Based Retrieval of Forest Aboveground Biomass from Combined LiDAR and Landsat 8 Data. Remote Sens. 2019, 11, 1459. [Google Scholar] [CrossRef]
  21. Hamedianfar, A.; Mohamedou, C.; Kangas, A.; Vauhkonen, J. Deep learning for forest inventory and planning: A critical review on the remote sensing approaches so far and prospects for further applications. For. Int. J. For. Res. 2022, 95, 451–465. [Google Scholar] [CrossRef]
  22. Ningthoujam, R.K.; Balzter, H.; Tansey, K.; Feldpausch, T.R.; Mitchard, E.T.A.; Wani, A.A.; Joshi, P.K. Relationships of S-band radar backscatter and forest aboveground biomass in different forest types. Remote Sens. 2017, 9, 1116. [Google Scholar] [CrossRef]
  23. Ningthoujam, R.K.; Balzter, H.; Tansey, K.; Morrison, K.; Johnson, S.C.; Gerard, F.; George, C.; Malhi, Y.; Burbidge, G.; Doody, S.; et al. Airborne S-band SAR for forest biophysical retrieval in temperate mixed forests of the UK. Remote Sens. 2016, 8, 609. [Google Scholar] [CrossRef]
  24. Mermoz, S.; Réjou-Méchain, M.; Villard, L.; Le Toan, T.; Rossi, V.; Gourlet-Fleury, S. Decrease of L-band SAR backscatter with biomass of dense forests. Remote Sens. Environ. 2015, 159, 307–317. [Google Scholar] [CrossRef]
  25. Ulaby, F.T.; Sarabandi, K.; McDonald, K.Y.L.E.; Whitt, M.; Dobson, M.C. Michigan microwave canopy scattering model. Int. J. Remote Sens. 1990, 11, 1223–1253. [Google Scholar] [CrossRef]
  26. Franceschini, M.H.D.; Becker, R.; Wichern, F.; Kooistra, L. Quantification of Grassland Biomass and Nitrogen Content through UAV Hyperspectral Imagery—Active Sample Selection for Model Transfer. Drones 2022, 6, 73. [Google Scholar] [CrossRef]
  27. Wan, L.; Cen, H.; Zhu, J.; Zhang, J.; Zhu, Y.; Sun, D.; Du, X.; Zhai, L.; Weng, H.; Li, Y.; et al. Grain yield prediction of rice using multi-temporal UAV-based RGB and multispectral images and model transfer—A case study of small farmlands in the South of China. Agric. For. Meteorol. 2020, 291, 108096. [Google Scholar] [CrossRef]
  28. Amitrano, D.; Cicala, L.; De Mizio, M.; Tufano, F. UAV Hyperspectral Characterization of Vegetation Using Entropy-Based Active Sampling for Partial Least Square Regression Models. Appl. Sci. 2023, 13, 4812. [Google Scholar] [CrossRef]
  29. Otgonbayar, M.; Atzberger, C.; Chambers, J.; Damdinsuren, A. Mapping pasture biomass in Mongolia using Partial Least Squares, Random Forest regression and Landsat 8 imagery. Int. J. Remote Sens. 2018, 40, 3204–3226. [Google Scholar] [CrossRef]
  30. Fekety, P.A.; Falkowski, M.J.; Hudak, A.T. Temporal transferability of LiDAR-based imputation of forest inventory attributes. Can. J. For. Res. 2015, 45, 422–435. [Google Scholar] [CrossRef]
  31. Tompalski, P.; White, J.C.; Coops, N.C.; Wulder, M.A. Demonstrating the transferability of forest inventory attribute models derived using airborne laser scanning data. Remote Sens. Environ. 2019, 227, 110–124. [Google Scholar] [CrossRef]
  32. Mauro, F.; Hudak, A.T.; Fekety, P.A.; Frank, B.; Temesgen, H.; Bell, D.M.; Gregory, M.J.; McCarley, T.R. Regional modeling of forest fuels and structural attributes using airborne laser scanning data in Oregon. Remote Sens. 2021, 13, 261. [Google Scholar] [CrossRef]
  33. DTuia, D.; Persello, C.; Bruzzone, L. Domain Adaptation for the Classification of Remote Sensing Data: An Overview of Recent Advances. IEEE Geosci. Remote Sens. Mag. 2016, 4, 41–57. [Google Scholar] [CrossRef]
  34. Berger, K.; Caicedo, J.P.R.; Martino, L.; Wocher, M.; Hank, T.; Verrelst, J. A survey of active learning for quantifying vegetation traits from terrestrial earth observation data. Remote Sens. 2021, 13, 287. [Google Scholar] [CrossRef]
  35. Maltamo, M.; Kinnunen, H.; Kangas, A.; Korhonen, L. Predicting stand age in managed forests using National Forest Inventory field data and airborne laser scanning. For. Ecosyst. 2020, 7, 44. [Google Scholar] [CrossRef]
  36. Stenman, V. Finnish Forest Data-Based Metsään.fi-services. In Big Data in Bioeconomy; Springer International Publishing: Berlin/Heidelberg, Germany, 2021; pp. 309–319. [Google Scholar] [CrossRef]
  37. Tuominen, S.; Pitkänen, T.; Balazs, A.; Kangas, A. Improving finnish multi-source national forest inventory by 3D aerial imaging. Silva Fenn. 2017, 51, 7743. [Google Scholar] [CrossRef]
  38. De Jong, S. SIMPLS: An alternative approach to partial least squares regression. Chemom. Intell. Lab. Syst. 1993, 18, 251–263. [Google Scholar] [CrossRef]
  39. Haralick, R.M.; Shanmugam, K.; Dinstein, I.H. Textural Features for Image Classification. IEEE Trans. Syst. Man Cybern. 1973, SMC-3, 610–621. [Google Scholar] [CrossRef]
  40. Jin, J.; Wang, Q. Evaluation of informative bands used in different PLS regressions for estimating leaf biochemical contents from hyperspectral reflectance. Remote Sens. 2019, 11, 197. [Google Scholar] [CrossRef]
  41. Farrés, M.; Platikanov, S.; Tsakovski, S.; Tauler, R. Comparison of the variable importance in projection (VIP) and of the selectivity ratio (SR) methods for variable selection and interpretation. J. Chemom. 2015, 29, 528–536. [Google Scholar] [CrossRef]
  42. Cocchi, M.; Biancolillo, A.; Marini, F. Chapter Ten—Chemometric Methods for Classification and Feature Selection. In Comprehensive Analytical Chemistry; Elsevier: Amsterdam, The Netherlands, 2018; Volume 82, pp. 265–299. [Google Scholar] [CrossRef]
  43. Shapiro, L.; Stockman, G. Computer Vision; Prentice Hall: Upper Saddle River, NJ, USA, 2002. [Google Scholar]
  44. He, T.; Zhang, S.; Xin, J.; Zhao, P.; Wu, J.; Xian, X.; Li, C.; Cui, Z. An Active Learning Approach with Uncertainty, Representativeness, and Diversity. Sci. World J. 2014, 2014, 827586. [Google Scholar] [CrossRef]
  45. Lu, X.; Zhang, J.; Li, T.; Zhang, Y. Incorporating Diversity into Self-Learning for Synergetic Classification of Hyperspectral and Panchromatic Images. Remote Sens. 2016, 8, 804. [Google Scholar] [CrossRef]
  46. Douak, F.; Melgani, F.; Benoudjit, N. Kernel ridge regression with active learning for wind speed prediction. Appl. Energy 2013, 103, 328–340. [Google Scholar] [CrossRef]
  47. Yuan, B.; Wu, Z.; Zhang, K.; Li, D.; Ma, Q. Application of Active Learning in Carbonate Lithologic Identification. In Proceedings of the 4th International Conference on Artificial Intelligence and Big Data, Chengdu, China, 28–31 May 2021; IEEE: Piscataway Township, NJ, USA, 2021; pp. 404–408. [Google Scholar] [CrossRef]
  48. Demir, B.; Persello, C.; Bruzzone, L. Batch-Mode Active-Learning Methods for the Interactive Classification of Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2010, 49, 1014–1031. [Google Scholar] [CrossRef]
  49. Patra, S.; Bruzzone, L. A Fast Cluster-Assumption Based Active-Learning Technique for Classification of Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2010, 49, 1617–1626. [Google Scholar] [CrossRef]
  50. Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 623–656. [Google Scholar] [CrossRef]
  51. Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning; Springer: New York, NY, USA, 2009. [Google Scholar] [CrossRef]
  52. Karnieli, A.; Kaufman, Y.J.; Remer, L.; Wald, A. AFRI—aerosol free vegetation index. Remote Sens. Environ. 2001, 77, 10–21. [Google Scholar] [CrossRef]
  53. Bannari, A.; Morin, D.; Bonn, F.; Huete, A.R. A review of vegetation indices. Remote Sens. Rev. 1995, 13, 95–120. [Google Scholar] [CrossRef]
  54. Main, R.; Cho, M.A.; Mathieu, R.; O’Kennedy, M.M.; Ramoelo, A.; Koch, S. An investigation into robust spectral indices for leaf chlorophyll estimation. ISPRS J. Photogramm. Remote Sens. 2011, 66, 751–761. [Google Scholar] [CrossRef]
  55. Le Maire, G.; François, C.; Dufrêne, E. Towards universal broad leaf chlorophyll indices using PROSPECT simulated database and hyperspectral reflectance measurements. Remote Sens. Environ. 2004, 89, 1–28. [Google Scholar] [CrossRef]
  56. Tucker, C.; Elgin, J.; McMurtrey, J.; Fan, C. Monitoring corn and soybean crop development with hand-held radiometer spectral data. Remote Sens. Environ. 1979, 8, 237–248. [Google Scholar] [CrossRef]
  57. Haboudane, D.; Miller, J.R.; Pattey, E.; Zarco-Tejada, P.J.; Strachan, I.B. Hyperspectral vegetation indices and novel algorithms for predicting green LAI of crop canopies: Modeling and validation in the context of precision agriculture. Remote Sens. Environ. 2004, 90, 337–352. [Google Scholar] [CrossRef]
  58. Pu, R.; Gong, P.; Yu, Q. Comparative Analysis of EO-1 ALI and Hyperion, and Landsat ETM+ Data for Mapping Forest Crown Closure and Leaf Area Index. Sensors 2008, 8, 3744–3766. [Google Scholar] [CrossRef] [PubMed]
  59. Clevers, J.G.P.W.; De Jong, S.M.; Epema, G.F.; Van Der Meer, F.D.; Bakker, W.H.; Skidmore, A.K.; Scholte, K.H. Derivation of the red edge index using the MERIS standard band setting. Int. J. Remote Sens. 2002, 23, 3169–3184. [Google Scholar] [CrossRef]
  60. Shibayama, M.; Salli, A.; Häme, T.; Iso-Iivari, L.; Heino, S.; Alanen, M.; Morinaga, S.; Inoue, Y.; Akiyama, T. Detecting Phenophases of Subarctic Shrub Canopies by Using Automated Reflectance Measurements. Remote Sens. Environ. 1999, 67, 160–180. [Google Scholar] [CrossRef]
  61. Wang, Y.; Pyörälä, J.; Liang, X.; Lehtomäki, M.; Kukko, A.; Yu, X.; Kaartinen, H.; Hyyppä, J. In situ biomass estimation at tree and plot levels: What did data record and what did algorithms derive from terrestrial and aerial point clouds in boreal forest. Remote Sens. Environ. 2019, 232, 111309. [Google Scholar] [CrossRef]
  62. Vafaei, S.; Soosani, J.; Adeli, K.; Fadaei, H.; Naghavi, H.; Pham, T.D.; Bui, D.T. Improving accuracy estimation of Forest Aboveground Biomass based on incorporation of ALOS-2 PALSAR-2 and Sentinel-2A imagery and machine learning: A case study of the Hyrcanian forest area (Iran). Remote Sens. 2018, 10, 172. [Google Scholar] [CrossRef]
  63. Santi, E.; Paloscia, S.; Pettinato, S.; Fontanelli, G.; Mura, M.; Zolli, C.; Maselli, F.; Chiesi, M.; Bottai, L.; Chirici, G. The potential of multifrequency SAR images for estimating forest biomass in Mediterranean areas. Remote Sens. Environ. 2017, 200, 63–73. [Google Scholar] [CrossRef]
  64. Dong, L.; Du, H.; Han, N.; Li, X.; Zhu, D.; Mao, F.; Zhang, M.; Zheng, J.; Liu, H.; Huang, Z.; et al. Application of Convolutional Neural Network on Lei Bamboo Above-Ground-Biomass (AGB) Estimation Using Worldview-2. Remote Sens. 2020, 12, 958. [Google Scholar] [CrossRef]
Figure 1. Composition of the exploited dataset for some sample clips. From left to right, S1 SAR image, natural color S2 image and AGB map. The size of each clip is 256 × 256 pixels. The pixel spacing of the maps is 10 m.
Figure 1. Composition of the exploited dataset for some sample clips. From left to right, S1 SAR image, natural color S2 image and AGB map. The size of each clip is 256 × 256 pixels. The pixel spacing of the maps is 10 m.
Remotesensing 15 05138 g001
Figure 2. Proposed workflow for AGB model transfer. Colored blocks identify input/output data.
Figure 2. Proposed workflow for AGB model transfer. Colored blocks identify input/output data.
Remotesensing 15 05138 g002
Figure 3. Schematic representation of histogram calculations for active sampling.
Figure 3. Schematic representation of histogram calculations for active sampling.
Remotesensing 15 05138 g003
Figure 4. Some scatterplots relevant to the performed experiments. (a) Clip 0, bootstrap (R2 = 0.798, RMSE = 9.98 t/ha), (b) Clip 4, ensemble regression k = 5 (R2 = 0.589, RMSE = 27.6 t/ha), (c) Clip 4 neural network (R2 = 0.731, RMSE = 27.6 t/ha). (d) Clip 7, ensemble regression, all samples (R2 = 0.515, RMSE = 29.1 t/ha), (e) Clip 7 ensemble regression, k = 5 (R2 = 0.423, RMSE = 31.8 t/ha), (f) ensemble regression results in the plane (R2, RMSE) for the different sampling settings. Overall means reported with solid markers.
Figure 4. Some scatterplots relevant to the performed experiments. (a) Clip 0, bootstrap (R2 = 0.798, RMSE = 9.98 t/ha), (b) Clip 4, ensemble regression k = 5 (R2 = 0.589, RMSE = 27.6 t/ha), (c) Clip 4 neural network (R2 = 0.731, RMSE = 27.6 t/ha). (d) Clip 7, ensemble regression, all samples (R2 = 0.515, RMSE = 29.1 t/ha), (e) Clip 7 ensemble regression, k = 5 (R2 = 0.423, RMSE = 31.8 t/ha), (f) ensemble regression results in the plane (R2, RMSE) for the different sampling settings. Overall means reported with solid markers.
Remotesensing 15 05138 g004
Table 1. AGB statistics for the area considered in this study.
Table 1. AGB statistics for the area considered in this study.
Min AGB Clip (t/ha)Max AGB Clip (t/ha)Mean AGB (t/ha)Std AGB (t/ha)Total Area (km2)
22.181.369.543.185.1
Table 2. Aggregated AGB estimation results for the different tested sampling settings and benchmarking techniques. The column “Inc” refers to the incremental sampling setting.
Table 2. Aggregated AGB estimation results for the different tested sampling settings and benchmarking techniques. The column “Inc” refers to the incremental sampling setting.
ProposedBenchmark
PLSRGBEnsembleBootstrapNN
SamplesRMSERMSERMSERMSE* R M S E ¯
AreaIncAreaIncAreaInc
All28.831.333.736.328.830.626.246.830.4
k = 1030.732.134.236.730.031.527.747.7
k = 531.632.234.037.930.732.328.549.7
RMSE*: minimum bootstrap RMSE; RMSE ¯ : average bootstrap RMSE.
Table 3. VIP variables extracted from the regression of the reference area.
Table 3. VIP variables extracted from the regression of the reference area.
NameFormulaRef
Band 2 R 492
Band 5 R 704
Band 8 R 832
Aerosol free vegetation index AFRI 1600 = R 830 0.66 R 1613 R 832 + 0.66 R 1613 [52]
Aerosol free vegetation index AFRI 2100 = R 832 0.5 R 2202 R 832 + 0.5 R 2202 [52]
Ashburn vegetation index AVI = 2 R 832 R 560 [53]
Chlorophyll absorption ratio index CARI = R 704 R 664 670 a + R 664 + b 2 a 2 + 1 0.5 [54]
Difference 800/550 D 800 / 550   = R 832 R 560 [55]
Green difference vegetation index GDVI = R 864 R 560 [56]
Differenced vegetation index MSS DVIMSS = 2.4 R 832 R 664 [53]
Global environment monitoring index GEMI = n 1 0.25 n R 664 0.125 1 R 664 [53]
Misra soil brightness index MSBI = 0.406 R 492 + 0.600 R 560 + 0.645 R 704 + 0.243 R 832 [53]
Misra yellow vegetation index MYVI = 0.723 R 492 + 0.597 R 560 + 0.206 R 704 + 0.278 R 832 [53]
Modified chlorophyll absorption in reflectance index MCARI = R 704 R 832 0.2 R 704 R 560 R 704 R 664 [57]
Nonlinear vegetation index NLI = R 782 2 R 664 [58]
Reflectance at the inflexion point RRE = R 664 + R 782 2 [59]
Simple ratio 833/1649 SR 800 / 1649 = R 832 R 1613 [60]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Amitrano, D.; Giacco, G.; Marrone, S.; Pascarella, A.E.; Rigiroli, M.; Sansone, C. Forest Aboveground Biomass Estimation Using Machine Learning Ensembles: Active Learning Strategies for Model Transfer and Field Sampling Reduction. Remote Sens. 2023, 15, 5138. https://doi.org/10.3390/rs15215138

AMA Style

Amitrano D, Giacco G, Marrone S, Pascarella AE, Rigiroli M, Sansone C. Forest Aboveground Biomass Estimation Using Machine Learning Ensembles: Active Learning Strategies for Model Transfer and Field Sampling Reduction. Remote Sensing. 2023; 15(21):5138. https://doi.org/10.3390/rs15215138

Chicago/Turabian Style

Amitrano, Donato, Giovanni Giacco, Stefano Marrone, Antonio Elia Pascarella, Mattia Rigiroli, and Carlo Sansone. 2023. "Forest Aboveground Biomass Estimation Using Machine Learning Ensembles: Active Learning Strategies for Model Transfer and Field Sampling Reduction" Remote Sensing 15, no. 21: 5138. https://doi.org/10.3390/rs15215138

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop