Next Article in Journal
Detecting Dead Standing Eucalypt Trees from Voxelised Full-Waveform Lidar Using Multi-Scale 3D-Windows for Tackling Height and Size Variations
Next Article in Special Issue
Spatially Explicit Analysis of Trade-Offs and Synergies among Multiple Ecosystem Services in Shaanxi Valley Basins
Previous Article in Journal
Forest Landscape Heterogeneity Increases Shrub Diversity at the Expense of Tree Seedling Diversity in Temperate Mixedwood Forests
Previous Article in Special Issue
Estimating Urban Vegetation Biomass from Sentinel-2A Image Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimation of Forest Biomass in Beijing (China) Using Multisource Remote Sensing and Forest Inventory Data

1
Precision Forestry Key Laboratory of Beijing, Beijing Forestry University, Beijing 100083, China
2
College of Natural Resources and Environment, Northwest A&F University, Yangling 712100, Shanxi, China
*
Author to whom correspondence should be addressed.
Forests 2020, 11(2), 163; https://doi.org/10.3390/f11020163
Submission received: 2 December 2019 / Revised: 20 January 2020 / Accepted: 28 January 2020 / Published: 31 January 2020

Abstract

:
Forest biomass reflects the material cycle of forest ecosystems and is an important index to measure changes in forest structure and function. The accurate estimation of forest biomass is the research basis for measuring carbon storage in forest systems, and it is important to better understand the carbon cycle and improve the efficiency of forest policy and management activities. In this study, to achieve an accurate estimation of meso-scale (regional) forest biomass, we used Ninth Beijing Forest Inventory data (FID), Landsat 8 OLI Image data and ALOS-2 PALSAR-2 data to establish different forest types (coniferous forest, mixed forest, and broadleaf forest) of biomass models in Beijing. We assessed the potential of forest inventory, optical (Landsat 8 OLI) and radar (ALOS-2 PALSAR-2) data in estimating and mapping forest biomass. From these data, a wide range of parameters related to forest structure were obtained. Random forest (RF) models were established using these parameters and compared with traditional multiple linear regression (MLR) models. Forest biomass in Beijing was then estimated. The results showed the following: (1) forest inventory data combined with multisource remote sensing data can better fit forest biomass than forest inventory data alone. Among the three forest types, mixed forest has the best fitting model. Forest inventory variables and multisource remote sensing variables can match each other in time and space, capturing almost all spatial variability. (2) The 2016 forest biomass density in Beijing was estimated to be 52.26 Mg ha−1 and ranged from 19.1381–195.66 Mg ha−1. The areas with high biomass were mainly distributed in the north and southwest of Beijing, while the areas with low biomass were mainly distributed in the southeast and central areas of Beijing. (3) The estimates from the RF model are better than those from the MLR model, showing a high R 2 and a low root mean square error (RMSE). The R 2 values of the MLR models of three forest types were greater than 0.5, and RMSEs were less than 15.5 Mg ha−1, The R 2 values of the RF models were higher than 0.6, and the RMSEs were lower than 13.5 Mg ha−1. We conclude that the methods in this paper can help improve the accurate estimation of regional biomass and provide a basis for the planning of relevant forestry decision-making departments.

1. Introduction

Forest ecosystems are an important component of the terrestrial ecosystem. Forests store 76%~98% of the organic carbon in terrestrial ecosystems [1] and play an irreplaceable role in mitigating global warming caused by the increase in atmospheric carbon dioxide [2]. Forest biomass reflects the material cycle of forest ecosystems and is an important indicator for measuring changes in forest structure and function. Additionally, forest biomass is closely related to the carbon sources and sinks in forest ecosystems [3]. Because the monitoring of forest biomass resources is expensive and time consuming, most countries do not have effective monitoring systems. Therefore, accurate estimations of forest biomass can effectively replace forest monitoring systems and are an important basis for assessing ecosystem processes, the carbon balance of ecosystems and climate change [4].
Meso-scale forest biomass estimations are usually obtained from forest inventory data [5]. In many countries, the use of large-scale forest inventories is considered an effective method for estimating biomass accurately [6]. China conducts a large-scale forest resource survey every five years to provide good data for statistical forest resources. Using these inventory data, forest biomass can be estimated at provincial or national scales [7]. However, with the continuous change in the forest resource structure of China, there have been some problems with these inventory data in regional biomass estimation [8]. To obtain the total volume or biomass of forest, the volume or biomass of one tree is calculated, and then the volumes or biomasses of all the trees in the sample plot are added together. Obviously, this method requires a high amount of manpower and material resources [9]. Moreover, inventory data cannot fully reflect forest information [6,10]. Therefore, we need data that cover a wide area and contain a high amount of vegetation information to supplement forest inventory data.
With developments in technology, remote sensing has increased the possibilities for forest biomass research [11,12]. The use of remote sensing data in the research of meso-scale biomass is an important technical method. Various remote sensing indicators based on optical sensors, such as the normalized difference vegetation index (NDVI) and other factors obtained by image transformations, have been shown to be well correlated with the ground vegetation, providing reliable information for forest biomass estimation [13,14,15]. However, applications with optical data are often limited due to the complexity of biomass in time and space and limitations in the spatial and spectral characteristics of satellite data [16]. More abundant remote sensing data are needed to depict detailed forest information. Lidar can penetrate dense forests, provide accurate three-dimensional information of trees, and then be used to obtain forest biomass [17]. However, because of its limited coverage, high cost and inconvenience to transport, Lidar is not suitable for forest biomass estimation at the meso-scale [18]. Synthetic Aperture Radar (SAR) data, such as L-band Advanced Land Observing Satellite/Phased Array L-band Synthetic Aperture Radar (ALOS/PALSAR) [19] and X-band TerraSAR-X data, are widely used in the estimation of forest biomass [20,21]. SAR is not affected by illumination and climate conditions and it can penetrate vegetation to obtain information, covering relatively large areas in a short period of time [20].
At the meso-scale, many studies have demonstrated the potential of optical and radar remote sensing-derived indicators to estimate forest biomass [22,23]. However, there is a large range and many uncertainties of remote sensing. For example, the resolution of remote sensing images might be insufficient, and the vertical structure information of forest canopies cannot be obtained, which has certain limitations in high biomass areas. Therefore, at the regional scale, the accuracy of forest biomass estimation using remote sensing data is low [24].
Optical and radar remote sensing data can match forest inventory data in time and space [25]. In addition, these data can provide forest attributes and structural information that are missing from inventory data. Therefore, combining multisource remote sensing data with forest inventory data for regional forest biomass research provides a more consistent spatial and temporal analyses than forest inventory data alone. Generally, the uncertainty in the estimation can be reduced by this combination [25]. Furthermore, this combination can promote the application of forest biomass estimation and other ecological research at the meso-scale. However, there is little information on the potential of the combination of sample plot survey data with multisource remote sensing data to estimate and map biomass. Most forest biomass estimation studies focus on the impact of environmental variables on forest biomass [26]. Therefore, it is necessary to better assess and understand the modeling potential of sample survey factors and remote sensing factors to provide decision makers with information on forest resources.
In this study, by synthesizing the existing technical means, we combined forest inventory data with multisource remote sensing data to estimate forest biomass and improve the accuracy of biomass estimation at the regional scale. There are three objectives of the present study: the primary objective is to assess the potential of forest inventory data combined with multisource remote sensing data in modeling and mapping forest biomass. The second objective is to estimate the biomass of different forest types in Beijing in 2016 and provide data support for regional biomass estimation. The third objective is to estimate biomass using multiple linear regression (MLR) and the random forest (RF) model and compare the performances of the two models.

2. Materials and Methods

2.1. Data Collection

2.1.1. Forest Inventory Data

The National Forest Resources Continuous Inventory system is a method of forest resource investigation that aims to understand the status and dynamics of macroforest resources and periodically reviews both with fixed sample plots. It is an important part of the comprehensive monitoring system for forest resources and ecological conditions in China. China’s Ministry of Forestry has carried out eight consecutive national surveys and inventories of forest resources [27]. According to the technical regulations of the national forest resources continuous inventory, systematic sampling is used to lay out fixed sample plots, the size of which is 4 km × 4 km, and the sample plots are laid out at the intersection point of the kilometer network of the newly compiled 50,000 or 100,000 topographic map of the country. To ensure that the sample points are not repeated and missed, computer technology such as GIS, is used as far as possible [28]. In recent years, the collection of forest inventory data depends primarily on manual work, and it is supported by high-tech survey instruments that can automatically collect data to improve the accuracy of inventory results [29].
In this study, we used the Ninth Beijing Forest Inventory data of 2016, which involve 1431 sample plots located in all districts and counties of Beijing, as shown in Figure 1, covering coniferous, broadleaf and mixed coniferous-broadleaf forest types. The dataset describes in detail plot locations, measurement dates and forest compositions. For each plot, multiple attributes were collected, including the mean diameter at breast height (DBH), mean tree height, mean age, crown density, volume, land use and cover, and ecological conditions. The biomass, mean DBH, mean tree height, mean age and crown density were used as the inventory variables to establish the model, as shown in Table 1. The real biomass value was calculated using the equation for the biomass-volume relationship of the stand type and age group [30]. The stem volume of each tree was provided by FID, and the stand volume of each fixed sample plot was the sum of all tree volumes. The area of each plot was 0.0667 hectares.
There are significant differences in topography and administrative functions among different districts in Beijing. From the topographic point of view, the northwest is a mountainous area with a higher terrain, and the southeast is a plain; from the administrative function point of view, the central part of Beijing is the capital and core functional area, the Northwest Mountainous Area and the southwest are ecological conservation functional areas, and the plain is a densely populated scientific and technological innovation and economic development area, which also leads to differences in forest biomass distribution. More than 80% of Beijing’s forest resources are distributed in mountainous counties in the west and north of the city. The forest coverage in mountainous areas of Beijing has reached more than 50%, but the forest area in the plain of southeast Beijing accounts for less than 20% of the whole city.
The forest ecosystem contains arbors, shrubs and herbs, but the amount of biomass from shrubs and herbs is less than the amount of arboreal biomass [31]. Therefore, this study considers only arboreal biomass and does not consider shrub and herb biomass.

2.1.2. Remote Sensing Data and Preprocessing

We used optical (Landsat 8 OLI) and radar (ALOS-2 PALSAR-2) remote sensing sources.

Landsat 8

Landsat 8 Operational Land Imager (OLI, which developed by Bauer Aerospace and Technology Corp, Colorado, USA) images included a 15 m panchromatic band, with a spectral range from 0.500 to 0.680 μm and eight 30 m multispectral bands, with a spectral range from 0.433 to 2.300 μm. They were selected for biomass estimation due to their suitability in terms of their resolution ratio; a spatial resolution of approximately 30 m by 30 m is adequate to assess information at the forest stand level. We selected eight Landsat 8 OLI scenes of Beijing with low cloud cover as the research images. The image range was 122 to 124 paths and 31 to 33 rows. The image acquisition time used in this study was June-August 2016, and the time phase was basically the same as the time phase of the Beijing forest inventory data.
The image preprocessing steps included geometric correction, radiation correction, atmospheric correction, and image clipping. Because the downloaded images were Level-1 data products, the geometric accuracy was high, so only radiation and atmospheric corrections were needed.
Based on the preprocessed Landsat 8 OLI data, we acquired the surface reflectance for 6 bands of the Landsat 8 OLI (Band 2-Band 7) and then acquired the vegetation indices, namely, normalized difference vegetation index (NDVI), difference vegetation index (DVI) and ratio vegetation index (RVI) (Table 2), through band processing and the Landsat 8 OLI image calculation. The NDVI, DVI, and RVI are commonly used vegetation indices that are sensitive to vegetation, as shown in Equations (1)–(3). Another dataset was derived by image transformation from the original satellite band, which involved tasseled cap transformation (TCT) and texture features as shown in Table 2. Tasseled cap transformation, also known as a K-T transform, is an image enhancement method for vegetation information extraction. It can enhance vegetation information of images. After the K-T transform, the same number of components as the number of bands can be obtained, and the second component is the green index, which has a strong relationship with the vegetation coverage and biomass on the ground [32]. Therefore, based on the TCT coefficients of the OLI sensor onboard Landsat 8, we chose the second band generated from the TCT, which was marked as the greenness [32].
In addition, we extracted the texture factor of the image. Texture is an important feature of remote sensing images and can be extracted by using the gray-level cooccurrence matrix (GLCM). Previous research has shown that Band 2 of a Landsat 8 OLI image contains much information about the image; thus, we extracted the texture feature of Band 2. The larger the selected window is, the greater the information content will be [33]. According to the sample area, five texture eigenvalues were extracted from the 15 × 15 window, namely, the mean, variance, contrast, correlation and second moment [34], as shown in Equations (4)–(8):
Normalized Difference Vegetation Index (NDVI):
NDVI = NIR 1 R NIR 1 + R
Difference Vegetation Index (DVI):
RVI = NIR 1 R
Ratio Vegetation Index (RVI):
DVI = NIR 1 R
Mean(ME):
ME = i , j = 0 N 1 i P i j
Variance(VA):
VA = i , j = 0 N 1 P i j ( i ME ) 2
Contrast(CO):
CO = i , j = 0 N 1 i P i j ( i j ) 2
Correlation (CC):
  CC = i , j = 0 N 1 i P i j | ( i ME ) ( j ME ) V A i V A j |
Second Moment(SM):
SM = i , j = 0 N 1 i P i j 2
Using bilinear interpolation, the average values of the remote sensing factors at and near sampling points can be extracted. This method effectively solves the problem that occurs when sampling sites do not match the image completely and can cover areas that the inventory data cannot fully cover, thus improving the estimation accuracy.

ALOS-2/PALSAR-2

We downloaded six images of ALOS-2/PALSAR-2 (L-band) taken in 2016 from Japan Aerospace Exploration Agency (JAXA (http://www.eorc.jaxa.jp/ALOS/en/index.htm)). The PALSAR data had a 25-m spatial resolution and contain two polarized bands, HH and HV. The preprocessing of PALSAR data was completed by the JAXA. The digital numbers (DN) of the PALSAR signal amplitude were extracted and converted to gamma naught backscattering coefficients (dB) in decimal units using the following equation [35,36]:
Γ 0 = 10 × l o g 10 D N 2 C F
where Γ 0 is the backscattering coefficient, DN is the digital number value of pixels, and CF is the calibration factor, which equals −83 [36]. Then, we calculated the sum, difference and ratio values using the backscattering coefficients of HH and HV, as shown in Equations (10)–(12):
sum = Γ H H 0 + Γ H v 0
difference = Γ H H 0 Γ H v 0
ratio = Γ H H 0 / Γ H v 0 ,
where Γ H H 0 and Γ H v 0 are the backscattering coefficients of HH and HV in decibels.

2.2. Multiple Regression Model

The allometric growth equation is the most widely used model for estimating forest biomass. Many studies have confirmed the advantages of the allometric growth equation for estimating forest biomass [37,38,39]. This model regresses a correlated variable (biomass) based on one or more independent variables. The DBH and tree height, as the two most relevant factors of biomass, are often used in biomass prediction in the form of single or compound variables. Based on the allometric model and previous research results, we introduced new variables (Landsat 8 data and backscattering coefficients) into the model to explore its ability to estimate forest biomass, as shown in Equation (13).
ln ( B ) = β 0 + a ln ( d 2 H ) + β 1 x 1 + β 2 x 2 + + β j x j
where B is the biomass of the sample plot, each x j is an independent variable ( j = 1 , 2 , 3 ), β j is the regression coefficient of x j , β 0 is a constant, and a is the regression coefficient of the model.
However, the remote sensing variables were highly collinear. To overcome this problem, we used a stepwise regression analysis method, which gradually screens variables and leaves highly correlated variables that are not collinear in the model, to retain a model that was not very complex and to reduce the number of calculations. The basic idea of stepwise regression is to introduce variables into the model one by one. After introducing an explanatory variable, we need to conduct F-test and t-test for the selected explanatory variables one by one. When the original explanatory variables are no longer significant due to the introduction of later explanatory variables, they will be deleted. To ensure that only significant variables are included in the regression equation before each new variable is introduced. This is a repeated process until neither significant explanatory variables are selected into the regression equation nor insignificant explanatory variables are removed from the regression equation. To ensure that the final set of explanatory variables is optimal.

2.3. Feature Selection and Random Forest Model

We used R to establish an RF model to estimate forest biomass. The RF model was a classification and regression algorithm based on decision trees [40]. By establishing and combining multiple decision tree predictions (1000 trees in our study), the average value of all the decision tree prediction results was taken as the final prediction result [41]. The RF model can effectively alleviate the problem of overfitting and is insensitive to the collinearity between variables, so it is suitable for establishing a nonlinear model [42]. RF is increasingly used to perform biomass regression and estimate forest biomass [43,44].
First, subsets of variables were selected as input for the RF prediction using feature selection to ensure that the input variables were highly correlated with biomass. Feature selection refers to the selection of subsets from the original feature set to optimize a certain evaluation criterion so that the model established with the optimal feature subset can achieve a prediction accuracy similar to or better than that of the model established without feature selection. RF provides an increase in the mean-squared error (percentage of IncMSE, where IncMSE indicates the increase in MSE) for each independent indicator, quantifying the increase in the MSE when the indicator is randomly permuted. This error measures the relative importance of each indicator, where a high IncMSE implies that the indicator has a high weight in the model prediction and vice versa [23]. Then, we used the data after feature selection as the independent variable, forest biomass as the dependent variable, and the random forest software package in R to establish an RF model.

2.4. Model Accuracy Evaluation

To test these models, we assessed the prediction accuracy on randomly selected subsets (20%) of the original dataset retained before the model was developed. To evaluate the advantage of the use of an advanced regression tree model versus more traditional approaches, the performance of the RF model was computed and compared with that of a stepwise multiple linear regression model.
We used the proportion of variance explained ( R 2 ) and the root mean square error (RMSE) to evaluate the model performance on the complete datasets. In addition, we computed the relative RMSE (RMSE%), the bias and the relative bias (bias%). Bias was calculated as the difference between a population mean of the measurements or test results and an accepted reference or true value, R2 values were used to judge the model, and RMSE, Bias%, RMSE% reflect the precision of the model [45].
These statistics were calculated as follows:
R 2 = 1 ( y i y ^ i ) 2 ( y i y ¯ i ) 2
RMSE = ( y i y ^ i ) 2 n 1
RMSE % = RMSE y ¯ i × 100 %
Biasc = 1 n i = 1 n ( y i y ^ i )
Bias % = BIAS y ¯ i × 100 %
where y i is the observed biomass of the plot, y ^ i is the predicted biomass of the plot, and y ¯ i is the mean biomass of n plots.

3. Results

3.1. Univariate Correlation Analysis

Previous studies have typically analyzed the relationship between a single remote sensing variable and the forest biomass or have used the original band and variables transformed from images for feature selection [46,47]. Through the Pearson correlation coefficient (r), we analyzed the ability of each variable to estimate biomass and obtained the correlation between each variable and the biomass, as shown in Table 3.
Among all variables, forest inventory variables were highly correlated with biomass in three forest types. Different remote sensing variables (OLI data and PALSAR data) showed different degrees of correlation. The shortwave infrared (SWIR) optical band (Band 7) showed the greatest relevant biomass among the Landsat data because it allowed an effective separation between high- and low-biomass data. The importance of the SWIR wavelengths in biomass prediction is consistent with previous studies [40]. In addition, Band 5, Band 4, Band 3, Band 2, the NDVI, and the greenness were also highly correlated with biomass, and the most relevant texture factors were the mean, correlation and second moment. The other Landsat variables had little correlation with biomass. SAR data can penetrate dense forests and obtain the vertical structure information of forests, so the PALSAR HH and HV backscatter coefficients and their derivative variables (sum, difference, ratio) were correlated with forest biomass. In addition, correlations between the forest biomass and HV backscatter coefficients of different forest types were higher than those between the forest biomass and HH backscatter coefficients, which is in line with previous research results [22,48]. All these factors can be considered potential variables for forest biomass estimation.

3.2. Results of Forest Biomass Model Establishment

3.2.1. Multiple Stepwise Regression Model

To avoid overfitting, the multiple stepwise regression method was used to screen variables and establish a multiple linear model. The results are as follows:
The multiple stepwise regression model of coniferous forests was:
ln ( B C ) = 3.821 + 0.226 × N 1 + 0.111 N 2 + 0.139 N 3 0.120 X 1 0.246 X 6 0.089 X 10 0.656 X 16 +   0.538 X 17 0.308 X 19  
The multiple stepwise regression model of mixed forest was:
ln ( B M ) = 3.776 + 0.291 × N 1 + 0.155 N 2 + 0.108 N 3 0.153 X 1 + 0.302 X 2 0.132 X 3 0.05 X 4 +   0.052 X 6 + 0.037 X 13
The multiple stepwise regression model of broadleaf forests was:
ln ( B B ) = 3.810 + 0.127 × N 1 + 0.137 N 2 + 0.181 N 3 + 0.110 X 1 + 0.039 X 6 0.088 X 9 + 0.024 X 13 + 0.043 X 15 0.057 X 18
Then, the biomass estimation model was obtained as follows:
B C = e 3.821 + 0.226 × N 1 + 0.111 N 2 + 0.139 N 3 0.120 X 1 0.246 X 6 0.089 X 10 0.656 X 16 +   0.538 X 17 0.308 X 19    
B M = e 3.776 + 0.291 × N 1 + 0.155 N 2 + 0.108 N 3 0.153 X 1 + 0.302 X 2 0.132 X 3 0.05 X 4 +   0.052 X 6 + 0.037 X 13    
B B = e 3.810 + 0.127 × N 1 + 0.137 N 2 + 0.181 N 3 + 0.110 X 1 + 0.039 X 6 0.088 X 9 + 0.024 X 13 +   0.043 X 15 0.057 X 18    
where X i and N i in each formula correspond to the variables in Table 3.
P values represent the probability that the sample results differ from the original hypothesis. The smaller the P value is, the more significant the results are. Generally speaking, p < 0.05 indicates a significant difference, and p < 0.01 indicates a very significant difference. Table 4 shows that the P values of the model coefficients of different forest types, which are less than 0.05, some of which are less than 0.01. These results show that the differences in the selected variables are significant.

3.2.2. Random Forest Model

First, feature variables were selected for the variables involved in the modeling. The importance of the variables was ranked according to IncMSE%, and the unimportant variables were eliminated. Generally, the number of final variables is 1/3 of the total number of input variables [49]. Table 5 shows that d 2 H and the mean age are two very important variables in the RF model. IncMSE% was more than 20% in the different forest types. NDVI, Band 2 and Band 7 were also important to the model with regard to optical data. The most influential backscattering coefficient factors were Γ H v 0 and Γ H H 0 Γ H v 0 . The IncMSE% of these factors were all higher than 10%.

3.3. Model Precision Evaluation and Comparison of Two Models

To test the goodness of fit of the model, 20% of the samples were used for validation. We analyzed the scatter diagrams of different forest types in Figure 2 and obtained the accuracy of the linear regression and RF models in Table 6.
Figure 2 shows that correlations between the estimated biomass and observed biomass in the coniferous forests, mixed forests, and broadleaf forests were all better for the RF models than for linear regression. In the linear model, as the biomass value increased, the performance of the model decreased, and most of the high-value biomass was underestimated. In particular, the high biomass values of mixed forests were greatly underestimated. RF can improve the performance of the model. When the biomass was less than 100 Mg ha−1, the difference between predicted and observed values is lower than that of the linear model; the error under higher biomass values was slightly larger, and some of the higher values were underestimated.
Table 6 shows that the Bias% values were all near 0, and the RMSE% ranged from 27.92% to 32.48% for the MLR model. For the RF model, the Bias ranged from −2.67 to −4.02 and the RMSE% ranged from 22.89%–27.24%. These results showed that the two types of models were relatively stable and could be used to estimate biomass. However, an improvement in performance was found in the RF models for coniferous forest ( R 2 = 0.66, RMSE = 13.23 Mg ha1), mixed forest ( R 2 = 0.77, RMSE = 11.09 Mg ha−1), and broadleaf forest ( R 2 = 0.64, RMSE = 11.98 Mg ha−1), in comparison to the linear regression models for coniferous forest ( R 2 = 0.59, RMSE = 14.15 Mg ha−1), mixed forest R 2 = 0.70, RMSE = 14.54 Mg ha−1), and broadleaf forest ( R 2 = 0.53, RMSE = 15.26 Mg ha−1). Generally, the RF model was characterized by a high R 2 and a low RMSE, indicating a good fitting result.
For the same model with different forest types, the fit of the mixed-forest models was better than that of the models with the other two forest types. The R 2 of the linear regression model based on mixed forests was 0.11 and 0.17 higher than the R 2 of the linear regression models based on coniferous forests and broadleaf forests, respectively. The R 2 of the RF model based on mixed forests was 0.11 and 0.13 higher than the R 2 of the RF models based on coniferous forests and broadleaf forests, respectively, and the RMSE was 4.35 and 4.13 Mg ha−1 lower, respectively. Overall, the model for mixed forests had a high estimation accuracy.

3.4. Results of Biomass for Different Forest Types and Spatial Distribution of the Forest Biomass Density in Beijing

Based on the model estimation results, the forest biomass and biomass density of coniferous forests, broadleaf forests and mixed forests were estimated, the kriging interpolation was used and biomass density distribution maps of three forest types were obtained. Biomass of different forest types are shown in Table 7 and biomass density distribution in Figure 3.
The total forest biomass obtained from the survey data was 74,746.10Mg, and the biomass density was 19.14–195.66 Mg ha−1, with an average biomass density of 52.26 Mg ha−1. Among these values, the total biomass of coniferous forest was 35,622.99 Mg, the average biomass density was 53.73 Mg ha−1; the total biomass of mixed forest was 13,926.40 Mg, the average biomass density was 51.20 Mg ha−1; the total biomass of broadleaf forest was 25,196.80Mg, and the average biomass density was 50.80 Mg ha−1.
As shown in Figure 3, the biomass distribution of arbor forests was basically consistent with the distribution of forestland in Beijing. The high-biomass area corresponded to dense forestland, and most of these forests were mature and overmature and were mainly distributed in the north and southwestern part of Beijing. The low-biomass area was mainly located in the southeast and central parts of Beijing. Because this area is urban with mostly developed land, the biomass in this area is low. The biomass density in most areas of Beijing was less than 70 Mg ha−1.
The distribution of the three types of forest was obviously different. Coniferous forests were mainly distributed to the west and south of Beijing, mixed forests were mainly distributed in the west, and broadleaf forests were mainly distributed in the north and southwest.
This study did not consider shrubs and herbs, so the estimation of biomass can be considered relatively conservative but can also represent the basic situation of biomass in Beijing. At present, China’s biomass estimation system is still not perfect. This study provides a feasible method for regional biomass estimation.

4. Discussion

4.1. Forest Biomass Estimation Model Based on Forest Inventory and Multisource Remote Sensing Data

In this paper, we propose a novel approach to modeling and mapping the biomass of forests at the regional scale that provides more detailed and accurate information than other approaches, such as estimating using only a single remote sensing data source or forest inventory data.
We combined forest inventory data with multisource remote sensing data (OLI and PALSAR) to estimate forest biomass, capturing almost all forest biomass spatial variability, and producing spatially explicit biomass estimates over regions.
According to the biomass characteristics of different forest types, it is very important to select variables with a high importance to the model [48]. The forest inventory factors selected in this study included not only the DBH and height, which are the two most relevant factors to biomass [38,39,50] but also the mean age and canopy density, which have received increasing attention in recent studies. Many previous studies have demonstrated that these two factors show a good correlation with biomass [40,51], which is consistent with our results. Landsat optical data are sensitive to forest vegetation, and their spatial resolution is suitable for the sample plot size. Zheng et al. confirmed that the red and NIR bands (Bands 4 and 5, respectively) are effective predictors of biomass [13]. Zheng et al. found that the SWIR band (Band 7) showed a satisfactory estimation ability in forests with a high canopy density [52], Foody et al. found that the NDVI and other vegetation indices are strongly correlated with biomass [53]. These results are consistent with our results. The models established by these factors fit the data relatively well. Multiple-variable PALSAR data have a higher correlation with forest biomass than individual-variable PALSAR HH and HV data because of their ability to detect canopy structure and retrieve forest biomass [54,55]. In addition, the biomass estimation model based on multisource remote sensing data combined with forest inventory data had a higher accuracy than that based on single-source data. Zhao used Landsat TM and ALOS PALSAR data to establish forest biomass models in Zhejiang Province. The R 2 of each forest type was below 0.5 [56]. Urbazaev used SAR backscatter, Landsat images and topographic factors to obtain the best R 2 , a value of only 0.62 [57], which was lower than the accuracy in this study. This indicates that our model is suitable for estimating forest biomass in Beijing.
Optical and radar data are an effective supplement to inventory data, provide spatial information for estimating regional forest biomass, and can continuously estimate forest biomass [51]. In the results, the R 2 and RMSE of the three forest types were all greater than 0.5 and less than 20 Mg ha−1. The models are reliable, but the model accuracy differed among different forest types.
In our results, mixed forests had the highest estimation accuracy, followed by coniferous forests and broadleaf forests, which is inconsistent with previous research.
Previous studies have shown that coniferous forest biomass estimation models have a high accuracy [58]. This inconsistency may be because the three types of modeling factors included in this study are highly sensitive to the structure of mixed forests. Moreover, differences in the study area location, tree species and forest types lead to different model estimation accuracies for different forest types.
However, the R 2 values of the model were all less than 0.8, which indicates the present model has less precision than the model established in a previous study [25]. A possible reason is that previous studies have mostly focused on small-scale areas. Our research mainly focuses on meso-scale areas, including a variety of terrain and environmental conditions, causing different environmental factors to have a certain impact on the modeling accuracy.

4.2. Estimation and Spatial Distribution of Forest Biomass in Beijing

The results of this study suggest that it is possible to produce spatially explicit biomass estimates over regions if adequate inventory data and remote sensing data are available. This meso-scale study was based on a relatively large sample size.
Because the data set used in this research did not contain all sample plot data from the forest inventory in Beijing, it can represent the distribution but not the total amount of biomass, while biomass density can represent the state of forest resources in Beijing well [9]. The 2016 forest biomass density range of Beijing was estimated by this model to be 19.14–195.66 Mg ha−1, and the average biomass density was 52.26 Mg ha−1. The forest biomass density in Beijing increased compared with previous studies [24].
Overall, forest resources show a pattern of more forests in mountainous areas, less forests in plains, as well as more forests outside urban areas and less forests in urban areas [59]. This is consistent with the distribution map of biomass density obtained in this study, as shown in Figure 3. However, this study shows that the biomass density in the central city is high, which is inconsistent with the distribution of forest resources in Beijing. This result may be caused by the fact that in recent years, the central city has insisted on the construction of ecological cities and increases in the area of green space, so the forest biomass has increased.
China is vigorously promoting the construction of eco-cities. At the meso-scale, forestry biomass estimation and biomass density mapping can provide decision makers with detailed information on forest resources to strengthen the management of forest resources.

4.3. Comparing Performance between MLR and RF Models

We also compared the performance of the MLR model and the RF model. We found that the prediction effect of the linear model for extreme biomass values (extremely high or very low) was not completely ideal. Low values were overestimated, and high values were underestimated, which is also common in previous studies [25,60]. The range in biomass for the multiple regression prediction is larger than that for the RF model.
This finding indicates that this model may be applicable only in the estimation of biomass values within a certain range. The use of the RF model had a positive impact on the estimation accuracy of extreme values [23]. The results showed that the RF model captured the complex nonlinear relationship between the optical and SAR data and biomass and compensated for the lack of inventory data, capitalizing on the strengths of both the forest inventory and remote sensing data. Therefore, the fit of the RF model is better than that of the linear regression model.
Linear regression and RF model contain different independent variable factors. Because of the collinearity between variables, some variables will be eliminated in linear model, while RF model can fully consider the fitting problem. The two model types include forest investigation factors such as d 2 H , crown density and the mean age, indicating that the FID variables has a greater impact on the estimation of biomass and is not affected by collinearity. The RF models contain more PALSAR variables, and the model accuracy is higher, which indicates that the PALSAR variables are more sensitive to forest biomass.
Certainly, there are still some limitations in our research. Two thirds of Beijing is plain, and one third is mountainous area. Terrain correction is a part of remote sensing image correction under rugged surface, which can offset the influence of terrain to a certain extent, and is helpful to improve the accuracy of biomass estimation. Therefore, the research of terrain correction will be strengthened in future research. In the image preprocessing stage, the inaccuracy of the biomass models based on forest types and age classes and the lack of a consideration for the impact of environmental factors such as topography, soil and hydrology on biomass, which will be strengthened in the future research. Despite these problems, this study aimed to improve the performance of the regional forest biomass model and can provide a reference and support for future plans of relevant forestry departments, which has certain practical significance.

5. Conclusions

This paper proposed an approach for establishing the forest biomass of different forest type models and calculating forest biomass in Beijing by combining forest inventory data with multisource remote sensing data. The approach can capture all spatial variability and provide a reliable method for estimating forest biomass at a meso-scale with a high efficiency and low cost. In addition, we used this model to predict the forest biomass in Beijing in 2016. Among the three studied forest types, coniferous forest had the highest biomass density. According to the distribution of forest biomass in Beijing, the northern and southwestern parts of Beijing had a high biomass, while the central and eastern parts have a low biomass density. At present, there is no perfect biomass estimation system in China. Therefore, this method can provide a basis for meso-level biomass estimation and a reference for the planning of relevant forestry decision-making departments.

Author Contributions

Conceptualization, Z.F. and Y.Z.; Methodology, Z.F.; Validation, Z.F. and Y.Z.; Formal Analysis, Y.Z.; Investigation, Z.F. and Y.Z.; Resources, Z.F.; Data Curation, Z.F. and J.L. (Jincheng Liu); Writing-Original Draft Preparation, Y.Z.; Writing-Review & Editing, Y.Z. and J.L. (Jing Lu); Visualization, Y.Z.; Supervision, Z.F.; Project Administration, Z.F.; and Funding Acquisition, Z.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was jointly supported by the medium long-term project of “Precision Forestry Key Technology and Equipment Research” (No. 2015ZCQ-LX-01) and the National Natural Science Foundation of China (No.U1710123).

Acknowledgments

We would like to acknowledge support from Beijing Key Laboratory for Precision Forestry, Beijing Forestry University, as well as all the people who have contributed to this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cheng, W.; Yang, C.; Zhou, J.; Zhou, W.; Liu, Y. Research Summary of Forest Volume Quantitative Estimation Based on Remote Sensing Technology. J. Anhui Agric. Sci. 2009, 37, 7746–7750. [Google Scholar]
  2. Wang, X.; Shao, G.; Chen, H.; Lewis, B.J. An Application of Remote Sensing Data in Mapping Landscape-Level Forest Biomass for Monitoring the Effectiveness of Forest Policies in Northeastern China. Environ. Manag. 2013, 52, 612–620. [Google Scholar] [CrossRef]
  3. Roy, P.S.; Ravan, S.A. Biomass estimation using satellite remote sensing data—An investigation on possible approaches for natural forest. J. Biosci. 1996, 21, 535–561. [Google Scholar] [CrossRef]
  4. Li, D.; Wang, C.; Hu, Y.; Liu, S. General review on remote sensing-based biomass estimation. Geomat. Inf. Sci. Wuhan Univ. 2012, 37, 631–635. [Google Scholar]
  5. Tang, X.G.; Liu, D.W.; Wang, Z.M.; Jia, M.M.; Xu, W.M. Estimation of forest aboveground biomass based on remote sensing data: A review. Chin. J. Ecol. 2012, 31, 1311–1318. [Google Scholar]
  6. Fang, J. Changes in Forest Biomass Carbon Storage in China between 1949 and 1998. Science 2001, 292, 2320–2322. [Google Scholar] [CrossRef]
  7. Zhang, H.; Feng, Z.; Chen, P.; Chen, X. Development of a Tree Growth Difference Equation and Its Application in Forecasting the Biomass Carbon Stocks of Chinese Forests in 2050. Forests 2019, 10, 582. [Google Scholar] [CrossRef] [Green Version]
  8. Liu, J.; Feng, Z.; Mannan, A.; Khan, T.; Cheng, Z. Comparing Non-Destructive Methods to Estimate Volume of Three Tree Taxa in Beijing, China. Forests 2019, 10, 92. [Google Scholar] [CrossRef] [Green Version]
  9. Lu, J.; Feng, Z.; Zhu, Y. Estimation of Forest Biomass and Carbon Storage in China Based on Forest Resources Inventory Data. Forests 2019, 10, 650. [Google Scholar] [CrossRef] [Green Version]
  10. Chave, J.; Réjou-Méchain, M.; Búrquez, A.; Chidumayo, E.; Colgan, M.S.; Delitti, W.B.; Duque, A.; Eid, T.; Fearnside, P.M.; Goodman, R.C.; et al. Improved allometric models to estimate the aboveground biomass of tropical trees. Glob. Chang. Biol. 2014, 20, 3177–3190. [Google Scholar] [CrossRef]
  11. Gu, Y.; Brown, J.F.; Verdin, J.P.; Wardlow, B. A five-year analysis of MODIS NDVI and NDWI for grassland drought assessment over the central Great Plains of the United States. Geophys. Res. Lett. 2007, 34, 6. [Google Scholar] [CrossRef] [Green Version]
  12. Gu, Y.; Wylie, B.K. Detecting Ecosystem Performance Anomalies for Land Management in the Upper Colorado River Basin Using Satellite Observations, Climate Data, and Ecosystem Models. Remote Sens.-Basel 2010, 2, 1880–1891. [Google Scholar] [CrossRef] [Green Version]
  13. Zheng, D.; Rademacher, J.; Chen, J.; Crow, T.; Bresee, M.; Le Moine, J.; Ryu, S. Estimating aboveground biomass using Landsat 7 ETM+ data across a managed landscape in northern Wisconsin, USA. Remote Sens. Environ. 2004, 93, 402–411. [Google Scholar] [CrossRef]
  14. Running, S.W.; Nemani, R.R.; Peterson, D.L.; Band, L.E.; Potts, D.F.; Pierce, L.L.; Spanner, M.A. Mapping Regional Forest Evapotranspiration and Photosynthesis by Coupling Satellite Data with Ecosystem Simulation. Ecology 1989, 70, 1090–1101. [Google Scholar] [CrossRef] [Green Version]
  15. Xiao, X.; Hollinger, D.; Aber, J.; Goltz, M.; Davidson, E.A.; Zhang, Q.; Moore, B., III. Satellite-based modeling of gross primary production in an evergreen needleleaf forest. Remote Sens. Environ. 2004, 89, 519–534. [Google Scholar] [CrossRef]
  16. Nichol, J.E.; Sarker, M.L. Improved Biomass Estimation Using the Texture Parameters of Two High-Resolution Optical Sensors. IEEE Trans. Geosci. Remote Sens. 2011, 49, 930–948. [Google Scholar] [CrossRef] [Green Version]
  17. Næsset, E. Predicting forest stand characteristics with airborne scanning laser using a practical two-stage procedure and field data. Remote Sens. Environ. 2002, 80, 88–99. [Google Scholar] [CrossRef]
  18. Hanes, J.; Zhang, X.; Nimeister, W. Biophysical Applications of Satellite Remote Sensing; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
  19. Zhang, H.; Zhu, J.; Wang, C.; Lin, H.; Long, J.; Zhao, L.; Fu, H.; Liu, Z. Forest Growing Stock Volume Estimation in Subtropical Mountain Areas Using PALSAR-2 L-Band PolSAR Data. Forests 2019, 10, 276. [Google Scholar] [CrossRef] [Green Version]
  20. Karjalainen, M.; Kankare, V.; Vastaranta, M.; Holopainen, M.; Hyypp, J. Prediction of plot-level forest variables using TerraSAR-X stereo SAR data. Remote Sens. Environ. 2012, 117, 338–347. [Google Scholar] [CrossRef]
  21. Vastaranta, M.; Holopainen, M.; Karjalainen, M.; Kankare, V.; Hyyppa, J.; Kaasalainen, S. TerraSAR-X Stereo Radar grammetry and Airborne Scanning LiDAR Height Metrics in Imputation of Forest Aboveground Biomass and Stem Volume. IEEE Trans. Geosci. Remote. Sens. 2013, 52, 1197–1204. [Google Scholar] [CrossRef]
  22. Sinha, S.; Jeganathan, C.; Sharma, L.K.; Nathawat, M.S.; Das, A.K.; Mohan, S. Developing synergy regression models with space-borne ALOS PALSAR and Landsat TM sensors for retrieving tropical forest biomass. J. Earth Syst. Sci. 2016, 125, 725–735. [Google Scholar] [CrossRef] [Green Version]
  23. Bourgoin, C.; Blanc, L.; Bailly, J.; Cornu, G.; Berenguer, E.; Oszwald, J.; Tritsch, I.; Laurent, F.; Hasan, A.; Sist, P.; et al. The Potential of Multisource Remote Sensing for Mapping the Biomass of a Degraded Amazonian Forest. Forests 2018, 9, 303. [Google Scholar] [CrossRef] [Green Version]
  24. Wang, G.; Liu, Q. Estimating Carbon Sequestration of Beijing’s Forests Based on TM Images and Forest Inventory Data. J. Basic Sci. Eng. 2013, 21, 224–235. [Google Scholar] [CrossRef]
  25. Zheng, D.; Heath, L.S.; Ducey, M.J. Forest biomass estimated from MODIS and FIA data in the Lake States: MN, WI and MI, USA. Forestry 2007, 80, 265–278. [Google Scholar] [CrossRef] [Green Version]
  26. Longo, M.; Keller, M.; Dos-Santos, M.N.; Leitold, V.; Pinagé, E.R.; Baccini, A.; Saatchi, S.; Nogueira, E.M.; Batistella, M.; Morton, D.C. Aboveground biomass variability across intact and degraded forests in the Brazilian Amazon. Glob. Biogeochem. Cycles 2016, 30, 1639–1660. [Google Scholar] [CrossRef] [Green Version]
  27. Shao, W.; Cai, J.; Wu, H.; Liu, J.; Zhang, H.; Huang, H. An Assessment of Carbon Storage in China’s Arboreal Forests. Forests 2017, 8, 110. [Google Scholar] [CrossRef] [Green Version]
  28. Qiu, Z.; Feng, Z.; Song, Y.; Li, M.; Zhang, P. Carbon sequestration potential of forest vegetation in China from 2003 to 2050: Predicting forest vegetation growth based on climate and the environment. J. Clean. Prod. 2020, 252, 119715. [Google Scholar] [CrossRef]
  29. Zhiguo, H.E.; Yang, Z. The problems and countermeasures of continuous forest inventory in China. Hunan For. Sci. Technol. 2012, 39, 80–83. [Google Scholar]
  30. Xinliang, X.U.; Mingkui, C.; Kerang, L.I. Temporal- Spatial Dynamics of Carbon Storage of Forest Vegetation in China. Prog. Geogr. 2007, 26, 1–10. [Google Scholar]
  31. Wang, H.; Niu, S.K.; Xiao, S.; Zhang, C. Study on biomass estimation methods of understory shrubs and herbs in forest ecosystem. Acta Prataculturae Sin. 2014, 23, 20–29. [Google Scholar]
  32. Crist, E.P.; Cicone, R.C. A Physically-Based Transformation of Thematic Mapper Data—The TM Tasseled Cap. IEEE Trans. Geosci. Remote Sens. 1984, GE-22, 256–263. [Google Scholar] [CrossRef]
  33. You, J. The research on High Resolution Texture Information of Remote Sensing in Forest Volume Estimate. Master’s Thesis, Xi’an University of Science and Technology, Xi’an, China, 2014. [Google Scholar]
  34. Gao, Y. Research on the Ratio Band of Remote Sensing Image in Forest Stock Volume and Choice of Texture Information. Master’s Thesis, Xi’an University of Science and Technology, Xi’an, China, 2014. [Google Scholar]
  35. Qin, Y.; Xiao, X.; Dong, J.; Zhang, G.; Shimada, M.; Liu, J.; Li, C.; Kou, W.; Moore, B. Forest cover maps of China in 2010 from multiple approaches and data sources: PALSAR, Landsat, MODIS, FRA, and NFI. ISPRS J. Photogramm. 2015, 109, 1–16. [Google Scholar] [CrossRef] [Green Version]
  36. Shimada, M.; Isoguchi, O.; Tadono, T.; Isono, K. PALSAR radiometric and geometric calibration. IEEE Trans. Geosci. Remote Sens. 2009, 47, 3915–3932. [Google Scholar] [CrossRef]
  37. Vieilledent, G.; Vaudry, R.; Andriamanohisoa, S.F.; Rakotonarivo, O.S.; Randrianasolo, H.Z.; Razafindrabe, H.N.; Rakotoarivony, C.B.; Ebeling, J.; Rasamoelina, M. A universal approach to estimate biomass and carbon stock in tropical forests using generic allometric models. Ecol. Appl. 2012, 22, 572–583. [Google Scholar] [CrossRef]
  38. Fehrmann, L.; Kleinn, C. General considerations about the use of allometric equations for biomass estimation on the example of Norway spruce in central Europe. For. Ecol. Manag. 2006, 236, 412–421. [Google Scholar] [CrossRef]
  39. Rutishauser, E.; Noor An, F.; Laumonier, Y.; Halperin, J.; Rufi, I.; Hergoualc, H.K.; Verchot, L. Generic allometric models including height best estimate forest biomass and carbon stocks in Indonesia. For. Ecol. Manag. 2013, 307, 219–225. [Google Scholar] [CrossRef]
  40. Baccini, A.; Friedl, M.A.; Woodcock, C.E.; Warbington, R. Forest biomass estimation over regional scales using multisource data. Geophys. Res. Lett. 2004, 31. [Google Scholar] [CrossRef] [Green Version]
  41. Fang, K.; Wu, J.; Zhu, J.; Xie, B. A Review of Technologies on Random Forests. Stat. Inf. Forum 2011, 26, 32–38. [Google Scholar]
  42. Zhao, T.; Yang, D.; Cai, X.; Cao, Y. Predict seasonal low flows in the upper Yangtze River using random forests model. J. Hydroelectr. Eng. 2012, 31, 18–24. [Google Scholar]
  43. Asner, G.P.; Mascaro, J.; Muller-Landau, H.C.; Vieilledent, G.; Vaudry, R.; Rasamoelina, M.; Hall, J.S.; van Breugel, M. A universal airborne LiDAR approach for tropical forest carbon mapping. Oecologia 2012, 168, 1147–1160. [Google Scholar] [CrossRef]
  44. Goetz, S.J.; Baccini, A.; Laporte, N.T.; Johns, T.; Walker, W.; Kellndorfer, J.; Houghton, R.A.; Sun, M. Mapping and monitoring carbon stocks with satellite observations: A comparison of methods. Carbon Balance Manag. 2009, 4, 2. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Saud, P.; Lynch, T.B.; KC, A.; Guldin, J.M. Using quadratic mean diameter and relative spacing index to enhance height–diameter and crown ratio models fitted to longitudinal data. Forestry 2016, 89, 215–229. [Google Scholar] [CrossRef] [Green Version]
  46. Gjertsen, A. Accuracy of forest mapping based on Landsat TM data and a kNN-based method. Remote Sens. Environ. 2007, 110, 420–430. [Google Scholar] [CrossRef]
  47. Katila, M.; Tomppo, E. Selecting estimation parameters for the Finnish multisource National Forest Inventory. Remote Sens. Environ. 2001, 76, 16–32. [Google Scholar] [CrossRef]
  48. Ma, J.; Xiao, X.; Qin, Y.; Chen, B.; Hu, Y.; Li, X.; Zhao, B. Estimating aboveground biomass of broadleaf, needleleaf, and mixed forests in Northeastern China through analysis of 25-m ALOS/PALSAR mosaic data. Forest Ecol. Manag. 2017, 389, 199–210. [Google Scholar] [CrossRef]
  49. Li, X.-H. Using “random forest”for classification and regression. Chin. J. Appl. Entomol. 2013, 50, 1190–1197. [Google Scholar]
  50. Fu, W.; Wu, Y. Estimation of aboveground biomass of different mangrove trees based on canopy diameter and tree height. Procedia Environ. Sci. 2011, 10, 2189–2194. [Google Scholar] [CrossRef] [Green Version]
  51. Avitabile, V.; Baccini, A.; Friedl, M.A.; Schmullius, C. Capabilities and limitations of Landsat and land cover data for aboveground woody biomass estimation of Uganda. Remote Sens. Environ. 2012, 117, 366–380. [Google Scholar] [CrossRef]
  52. Zheng, G.; Chen, J.M.; Tian, Q.J.; Ju, W.M.; Xia, X.Q. Combining remote sensing imagery and forest age inventory for biomass mapping. J. Environ. Manag. 2007, 85, 616–623. [Google Scholar] [CrossRef]
  53. Foody, G.M.; Cutler, M.E.; McMorrow, J.; Pelz, D.; Tangki, H.; Boyd, D.S.; Douglas, I. Mapping the Biomass of Bornean Tropical Rain Forest from Remotely Sensed Data. Glob. Ecol. Biogeogr. 2001, 10, 379–387. [Google Scholar] [CrossRef]
  54. Saatchi, S.; Halligan, K.; Despain, D.G.; Crabtree, R.L. Estimation of Forest Fuel Load from Radar Remote Sensing. IEEE Trans. Geosci. Remote Sens. 2007, 45, 1726–1740. [Google Scholar] [CrossRef]
  55. Carreiras, J.M.B.; Vasconcelos, M.J.; Lucas, R.M. Understanding the relationship between aboveground biomass and ALOS PALSAR data in the forests of Guinea-Bissau (West Africa). Remote Sens. Environ. 2012, 121, 426–442. [Google Scholar] [CrossRef]
  56. Zhao, P.; Lu, D.; Wang, G.; Liu, L.; Li, D.; Zhu, J.; Yu, S. Forest aboveground biomass estimation in Zhejiang Province using the integration of Landsat TM and ALOS PALSAR data. Int. J. Appl. Earth Obs. 2016, 53, 1–15. [Google Scholar] [CrossRef]
  57. Urbazaev, M.; Thiel, C.; Migliavacca, M.; Reichstein, M.; Rodriguez-Veiga, P.; Schmullius, C. Improved Multi-Sensor Satellite-Based Aboveground Biomass Estimation by Selecting Temporally Stable Forest Inventory Plots Using NDVI Time Series. Forests 2016, 7, 169. [Google Scholar] [CrossRef] [Green Version]
  58. Ma, J.; Bu, R.; Miao, L.; Yu, C.; Han, F.; Qin, Q.; Hu, Y. Recovery of understory vegetation biomass and biodiversity in burned larch boreal forests in Northeastern China. Scand. J. For. Res. 2015, 31, 1–12. [Google Scholar] [CrossRef]
  59. Chen, H.Y.; Li, W. Analysis on Forest Distribution and Structure in Beijing. For. Resour. Manag. 2011, 2, 32–35. [Google Scholar]
  60. Fuchs, H.; Magdon, P.; Kleinn, C.; Flessa, H. Estimating aboveground carbon in a catchment of the Siberian forest tundra: Combining satellite imagery and field inventory. Remote Sens. Environ. 2009, 113, 518–531. [Google Scholar] [CrossRef]
Figure 1. Spatial distribution map of the forest sample plots in Beijing.
Figure 1. Spatial distribution map of the forest sample plots in Beijing.
Forests 11 00163 g001
Figure 2. Graphs of the predicted versus observed values of three forest types for two models. (a1) and (a2) Coniferous forest; (b1) and (b2) Mixed forest; (c1) and (c2): Broadleaf forest.
Figure 2. Graphs of the predicted versus observed values of three forest types for two models. (a1) and (a2) Coniferous forest; (b1) and (b2) Mixed forest; (c1) and (c2): Broadleaf forest.
Forests 11 00163 g002
Figure 3. Biomass density distribution in Beijing, China (a) Coniferous forest; (b) Mixed forest; (c) Broadleaf forest; (d) all forest sampling plots.
Figure 3. Biomass density distribution in Beijing, China (a) Coniferous forest; (b) Mixed forest; (c) Broadleaf forest; (d) all forest sampling plots.
Forests 11 00163 g003
Table 1. Statistics of the main forest inventory dataset (plot number and biomass of the three forest types in Beijing, China).
Table 1. Statistics of the main forest inventory dataset (plot number and biomass of the three forest types in Beijing, China).
Forest TypesConiferous Forest (n1 = 663)Mixed Forest (n2 = 272)Broadleaf Forest (n3 = 496)
MaxMinMeanMaxMinMeanMaxMinMean
Biomass (Mg ha−1)260.455.0851.83170.4521.6351.78138.2311.8251.71
Mean H(m)32.401.007.4227.71.507.4122.801.507.41
Mean DBH (cm)51.001.0012.840.603.5012.842.503.0012.8
Mean age11512510032480324
Crown density902048902048902048
Table 2. Remote sensing factors calculated from the Landsat 8 and ALOS-2/PALSAR-2 images.
Table 2. Remote sensing factors calculated from the Landsat 8 and ALOS-2/PALSAR-2 images.
Factor TypeRemote Sensing FactorsData Source
Band valueBand 2, Band 3, Band 4, Band 5, Band 7Landsat 8 OLI
Vegetation indexNDVI, DVI, RVILandsat 8 OLI
Tasseled cap transformationGreennessLandsat 8 OLI
Texture analysisMean, Variance, Contrast, Correlation, Second moment Landsat 8 OLI
Backscattering coefficients Γ H H 0 , Γ H v 0 , Γ H H 0 + Γ H v 0 ,   Γ H H 0 Γ H v 0 , Γ H H 0 / Γ H v 0 ALOS-2/PALSAR-2
Table 3. Coefficients of correlation between forest biomass and variables.
Table 3. Coefficients of correlation between forest biomass and variables.
VariablesCodeCorrelation (r)
Coniferous ForestMixed ForestBroadleaf Forest
FID d 2 H N10.4920.5100.489
Crown densityN20.3730.4510.401
Mean ageN30.4550.4120.359
Original bandsB2X10.3590.2900.367
B3X20.1180.2570.213
B4X30.2190.1970.156
B5X40.2370.2110.181
B7X50.2350.2430.221
Vegetation indexNDVIX60.3360.3320.317
DVIX70.1050.1160.132
RVIX80.0890.1240.098
Tasseled capGreennessX90.2390.1590.224
Texture (15 × 15)MeanX100.1960.1950.174
VarianceX110.0890.0770.103
ContrastX120.1300.1140.082
CorrelationX130.1380.2110.243
Second momentX140.1450.0560.097
Backscattering coefficients Γ H H 0 X150.1350.1250.126
Γ H v 0 X160.1650.1870.173
Γ H H 0 + Γ H v 0 X170.1480.0990.106
Γ H H 0 Γ H v 0 X180.1320.0710.121
Γ H H 0 / Γ H v 0 X190.1270.1260.187
Table 4. P values of the coefficients of models for different forest types.
Table 4. P values of the coefficients of models for different forest types.
Forest TypeVariablesp-Value
Coniferous forestN10.001
N20.004
N30.010
X10.011
X60.020
X100.010
X160.015
X170.009
X190.022
Mixed forestN10.002
N20.001
N30.004
X10.016
X20.011
X30.002
X40.015
X60.008
X130.037
Broadleaf forestN10.005
N20.003
N30.010
X10.033
X60.02
X90.01
X130.025
X150.019
X180.002
Table 5. The IncMSE% of the top five most important variables in the biomass fitting of different forest types in the random forest model.
Table 5. The IncMSE% of the top five most important variables in the biomass fitting of different forest types in the random forest model.
Forest TypeVariablesIncMSE%
Coniferous forest d 2 H 40.82
The mean age19.83
Band 215.39
NDVI12.36
Γ H v 0 11.15
Mixed forestThe mean age32.28
d 2 H 25.52
Γ H H 0 Γ H v 0 18.48
B a n d 2 m e a n 14.23
Crown density11.88
Broadleaf forest d 2 H 43.46
Crown density23.02
Band 712.77
Γ H v 0 11.70
NDVI10.08
Table 6. Estimation accuracy of different models.
Table 6. Estimation accuracy of different models.
ModelForest Type R 2 RMSE (Mg ha−1)RMSE%Bias (Mg ha−1)Bias%
MLRConiferous forest0.5914.1529.650.340.71
Mixed forest0.714.5427.920.91.73
Broadleaf forest0.5315.2632.480.160.33
RFConiferous forest0.6613.2327.24−2.67−5.50
Mixed forest0.7711.0922.89−3.34−6.89
Broadleaf forest0.6411.9827.02−4.02−9.08
Table 7. Biomass and biomass density of each forest type.
Table 7. Biomass and biomass density of each forest type.
Forest TypeArea (ha−1)Biomass (Mg)Biomass Density (Mg ha−1)
Coniferous forest66335,622.9953.73
Mixed forest27213,926.4051.20
Broadleaf forest49625,196.8050.80

Share and Cite

MDPI and ACS Style

Zhu, Y.; Feng, Z.; Lu, J.; Liu, J. Estimation of Forest Biomass in Beijing (China) Using Multisource Remote Sensing and Forest Inventory Data. Forests 2020, 11, 163. https://doi.org/10.3390/f11020163

AMA Style

Zhu Y, Feng Z, Lu J, Liu J. Estimation of Forest Biomass in Beijing (China) Using Multisource Remote Sensing and Forest Inventory Data. Forests. 2020; 11(2):163. https://doi.org/10.3390/f11020163

Chicago/Turabian Style

Zhu, Yan, Zhongke Feng, Jing Lu, and Jincheng Liu. 2020. "Estimation of Forest Biomass in Beijing (China) Using Multisource Remote Sensing and Forest Inventory Data" Forests 11, no. 2: 163. https://doi.org/10.3390/f11020163

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop