Next Article in Journal
Tree-Lists Estimation for Chinese Boreal Forests by Integrating Weibull Diameter Distributions with MODIS-Based Forest Attributes from kNN Imputation
Previous Article in Journal
Topographic Controls on Vegetation Changes in Alpine Tundra of the Changbai Mountains
Previous Article in Special Issue
Spatial Variation in Canopy Structure across Forest Landscapes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Levenberg–Marquardt Backpropagation Neural Network for Predicting Forest Growing Stock Based on the Least-Squares Equation Fitting Parameters

1
Key Laboratory of Forestry Intelligent Monitoring and Information Technology Research of Zhejiang Province, Zhejiang A & F University, Lin’an 311300, Zhejiang, China
2
School of Information Engineering, Zhejiang A & F University, Lin’an 311300, Zhejiang, China
*
Author to whom correspondence should be addressed.
Forests 2018, 9(12), 757; https://doi.org/10.3390/f9120757
Submission received: 26 October 2018 / Revised: 1 December 2018 / Accepted: 3 December 2018 / Published: 5 December 2018
(This article belongs to the Special Issue Defining, Quantifying, Observing and Modeling Forest Canopy Traits)

Abstract

:
Traditional field surveys are expensive, time-consuming, laborious, and difficult to perform, especially in mountainous and dense forests, which imposes a burden on forest management personnel and researchers. This study focuses on predicting forest growing stock, one of the most significant parameters of a forest resource assessment. First, three schemes were designed—Scheme 1, based on the study samples with mixed tree species; Scheme 2, based on the study samples divided into dominant tree species groups; and Scheme 3, based on the study samples divided by dominant tree species groups—the evaluation factors are fitted by least-squares equations, and the non-significant fitted-factors are removed. Second, an overall evaluation indicator system with 17 factors was established. Third, remote sensing images of Landsat Thematic Mapper, digital elevation model, and the inventory for forest management planning and design were integrated in the same database. Lastly, a backpropagation neural network based on the Levenberg–Marquardt algorithm was used to predict the forest growing stock. The results showed that the group estimation precision exceeded 90%, which is the highest standard of total sampling precision of inventory for forest management planning and design in China. The prediction results for distinguishing dominant tree species were better than for mixed dominant tree species. The results also showed that the performance metrics for prediction could be improved by least-squares equation fitting and significance filtering of the evaluation factors.

1. Introduction

To promote the management and long-term development of forest resources, managers and researchers need to retain the latest information about forest resources and track the spatial changes of forest landscapes [1]. Forest inventory can represent the forest extent, determine species composition, and track changes from the past to the present. Traditional ground surveys effectively provide objective and reliable information for monitoring and managing forest resources [2,3]. However, traditional ground-based field measurements are expensive, time-consuming, labor-intensive, and hard to implement, particularly in mountainous and forest areas. This lay a burden on forest management personnel and researchers [4,5].
In China, two main types of surveys of large-area forest resources are available: national forest inventory (NFI), which is repeated every five years, and inventory for forest management planning and design, which is conducted at 10-year intervals. However, for both ecological monitoring and forest product utilization, the long survey cycles do not satisfy the social needs [6].
Since the 1970s, satellite images, such as Landsat Thematic Mapper (TM) and Satellite Pour l’Observation de la Terre (SPOT), have been routinely employed to estimate continuous forest parameters. By combining remote sensing image with the actual plots data, managers and researchers can estimate forest variables [7,8,9].
Scholars have been trying to find more efficient and credible prediction models [10,11,12] to partially replace the traditional and expensive measuring techniques.
Due to their adaptability and flexibility, artificial neural networks (ANNs) have become a substitutable and effective method to simulate nonlinear and complex ecosystems like forests. With robust data structures and highly interrelated relationships, ANN models have become popular. Being only slightly affected by data quality and bias, ANNs can also learn more complicated models and data tendencies [2,13]. ANNs therefore have advantages that can be used to solve difficult problems in forest resource management [13,14,15,16].
Traditional BP algorithms are sensitive to convergence into a local minimum, which may lead to instability of the training or sub-optimal results [17]. The Levenberg-Marquardt (LM) algorithm provides is less sensitive to local converges and therefore, it provides a better learning training approach for the back-propagation network. It chooses a balanced tradeoff between the training speed and the stability [18].
As one of the fundamental indicators of forest inventory [19], forest growing stock helps to determine the productive capacity of an area identified as forest available for wood production. The spatiotemporal dynamics of forest growing stock of various tree species play a key role in the balance between a continuous supply of timber and the sustainable development of the whole ecosystem [20]. By applying biomass expansion factors, the forest growing stock can be transformed to estimate the aboveground and belowground biomass [21]. Moreover, the increase or decrease in forest growing stock is essential to calculate the carbon storage of the forestry department [22]. During the estimation or prediction of forest growing stock, data from permanent plots (the basic units of NFI, set up by systematic sampling methods at the intersection of kilometer networks referring to the topographic maps with 1:50,000 map scale, usually with an area of 0.0667 ha) are frequently chosen to validate various models [9,23,24,25,26]. However, in China, the number of subcompartments (the basic unit of inventory for forest management planning and design, divided by the terrain boundaries including ridge line, valleys, roads, etc. or forest ownership boundaries, with the maximum area of 15 ha in South China and 25 ha in other parts of China) is much larger than the number of permanent plots. For example, in Zhejiang province in China (Figure 1), the number of permanent forest plots was 4252 in 2004 [6]. In contrast, the number of subcompartments was 39,377 in the city of Longquan (Figure 1) in Zhejiang province in 2007. To reduce the cost of investigating, estimation of forest growing stock based on subcompartments is more valuable than methods based on permanent forest plots.
This study focuses on predicting forest growing stock. First, three models were designed according to whether the study samples were based on mixed tree species or dominant tree species groups and whether the evaluation factors could be fitted by the least-squares equation and filtering by setting the significance p ≤ 0.05. Second, we established an overall evaluation factor system with 17 factors. Third, we integrated the remote sensing images of Landsat TM, digital elevation model (DEM), and the inventory for forest management planning and design into a database. Last, the Levenberg–Marquardt backpropagation (LM-BP) model was used to predict the forest growing stock.

2. Materials

2.1. Study Area

The study area, Longquan city, located in the southwest of Zhejiang province, has a total area of 3059 km2, which extends from 118°42′ E to 119°25′ E, and 27°42′ N to 28°20′ N. Longquan includes 22 basic forestry management departments that contain 19 townships, two forest farms, and one scenic area (Figure 1). There are 265,667 ha of forest resources, with 14.56 million cubic meters of forest growing stock and 84.2% forest coverage [27].

2.2. Evaluation Factors

Forest growth is comprehensively influenced by multiple environments that primarily contain three categories: climate, topography, and soil.
Due to the costly data, not all of the environmental factors are suitable for estimating forest growth [28], which has resulted in many researchers trying to choose some factors for their experiments and accordingly make some harvest predictions [29,30,31,32].
To balance the costs and the richness of information, we chose comprehensive evaluation factors (Table 1), which had been applied in a previous paper [3].

2.3. Research Data

The research data included the administrative map Landsat TM, DEM, and inventory data for forest management planning and design.
The administrative map of the study area was provided by the Forestry Bureau of Longquan.
The first version data of the global digital elevation model (GDEM) were generated from the Advanced Spaceborne Thermal Emission and Reflection Radiometer (a Japanese sensor which is one remote sensory device on board the Terra satellite launched into Earth orbit by NASA in 1999 and has been collecting data since February 2000) in 2009, with 30-m resolution, IMG data type, Universal Transverse Mercator (UTM) projection, and World Geodetic System 1984 (WGS84). The data set was supplied by the International Scientific & Technical Data Mirror Site, Computer Network Information Center, Chinese Academy of Sciences (http://www.gscloud.cn).
The inventory data for forest management planning and design, containing 39,377 forest subcompartments with 83,078 subplots, were provided by the Forestry Bureau of Longquan in 2007. In order to eliminate erroneous and incorrect data, nonforest, zero-volume, and zero-canopy density subplot samples were removed by preliminary processing. Therefore, the survey data of forest resources planning finally included 28,707 forest subcompartments that consisted of 38,898 subplots. In the data, four dominant tree species (Chinese fir (Cunninghamia lanceolata (Lamb.) Hook.), Masson pine (Pinus massoniana Lamb.), Taiwan pine (Pinus taiwanensis Hayata), and hard broadleaves accounted for the vast majority of the dominant tree species (Table 2). We separated the research data into four groups (Table 2): Group 1, Chinese fir; Group 2, Masson pine; Group 3, Taiwan pine; and Group 4, hard broadleaves. The remaining 613 subplots, which consisted of 26 dominant tree species that included soft broadleaves, fruit trees, Chinese cryptomeria, etc., accounted for a very small proportion (1.58%) of the total data. Thus, the 613 subplots were removed because the small size and variety of dominant tree species might have caused the experiment results to be unstable and unreliable.

3. Methods

3.1. Study Scheme Design

For modeling and prediction, the remaining subcompartments were divided into two parts in which there were 11 basic forestry management departments for modeling and another 11 for predicting. Specifically, the modeling area included Badu town, Baoxi township, Chatian town, Jianchi town, Lanju township, Pingnan town, Shangyang town, city forest farm, Tashi township, Xiaomei town, and Zhuyang township (Figure 1); the remaining 11 basic forestry management departments were used for predicting (Figure 1). We then designed three schemes: Scheme 1 involved modeling and predicting for all mixed dominant tree species; Scheme 2 involved modeling and predicting for the four groups of dominant tree species, i.e., Chinese fir, Masson pine, Taiwan pine, and hard broadleaves; and Scheme 3 involved modeling and predicting for the four groups of dominant tree species based on the evaluation factors fitted by the least-squares equation.

3.2. Data Integration

For modeling and prediction, we integrated all the required forest resource survey data into the same relational database. The soil depth, A-layer depth of soil, tree age, canopy density, and forest growing stock data were obtained from the inventory for forest management planning and design. The elevation, slope, aspect, surface curvature, solar radiation index, and topographic humidity index data were first obtained from DEM. In addition, the normalized difference vegetation index (NDVI) and the characteristic values of the bands (Bands 1 to 5 and 7) were initially obtained from Landsat TM remote sensing images. The average unit volume (m3/ha) of the forest resources was the only predicted parameter.

3.3. Evaluation Factors Fitting

In Scheme 3, we pre-fitted the evaluation factors and the average volume per unit by the least-squares equation. The three evaluation factors—soil depth, A-layer depth of soil, and canopy density—did not need to be fit in advance because their increase in values usually indicates the increase in forest growing stock.
The detailed operating procedure was as follows:
Step 1: The normalized values for each evaluation factor of each sample were calculated with the following formula:
y i , j = x i , j min ( x i ) max ( x i ) min ( x i )
where xi,j is the original value of the ith evaluation factor of the jth sample, max(xi) is the maximum of all values for the ith evaluation factor, min(xi) is the minimum of all values for the ith evaluation factor, and yi,j is the normalized value of xi,j.
Step 2: The average volume per unit (zi,j) was grouped by rounding the values of yi,j × 100, except for the three evaluation factors of soil depth, A-layer depth of soil, and canopy density. Then, their least-squares fitting equations between zi,j and rounded yi,j × 100 were established (Table 3).
Step 3: In Table 3, R denotes the correlation coefficient between each evaluation and the average volume per unit, and p denotes the significance of the least square fitting regression. There were some evaluation factors with lower values of correlation coefficient (R) and higher values of significance (p) in Table 3. Usually, P has three values, 0.001, 0.01, and 0.05. By letting p equal to 0.05 in this paper, the evaluation factor would be retained if its p ≤ 0.05. Therefore, for the dominant tree species Chinese fir, the remained factors were tree ages, slope, aspect, and band 1; for Masson pine, the remained factors were tree ages, slope, elevation, solar radiation index, NDVI, and band 1; for Taiwan pine, the remained factors were tree ages, curvature, and band 7; and for hard broadleaves, the remained factors were tree ages, slope, band 2, and band 3. Including the 3 retained-factors mentioned above, soil depth, A-layer depth of soil, and canopy density, there were eight evaluation factors remained in modelling and prediction for Chinese fir, ten evaluation factors remained for Masson pine, seven evaluation factors remained for Taiwan pine, and eight evaluation factors remained for hard broadleaves. To all remained evaluation factors, the fitted values of xi,j were recalculated by the fitting equations (Table 3) and were renormalized by Equation (1).

3.4. Improved BP Neural Network Model Based on LM Algorithm

Although there are many feasible network models, the backpropagation neural network (BPNN) has outstanding assortment ability and pattern identification in practical applications [33]. The BPNN model minimizes the mean square error between the predicted values and the expected values by modifying the connection weight of network [34]. Usually, a BPNN comprises three types of successive layers: input layer, hidden layer, and output layer. Its operating processes can be described in two steps. First, the input signal is forward-propagated from the input layer, via the hide layer, to the output layer. During this process, the weight value and offset value of the network are maintained constant, and the status of each layer of neurons will only affect the next layer of neurons. The difference between the predicted output and expected output of the network is defined as the error signal. Second, the error signal is backpropagated from the output layer to the input layer in a layer-by-layer manner. During this process, the weight value of network is regulated by the error feedback to make the predicted output of network more closer to the expected one [34].
The LM algorithm is essentially an evolution of the gradient descent algorithm and Newton algorithm, which can considerably speed up the convergence rate by reducing the iteration process and then produce more accurate data. Compared with the disadvantages of traditional BPNNs, such as slow convergence speed and local minimum problems, the convergence rate of the LM algorithm is the fastest of all traditional or improved networks. The improved BPNNs with the LM algorithm have been shown to achieve excellent results, especially when applied to evaluation and prediction [3,35,36,37,38,39].
Therefore, taking the comprehensive evaluation factors (Table 1) as input variables and the forest growing stock as the output variable, the LM-BP was selected as the modeling and prediction model in this paper. As mentioned above, the values of comprehensive evaluation factors were normalized to inputs by Equation (1), both in Scheme 1 and Scheme 2. In Scheme 3, the inputs were preprocessed by three steps. First, the values of input variables were pre-fitted with the evaluation factors and the average volume per unit from the least-squares method, except for the three evaluation factors of soil depth, A-layer depth of soil, and canopy density. Second, the evaluation factor would be retained if its significance p ≤ 0.05 when fitted by the least square method. Last, the inputs were normalized by Equation (1).
The LM-BP computer program was coded by MATLAB R2011b, and its main steps were as follows:
Step 1: The parameters were set as follows:
  • Max_Epochs = 1000
  • Input_Num = 17
  • Output_Num = 1
  • Hidden_Neuron_Num = 2 × Input_Neuron_Num + Output_Neuron_Num
  • TransferFcn = {‘tansig’ ‘purelin’}
  • TrainFcn = ‘trainlm’
  • PerformFcn = ‘mse’
where Max_Epochs is the maximum of training epochs, Hidden_Neuron_Num is the number of nodes in the hidden layer, Input_Neuron_Num is the number of nodes in the input layer, Output_Neuron_Num is number of nodes in the output layer, tansig is the transfer function that transfers values to the hidden layer from the input layer, purelin is the transfer function that transfers values to the output layer from the hidden layer, trainlm is the training function corresponding to the LM algorithm, and mse (mean square error) is the performing function.
Step two: Net creation.
  • Net = newff (I, O, Hidden_Neuron_Num)
where I is the input matrix and O is the output vector.
Step three: Net training.
  • [Net TR] = train (Net, I, O)
  • Step 4: Prediction.
  • y = sim (Net, I_test)
where Net is the trained net, I_test is the input matrix of predicting samples, and y is the prediction result.

3.5. Model Performance Metrics

The following equations were used to calculate the accuracy of each model. By letting ti represent the measured values, yi represent the predicted values at sample i, t denote the observed mean, and y denote the predicted mean, the first statistical measurement, i.e., the group absolute percentage error (GAPE), was defined with Equation (2):
GAPE = | i = 1 n ( t i y i ) i = 1 n t i | × 100 %
where n is the number of samples in the test dataset.
The second criterion is the mean absolute error (MAE) as defined in Equation (3):
MAE = 1 n | i = 1 n ( t i y i ) |
Then, the mean absolute percentage error (MAPE) is defined by Equation (4):
MAPE = 1 n i = 1 n | t i y i | | t i | × 100 %
The root mean squared error (RMSE) is one of the most frequently used factors for assessing the capability of ANNs, which can be obtained with Equation (5):
RMSE = 1 n i = 1 n ( t i y i ) 2
The factor of agreement (IA) is a nondimensional factor of calculation that can be obtained with Equation (6):
IA = i = 1 n ( t i y i ) 2 i = 1 n ( | t i t ¯ | + | y i y ¯ | ) 2
The IA distinguishes between the expected and measured values, and it is restricted from 0 to 1. High value of IA means good consistency between values.
The correlation coefficient R2 is defined with Equation (7):
R 2 = [ i = 1 n ( t i t ¯ ) ( y i y ¯ ) ] 2 i = 1 n ( t i t ¯ ) 2 × i = 1 n ( y i y ¯ ) 2

4. Results

4.1. Modeling

During the modeling period, the optimal model was chosen from 10 experiments performed for each scheme. The results are presented in Table 4.
In the three modeling schemes, the GAPE values were much less than 10%, meaning the accuracies of the group estimation were above 90%. The performance metrics MAE, RMSE, IA, and R2 in Scheme 1 were all significantly inferior to those in Schemes 2 and 3, and the MAPE in Scheme 1 was generally inferior to that in Schemes 2 and 3. Through a pooled analysis of Scheme 1 to Schemes 2 and 3, the R2 were significantly improved from 0.6388 to 0.7427 and 0.7272, respectively, and MAPE decreased from 41.0680% to 32.0223% and 33.6355%, respectively. Schemes 2 and 3 were much better than Scheme 1. Except GAPE, other performance metrics of Scheme 3 were slightly worse than those of Scheme 2 in the modelling results (Table 5).

4.2. Predicting

The prediction results are presented in Table 6 and Figure 2. Table 6 shows the sample size and performance metrics of each scheme. Figure 2 shows the scatter plots between predicted values and measured values of forest growing stock.
During prediction, except the scheme for Taiwan pine as the dominant tree, the GAPE values of the other schemes were less than 10%, which means that the accuracies of the group estimation were above 90%. In Scheme 2, the GAPE for predicting Taiwan pine was as much as 15.5888%, so its group estimation precision was 84.4112%. However, in Scheme 3, the GAPE for predicting Taiwan pine has been improved greatly that in Scheme 2.
The pooled analysis for Schemes 2 and 3, achieved by merging the prediction results of the four dominant tree species into a whole of prediction result, and the total performance metrics were recalculated and are shown in Table 5. From Scheme 1 to Schemes 2 and 3, the R2 were improved from 0.5279 to 0.6460 and 0.6823, respectively, and MAPE decreased from 50.2675% to 41.7724% and 37.5268%, respectively. The other performance metrics, including GAPE, MAE, RMSE, and IA, all improved to some extent from Scheme 1 to Schemes 2 and 3.
The GAPEs in all schemes were less than 5% (Table 5), which indicated that the total group estimation precisions were much greater than the highest standard of the total sampling precision of forest resource inventory in China (GB/T 26424-2010).

5. Discussion

The results of modeling indicated that the performance metrics for distinguishing dominant tree species were much better than that for determining mixed dominant tree species. However, except GAPE, other performance metrics of Scheme 3 were slightly worse than those of Scheme 2 in the modelling results. This probably occurred because we used the values fitted between the evaluation factors and the average volume per unit using the least-squares equations instead of the original values of the evaluation factors before modeling, and remove the non-significant evaluation factors when their P-values were greater than 0.05, meaning the fitted values were closer to their averages than their original values; meanwhile, some detail in the information could have also been lost.
The prediction results indicated that the performance metrics for distinguishing dominant tree species were also better than for determining mixed dominant tree species, which is similar to most research results [40,41]. This result was apparently different from the published forest growing stock simulation [8], which used the k-nearest-neighbor estimation method to estimate the forest growing stock based on Landsat TM images and stand-level field inventory data. The results of that study showed that the volumetric estimation errors of different tree species were significantly higher than those of the overall estimations.
Regardless of the modeling results, all the prediction performance metrics of Scheme 3 were better than those of Scheme 2, implying that the prediction results that were based on the least-squares equation fitting and significance p filtering for evaluation factors were more stable and reliable.
In Schemes 1 and 2, the predicted values were systematically lower than the measured values from the modeling results and the prediction results. This may be due to the modeling areas being chosen manually rather than randomly. However, in Scheme 3, for both modeling and prediction, the average predicted values were very close to the average measured values. This may be explained by the fitted factor values obtained using the least-squares equations being closer to their averages than their original values and removing the non-significant evaluation factors to reduce the random noise, meaning the predicted results would be closer to the expected values.
Despite high MAPE values, the group estimation accuracy from our study may be good enough for some purposes. For example, for forest management planning and design at a regional level, the estimated results should be a partial substitution for field investigation. An acceptable level of estimation error for total forest volume is about 20% when the inventory area is larger than 30 ha [8,42]. The total performance metrics indicate that all GAPE values were lower than 5%, meaning the overall accuracies were over 95%, which exceed the highest standard of total sampling precision (90%) of inventory for forest management planning and design in China. Especially in Scheme 3, both the modeling results and the predicting results had GAPE values lower than 2%, meaning the overall accuracies were over 98%. Therefore, the results should be used in forest management planning and design in situations where there is a lack of available measured data.
The prediction results could likely be further improved with further research. There are, however, two limitations to our study: overfitting and limited network layers. In our study, we found that higher modeling accuracy did not necessarily mean higher prediction accuracy, which might have been caused by overfitting in the modeling procedure. Due to the full connection network structure, the number of the layers was very limited, therefore, the prediction accuracy was also limited. The next study plan may depend on a new model, such as the deep-learning model.

6. Conclusions

Three conclusions were drawn from the data analyses in this study. First, the prediction results of all schemes showed that their average predicted values were close to the average measured values and all GAPE values were lower than 5% (Table 5), indicating that group estimation precision of all predicting schemes were much greater than 90%, which is the highest standard of the total sampling precision for volume of forest resource inventory in China. Second, the prediction results for distinguishing dominant tree species were better than for mixed dominant tree species. Lastly, the results showed that the performance metrics for prediction could be improved using least-squares equation fitting for the evaluation factors and by using the significance P filtering the evaluation factors.
We chose one part of an area in the research region as the modeling area and the other as the prediction area instead of randomly choosing modeling samples [3,43]. This procedure is more convenient in practice because the field measurements in the inventory for forest management planning and design are recorded in some contiguous areas while others are predicted. This research also reflects the characteristics of sufficient samples and reduced data costs during the prediction in which the canopy density is the only factor that should be remeasured by observation or a line-intercept method in the plot. Therefore, the costs of data acquisition are much lower than those of traditional models that are used to survey the diameter or height of each tree [3,44,45]. In addition, the pattern is more suitable for the prediction of volume for forest under the state of natural growth or with minimal human interference. This model offers the possibility to monitor forest biomass within a short period based on a conversion of volume to biomass.

Author Contributions

Conceptualization, D.W.; Formal analysis, D.W.; Resources, L.F., A.X., and X.L.; Writing—original draft, R.Z.

Funding

This work was financially supported by Zhejiang provincial key science and technology project (2018C02013), Zhejiang provincial major science and technology project (2011C12047), and Zhejiang A & F University’s professionals supporting project (2014FR078).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mohammadi, J.; Joibary, S.S.; Yaghmaee, F.; Mahiny, A.S. Modeling forest stand volume and tree density using Landsat ETM+ data. Int. J. Remote Sens. 2010, 31, 2959–2975. [Google Scholar] [CrossRef]
  2. Gianfranco, S.; Laura, M.; David, G. Development of a neural network model to update forest distribution data for managed alpine stands. Ecol. Model. 2007, 206, 331–346. [Google Scholar]
  3. Wu, D.S.; Ji, Y.Q. Dynamic estimation of forest volume based on multi-source data and neural network model. J. Agric. Sci. 2015, 7, 18–31. [Google Scholar] [CrossRef]
  4. Buckley, D.S.; Isebrands, J.; Sharik, T.L. Practical field methods of estimating canopy cover, PAR, and LAI in Michigan oak and pine stands. North. J. Appl. For. 1999, 16, 25–32. [Google Scholar]
  5. Chowdhury, T.A.; Thiel, C.; Schmullius, C. Growing stock volume estimation from L-band ALOS PALSAR polarimetric coherence in Siberian forest. Remote Sens. Environ. 2014, 155, 129–144. [Google Scholar] [CrossRef]
  6. Liu, A.X. Annual Monitoring Theory and Methods of Forest Resources in Zhejiang Province. Ph.D. Thesis, Nanjing Forestry University, Nan Jing, China, June 2006. [Google Scholar]
  7. Hudak, A.T.; Lefsky, M.A.; Cohen, W.B.; Berterretche, M. Integration of LIDAR and Landsat ETM+ data. Int. Arch. Photogramm. Remote Sens. 2001. [Google Scholar]
  8. Mäkelä, H.; Pekkarinen, A. Estimation of forest stand volumes by Landsat TM imagery and stand-level field-inventory data. For. Ecol. Manag. 2004, 196, 245–255. [Google Scholar] [CrossRef]
  9. Shataeea, S.; Weinaker, H.; Babanejad, M. Plot-level forest volume estimation using Airborne Laser Scanner and TM data, comparison of boosting and random forest tree regression algorithms. Procedia Environ. Sci. 2011, 7, 68–73. [Google Scholar] [CrossRef]
  10. Robinson, A.P.; Monserud, R.A. Criteria for comparing the adaptability of forest growth models. For. Ecol. Manag. 2003, 172, 53–67. [Google Scholar] [CrossRef]
  11. Mohammadi, Z.; Limaei, S.M.; Lohmander, P.; Olsson, L. Estimation of a basal area growth model for individual trees in uneven-aged Caspian mixed species forests. J. For. Res. 2018, 29, 1205–1214. [Google Scholar] [CrossRef]
  12. Lamb, S.M.; Maclean, D.A.; Hennigar, C.R.; Pitt, D.G. Forecasting forest inventory using imputed tree lists for LiDAR grid cells and a tree-list growth model. Forests 2018, 9, 167. [Google Scholar] [CrossRef]
  13. Haykin, S. Neural Networks and Learning Machines, 3rd ed.; Pearson: Upper Saddle River, NJ, USA, 2008. [Google Scholar]
  14. Castaño-Santamaría, J.; Crecente-Campo, F.; Fernández-Martínez, J.L.; Barrio-Anta, M.; Obeso, J.R. Tree height prediction approaches for uneven-aged beech forests in northwestern Spain. For. Ecol. Manag. 2013, 307, 63–73. [Google Scholar] [CrossRef]
  15. Wang, S.H.; Zhang, M.Z.; Zhao, P.A.; Chen, J.X. Modelling the spatial distribution of forest carbon stocks with artificial neural network based on TM images and forest inventory data. Acta Ecol. Sin. 2011, 31, 998–1008. [Google Scholar]
  16. Shataee, S. Non-Parametric Forest Attributes Estimation Using Lidar and TM Data. In Proceedings of the 32nd Asian Conference on Remote Sensing, Taipei, China, 3–7 Octorber 2011; pp. 887–893. [Google Scholar]
  17. Zhang, D.F. The Design of Neural Network Application based on MATLAB; Machinery Industry Press: Beijing, China, 2009. [Google Scholar]
  18. Ma, L.; Xu, F.; Wang, X.; Tang, L. Earthquake Prediction Based on Levenberg-Marquardt Algorithm Constrained Back-Propagation Neural Network Using DEMETER Data. In Proceedings of the 4th International Conference on Knowledge Science, Engineering and Management, Belfast, Northern Ireland, UK, 1–3 September 2010; pp. 591–596. [Google Scholar]
  19. Tomppo, E.; Gschwantner, T.; Lawrence, M.; McRoberts, R.E. National Forest Inventories: Pathways for Common Reporting; Springer: Berlin, Germany, 2010; p. 612. [Google Scholar]
  20. Debeljaka, M.; Poljanecbk, A.; Ženko, B. Modelling forest growing stock from inventory data: A data mining approach. Ecol. Indic. 2014, 41, 30–39. [Google Scholar] [CrossRef]
  21. Brown, S.L.; Schroder, P.; Kern, J.S. Spatial distribution of biomass in forests of the eastern USA. For. Ecol. Manag. 1999, 123, 81–90. [Google Scholar] [CrossRef]
  22. Karjalainen, T.; Nabuurs, G.J.; Eggers, T.; Lapveteläinen, T.; Kaipainen, T. Sce-nario analysis of the impacts of forest management and climate change on the European forest sector carbon budget. For. Policy Econ. 2003, 5, 141–155. [Google Scholar] [CrossRef]
  23. Franco-Lopeza, H.; Ek, A.R.; Bauer, M.E. Estimation and mapping of forest stand density, volume, and cover type using the k-nearest neighbors method. Remote Sens. Environ. 2001, 77, 251–274. [Google Scholar] [CrossRef] [Green Version]
  24. Mohammadi, J.; Shataee, S.; Babanezhad, M. Estimation of forest stand volume, tree density and biodiversity using Landsat ETM+ Data, comparison of linear and regression tree analyses. Procedia Environ. Sci. 2011, 7, 299–304. [Google Scholar] [CrossRef]
  25. McRoberts, R.E.; Gobakken, T.; Næsset, E. Post-stratified estimation of forest area and growing stock volume using lidar-based stratifications. Remote Sens. Environ. 2012, 125, 157–166. [Google Scholar] [CrossRef]
  26. McRoberts, R.E.; Næsset, E.; Gobakken, T. Inference for Lidar-assisted estimation of forest growing stock volume. Remote Sens. Environ. 2013, 128, 268–275. [Google Scholar] [CrossRef]
  27. Hong, R.F. Research on the Reform of Foresty Property Rights System in Longquan City. Master’s Thesis, Zhejiang A & F University, Lin’an, China, May 2012. [Google Scholar]
  28. Zhao, Z. Afforestation Planning and Design Tutorial; China Forestry Press: Beijing, China, 2007. [Google Scholar]
  29. Deng, L.B.; Li, J.P. Variable density volume prediction model for Chinese fir based on artificial neural network. J. Northwest For. Coll. 2002, 17, 87–89. [Google Scholar]
  30. Xie, H.Q. Establishment on growth model of Chinese fir and application of multilinear regression. J. Fujian For. Sci. Technol. 2004, 1, 34–37. [Google Scholar]
  31. Hong, W.; Wu, C.Z.; He, D.J. A study on the model of forest resources management based on the artificial neural network. J. Nat. Res. 1998, 13, 69–72. [Google Scholar]
  32. Xu, W.M.; Chen, Y.F.; Lin, G.F.; Chen, M.H. Dynamic visualization of Chinese fir volume driven by site condition and growth model. J. Fujian Coll. For. 2011, 31, 151–155. [Google Scholar]
  33. Lek, S.; Guégan, J.F. Artificial neural networks as a tool in ecological modelling: An introduction. Ecol. Model. 1999, 120, 65–73. [Google Scholar] [CrossRef]
  34. Li, J.; Cheng, J.H.; Shi, J.Y. Brief introduction of back propagation (BP) neural network algorithm and its improvement. In Advances in Computer Science and Information Engineering; Springer: Berlin/Heidelberg, Germany, 2012; pp. 553–558. [Google Scholar]
  35. Zheng, C.L.; Jiang, H.Y. K/S value prediction of reactive dyes with improved LM-BP algorithm. J. Text. Res. 2010, 31, 82–85. [Google Scholar]
  36. Wang, Z.P. Application of LM-BP neural network in lake trophic evaluation. Environ. Sci. Surv. 2013, 32, 98–101. [Google Scholar]
  37. Hua, Z.; Qian, W.; Gu, L. Application of improved LM-BP neural network in water quality evaluation. Water Resour. Prot. 2008, 24, 22–25. [Google Scholar]
  38. Jian, X.C.; Wang, L.W.; Min, F. BP neural network based on LM algorithm for the forecasting of vehicle emission. J. Chongqing Univ. Technol. 2012, 26, 11–16. [Google Scholar]
  39. Miao, X.Y.; Chu, J.K.; Du, X.W. Application of LM-BP neural network in predicting dam deformation. Comput. Eng. Appl. 2011, 47, 220–222. [Google Scholar]
  40. Gherardo, C.; Anna, B.; Piermaria, C.; Marco, M.; Davide, T.; Fabio, M.; Roberta, B. Non-parametric and parametric methods using satellite images for estimating growing stock volume in Alpine and Mediterranean forest ecosystems. Remote Sens. Environ. 2008, 112, 2686–2700. [Google Scholar]
  41. Chirici, G.; Corona, P.; Marchetti, M.; Maselli, F.; Bottai, L. Spatial distribution modelling of forest attributes coupling remotely sensed imagery and GIS techniques. In Modelling Forest Systems; Amaro, A., Reed, D., Soares, P., Eds.; CABI Publishing: New York, NY, USA, 2003; pp. 41–50. [Google Scholar]
  42. Tokola, T.; Heikkila, J. Improving satellite based forest inventory by using a priori site quality information. Silv. Fenn. 1997, 1, 67–78. [Google Scholar] [CrossRef]
  43. Wu, D.S. Estimation of forest volume based on LM-BP neural network model. Comput. Model. New Technol. 2014, 18, 131–137. [Google Scholar]
  44. Wei, X.H.; Sun, Y.J.; Ma, W. A height growth model for Cunninghamia lanceolata based on Richards’ equation. J. Zhejiang A & F Univ. 2012, 29, 661–666. [Google Scholar]
  45. Lin, C.L.; Hong, W.; Wu, C.Z.; He, D.J.; Lan, B. A study on the model of individual accretion of pinusmassoniana forest. J. Fujian Coll. For. 2000, 20, 227–230. [Google Scholar]
Figure 1. Administrative map of the study area.
Figure 1. Administrative map of the study area.
Forests 09 00757 g001
Figure 2. Scatter plots between predicted values and measured values for Schemes 1, 2 and 3. Subgraph (a) represents Scheme 1 with mixed dominant tree species; subgraphs (be) represent Scheme 2 for Chinese fir, Masson pine, Taiwan pine, and hard broadleaves, respectively; and subgraphs (fi) represent Scheme 3 for Chinese fir, Masson pine, Taiwan pine, and hard broadleaves, respectively.
Figure 2. Scatter plots between predicted values and measured values for Schemes 1, 2 and 3. Subgraph (a) represents Scheme 1 with mixed dominant tree species; subgraphs (be) represent Scheme 2 for Chinese fir, Masson pine, Taiwan pine, and hard broadleaves, respectively; and subgraphs (fi) represent Scheme 3 for Chinese fir, Masson pine, Taiwan pine, and hard broadleaves, respectively.
Forests 09 00757 g002
Table 1. Comprehensive evaluation factors for forest growth.
Table 1. Comprehensive evaluation factors for forest growth.
NumberEvaluation FactorNumberEvaluation Factor
1Tree Age2Slope
3Canopy Density4Soil Depth
5A-layer Depth of Soil6Aspect
7Elevation8Curvature
9Solar Radiation Index10Topographic Humidity Index
11NDVI12Band 1
13Band 214Band 3
15Band 416Band 5
17Band 7
Note: NDVI is normalized differential vegetation index; bands 1 through 5, and band 7, are the bands of the images from Landsat Thematic Mapper.
Table 2. Groups divided by the dominant tree species and the size of subplots.
Table 2. Groups divided by the dominant tree species and the size of subplots.
Group NumberDominant Tree SpeciesNumber of SubplotTotal Number of SubplotProportion (%)
Group 1Chinese fir20,29638,89852.18%
Group 2Masson pine598915.40%
Group 3Taiwan pine24186.22%
Group 4Hard broadleaves958224.63%
The other dominant tree species6131.58%
Table 3. The least-squares fitting equations for the evaluation factors for Scheme 3.
Table 3. The least-squares fitting equations for the evaluation factors for Scheme 3.
NumberDominant Tree SpeciesEvaluation FactorNumber of GroupLeast-Squares Fitting EquationCorrelation Coefficient (R)Significance (p)
1Chinese firTree Age44−0.0427890198 × x2 + 6.4914989075 × x + −10.89990694290.87540.0000
2Slope86−0.0115959263 × x2 + 0.6038878174 × x + 82.1824866639−0.53360.0000
3Aspect99−0.0033003048 × x2 + 0.4839666164 × x + 72.05591528220.32460.0010
4Elevation850.0023254337 × x2 + −0.2572556056 × x + 89.0182097406−0.09450.3896
5Curvature640.0065366953 × x2 + −0.5371817451 × x + 90.5614723394−0.10030.4305
6Solar Radiation Index82−0.0026200461 × x2 + 0.3467036840 × x + 73.39217067990.09180.4120
7Topographic Humidity Index910.0043653961 × x2 + −0.2512069504 × x + 88.21550511430.16320.1221
8NDVI980.0022565071 × x2 + −0.1541807892 × x + 86.16255874140.15090.1380
9Band 135−0.0118922876 × x2 + 1.5080606424 × x + 43.21615102340.46370.0050
10Band 222−0.0127550587 × x2 + 1.4717083307 × x + 48.62422245270.33890.1228
11Band 332−0.0090806959 × x2 + 0.7815481904 × x + 70.11962858830.00420.9818
12Band 4530.0020084727 × x2 + −0.0805199224 × x + 84.56225908440.21990.1136
13Band 580−0.0042838563 × x2 + 0.4542927464 × x + 74.81580270090.08840.4355
14Band 7410.0004892917 × x2 + −0.0031706472 × x + 84.18872493100.09400.5586
15Masson pineTree Age380.0331501479 × x2 + 2.8278931378 × x + 15.23281734650.94820.0000
16Slope78−0.0148356773 × x2 + 0.9016394596 × x + 67.9666405506−0.33310.0029
17Aspect94−0.0032729759 × x2 + 0.2348500208 × x + 75.6303889833−0.16680.1081
18Elevation70−0.0206467958 × x2 + 1.1473753074 × x + 65.8295224974−0.45600.0001
19Curvature47−0.0065277438 × x2 + 0.3054885854 × x + 74.8842350529−0.02340.8758
20Solar Radiation Index67−0.0116390188 × x2 + 1.7344282575 × x + 16.78083521900.50740.0000
21Topographic Humidity Index790.0068459173 × x2 + −0.6174814023 × x + 82.08516848750.00860.9402
22NDVI900.0040823038 × x2 + −0.1607289597 × x + 72.48709531810.24130.0220
23Band 1300.0215449663 × x2 + −1.4478075547 × x + 83.42983329360.49540.0054
24Band 219−0.0107774671 × x2 + 1.1612851686 × x + 43.96768593780.10360.6731
25Band 328−0.0056810995 × x2 + 0.6748729435 × x + 57.98925946910.15940.4178
26Band 4490.0051865652 × x2 + −0.4677663146 × x + 83.95013636690.01030.9442
27Band 5720.0010562299 × x2 + −0.2748031246 × x + 86.6400382977−0.15670.1888
28Band 7390.0121557341 × x2 + −0.9500200238 × x + 83.62783166880.14220.3880
29Taiwan pineTree Age460.0108691950 × x2 + 2.8041406197 × x + 0.75483794200.91530.0000
30Slope710.0007210346 × x2 + −0.1981166266 × x+80.5153065313−0.09850.4140
31Aspect92−0.0033269117 × x2 + 0.4194282364 × x + 62.05072449670.13300.2064
32Elevation67−0.0170165295 × x2 + 2.0442044776 × x + 13.84620320020.18010.1447
33Curvature52−0.0089471129 × x2 + 0.1602318306 × x + 74.2890687778−0.28530.0404
34Solar Radiation Index770.0024392679 × x2 + −0.1965197674 × x + 72.50257123580.10110.3816
35Topographic Humidity Index33−0.0162849784 × x2 + 1.1347361067 × x + 56.47564799990.05680.7537
36NDVI85−0.0018733247 × x2 + 0.3472925064 × x + 60.48534642350.20630.0581
37Band 1150.0584268211 × x2 + −2.8925728373 × x + 102.42136450200.13720.6259
38Band 2110.0462181612 × x2 + −2.0249896794 × x + 87.14078027970.43390.1824
39Band 3170.0520556446 × x2 + −1.8273924977 × x + 82.65890156040.34480.1753
40Band 445−0.0026199728 × x2 + 0.2061151757 × x + 69.9711400898−0.01550.9195
41Band 5630.0012177555 × x2 + −0.1554842686 × x + 74.2397400623−0.08150.5255
42Band 7320.0142484577 × x2 + −0.6341009191 × x + 75.55281815150.36880.0378
43Hard broadleavesTree Age620.0163003073 × x2 + 1.1534510399 × x + 17.15754987420.91630.0000
44Slope850.0036011768 × x2 + −0.6254997771 × x + 74.8775043187−0.34500.0012
45Aspect960.0039735998 × x2 + −0.3577728963 × x + 58.25015797020.09600.3520
46Elevation830.0180034120 × x2 + −1.3930520666 × x + 71.49958880640.18530.0935
47Curvature600.0286950221 × x2 + −1.5689644141 × x + 64.87354224190.16060.2203
48Solar Radiation Index870.0040370576 × x2 + −0.2975027144 × x + 55.52628528860.18290.0900
49Topographic Humidity Index620.0277373791 × x2 + −1.8961270968 × x + 72.81653606500.24340.0566
50NDVI93−0.0027252454 × x2 + 0.2307572964 × x + 48.9924512551−0.04680.6558
51Band 1290.0626190912 × x2 + −4.5932489190 × x + 123.02935453830.34390.0678
52Band 2180.0292226631 × x2 + −2.1099893402 × x + 81.37535296580.54530.0192
53Band 3280.0226705539 × x2 + −1.2930119073 × x + 66.08575184100.48850.0084
54Band 4520.0029985089 × x2 + −0.4153878341 × x + 62.4330118645−0.21180.1317
55Band 5690.0133684136 × x2 + −0.9407788330 × x + 62.49654265730.18850.1210
56Band 7330.0297143535 × x2 + −1.7693736017 × x + 67.62235612100.32810.0623
Note: NDVI is normalized differential vegetation index; bands 1 through 5, and band 7, are the bands of the images from Landsat Thematic Mapper.
Table 4. Detailed performance metrics of the modeling results for each scheme.
Table 4. Detailed performance metrics of the modeling results for each scheme.
SchemeDominant Tree SpeciesNumbers of SampleAverage Measured Value (m3)Average Predicted Value (m3)GAPE (%)MAPE (%)MAE (m3)RMSE (m3)IAR2
Scheme 1Mixed1611276.837575.20402.125141.068020.449528.1190.87730.6388
Scheme 2Chinese fir952985.035083.41201.909729.476118.045024.65250.91170.7212
Masson pine187279.152076.23003.691630.150018.114023.94150.90800.7107
Taiwan pine145770.551066.82355.282225.096114.926520.41950.92760.7605
Hard broadleaves301454.538554.33900.367344.583614.929521.54900.91810.7372
Scheme 3Chinese fir952985.035486.09791.249531.339918.305825.03760.90870.7116
Masson pine187279.152479.96501.026731.483718.471124.41940.90410.6949
Taiwan pine145770.550570.40160.211027.209315.422121.43460.91530.7261
Hard broadleaves301454.539154.27950.476045.336215.678023.05080.90490.6991
Note: GAPE is the group absolute percentage error defined by Equation (2); MAPE is the mean absolute percentage error defined by Equation (4); MAE is the mean absolute error defined by Equation (3); RMSE is the root mean squared error defined by Equation (5); IA is the factor of agreement defined by Equation (6); R2 is the correlation coefficient defined by Equation (7).
Table 5. The total performance metrics for each scheme for modeling and predicting.
Table 5. The total performance metrics for each scheme for modeling and predicting.
Modeling or PredictingScheme NameNumber of SampleAverage Measured Value (m3)Average Predicted Value (m3)GAPE (%)MAPE (%)MAE (m3)RMSE (m3)IAR2
ModelingScheme 116,11276.837575.20402.125141.068020.449528.11900.87730.6388
Scheme 215,87277.221575.52052.201132.022317.175023.64150.92020.7427
Scheme 315,87277.221577.89160.868733.635517.561624.28500.91570.7272
PredictingScheme 122,78672.801069.36904.714950.267524.150033.65550.82960.5279
Scheme 222,41373.213571.29502.621741.772420.919028.99500.88830.6460
Scheme 322,41373.213573.34150.173937.526819.568527.49080.90360.6823
Note: GAPE is the group absolute percentage error defined by Equation (2); MAPE is the mean absolute percentage error defined by Equation (4); MAE is the mean absolute error defined by Equation (3); RMSE is the root mean squared error defined by Equation (5); IA is the factor of agreement defined by Equation (6); R2 is the correlation coefficient defined by Equation (7).
Table 6. Detailed performance metrics of the prediction results for each scheme.
Table 6. Detailed performance metrics of the prediction results for each scheme.
SchemeDominant Tree SpeciesNumber of SampleAverage Measured Value (m3)Average Predicted Value (m3)GAPE (%)MAPE (%)MAE (m3)RMSE (m3)IAR2
Scheme 1Mixed22,78672.801069.36904.714950.267524.150033.65550.82960.5279
Scheme 2Chinese fir10,76783.025080.95202.497131.564120.827528.65150.88620.6455
Masson pine411787.351080.25908.119237.889024.633034.02600.86570.5979
Taiwan pine96182.464069.609015.588835.995225.537536.12900.81050.4979
Hard broadleaves656846.917050.09106.767561.786718.067524.62550.86040.5647
Scheme 3Chinese fir10,76783.025083.63620.736132.899720.366828.20050.88970.6544
Masson pine411787.351084.81612.901934.470322.560031.41440.89420.6523
Taiwan pine96182.464076.07337.750335.452922.644031.82990.83320.5601
Hard broadleaves656846.917048.87304.170447.331515.934622.54830.88930.6375
Note: GAPE is the group absolute percentage error defined by Equation (2); MAPE is the mean absolute percentage error defined by Equation (4); MAE is the mean absolute error defined by Equation (3); RMSE is the root mean squared error defined by Equation (5); IA is the factor of agreement defined by Equation (6); R2 is the correlation coefficient defined by Equation (7).

Share and Cite

MDPI and ACS Style

Zhou, R.; Wu, D.; Fang, L.; Xu, A.; Lou, X. A Levenberg–Marquardt Backpropagation Neural Network for Predicting Forest Growing Stock Based on the Least-Squares Equation Fitting Parameters. Forests 2018, 9, 757. https://doi.org/10.3390/f9120757

AMA Style

Zhou R, Wu D, Fang L, Xu A, Lou X. A Levenberg–Marquardt Backpropagation Neural Network for Predicting Forest Growing Stock Based on the Least-Squares Equation Fitting Parameters. Forests. 2018; 9(12):757. https://doi.org/10.3390/f9120757

Chicago/Turabian Style

Zhou, Ruyi, Dasheng Wu, Luming Fang, Aijun Xu, and Xiongwei Lou. 2018. "A Levenberg–Marquardt Backpropagation Neural Network for Predicting Forest Growing Stock Based on the Least-Squares Equation Fitting Parameters" Forests 9, no. 12: 757. https://doi.org/10.3390/f9120757

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop