**Evaluation of Di**ff**erent Algorithms for Estimating the Growing Stock Volume of** *Pinus massoniana* **Plantations Using Spectral and Spatial Information from a SPOT6 Image**

**Jingjing Zhou 1,2, Zhixiang Zhou 1,2, Qingxia Zhao 3, Zemin Han 1,2, Pengcheng Wang 1,2, Jie Xu <sup>4</sup> and Yuanyong Dian 1,2,\***


Received: 28 March 2020; Accepted: 9 May 2020; Published: 12 May 2020

**Abstract:** Precise growing stock volume (GSV) estimation is essential for monitoring forest carbon dynamics, determining forest productivity, assessing ecosystem forest services, and evaluating forest quality. We evaluated four machine learning methods: classification and regression trees (CART), support vector machines (SVM), artificial neural networks (ANN), and random forests (RF), for their reliability in the estimation of the GSV of *Pinus massoniana* plantations in China's northern subtropical regions, using remote sensing data. For all four methods, models were generated using data derived from a SPOT6 image, namely the spectral vegetation indices (SVIs), texture parameters, or both. In addition, the effects of varying the size of the moving window on estimation precision were investigated. RF almost always yielded the greatest precision independently of the choice of input. ANN had the best performance when SVIs were used alone to estimate GSV. When using texture indices alone with window sizes of 3 × 5 × 5 or 9 × 9, RF achieved the best results. For CART, SVM, and RF, *R*<sup>2</sup> decreased as the moving window size increased: the highest *R*<sup>2</sup> values were achieved with 3 × 3 or 5 × 5 windows. When using textural parameters together with SVIs as the model input, RF achieved the highest precision, followed by SVM and CART. Models using both SVI and textural parameters as inputs had better estimating precision than those using spectral data alone but did not appreciably outperform those using textural parameters alone.

**Keywords:** machine learning algorithms; forest growing stock volume; SPOT6 imagery; *Pinus massoniana* plantations

#### **1. Introduction**

Forest growing stock volume (GSV) is one of the most important forest characteristics, both economically and environmentally, because it is a key determinant of forest productivity [1]. Precisely estimating forest GSVs on large scales is crucial for monitoring forest carbon dynamics, assessing forest ecosystem services, and evaluating forest quality [2–5]. Traditional ground-based GSV estimation strategies that rely on field measurements of tree height and diameters at breast height (DBH) are time-consuming and labor-intensive [1]. GSV is estimated over large areas almost everywhere in the world, and estimates are often precise. The (additional) benefits of including remote

sensing (RS) data in this process are that spatially explicit information can be produced (i.e., maps) and that the precision of population parameter estimates can be improved.

Several modelling approaches, both parametric and non-parametric, have proven capable of precisely estimating forest variables such as the leaf area index, canopy cover, height, basal area, and stock volume based on RS data [4,6–10]. The most popular parametric regression methods for modelling forest attributes are simple or multiple linear regression models. Commonly used non-parametric models include classification and regression trees (CART), k-nearest neighbor (k-NN), random forest (RF), and support vector machine (SVM). Although many researchers have performed comparative evaluations of different machine learning algorithms (MLAs), no single technique has been revealed as universally superior for predicting forest inventory attributes [8,11,12]. Different machine learning algorithms (MLAs) have different estimating precisions; for example, CART achieves the lowest precision, SVM and ANN tend to achieve moderate performance, and RF has achieved the highest precision and lowest error for the forest variables under consideration [4]. Zhang et al. [12] found that ANN and SVM were similarly effective at estimating sawgrass aboveground biomass (AGB), with correlation coefficients (r values) exceeding 0.9. However, ANN offered the most precise total biomass estimates (r = 0.94). Wang et al. [13] found that the MLR method generally outperformed SVM and RF at predicting site-level AGB. The GSV of *Pinus massoniana* plantations has not been estimated extensively in China's northern subtropical regions. Therefore, there is a need to evaluate and compare the performance of different MLAs in this specific task.

The spectral information in optical remote sensing images generally provides insufficient information to precisely assess the state of vegetation in dense forests with complex structures. Therefore, texture parameters derived from remote sensing images are commonly analyzed to assess the spatial distribution of image tone variance and thereby acquire spatial information on vegetation cover, especially in complex and dense forests [14]. Several recent studies have concluded that textural parameters from high-resolution satellite images are more useful than spectral information for estimating forest variables because the interpretation of spectral data is complicated by saturation when studying multi-layer forests with full canopy cover [6,12,14–19]. However, it is not yet clear whether combining spectral and texture-based features can enable more precise estimation of forest variables than analyses based on textural information alone. Chrysafis et al. [5] argued that combining spatial and textural information yielded at most marginal improvements in precision over texture-only models when estimating GSV in a Mediterranean forest. However, others have reported that forest variables such as the leaf area index (LAI) can be estimated more precisely by combining spectral and textural features than by relying solely on one type or the other [6,15,17]. Texture is a complex parameter and obtaining precise texture information over the entire forest area is challenging. This is because texture values are highly sensitive to texture measures, landscape type, physiological growth period, associated parameters (i.e., moving window size, direction), and the type of remote sensing image [5,20]. It is still worthwhile to explore whether a large or small moving window size should be used in various forest types, and with an inversion algorithm.

*Pinus massoniana* is a major coniferous tree that is widely distributed in the subtropical forests of South China and exhibits relatively high tolerance to acid rain, drought, and phosphorus deficiency [21,22]. It is central to forest ecosystems and is also an economically important source of timber and wood pulp [23]. Few efforts have been made to evaluate the capability of modern remote sensing techniques for mapping *Pinus massonian* GSV in the subtropical area of China [24,25]. A growing stock volume map of *Pinus massoniana* trees in in the subtropical area of China has never been generated and published. The specific objectives of this study were therefore to: (1) examine the ability of four MLAs (CART, SVM, ANN, and RF) to predict the GSV of *Pinus massoniana* plantations, (2) compare the effectiveness of spectral vegetation indices (SVIs) and texture data derived from SPOT6 images for forest GSV quantification, and (3) assess the influence of texture index window size on the precision of forest GSV estimation.

#### **2. Materials and Methods**

#### *2.1. Study Area*

This study focused on an approximately 7600 ha region of *Pinus massoniana* plantations near Taizi mountain in Jingshan County of China's Hubei Province (30◦48 –31◦02 N and 112◦48 –113◦03 E) (Figure 1). Topographically, the area is characterized by a hilly landscape with moderate slope and its elevation is between 40.3 and 467.4 m above sea level. It has a subtropical monsoon climate with a mean temperature of 16.4 ◦C. The average annual precipitation is 1094.6 mm, of which over 53% falls between April and August. The dominant soil type at the site is yellow-brown soil. Yellow-brown soil is a transitional soil between yellow, red, and brown soils, which are common in mixed subtropical evergreen broad-leaved and deciduous broad-leaved forests widely. The landscape of the study area is characterized by pure forests, mixed forests, shrub land, water, cultivations, and forest openings. The percentage of forest cover is 85%. The main tree species in the area are *Pinus massoniana* Lamb. and *Quercus acutissima* Carr. Less abundant species include *Cunninghamia lanceolata (Lamb.)* Hook., *Ilex chinensis* Sims, *Platycarya strobilacea* Sieb.et Zucc., *Liquidambar formosana* Hance, *Dalbergia hupeana* Hance, *Pistacia chinensis* Bunge., and *Celtis sinensis* Pers. [26].

**Figure 1.** A portion of the study area and the location of the sample plots over a SPOT6 Multispectral Image (MSI) image acquired on 6 August 2015.

#### *2.2. Data*

#### 2.2.1. Field Data and Stand Volume Estimation

The field sample plots were established in August 2015, August 2018, and November 2019. In total, 68 square plots of the size 20 × 20 m each were established in pure *Pinus massoniana* plantations by applying the method of combining stratified sampling and random sampling. Considering the standards for division of *Pinus massoniana* plantation age, there were 17 plots of young forest, 11 of middle-age forest, 15 of near-mature forest, 11 of mature forest, and 14 of over-mature forest. Sample plots in every group of forest ages were distributed randomly in terms of geography. In each plot, trees with DBH over 5 cm were tallied. DBH, tree height (H), and crown diameter (CD) of the individual tree in the plot were all measured. Trees with a DBH below 5 cm were excluded. The LAI-2200 instrument (LI-COR Inc., Li-Cor, Lincoln, NE, USA, 2010) was used to indirectly measure the leaf area index (LAI) of each plot. Measurements were taken either under overcast conditions, or alternatively within two hours after sunrise or before sunset. At each site, two above-canopy and nine low-canopy readings were taken with an opaque, 180◦, view-restricting cap placed over the sensor in order to mask out the operator [6]. After finding the total number of all trees in a plot, the stand density (Density) was estimated in trees per hectare (trees ha<sup>−</sup>1). The survey also provided information about aspect, slope, and slope position. A differential global positioning unit (Trimble GeoXH6000 GPS units) was used to locate the center and the four corners of each plot, allowing the plots to be geo-referenced against satellite data. The summarized characteristics of the 68 plots are documented in Table 1.

The volumes of each individual growing trees were predicted by applying the allometric model (Equation (1)) based on DBH and H. These models yielded the following expression [27]:

$$\text{V} = 0.00006228789 \times \text{DBH}^{1.849839} \times \text{H}^{0.9843411} \tag{1}$$

where, V is the individual tree volume, DBH is the tree diameter at a height of 1.3 m, and H is the total tree height. The volumes estimated from all individual trees within a sample plot were summed to obtain the GSV of the sample plot, which was then up-scaled to per hectare. The GSV of the plot was measured in cubic meter per hectare (m3 ha<sup>−</sup>1). The field data revealed that the plot-level GSVs varied widely, ranging from 10.20 to 319.82 m<sup>3</sup> ha<sup>−</sup>1, with a mean of 140.84 m3 ha<sup>−</sup>1. The inconsistency between the time of sample plot data collected in 2018 and 2019 and remote sensing (RS) image was an important factor which contributed to low precision of forest parameter estimation. We used the growth model of *Pinus massoniana* tree for conversing volume values of 2018 and 2019 to V values of 2015. The formula was as follows [28]:

$$\mathbf{V} = 0.000025058 \times \mathbf{A}^{2.5983243} \text{ (R} = 0.996) \tag{2}$$

where, V is volume of a single tree and A is the stand age.

**Table 1.** The summarized characteristics of the 68 plots surveyed in August 2015, August 2018, and November 2019.



**Table 1.** *Cont.*

#### 2.2.2. SPOT 6 Image and Processing

This study was based on a single SPOT 6 panchromatic-multispectral image that was acquired on 6 August 2015 under clear sky conditions. The panchromatic image had a spatial resolution of 1.5 m while the resolution of the multispectral image was 6 m. The panchromatic image was orthorectified using ground control points and digital elevation model (DEM) data. Then, the corrected panchromatic image was used to rectify the multispectral data. The raw digital number values for the multispectral data were converted to spectral radiance values and then into top of atmosphere (TOA) reflectance. Atmospheric correction was performed using the Fast Line-of-sight Atmospheric Analysis of Spectral Hypercubes (FLAASH) approach [29]. All image processing was performed using the ENVI 5.1 software package.

#### 2.2.3. Spectral Vegetation Indices and Texture Parameters

The average surface reflectance values from the multispectral data were used to compute eight vegetation indices (VIs) (Table 2) that are widely used in forest studies: the Atmospherically Resistant Vegetation Index (ARVI), Difference Vegetation Index (DVI), Enhanced Vegetation Index (EVI), Modified Soil Adjusted Vegetation Index (MSAVI), Normalized Difference Vegetation Index (NDVI), Non-Linear Vegetation Index (NLI), Soil Adjusted Vegetation Index (SAVI), and Simple Ratio (SR). Eight texture parameters—the Mean (MEAN), Homogeneity (HOM), Contrast (CON), Dissimilarity (DIS), Entropy (ENT), Variance (VAR), Angular Second Moment (ASM), and Correlation (COR)—were calculated (see Table 3) based on the SPOT6 panchromatic band at the maximum spatial resolution of 1.5 m using the GLCM method [30].

**Table 2.** Spectral Vegetation index and its calculation formula.


Notes: B, R, and NIR represent SPOT6 reflectance in the blue, red, and near-infrared wavelengths, respectively. Parameters L and γ represent the SAVI term (set to 0.5) and the ARVI term (set to 1), respectively. The coefficients used in the EVI algorithm are C1 = 6.0, C2 = 7.5, C3 = 1, and G = 2.5 [17].


**Table 3.** Formula of texture measurements used in this study [30].

pixels that are summed. μ*i*, μ*j*, σ<sup>2</sup> *<sup>i</sup>* , and <sup>σ</sup><sup>2</sup> *<sup>j</sup>* are the means and standard deviations of *Pi* and *Pj*. *P(i, j)* is the normalized cooccurrence matrix.

#### 2.2.4. Optimum Window Selection

Image texture is a function of the image's spatial resolution. Texture parameters derived with the GLCM method are highly sensitive to the moving window size [20]. Small window sizes are known to exaggerate differences within the window but retain a high spatial resolution, whereas larger windows may cause inefficient extraction of texture information due to over-smoothing of textural variations [39]. There is no consensus as to what moving window size is optimal, so it is necessary to test a range of window sizes to determine which provides the best speed and precision when estimating GSV. Therefore, seven window sizes were used to estimate the stand volume in this work: 3 × 3, 5 × 5, 7 × 7, 9 × 9, 11 × 11, 13 × 13, and 15 × 15.

#### *2.3. Machine Learning Algorithms (MLAs)*

#### 2.3.1. Classification and Regression Tree (CART)

The CART algorithm is a basic machine learning method proposed by Breiman et al. [40]. The regression tree algorithm was used to build models because the response variable in this study was the forest GSV. The CART algorithm was implemented with the "rpart" in the R software package (version 3.4.3; R Core Team, Viennna, Austria, 2017). Feature selection and pruning are the two core issues in decision tree algorithms. The Gini index and post-pruning were used as the binary split criterion for the selection of split features and to modify the tree, respectively.

#### 2.3.2. Support Vector Machine (SVM)

Support vector machines (SVM) use a nonlinear kernel function to project input data onto a high-dimensional feature space, where complex non-linear patterns can be simply represented [12,41]. In the new hyperspace, the SVM aims to identify an optimal hyperplane that fits the data and minimizes the training error and the complexity of the model [42]. The Gaussian radial basis kernel function of the form was used in this study [43,44]. The best regularization and bandwidth parameter were determined using the training data. The SVM model was implemented with "e1071" packages in the R software package.

#### 2.3.3. Artificial Neural Network (ANN)

A multi-layer neural network with a back-propagation algorithm, which may be the most popular type, was used in this study [42,45]. In this context, the neurons in the input layer were the predicted variables selected by the Boruta algorithm, while the number of neurons in the hidden layer, which carried weights representing the linkages between the predictors and GSV data, was determined using both the training and validation data. The output layer was a single neuron that represented the output values of the forest GSV. We examined the number of neurons ranging from 1 to 10 in the hidden layer to build different models. The ANN algorithm was implemented with the "neuralnet" package in the R software package.

#### 2.3.4. Random Forest (RF)

Ntree (i.e., to the number of variables) and Mtry (i.e., to the number of variables to randomly sample as candidates at each split) parameters are highly sensitive to prediction robustness of RF algorithms [12]. Wang et al. [46], Breiman [47], and Zhao et al. [4] have all indicated that the default values are often a good choice, and that the influence of user-defined parameters on the sensitivity of prediction is very small. Thus, the Ntree value was set to 500 and the Mtry value was set to one-third of the predictive variables. The RF algorithms were implemented with the "randomForest" package in the R software package.

#### *2.4. Model Testing and Comparison*

The field data was randomly divided into training (70%) and testing (30%) data. The testing data (*n* = 20) was used to validate the CART, SVM, ANN, and RF models. A random selection of training samples was repeated 100 times to reduce variability. Three common statistical parameters, namely the coefficient of determination (*R*2), root mean square error (RMSE), and relative RMSE (rRMSE) were calculated to evaluate the models' performances.

$$\text{RMSE} = \sqrt{\frac{\sum\_{i=1}^{n} \left(\hat{y}\_i - y\_i\right)^2}{n}} \tag{3}$$

$$\text{rRMSE} = \text{RMSE} / \overline{y} \tag{4}$$

where *y*ˆ*<sup>i</sup>* was the predicted forest GSV and *yi* was the forest GSV observed in the field.

#### **3. Results**

#### *3.1. E*ff*ects of Moving Window Size on the Precision of GSV*

The effect of varying the moving window size on the precision of GSV estimation depended on the algorithm used for variable selection. The *R*<sup>2</sup> decreased with increasing moving window size when CART, SVM, and RF were used to estimate the GSV. In other words, 3 × 3 and 5 × 5 window yielded the highest *R*<sup>2</sup> and lowest RMSE and rRMSE when using these methods (Table 4). For the ANN method, the *R*<sup>2</sup> initially decreased with the moving window size and then increased, the highest *R*<sup>2</sup> values (0.83) was achieved with this model when using a window size of 15 × 15 in this case (Table 4).



CART = classification and regression tree, SVM = support vector machine, ANN = artificial neural network (ANN), RF = random forest, *R*<sup>2</sup> = the coefficient of determination, RSME = root mean square error, and rRMSE = the relative RMSE.

#### *3.2. Performance of CART, SVM, ANN, and RF*

To evaluate the performance of CART, SVM, ANN, and RF for predicting the forest GSV when using only textural parameters, four machine learning algorithms were compared in terms of their precision (*R*2) and RMSE (Figure 2). When the window size was set as 3 <sup>×</sup> 3, 5 <sup>×</sup> 5, and 9 <sup>×</sup> 9, RF demonstrated the best regression performance in terms of model precision and RMSE. The ANN achieved the highest precision when the window size was 11 × 11, 13 × 13, or 15 × 15 (Figure 2).

**Figure 2.** The coefficient of determination (R2) and root mean square error (RMSE) the forest GSV estimated by different regression models (classification and regression tree, CART; support vector machine, SVM; artificial neural network, ANN; random forest, RF) with different textural information calculated with different window sizes; (**a**) 3 × 3 window size, (**b**) 5 × 5 window size, (**c**) 7 × 7 window size, (**d**) 9 × 9 window size, (**e**) 11 × 11 window size, (**f**) 13 × 13 window size, and (**g**) 15 × 15 window size.

On the other hand, a comparative analysis based on texture parameters and textural parameters together with SVIs demonstrated that RF was the best method for estimating GSV. The *R*<sup>2</sup> values for models generated using this method ranged from 0.82 to 0.86 (Figure 3). For models using only SVI data as inputs, ANN yielded the highest precision followed by SVM and RF. When using textural parameters together with SVIs as inputs, SVM achieved the second highest precision (Figure 3).

**Figure 3.** Scatter plots of predicted versus observed growing stock volume (GSV) using classification and regression tree (CART), support vector machine (SVM), artificial neural network (ANN), and random forest (RF) models for (**a**) spectral vegetation indices (SVIs), (**b**) texture features, and (**c**) SVIs plus texture features. *R*<sup>2</sup> = the coefficient of determination, RSME = root mean square error, and rRMSE = the relative RMSE.

#### *3.3. Performance of Spectral, Texture, and Fusion of Spectral and Texture Information*

Regression models using SVIs as inputs achieved worse estimating precision than those using textural parameters independent of the choice of regression method. The *R*<sup>2</sup> value for the model based on texture data alone (the *R*<sup>2</sup> values were 0.76, 0.75, 0.66, and 0.82 for CART, SVM, ANN, and RF, respectively) was substantially higher than that for the model based on SVIs alone (Figure 3). Likewise, the RMSE and rRMSE values for the model based on textural parameters alone were considerably lower than those for the model based on SVIs alone. The RMSE values were 36.61 m3/ha, 37.96 m3/ha, 45.44 m3/ha, and 32.36m3/ha, and the rRMSE values was 29.29%, 30.37%, 31.45%, and 25.89% for CART, SVM, ANN, and RF, respectively. Fusing spatial and textural information did not greatly increase the estimating precision relative to that achieved using textural parameters alone, although minor improvements were observed with the SVM and RF methods (Figure 3). The *R*<sup>2</sup> values achieved with the CART methods when using textural parameters alone as inputs were almost

equal to those achieved using textural parameters together with SVIs (Figure 3). Figure 4 indicates the importance of explanatory variables when the RF algorithm is used. The NDVI and MEAN calculated with 3 × 3 and 5 × 5 window sizes, were of higher importance than the other variables.

**Figure 4.** The importance of explanatory variables: (**a**) spectral vegetation indices (SVIs), (**b**) texture features, (**c**) SVIs plus texture features when the Random Forest (RF) algorithm was used. VImp = the variable's importance values, NDVI = Normalized Difference Vegetation Index, ARVI = Atmospherically Resistant Vegetation Index, DVI = Difference Vegetation Index, EVI = Enhanced Vegetation Index, SAVI = Soil Adjusted Vegetation Index, MSAVI= Modified Soil Adjusted Vegetation Index, NLI = Non-linear Vegetation Index, SR = Simple Ratio. Texture (3 × 3 + 5 × 5) was the value of textural parameters when window size was set as 3 × 3 or 5 × 5. MEAN 3 × 3, VAR 3 × 3, DIS 3 × 3, ASM 3 × 3, CON 3 × 3, COR 3 × 3, ENT 3 × 3, and HOM 3 × 3 were Mean, Variance, Dissimilarity, Angular Second Moment, Contrast, Correlation, Entropy, and Homogeneity values when the moving window size was set as 3 × 3. MEAN 5 × 5, VAR 5 × 5, DIS 5 × 5, ASM 5 × 5, CON 5 × 5, COR 5 × 5, ENT 5 × 5, and HOM 5 × 5 were Mean, Variance, Dissimilarity, Angular Second Moment, Contrast, Correlation, Entropy, and Homogeneity values when the moving window size was set as 5 × 5.

#### **4. Discussion**

The GSV of *Pinus massoniana* plantations was estimated using four machine learning algorithms (CART, SVM, ANN, and RF) together with spectral and textural information from a SPOT6 image. The effect of varying the window size on the precision of GSV estimation was investigated, and the usefulness of textural parameters from SPOT6 images for forest GSV estimation was assessed.

Image texture is a complex aspect of visual perception [48]. The optimal moving window size for extracting textural parameters from the SPOT6 image generally depends on the choice of regression method. In this study, the CART, RF, and SVM methods performed best with 3 × 3 or 5 × 5 moving windows, while 15 × 15 windows were best for ANN. These results are consistent with those of previous studies [6,49], in which small windows (3 × 3 or 5 × 5) were found to be best when using the multiple linear regression method to estimate forest variables. Small window sizes increase sensitivity to interpixel differences in the proportions of tree crown and shadow and may better detect fine-scale variation in pixel brightness, whereas larger windows may extract texture information inefficiently due to over-smoothing of textural variation [50–52]. Some researchers have suggested that the window size should match that of the sample plots to increase precision [7,53,54]. Moreover, Chryasfis et al. [5] concluded that no single window size could adequately characterize the texture conditions in different phenological scenes when using the bagging least absolute shrinkage and selection operator (LASSO) algorithm. One of the reasons ANN performed well at 15 × 15 windows, while CART, RF, and SVM

performed better at 3 × 3 or 5 × 5 windows, may be related to the autocorrelation of texture parameters from different window sizes.

MLAs have fewer assumptions, higher methodological accuracy, and high non-linear adaptation, and can efficiently model the complex non-linear relationships between forest biophysical parameters and RS data [55–59]. Four machine learning regression algorithms, CART, SVM, ANN, and RF were explored in this study. The RF method yielded more precise GSV estimates than the other tested methods (CART, SVM, and ANN) when texture parameters alone, or a fusion of spectral and texture information, were used. This is consistent with the results of Zhao et al. [4], who found that RF achieved the greatest precision when estimating forest variables in black locust plantations on the Loess Plateau. Similarly, Fallah et al. [60] concluded that SVR and RF outperformed k-NN in volume/ha estimation. These results may be due to the fact that RF is more sensitive to overfitting during model training, and can handle high data dimensionality [47,61,62]. The algorithm of the random forest method naturally includes the interactions of variables which are often ignored in other models because of their complexity [63]. However, Zhang et al. [12] found that ANN and SVM were the best methods for estimating sawgrass marsh AGB, with both methods achieving correlation coefficients (r) above 0.9, while ANN was the best method for total biomass estimation (r = 0.94). ANN achieved the best estimating precision independently of the SVIs of input data in this study. This result may be explained by the fact that ANN has advantages in complex pattern learning and generalizing in noisy environments. For CART, it was observed that for larger values of observed GSV, the predicted values of GSV were systematically lower (i.e., always below the 1:1 line), although high *R*<sup>2</sup> was obtained when texture parameters or a combination of texture parameters and SVIs were used. This may be contributed to by over-fitting. Another reason for underestimation may be due to optical saturation under the conditions of high biomass in dense forests [64]. Overall, CART, SVM, ANN, and RF each have their own advantages. The selection of MLAs may be related to the type of RS information applied. For example, in this study, ANN could be used when applying SVIs alone for GSV estimation, and RF was a good choice when texture parameters or a fusion of textural and spectral information were used. The main drawback of these MLAs was that they did not reveal the functional relationships between the target and predictor variables. MLAs are often referred to as "black box" approaches [7,12,42,65].

Models using textural parameters as inputs achieved greater precision than spectral models, confirming that high-resolution textural information is more useful than spectral data for forest parameter estimation, especially in dense and complex forests characterized by a mosaic of dense and sparse vegetation cells [6,7,66]. Our results indicate that combining spectral and textural parameters yields, at best, marginal improvements in estimating precision compared to using textural parameters alone when estimating forest GSV. This is consistent with the conclusions of Chrysafis et al. [5], based on their work on Mediterranean forests. Textural information was indispensable in the estimation of forest parameters, especially when a high-resolution image was adopted. However, there were many uncertainties that needed to be considered, such as window size. It may be necessary to determine the optimal moving window size for extracting textural parameters.

In this research, forest GSV tended to be overestimated at lower values and underestimated at higher values (Figure 3). This uncertainty may be contributed to the inconsistency between the time of in-site data collection and RS image, although we applied the growth model of *Pinus massoniana* tree for stand volume conversion from 2018 or 2019 to 2015. The growth was calculated from the stand age of forest, which decreased the precision in our study. We suggested that it was better to keep the time of field data surveyed consistent with time of RS acquisition in estimation of forest parameters.

#### **5. Conclusions**

Four machine learning regression algorithms including classification and regression tree (CART), support vector machine (SVM), artificial neural network (ANN), and random forest (RF) algorithms were used to estimate the growing stock volume (GSV) of *Pinus massoniana* plantations in the northern subtropical area of China based on spectral vegetation indices (SVIs) and textural parameters extracted

from a SPOT6 remote sensing image. The RF method was found to offer the best performance for GSV estimation, although SVM, CART, and ANN also performed reasonably well. Increasing the size of the moving window used when extracting data from the SPOT6 image reduced the predictive performance of the CART, SVM, and RF methods, and the highest coefficient of determination (*R*2) values in these cases were achieved with 3 × 3 or 5 × 5 windows. Using both spatial and textural parameters as model inputs improved estimating precision relative to using spectral data alone. However, the combined approach did not appreciably outperform models using only textural parameters as inputs.

**Author Contributions:** Y.D. supervised the work and provided advice for preparing and revising the paper. J.Z. proposed the idea, collected field survey data, organized the writing, and revised the paper. Z.Z. contributed to the programming and experiments, and drafted the manuscript. Q.Z. and Z.H. analyzed the data and prepared figures. J.X. collected field survey data and did image preprocessing. P.W. helped to collect field survey data and revise the manuscript. All authors read and approved the final manuscript.

**Funding:** The research was funded by the national key research and development plan (grant number, 2017YFC050550404) and project 2662020YLPY020 supported by the fundamental research funds for the central university.

**Acknowledgments:** We would like to thank Jie Luo from Huazhong Agricultural University for providing constructive comments, which greatly improved the manuscript.

**Conflicts of Interest:** The authors declare no conflict of interest.

#### **References**


© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

MDPI St. Alban-Anlage 66 4052 Basel Switzerland Tel. +41 61 683 77 34 Fax +41 61 302 89 18 www.mdpi.com

*Forests* Editorial Office E-mail: forests@mdpi.com www.mdpi.com/journal/forests

MDPI St. Alban-Anlage 66 4052 Basel Switzerland

Tel: +41 61 683 77 34 Fax: +41 61 302 89 18