Next Article in Journal
Moisture Content of Fresh Scots Pine Wood in Areas near Damage Caused by Harvester Head Feed Rollers
Previous Article in Journal
Experimental Study of Surface Roughness of Pine Wood by High-Speed Milling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Alternative Method for Estimation of Stand-Level Biomass for Three Conifer Species in Northeast China

1
Department of Forest Management, School of Forestry, Northeast Forestry University, Harbin 150040, China
2
Wildlife and Fisheries Department, Punjab Forestry, Multan 60000, Pakistan
3
Key Laboratory of Sustainable Forest Ecosystem Management-Ministry of Education, School of Forestry, Northeast Forestry University, Harbin 150040, China
*
Authors to whom correspondence should be addressed.
Forests 2023, 14(6), 1274; https://doi.org/10.3390/f14061274
Submission received: 20 May 2023 / Revised: 6 June 2023 / Accepted: 18 June 2023 / Published: 20 June 2023
(This article belongs to the Section Forest Ecology and Management)

Abstract

:
Accurate large-scale biomass prediction is crucial for assessing forest carbon storage and dynamics. It can also inform sustainable forest management practices and climate change mitigation efforts. However, stand-level biomass models are still scarce worldwide. Our study aims to introduce the generalized additive model (GAM) as a convenient and efficient approach for forest biomass estimation. Data from 311 sample plots of three conifer species in northeastern China were used to evaluate the performance of the GAM model and compare it with traditional nonlinear seemingly unrelated regression (NSUR) models in predicting stand biomass, including total, aboveground, and component biomass. The results indicated that the goodness of fit of GAM was better than that of NSUR in two model systems. In the majority of cases, the scatter plots and prediction performance revealed that the stand total and component biomass models utilizing GAM outperformed those based on NSUR. Disregarding heteroscedasticity and requiring fewer statistical assumptions provide additional support for the replacement of NSUR-based models with GAM-based models. This study implies that the GAM approach has greater potential for developing a system of stand biomass models.

1. Introduction

Forests play a critical role in terrestrial ecosystems, covering a substantial portion of the terrestrial surface of the Earth. They are crucial in mitigating the human-induced greenhouse effect by absorbing carbon dioxide from the atmosphere [1,2,3]. Since forest biomass is a fundamental characteristic of forest ecosystems, its accurate estimation is essential for experts and policymakers interested in the exchange and storage of carbon on a global scale, nutrient cycling, and energy flow [4,5,6,7]. The global focus on carbon storage estimation was further heightened with the implementation of the Kyoto Protocol under the United Nations Framework Convention on Climate Change. To quantify carbon, assessing forest biomass is a prerequisite. As a result, researchers have been striving to enhance the precision of biomass prediction at the individual tree level [8,9,10,11], and at a regional scale [12,13,14,15].
Generally, forest biomass is estimated using the scaling-up approach, which obtains stand biomass or local scale biomass by adding up the estimated tree biomass [16]. The scale-up strategy uses easily measured variables of the tree, i.e., diameter at breast height, height of the tree, age, and crown factors [17,18]. However, individual tree information is occasionally available for broader management landscapes. An efficient and direct approach to estimating forest biomass involves establishing a correspondence between forest monitoring data and biomass projections at a comparable spatial level. A link provides easily accessible stand variables to develop the total stand and component biomass models [19]. The models are usually divided into two types, including stand variables and volume-derived biomass. The volume-derived biomass models use the stand volume as a single predictor, while multiplying the obtained stand volume with biomass expansion factors allows for the calculation of a stand’s total or component biomass [20,21,22]. The establishment of additive individual tree biomass models in different regions and stand conditions worldwide has been crucial in supporting the development of stand biomass models [23,24,25,26,27,28,29].
Furthermore, national monitoring of forests is based on national forest inventories (NFIs), whose data generally summarize stand attributes, such as basal area and stand density. This approach offers rapid and convenient conditions to facilitate the development of stand biomass models. Nonetheless, compared to studies on individual tree biomass models, there are still relatively few tree species and areas available [30,31].
An essential and logical feature for predicting biomass is additivity, i.e., when multiple component models are constructed for individual trees or stands, the total predicted biomass should be equal to the sum of the predicted values of each component, ensuring consistency in the estimation process [32]. Therefore, various estimation methods have been proposed by researchers to ensure the additivity and statistical efficiency of biomass system parameters [19,33,34,35,36]. Among the methods suggested, nonlinear seemingly unrelated regression (NSUR) is a popular parameter estimation method because it can ensure additivity by fitting the system simultaneously and considering the constraints of linear correlation between the components. The flexibility and general nature of NSUR allow the component model to use different independent variables, and each component model may contain a particular weight function for heteroscedasticity in the system. As the result, the total biomass produces a low variance [37].
Additionally, if NSUR is expected to estimate subtotal biomass (crown, aboveground, ect.), Parresol [35] proposes the addition of one or more constraints to maintain the additivity among the components while guaranteeing the additivity to the total biomass. Therefore, NSUR has been widely applied in crown width models, individual tree models, and stand biomass models [11,38,39,40].
In cases where prediction is the primary goal of developing a model, parametric models can deliver accurate future predictions [41]. However, sometimes the relationship between explanatory and response variables might be too complex to find an appropriate mathematical function [42]. Semiparametric models, e.g., generalized additive models (GAM), provide an attractive substitute for such scenarios.
The advantage of GAM is the automatic identification of the amicable relationships between the dependent and predictor variables [43]. Thus, GAM does not face the problem of comparing several potential alternative model forms. Another merit of GAMs is their flexibility and robustness [44], as GAMs are more relaxed toward the basic assumptions required in developing linear or nonlinear models (i.e., independent, normality, and equal variance). Therefore, GAMs have been used extensively in forestry studies like taper modeling [45], insect pests [46], increment of basal area [47], taper functions [41,48], and tree biomass [9].
Korean pine (Pinus koraiensis Siebold & Zucc.), a multipurpose tree species, has substantial economic value. It mainly produces good-quality timber and edible nuts. The nuts of Korean pine are highly priced in the international market [49]. Thus, Korean pine contributes to the maintenance of ecosystem diversity as well. Korean larch (Larix olgensisi A. Henry) is highly valued for its luxuriant growth and resistance to diseases and insects, making it an important commercial tree species in the north, northeast, and south subalpine regions of China as well as a valuable indigenous species. The wood is highly valued for its excellent mechanical properties and is widely used for various applications, including furniture, flooring, housing, and plywood. It is among the four pulp-producing coniferous species in China [50]. Mongolian pine (Pinus sylvestris var. mongolica Lity.), is highly resistant to soil infertility, drought, and cold. It has been extensively planted in the successful afforestation project in the Three-North regions of China for soil and water conservation, windbreaks, and timber production [51,52]. The proportion of planted forests in Jilin Province is around 12%, covering an area of about 1,759,400 hectares, and continues to expand [53]. Planted forests have relatively high productivity, which is also essential for increasing forest cover and fixing carbon dioxide in the atmosphere. Therefore, accurately estimating the biomass of plantation forests is important for understanding the mechanisms of the global carbon cycle and formulating policies to mitigate global warming [54].
The aims of this study were (1) to introduce GAM to establish stand biomass prediction systems, i.e., stand biomass models including observed stand variables (Model 1) and volume-derived models (Model 2) for three types of plantation species in the northeastern region of China; and (2) to evaluate the predictive ability of two modeling techniques (GAM and NSUR) for stand total and component biomass prediction.

2. Materials and Methods

2.1. Experimental Site and Data Source

The research was fulfilled in a forest plantation that is considered representative (125°3′–125°50′ E, 42°18′–43°14′ N), administered by the Dongfeng County Forestry Bureau, Jilin Province, Northeast China. The study area covers 48,486.79 hectares of forest plantation, dominated by Korean pine (4396.73 ha), Korean larch (27,021.1 ha), and Mongolian pine (17,068.96 ha). Dongfeng County is situated in the southern region of the Haddaling Mountains, with an elevation ranging from 300 to 914 m. The topographic features are mainly hilly terraces, accounting for 61.4% of the county’s area. The region is situated in the monsoon zone and classified as having a humid, mid-temperate climate. The mean annual temperature ranges from −37 °C to 35 °C, with a mean of 4.5 °C. The annual precipitation fluctuates from 451.9 mm to 867.5 mm, and the frost-free period is about 128 days. There are twelve forest farms in the Dongfeng Forestry Bureau. The study utilized data collected from seven forest farms, namely: Daxing, Hengdaohe, Renhe, Shahezhen, Yangmulin, Yimianshan, and Zhongyu. The experiment site and the spatial distribution of the sample plots are shown in Figure 1.
In the summer and autumn of 2021, a total of 311 temporary plantation plots were established using a stratified sampling method, with site types as the stratification variable. Specifically, 121 plots for Korean pine, 90 for Korean larch, and 100 for Mongolian pine. Out of the total number of plots, 214 were classified as pure plantation forests, with the dominant species accounting for 100%. The 97 plots are mixed forests, with 20 plots having dominant tree species accounting for 50%–80% of the composition, while the remaining 77 plots have dominant tree species accounting for over 80% of the composition. The main species included in the mixed forest plots are Quercus mongolica, Fisch. ex Ledeb., Betula platyphylla Sukaczev, Fraxinus mandshurica Rupr., Ulmus pumila L., Juglans mandshurica Maxim., and Acer elegantulum W.P. Fang and P.L. Chiu. The plot size varied from 100 to 900 m2.
Site information, including elevation, longitude, latitude, slope, and aspect, was recorded for each plot. All tree information was measured and recorded in each plot, except for those with a diameter at breast height less than 5 cm, a height less than 1.3 m, or those that were dead. The forked trees were considered a single tree when the fork’s location was above 1.3 m or two trees when the fork’s location was below 1.3 m [55]. The measurements taken for sampled trees in each plot included diameter at breast height, tree height, the spatial location of the tree, the height of the first living whorl, and crown width in four directions (east, west, north, and south). In a total of 311 sample plots, we measured 13,140 trees, with 4976 trees measured in Korean pine plots, 3518 trees measured in Korean larch plots, and 4646 trees measured in Mongolian pine plots.

2.2. Stand Characteristics and Biomass Estimation

Stand characteristics were calculated using the individual tree variables. For example, the basal area (G, m2·ha−1) and stand quadratic mean diameter (Dg, cm) were calculated through π 4 D 2   A and D 2 / n   . Where D is diameter at breast height, A is the area of the plot, and n is the number of trees per plot, respectively. The stand mean height (Hm, m) was calculated as mean tree height measurements within the plot, while stand density (N, trees·ha−1) was obtained dividing the number of trees by the plot area. The stand volume was derived through adding up individual volume of all trees within a plot, and then dividing that sum by the area of plot. The individual stem volume was estimated employing the model of the tree volume table of China (Table S1), which contains tree volume models for three species in Dongfeng County. The next step involved calculating the total and component biomass of the stand. Firstly, the biomass of each tree was obtained using a previously reported tree biomass model (Table S2). To obtain the total biomass of a plot, we sum up the individual tree biomass within the plot. Secondly, the total biomass of a plot was determined by dividing the plot area by the stand biomass. Table 1 summarizes the statistical information for the variables as well as the total and component biomass.

2.3. Stand Biomass Model Specification

Some previous researchers have indicated a strong relationship between stand biomass and stand variables [56,57,58]. Furthermore, Castedo-Dorado et al. [12] included the variable of stand volume to explore efficient stand biomass modeling for large management units. Therefore, we evaluated two alternate models for stand biomass prediction: (1) The system of stand total and component biomass models that used observed stand variables (Model 1), and (2) The model system that used stand volume as a predictor (Model 2). The base models are as follows:
B = α 0 X 1 α 1 X 2 α 2 X 3 α 3 X n α n + ε
B = α 0 V α 1 + ε
where B is the total or component biomass, α 0     α n are the estimated parameters of the models, X1Xn are stand variables (stand age, stand density, basal area, and stand mean height); V is standing volume; and ε is error term of the model.

2.3.1. NSUR-Based Stand Biomass Model

An attempt was made to develop a stand biomass model using stand variables, and scatter plots of stand variables and stand biomass were plotted (Figure 2). The results showed that the scatter trend of biomass with G, Hm, and N was more obvious than with other variables. The modeling attempts using stand variables showed that the former two were the best predictor variables for estimating stand total and component biomass. To ensure the additivity of the models, an additive model system was applied, containing six equations that simultaneously fit the system with cross-equation constraints by Nonlinear Seemingly Unrelated Regression (NSUR) [35]. Therefore, Model 1 employs the basal area and stand mean height as predictor variables, as shown in Equation (3), while Model 2 uses stand volume as the only predictor variable, as shown in Equation (4).
{ B i = c i j G k i j H m m i j + ε i B a = B s + B b + B n + ε a B t = B r + B s + B b + B n + ε t
{ B i = c i j V k i j + ε i B a = B s + B b + B n + ε a B t = B r + B s + B b + B n + ε t
where Bi is the biomass of i components; G is the basal area; Hm is stand mean height; V is standing volume; cij, kij, and mij are parameters of the model system; i indicates r, s, b, or n (belowground, stem, branch, or needle); a and t are aboveground and total; and j indicates Korean pine, Korean larch, or Mongolian pine.
The NSUR-based stand biomass models were developed by the SAS/ETS MODEL procedure.

2.3.2. GAM-Based Stand Biomass Model

The generalized additive model (GAM) is a semi-parametric expansion of the generalized linear model [48,59,60]. GAM, a flexible semi-parametric method, explores complex nonlinear and non-monotonic relationships between the response and predictor variables. The model is generally expressed as:
Y = β + n = 1 p S n ( X n ) + ε
where Y is the response variable with an exponential family distribution; Xn (n = 1…p) are predictor variables; Sn (·) (n = 1…p) are smoothing functions; and β is the intercept. The GAM follows the basic assumption that smoothing functions are additive to each other. It implies that the implementation of GAM is similar to the addition of multivariate linear regression functions. The use of smooth functions may fit better, tapping the potential of the data better than a purely parametric model because it is sometimes data-driven rather than model-driven. The GAM is usually fitted by a numerical algorithm, i.e., backfitting. The minimization of the smoothing function is sought through Equation (6):
ε 2 = [ Y ( β + n = 1 p S ( X n ) ) ] 2
Various smooth functions are available for GAM, allowing different smooth functions to be combined by the additive. Different smooth functions include thin-plate regression spline (TP), B-spline (BS), Gaussian process smooths (GP), cubic spline (CS), polynomial cubic spline (PS), and locally weighted scatter plot smoothing (LOESS) [61]. If the spline function is not set at modeling, then the default thin-plate regression spline will be used [62]. This ensures consistency with the simultaneous fitting of the stand total and component biomass models using NSUR. Two types of GAM-based stand biomass modeling systems were established. We chose the same stand variables, i.e., basal area (G), stand mean height (Hm), and stand volume (V), to develop the semi-parametric stand biomass model systems. Therefore, the prediction systems (Model 1) utilizing basal area (G) and stand mean height (Hm) are established according to Equations (7)–(9) and have the following format:
B ^ i = β ^ i j + S ( G ) + S ( H m )
B ^ a = B ^ s + B ^ b + B ^ n
B ^ t = B ^ r + B ^ s + B ^ b + B ^ n
The second system (Model 2) used stand volume (V) and comprised the following Equations (10)–(12).
B ^ i = β ^ i j + S ( V )
B ^ a = B ^ s + B ^ b + B ^ n
B ^ t = B ^ r + B ^ s + B ^ b + B ^ n
where B ^ i is i component biomass estimates, β ^ i j are estimated parameters, S (·) are smoothing functions; other variables have been previously defined.
The GAM-based stand biomass models were fitted using the R software (Version 4.2.0, Vienna, Austria) package mgcv [61].

2.4. Heteroskedasticity

Heteroscedasticity is often present in the tree or stand biomass modeling process, which reflects a particular pattern in the residuals of the estimated base model, i.e., the variance of the residuals increases or decreases as the predicted values increase [63]. The issue of heteroscedasticity does not hinder the development of GAM-based stand biomass models since the underlying assumptions of GAM are not constrained by this problem. However, the presence of heteroscedasticity can compromise parameter estimates [64], lead to inaccurate confidence intervals [65], and introduce systematic errors in predictions [66,67]. The weight functions were introduced to overcome the problem. Firstly, stand biomass models were initially fitted using NSUR [68]. In the current state, the residuals with heteroscedasticity were obtained for each model. The variance of residuals could be modeled with one or more stand-alone variables [16,35]. Accordingly, the assumption was as follows:
E ( ε i 2 ) = σ 2 ( X 1 φ 1 X n φ n )
Secondly, the estimated residuals squared ( ε ^ i 2 ) of each model were transformed using the natural logarithm, as well as the stand variables. Then they were fitted by stepwise regression of the following form:
ln ( ε ^ 2 ) = ln ( σ 2 ) + φ 1 ln ( X 1 ) + + φ n ln ( X n )
where ε ^ represents unweighted residuals for each model, φ 1   φ n are parameters, σ 2 represents the variance of residuals, and X1Xn represents stand variables.
Thirdly, the parameters of Equation (14) were selected based on a significance level of α = 0.05. The weight functions were 1 / ( X 1 φ ^ 1 X n φ ^ n ) for different stand biomass models. Finally, the model system was refitted using the NSUR in SAS Procedure, and the r e s i d .   B i = r e s i d . B i / ( X 1 φ ^ 1 X n φ ^ n ) (where resid. Bi represents the residual of ith model) was added in the process [69,70].

2.5. Evaluation of Systems of Stand Biomass Models

The performance of Model 1 and Model 2 based on GAM and NSUR methods was evaluated using coefficient of determination (R2) and root mean square error (RMSE) as metrics for evaluating goodness-of-fit. Additionally, the models were validated using mean error (ME), RMSE, and relative root mean square error (RRMSE). The mathematical expressions for these statistics are as follows:
R 2 = 1 i = 1 n ( B i B ^ i ) 2 i = 1 n ( B i B ¯ ) 2
R M S E = i = 1 n ( B i B ^ i ) 2 n 1
M E = 1 n i = 1 n ( B i B ^ i )
R R M S E = i = 1 n ( B i B ^ i ) 2 n 1 B ¯
where Bi represents observed values, B ^ i represents predicted values, B ¯ represents the mean of the observed values, and n represents the number of samples. The units for RMSE and ME are Mg ha−1, while R2 and RRMSE are dimensionless.
To validate the optimal model system, we employed the leave-one-out cross-validation (LOOCV) technique. This technique leaves individual sample plots in each step. We deleted a sample plot from the data and executed the model fitting with the remaining dataset. In each stage, the calculated parameters were employed to forecast the biomass in the removed sample plot. The same procedure was carried out for all 311 sample plots. Then, we used the measured and predicted values to calculate validation statistics and verify the performance of the models [48,71,72].
In addition, to evaluate the performance of the NSUR and GAM models in predicting the outcome, we utilized scatter plots to compare the predicted values with the observed values. Furthermore, we employed a simple linear model to fit the predicted values against the observed values, facilitating a clear comparison between the two methods. The linear model adopted the form y = kx + b, where y is the observed value, x is the predicted value, k is the slope, and b is the intercept. When the intercept of the linear model approaches zero and the slope approaches 45 degrees, it indicates a closer match between the predicted values and the observed values, signifying a more accurate prediction by the model. This approach offers the advantage of visually identifying the differences in predictions between the methods, particularly for subtle variations. As a result, it has found practical applications among forestry researchers [73,74].

3. Results

3.1. Stand Biomass Models Fitting

While fitting Model 1 and Model 2 using the NSUR method, the inherent correlation among stand biomass components was considered. Additionally, two constraints were considered: The total biomass is the combined biomass of its individual components, whereas the aboveground biomass is comprised of the collective biomass of its stems, branches, and needles. Thus, three 6 × 6 matrices were assumed for three conifer species. Figure 3 displays the matrices showing the correlation of residuals between the total biomass and the individual component biomasses for Model 1 and Model 2. The residual correlations of stand totals and components of Korean larch were higher than those of the other two species. For Korean pine, significant correlations existed between the total biomass and its aboveground biomass, as well as between the total and branch biomass. For Mongolian pine, the correlations were stronger between the stand total and aboveground biomass, stem biomass, and root biomass.
By employing all available data, Model 1 and Model 2 were fitted to ensure that the aggregate of the predicted values for biomass components equaled the predicted values of the total biomass. In order to address the heteroscedasticity in Model 1, different weight functions were employed (Table 2).
Table 2 presents indicators of model fitness for stand biomass models developed with both NSUR and GAM for three species. The GAM-based model systems showed superior fitting to the NSUR-based model systems. As a whole, Model 2 reflected larger values of R2 (0.7303–0.9906) and smaller values of RMSE (0.4136–7.1482). Although Model 1 showed competitive values of R2 (0.6618–0.9901), it reflected significant variations in terms of RMSE (0.2363–12.0525). However, the models behaved differently in terms of the component biomass of the species analyzed. With both NSUR and GAM methods, the performance of Model 1 was better for branch biomass of Korean pine and Korean larch. Similarly, Model 1 showed better results for the belowground and needle biomass of Mongolian pine. The results also revealed that Model 1 and Model 2 delivered the highest RMSEs for total, aboveground, belowground, and stem biomass of Korean larch.

3.2. Stand Biomass Model Validation

Table 3 exhibits the assessment measures for the stand biomass models through the application of LOOCV. The values of ME for GAM-based models were noticeably lower than those for NSUR-based models. The only exception was the branch component in NSUR-based Model 2 for Mongolian pine. As a whole, the GAM-based stand total and the majority of the component biomass models delivered lower values of RMSE and RRMSE compared with the NSUR-based models. However, for the NSUR method, Model 1 performed slightly better in the branch component for Korean larch. Additionally, the RMSE and RRMSE of Model 1 and Model 2 (NSUR) were slightly lower in belowground and needle and belowground and branch components, respectively, for Mongolian pine. It illustrated that Model 1 and Model 2 systems based on GAM could provide reliable prediction accuracy for stand total and component biomass. Like the fitting outcomes, the overall prediction accuracy of Model 2 was noticeably better than Model 1. Moreover, the accuracy of the models for predicting the stand total and aboveground, belowground, and stem components was lower for Korean larch than for Korean pine and Mongolian pine. This results also showed larger values of RRMSE for stand branch or needle components of Model 1 and Model 2 with both NSUR and GAM methods. The larger values of RRMSE were generated because of smaller average values of stand branch and needle than other components, according to Equation (18) above.

3.3. Prediction Accuracy of GAM and NSUR Methods

The NSUR and GAM modeling techniques were evaluated using scatter plots of the predicted values against the observed values for Model 1 and Model 2 (Figure 4 and Figure 5). In terms of the stand biomass models for Korean pine, the GAM-based Model 1 outperformed the NSUR method, with an intercept closer to 0 and a slope closer to 45 degrees (Figure 4). For Korean larch, the GAM-based Model 1 showed better performance for the needle and belowground components. Although the NSUR method had a slightly better slope for the remaining components, the GAM method maintained its superiority in terms of the intercept. In the case of Mongolian pine, the GAM method exhibited slightly poorer performance, except for the belowground and needle components. When considering Model 2, the GAM-based models outperformed the NSUR method in terms of both the intercept and slope for all three species (Figure 5).
The predicted biomass values of stand total and aboveground acquired through LOOCV were further evaluated using box plots to compare the residuals of the stand component models’ sum (Figure 6). Both GAM and NSUR indicated larger prediction errors for Korean larch. However, GAM performed better in mean prediction errors, and NSUR showed lower variance in Model 1. The residual variance was almost similar in GAM and NSUR for Mongolian pine. Regarding Korean pine, GAM residuals were closer to 0 in Model 2, and its overall error distribution was similar. As a whole, GAM and NSUR reflected similar results for stand total and aboveground biomass predictions.

3.4. Parameter Estimates

Table 4 and Table 5 present the estimated parameters and corresponding standard errors (SE) for Model 1 and Model 2 based on NSUR and GAM for the species analyzed. The majority of estimated parameters were statistically significant at p < 0.05. After testing various smooth functions, the final selection for the GAM-based models included Gaussian process smooths (GP) for the components of Korean pine, thin-plate regression splines (TP) for the components of Korean larch, and B-splines (BS) for the components of Mongolian pine.
Model 1 showed positive powers of G for all stand biomass components across the species, and positive powers of Hm for all components except for belowground biomass in Mongolian pine. These results suggest that an increase in G for the same Hm led to an increase in stand biomass. Likewise, the increase in Hm for the same G increased the biomass for all components of the species, excluding the belowground biomass of Mongolian pine, which decreased. In Model 2, all parameters for stand biomass components were positive. This indicates that there is a positive relationship between V and both stand total and component biomass, meaning that an increase in V leads to an increase in biomass (Table 4).

4. Discussion

Forest management and assessment studies require quantitative information on trees and stands. In forestry, a forest stand refers to a contiguous cluster of trees that share similar attributes such as age distribution, species composition, site quality, and structure. It is considered a fundamental management unit in forestry [19,75]. Therefore, precise estimation of stand biomass is of utmost importance for carrying out scientific research on ecosystem productivity, carbon cycles, and nutrient and energy transfer at the local, regional, or global scale.
This study developed two alternate models (Model 1 and Model 2) for predicting the stand biomass of Korean pine, Korean larch, and Mongolian pine. Model 1 contained commonly used stand variables, while Model 2 was based on stand volume. Both models were evaluated using GAM and NSUR methods for predicting total biomass and its components. We employed the LOOCV to verify the performance of models in estimating the stand’s total and component biomass. Overall, the modeling technique of GAM was better than NSUR, and Model 2 was superior to Model 1 for the species analyzed.

4.1. The Development of Stand Biomass Model Systems

Stand biomass models link different biomass components with commonly measured variables in forest inventories and, thus, serve as an essential tool for forest biomass estimation. The variables documented in such inventories generally include basal area, stand volume, and stand density. This study found basal area (G) as an essential variable in the model system of stand variables for the three conifer species, which supports previous studies [12,76]. However, to improve the accuracy of stand biomass models, Xin et al. [77] have recommended incorporating stand mean height (Hm) as a secondary variable. In agreement with their findings, stand mean height was significant statistically for the majority of the stand components models in this study (Table 4).
Introducing additional stand variables may improve the prediction accuracy of the two model systems in this study, but a simplified model system may be more practical and applicable for forest managers, even though introducing additional stand variables may enhance the prediction accuracy of the two model systems in this study [78]. Thus, the biomass prediction selects the appropriate number of variables with reasonable accuracy to fulfill management requirements and minimize the workload and cost of field surveys [79,80]. Moreover, introducing multiple independent variables in a model might create complexity in the modeling process. Therefore, two parsimonious model systems were developed in this study.
Researchers have used different linear and non-linear methods to develop additive biomass model systems in recent decades. The additive biomass model includes simultaneous equations and separate modeling. Some methods used in this model involve predicting based on component proportions, such as proportion adjustment, fractional multinomial logit regression, Dirichlet, three-step proportional weighting system, and log-ratio regression approach. [33,37,81,82]. The parameter estimation methods used include ordinary least squares, seemingly unrelated regression, Dirichlet regression, and two error-in-variable models [25,38,83]. When using these methods, it is important to consider using the same variables for fractional multinomial logit regression [81].Using different variables may prevent the simplification of the model structure in the three-step proportional weighting system. The NSUR-based simultaneous equations were chosen for stand biomass modeling and compared with the GAM-based system due to its applicability, flexibility, and popularity. We excluded the linear model that logarithmically transforms stand biomass and stand variables due to the potential systematic bias introduced by anti-log transformations [84,85]. Additionally, the use of correction factors can make it challenging for the system to maintain additivity in predicted values.
Based on the statistical criteria, Model 2 exhibited superior predictive power compared to Model 1 in estimating all biomass components across three conifer species (Table 3). This outcome can primarily be attributed to the strong correlation between volume and stand biomass (Figure 2). Another factor contributing to this superiority is the higher proportion of stand stem biomass to total biomass, as observed by Bi [86]. Additionally, He et al. [56] discovered that the stand biomass model utilizing stand volume outperformed models based on other stand variables for Larix plantations in China. Regarding stand volume as the only predictor. Similarly, researchers have reported findings for various tree species in Northeast China, with stand volume as the sole predictor [57].
It is crucial to note that the stand volume was estimated using the individual tree volume model, which means that propagation errors were expected in the estimation of stand biomass [87]. In this study, it was observed that the needle component of Korean larch had the lowest R2 value (0.65–0.73) among the species. This occurred because the leaf biomass was smaller compared to the other two species at the individual tree level (Figure 2). The relatively smaller leaf biomass could lead to more discrete observed values if there were variations in the number of trees within the sample plot (Figure 4 and Figure 5).

4.2. GAM vs. NSUR

The comparison of GAM and NSUR methods in this study showed that the former had better performance for three conifer species and can be widely applied for stand biomass estimation. The goodness-of-fit and validation statistics showed that, in terms of RMSE and RRMSE values, the GAM-based Model 1 and Model 2 outperformed NSUR in most biomass models for all species (Table 2 and Table 3). In particular, the GAM-based models delivered the lowest ME values for stand total and component biomass (Table 3). The minimizing loss function in NSUR simultaneously minimizes the loss of nonlinear biomass models of the system and accounts for the inherent correlations between total, subtotal, and component biomass. However, the minimizing loss function might sacrifice the accuracy of the component models, although it ensures additivity [37,88]. Even if the GAM-based component model was developed separately, the predicted values of the total or subtotal were acquired through aggregating the predicted values of the components. However, the prediction accuracy of GAM’s stand total and subtotal biomass was better than NSUR, and most GAM-based component models had better statistical indicators in Model 1 and Model 2 for the species analyzed (Table 3). The data-driven GAM-based stand biomass models indicate that it can fit complex trends in the data than model-driven, which is supported by tree biomass prediction and taper modeling [9,41,48].
The scatter plots reveal slight variations between the NSUR and GAM techniques, as evidenced by the values of the slope and intercept. In general, the GAM-based models demonstrated greater accuracy in estimating the total and component biomass of the analyzed species (Figure 4 and Figure 5). At the same time, the variance of the stand total residuals was similar for the NSUR and GAM methods (Figure 6), although the latter performed more consistently in the overall evaluation.
Stand biomass modeling is convenient with the GAM technique due to fewer restrictions on statistical assumptions. The NSUR may face problems of model selection and heteroskedasticity. In such cases, the GAM can serve as an alternative that is flexible and robust for modeling non-linear and non-constant variance structures [89]. The data-driven GAM deals with the relationship between the response and predictor variables a suitable relationship through smoothing functions. One concern about GAMs is over-fitting, which happens when a model predicts noise as a pattern. GAM automatically sets a suitable degree of smoothness for splines during the modeling process [9]. Furthermore, LOOCV was applied to minimize the approximate out-of-sample prediction error and prevent over-fitting [90]. The GAM predictions should be biologically realistic to avoid flagrant errors such as negative values for stand biomass predictions. We did not find such a deviation in this study (Figure 4 and Figure 5).
The NSUR method, unlike GAM, considers the inherent correlation between biomass components in the additivity system (Figure 3). Therefore, NSUR has better statistical efficiency in the component model estimation of the system [34,35]. Moreover, the error term of total and aboveground biomass was acquired by summing up the error terms of the components (Equations (3) and (4)). It should be noted that the error terms may result in a singular variance and covariance matrix across equations. This situation may occur when either no weight function was applied or the same weight function was utilized for the entire system [91,92]. Due to the use of an optimization iterative algorithm (Gauss-Newton) in NSUR, this algorithm heavily relies on the initial parameter values provided. If the initial iterated values for the parameters differ significantly from the estimated model parameters, it is possible that the parameter estimation may not converge, especially when dealing with a large number of model parameters. Therefore, when using NSUR, it is necessary to carefully set the initial iterated values based on experience or literature to ensure parameter convergence [35,63].
Finally, this study demonstrated that GAM-based prediction systems could reduce the slight uncertainties in estimating stand total and component biomass. However, the slight uncertainties might be amplified at the stand level when stand biomass is extrapolated to forest or national levels. Therefore, with the main aim of stand biomass prediction, GAM could serve as a flexible and promising alternative modeling approach.

5. Conclusions

Two stand biomass models (Model 1 and Model 2) were developed for three conifer species in Northeast China using the GAM and NSUR methods in this study. The developed stand biomass models included estimates of the total, aboveground, and component biomass. The fitting and validation statistics and the graphical analysis showed that the GAM-based models were more appropriate in predicting stand total and component biomass and provided a reliable alternative to the traditional NSUR method. In addition to better predictions, the GAM requires fewer statistical assumptions that induce efficient forestry data modeling and convenient predictions. The volume-derived model (Model 2) was superior to the stand biomass model, including observed stand variables (Model 1). Therefore, the GAM-based Model 2 was recommended for the biomass estimation of three conifer species. However, the GAM-based Model 1 also provides an alternative where stand volume is unavailable. Future work focusing on different species in the region would broaden the scope of this research to a larger landscape.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/f14061274/s1, Table S1: Individual tree volume models for three tree species [93]; Table S2: Allometric equations for the calculation of stand total and component biomass for each tree species [94,95].

Author Contributions

Conceptualization, S.X. and L.J.; methodology, S.X. and L.J.; software, S.X., M.K.S. and S.B.M.; validation, S.X.; formal analysis, S.X.; investigation, S.X. and M.K.S.; resources, L.J.; data curation, S.X.; writing—original draft preparation, S.X.; writing—review and editing, M.K.S., W.W. and S.B.M.; visualization, S.X., M.K.S. and S.B.M.; supervision, L.J.; project administration, L.J.; funding acquisition, L.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research was financially supported by the Heilongjiang Province Applied Technology Research and Development Plan Project of China (GA19C006).

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

The authors would like to thank the faculty and students of the Department of Forest Management, Northeast Forestry University (NEFU), China, who collected and provided the data for this study.

Conflicts of Interest

The authors declare there are no competing interests.

References

  1. Dixon, R.K.; Solomon, A.M.; Brown, S.; Houghton, R.A.; Trexier, M.C.; Wisniewski, J. Carbon pools and flux of global forest ecosystems. Science 1994, 263, 185–190. [Google Scholar] [CrossRef] [PubMed]
  2. Baldocchi, D.; Penuelas, J. The physics and ecology of mining carbon dioxide from the atmosphere by ecosystems. Global. Change Biol. 2019, 25, 1191–1197. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Usoltsev, V.A.; Shobairi, S.O.R.; Tsepordey, I.S.; Chasovskikh, V.P. Modeling the additive structure of stand biomass equations in climatic gradients of Eurasia. Environ. Qual. Manag. 2018, 28, 55–61. [Google Scholar] [CrossRef]
  4. Augusto, L.; Ranger, J.; Ponette, Q.; Rapp, M. Relationships between forest tree species, stand production and stand nutrient amount. Ann. Forest. Sci. 2000, 57, 313–324. [Google Scholar] [CrossRef] [Green Version]
  5. Goodale, C.L.; Apps, M.J.; Birdsey, R.A.; Field, C.B.; Heath, L.S.; Houghton, R.A.; Jenkins, J.C.; Kohlmaier, G.H.; Kurz, W.; Liu, S. Forest carbon sinks in the Northern Hemisphere. Ecol. Appl. 2002, 12, 891–899. [Google Scholar] [CrossRef]
  6. Houghton, R.A.; Hall, F.; Goetz, S.J. Importance of biomass in the global carbon cycle. J. Geophys. Res. 2009, 114, G00E03. [Google Scholar] [CrossRef]
  7. Zheng, G.; Chen, J.M.; Tian, Q.J.; Ju, W.M.; Xia, X.Q. Combining remote sensing imagery and forest age inventory for biomass mapping. J. Environ. Manag. 2007, 85, 616–623. [Google Scholar] [CrossRef]
  8. Fu, L.; Zeng, W.; Tang, S. Individual tree biomass models to estimate forest biomass for large spatial regions developed using four pine species in China. For. Sci. 2017, 63, 241–249. [Google Scholar]
  9. Levine, J.; de Valpine, P.; Battles, J. Generalized additive models reveal among-stand variation in live tree biomass equations. Can. J. For. Res. 2021, 51, 546–564. [Google Scholar] [CrossRef]
  10. Xu, Q.; Lei, X.; Zhang, H. A novel method for approaching the compatibility of tree biomass estimation by multi-task neural networks. For. Ecol. Manag. 2022, 508, 120011. [Google Scholar] [CrossRef]
  11. Li, H.; Zhao, P. Improving the accuracy of tree-level aboveground biomass equations with height classification at a large regional scale. For. Ecol. Manag. 2013, 289, 153–163. [Google Scholar] [CrossRef]
  12. Castedo-Dorado, F.; Gómez-García, E.; Diéguez-Aranda, U.; Barrio-Anta, M.; Crecente-Campo, F. Aboveground stand-level biomass estimation: A comparison of two methods for major forest species in northwest Spain. Ann. For. Sci. 2012, 69, 735–746. [Google Scholar] [CrossRef] [Green Version]
  13. Somogyi, Z.; Cienciala, E.; Mäkipää, R.; Muukkonen, P.; Lehtonen, A.; Weiss, P. Indirect methods of large-scale forest biomass estimation. Eur. J. For. Res. 2007, 126, 197–207. [Google Scholar] [CrossRef]
  14. Jagodziński, A.M.; Dyderski, M.K.; Gęsikiewicz, K.; Horodecki, P.; Cysewska, A.; Wierczyńska, S.; Maciejczyk, K. How do tree stand parameters affect young Scots pine biomass?–Allometric equations and biomass conversion and expansion factors. For. Ecol. Manag. 2018, 409, 74–83. [Google Scholar] [CrossRef]
  15. Xin, S.; Wang, J.; Mahardika, S.B.; Jiang, L. Sensitivity of Stand-Level Biomass to Climate for Three Conifer Plantations in Northeast China. Forests 2022, 13, 2022. [Google Scholar] [CrossRef]
  16. Balboa-Murias, M.Á.; Rodríguez-Soalleiro, R.; Merino, A.; Álvarez-González, J.G. Temporal variations and distribution of carbon stocks in aboveground biomass of radiata pine and maritime pine pure stands under different silvicultural alternatives. For. Ecol. Manag. 2006, 237, 29–38. [Google Scholar] [CrossRef]
  17. Xie, L.; Fu, L.; Widagdo, F.R.A.; Dong, L.; Li, F. Improving the accuracy of tree biomass estimations for three coniferous tree species in Northeast China. Trees 2021, 62, 129. [Google Scholar] [CrossRef]
  18. Luo, Y.; Wang, X.; Ouyang, Z.; Lu, F.; Feng, L.; Tao, J. A review of biomass equations for China’s tree species. Earth Syst. Sci. Data 2020, 12, 21–40. [Google Scholar] [CrossRef] [Green Version]
  19. Bi, H.; Long, Y.; Turner, J.; Lei, Y.; Snowdon, P.; Li, Y.; Harper, R.; Zerihun, A.; Ximenes, F. Additive prediction of aboveground biomass for Pinus radiata (D. Don) plantations. For. Ecol. Manag. 2010, 259, 2301–2314. [Google Scholar] [CrossRef]
  20. Fang, J.; Chen, A.; Peng, C.; Zhao, S.; Ci, L. Changes in forest biomass carbon storage in China between 1949 and 1998. Science 2001, 292, 2320–2322. [Google Scholar] [CrossRef]
  21. Lehtonen, A.; Mäkipää, R.; Heikkinen, J.; Sievänen, R.; Liski, J. Biomass expansion factors (BEFs) for Scots pine, Norway spruce and birch according to stand age for boreal forests. For. Ecol. Manag. 2004, 188, 211–224. [Google Scholar] [CrossRef]
  22. Soares, P.; Tomé, M. Biomass expansion factors for Eucalyptus globulus stands in Portugal. For. Syst. 2012, 21, 141–152. [Google Scholar] [CrossRef] [Green Version]
  23. Ter-Mikaelian, M.T.; Korzukhin, M.D. Biomass equations for sixty-five North American tree species. For. Ecol. Manag. 1997, 97, 1–24. [Google Scholar] [CrossRef] [Green Version]
  24. Návar, J. Allometric equations for tree species and carbon stocks for forests of northwestern Mexico. For. Ecol. Manag. 2009, 257, 427–434. [Google Scholar] [CrossRef]
  25. Zeng, W.; Duo, H.; Lei, X.; Chen, X.; Wang, X.; Pu, Y.; Zou, W. Individual tree biomass equations and growth models sensitive to climate variables for Larix spp. in China. Eur. J. For. Res. 2017, 136, 233–249. [Google Scholar] [CrossRef]
  26. Romero, F.M.B.; Jacovine, L.A.G.; Ribeiro, S.C.; Torres, C.M.M.E.; Silva, L.F.d.; Gaspar, R.d.O.; Rocha, S.J.S.S.d.; Staudhammer, C.L.; Fearnside, P.M. Allometric equations for volume, biomass, and carbon in commercial stems harvested in a managed forest in the southwestern Amazon: A case study. Forests 2020, 11, 874. [Google Scholar] [CrossRef]
  27. Bi, H.; Murphy, S.; Volkova, L.; Weston, C.; Fairman, T.; Li, Y.; Law, R.; Norris, J.; Lei, X.; Caccamo, G. Additive biomass equations based on complete weighing of sample trees for open eucalypt forest species in south-eastern Australia. For. Ecol. Manag. 2015, 349, 106–121. [Google Scholar] [CrossRef]
  28. Kenzo, T.; Himmapan, W.; Yoneda, R.; Tedsorn, N.; Vacharangkura, T.; Hitsuma, G.; Noda, I. General estimation models for above-and below-ground biomass of teak (Tectona grandis) plantations in Thailand. For. Ecol. Manag. 2020, 457, 117701. [Google Scholar] [CrossRef]
  29. Güner, Ş.T.; Diamantopoulou, M.J.; Poudel, K.P.; Çömez, A.; Özçelik, R. Employing artificial neural network for effective biomass prediction: An alternative approach. Comput. Electron. Agr. 2022, 192, 106596. [Google Scholar] [CrossRef]
  30. Usoltsev, V.A.; Shobairi, S.O.R.; Tsepordey, I.S.; Chasovkikh, V.P. Modelling forest stand biomass and net primary production with the focus on additive models sensitive to climate variables for two-needled pines in Eurasia. J. Clim. Change 2019, 5, 41–49. [Google Scholar] [CrossRef]
  31. Usoltsev, V.A.; Shobairi, S.O.R.; Chasovskikh, V.P. Modeling the additive stand biomass of Larix spp. for Eurasia. Ecol. Quest. 2019, 30, 35–46. [Google Scholar]
  32. Kozak, A. Methods for ensuring additivity of biomass components by regression analysis. For. Chron. 1970, 46, 402–405. [Google Scholar] [CrossRef]
  33. Tang, S.; Zhang, H.; Xu, H. Study on establish and estimate method of compatible biomass model. Sci. Silivae Sin. 2000, 36, 19–27, (In Chinese with English an abstract). [Google Scholar]
  34. Parresol, B.R. Assessing tree and stand biomass: A review with examples and critical comparisons. For. Sci. 1999, 45, 573–593. [Google Scholar]
  35. Parresol, B.R. Additivity of nonlinear biomass equations. Can. J. For. Res. 2001, 31, 865–878. [Google Scholar] [CrossRef]
  36. Affleck, D.L.R.; Diéguez-Aranda, U. Additive nonlinear biomass equations: A likelihood-based approach. For. Sci. 2016, 62, 129–140. [Google Scholar] [CrossRef]
  37. Dong, L.; Zhang, L.; Li, F. A three-step proportional weighting system of nonlinear biomass equations. For. Sci. 2015, 61, 35–45. [Google Scholar] [CrossRef]
  38. Fu, L.; Sharma, R.P.; Wang, G.; Tang, S. Modelling a system of nonlinear additive crown width models applying seemingly unrelated regression for Prince Rupprecht larch in northern China. For. Ecol. Manag. 2017, 386, 71–80. [Google Scholar] [CrossRef]
  39. González-García, M.; Hevia, A.; Majada, J.; Barrio-Anta, M. Above-ground biomass estimation at tree and stand level for short rotation plantations of Eucalyptus nitens (Deane & Maiden) Maiden in Northwest Spain. Biomass Bioenergy 2013, 54, 147–157. [Google Scholar]
  40. Jagodziński, A.M.; Dyderski, M.K.; Gęsikiewicz, K.; Horodecki, P. Tree-and stand-level biomass estimation in a Larix decidua Mill. Chronosequence. Forests 2018, 9, 587. [Google Scholar] [CrossRef] [Green Version]
  41. Robinson, A.P.; Lane, S.E.; Thérien, G. Fitting forestry models using generalized additive models: A taper model example. Can. J. For. Res. 2011, 41, 1909–1916. [Google Scholar] [CrossRef]
  42. Picard, N.; Rutishauser, E.; Ploton, P.; Ngomanda, A.; Henry, M. Should tree biomass allometry be restricted to power models? For. Ecol. Manag. 2015, 353, 156–163. [Google Scholar] [CrossRef]
  43. Hastie, T.; Tibshirani, R. Generalized additive models: Some applications. J. Am. Stat. Assoc 1987, 82, 371–386. [Google Scholar] [CrossRef]
  44. Adamec, Z.; Drápela, K. Generalized additive models as an alternative approach to the modelling of the tree height-diameter relationship. J. For. Sci 2015, 61, 235–243. [Google Scholar] [CrossRef] [Green Version]
  45. Li, J.; Mao, X. Comparison of canopy closure estimation of plantations using parametric, semi-parametric, and non-parametric models based on GF-1 remote sensing images. Forests 2020, 11, 597. [Google Scholar] [CrossRef]
  46. Munro, H.L.; Montes, C.R.; Kinane, S.M.; Gandhi, K.J.K. Through space and time: Predicting numbers of an eruptive pine tree pest and its predator under changing climate conditions. For. Ecol. Manag. 2021, 483, 118770. [Google Scholar] [CrossRef]
  47. Vospernik, S. Basal area increment models accounting for climate and mixture for Austrian tree species. For. Ecol. Manag. 2021, 480, 118725. [Google Scholar] [CrossRef]
  48. He, P.; Hussain, A.; Khurram Shahzad, M.; Jiang, L.; Li, F. Evaluation of four regression techniques for stem taper modeling of Dahurian larch (Larix gmelinii) in Northeastern China. For. Ecol. Manag. 2021, 494, 119336. [Google Scholar] [CrossRef]
  49. Jin, X.; Li, F.; Pukkala, T.; Dong, L. Modelling the cone yields of Korean pine. For. Ecol. Manag. 2020, 464, 118086. [Google Scholar] [CrossRef]
  50. Peng, W.; Pukkala, T.; Jin, X.; Li, F. Optimal management of larch (Larix olgensis A. Henry) plantations in Northeast China when timber production and carbon stock are considered. Ann. For. Sci. 2018, 75, 513. [Google Scholar] [CrossRef] [Green Version]
  51. Zheng, L.; Zhao, Q.; Sun, Q.; Liu, L.; Zeng, D. Nitrogen addition elevated autumn phosphorus retranslocation of living needles but not resorption in a nutrient-poor Pinus sylvestris var. Mongolica plantation. For. Ecol. Manag. 2020, 468, 118174. [Google Scholar] [CrossRef]
  52. Zhang, X.; Zhang, X.; Han, H.; Shi, Z.; Yang, X. Biomass Accumulation and Carbon Sequestration in an Age-Sequence of Mongolian Pine Plantations in Horqin Sandy Land, China. Forests 2019, 10, 197. [Google Scholar] [CrossRef] [Green Version]
  53. State Forestry and Grassland Administration. The Ninth Forest Resource Survey Report 2014–2018; China Forestry Publishing House: Beijing, China, 2019. (In Chinese) [Google Scholar]
  54. Peichl, M.; Arain, M.A. Above-and belowground ecosystem biomass and carbon pools in an age-sequence of temperate pine plantation forests. Agr. Forest. Meteorol 2006, 140, 51–63. [Google Scholar] [CrossRef]
  55. State Forestry and Grassland Administration. Forest Mensuration, 4th ed.; China Forestry Publishing House: Beijing, China, 2019. (In Chinese) [Google Scholar]
  56. He, X.; Lei, X.; Dong, L. How large is the difference in large-scale forest biomass estimations based on new climate-modified stand biomass models? Ecol. Indic 2021, 126, 107569. [Google Scholar] [CrossRef]
  57. Dong, L.; Zhang, L.; Li, F. Evaluation of stand biomass estimation methods for major forest types in the eastern Da Xing’an Mountains, northeast China. Forests 2019, 10, 715. [Google Scholar] [CrossRef] [Green Version]
  58. Xin, S.; Mahardika, S.B.; Jiang, L. Stand-level biomass estimation for Korean pine plantations based on four additive methods in Heilongjiang province, northeast China. CERNE 2022, 28, 129. [Google Scholar] [CrossRef]
  59. Zhang, L.; Ma, Z.; Guo, L. Spatially assessing model errors of four regression techniques for three types of forest stands. Forestry 2008, 81, 209–225. [Google Scholar] [CrossRef] [Green Version]
  60. Guisan, A.; Edwards Jr, T.C.; Hastie, T. Generalized linear and generalized additive models in studies of species distributions: Setting the scene. Ecol. Model 2002, 157, 89–100. [Google Scholar] [CrossRef] [Green Version]
  61. Wood, S.N. Generalized Additive Models: An Introduction with R, 2nd ed.; Chapman & Hall/CRC texts in statistical science series; CRC Press: Boca Raton, FL, USA, 2017. [Google Scholar]
  62. Wood, S.N. Thin plate regression splines. J. Roy. Stat. Soc. B. 2003, 65, 95–114. [Google Scholar] [CrossRef]
  63. Fu, L.; Lei, Y.; Wang, G.; Bi, H.; Tang, S.; Song, X. Comparison of seemingly unrelated regressions with error-in-variable models for developing a system of nonlinear additive biomass equations. Trees 2016, 30, 839–857. [Google Scholar] [CrossRef]
  64. Tang, S.; Lang, K.; Li, H. Statistics and Computation of Biomathematical Models (ForStat Course); Science Press: Beijing, China, 2008. (In Chinese) [Google Scholar]
  65. Saint-André, L.; M’Bou, A.T.; Mabiala, A.; Mouvondy, W.; Jourdan, C.; Roupsard, O.; Deleporte, P.; Hamel, O.; Nouvellon, Y. Age-related equations for above- and below-ground biomass of a Eucalyptus hybrid in Congo. For. Ecol. Manag. 2005, 205, 199–214. [Google Scholar] [CrossRef]
  66. Mascaro, J.; Litton, C.M.; Hughes, R.F.; Uowolo, A.; Schnitzer, S.A. Minimizing bias in biomass allometry: Model selection and log-transformation of data. Biotropica 2011, 43, 649–653. [Google Scholar] [CrossRef] [Green Version]
  67. Dutcă, I.; McRoberts, R.E.; Næsset, E.; Blujdea, V.N.B. Accommodating heteroscedasticity in allometric biomass models. For. Ecol. Manag. 2021, 34, 119865. [Google Scholar] [CrossRef]
  68. SAS Institute Inc. SAS/ETS® 9.3. User’s Guide 2011; SAS Institute Inc.: Cary, NC, USA, 2011; p. 3302. [Google Scholar]
  69. Zhao, D.; Kane, M.; Markewitz, D.; Teskey, R.; Clutter, M. Additive tree biomass equations for midrotation loblolly pine plantations. For. Sci. 2015, 61, 613–623. [Google Scholar] [CrossRef] [Green Version]
  70. Harvey, A.C. Estimating regression models with multiplicative heteroscedasticity. Econometrica 1976, 44, 461–465. [Google Scholar] [CrossRef]
  71. Kozak, A.; Kozak, R. Does cross validation provide additional information in the evaluation of regression models? Ann. Forest. Sci. 2003, 33, 976–987. [Google Scholar] [CrossRef]
  72. Timilsina, N.; Staudhammer, C.L. Individual tree-based diameter growth model of slash pine in Florida using nonlinear mixed Modeling. For. Sci. 2013, 59, 27–37. [Google Scholar] [CrossRef]
  73. He, P.; Jiang, L.; Li, F. Evaluation of parametric and non-parametric stem taper modeling approaches: A case study for Betula platyphylla in Northeast China. For. Ecol. Manag. 2022, 525, 120535. [Google Scholar] [CrossRef]
  74. Diamantopoulou, M.J.; Özçelik, R.; Yavuz, H. Tree-bark volume prediction via machine learning: A case study based on black alder’s tree-bark production. Comput. Electron. Agric. 2018, 151, 431–440. [Google Scholar] [CrossRef]
  75. Burkhart, H.E.; Tomé, M. Modeling Forest Trees and Stands; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012; ISBN 9048131707. [Google Scholar]
  76. Snowdon, P. Ratio methods for estimating forest biomass. N. Z. J. For. Sci 1992, 22, 54–62. [Google Scholar]
  77. Xin, S.; Yan, Y.; Jiang, L. Stand biomass model for Pinus koraiensis plantation based on different additive methods in Heilongjiang Province, China. Chin. J. Appl. Ecol. 2020, 31, 3322–3330, (In Chinese with an English abstract). [Google Scholar]
  78. Kiernan, D.H.; Bevilacqua, E.; Nyland, R.D. Individual-tree diameter growth model for sugar maple trees in uneven-aged northern hardwood stands under selection system. For. Ecol. Manag. 2008, 256, 1579–1586. [Google Scholar] [CrossRef]
  79. Adame, P.; del Río, M.; Canellas, I. A mixed nonlinear height–diameter model for pyrenean oak (Quercus pyrenaica Willd.). For. Ecol. Manag. 2008, 256, 88–98. [Google Scholar] [CrossRef]
  80. Calama, S.R.A.; Montero, G. Multilevel linear mixed model for tree diameter increment in stone pine (Pinus pinea): A calibrating approach. Silva. Fenn. 2005, 39, 37–54. [Google Scholar] [CrossRef] [Green Version]
  81. Zhao, D.; Kane, M.; Teskey, R.; Markewitz, D. Modeling aboveground biomass components and volume-to-weight conversion ratios for Loblolly Pine trees. For. Sci. 2016, 62, 463–473. [Google Scholar] [CrossRef]
  82. Poudel, K.P.; Temesgen, H. Methods for estimating aboveground biomass and its components for Douglas-fir and lodgepole pine trees. Can. J. For. Res. 2016, 46, 77–87. [Google Scholar] [CrossRef] [Green Version]
  83. Poudel, K.P.; Temesgen, H.; Radtke, P.J.; Gray, A.N. Estimating individual-tree aboveground biomass of tree species in the western USA. Can. J. For. Res. 2019, 49, 701–714. [Google Scholar] [CrossRef]
  84. Baskerville, G.L. Use of logarithmic regression in the estimation of plant biomass. Can. J. For. Res. 1972, 2, 49–53. [Google Scholar] [CrossRef]
  85. Zianis, D.; Xanthopoulos, G.; Kalabokidis, K.; Kazakis, G.; Ghosn, D.; Roussou, O. Allometric equations for aboveground biomass estimation by size class for Pinus brutia Ten. trees growing in North and South Aegean Islands, Greece. Eur. J. For. Res. 2011, 130, 145–160. [Google Scholar] [CrossRef]
  86. Bi, H. Converting stem volume to biomass with additivity bias correction sand confidence bands for two Australian tree species. N. Z. J. For. Sci. 2001, 31, 298–319. [Google Scholar]
  87. McRoberts, R.E.; Westfall, J.A. Propagating uncertainty through individual tree volume model predictions to large-area volume estimates. Ann. For. Sci. 2016, 73, 625–633. [Google Scholar] [CrossRef] [Green Version]
  88. Reed, D.D.; Green, E.J. A method of forcing additivity of biomass tables when using nonlinear models. Can. J. For. Res. 1985, 15, 1184–1187. [Google Scholar] [CrossRef]
  89. Moisen, G.G.; Frescino, T.S. Comparing five modelling techniques for predicting forest characteristics. Ecol. Model. 2002, 157, 209–225. [Google Scholar] [CrossRef] [Green Version]
  90. Wood, S.N. mgcv: GAMs and generalized ridge regression for R. R News 2001, 1, 20–25. [Google Scholar]
  91. Bi, H.; Turner, J.; Lambert, M.J. Additive biomass equations for native eucalypt forest trees of temperate Australia. Trees 2004, 18, 467–479. [Google Scholar] [CrossRef]
  92. Zhao, D.; Westfall, J.; Coulston, J.W.; Lynch, T.B.; Bullock, B.P.; Montes, C.R. Additive biomass equations for slash pine trees: Comparing three modeling approaches. Can. J. For. Res. 2019, 49, 27–40. [Google Scholar] [CrossRef]
  93. Liu, J.; Meng, S.; Zhou, H.; Zhou, G.; Li, Y. Tree Volume Tables of China; China Forestry Publishing House: Beijing, China, 2017; pp. 173–180. (In Chinese) [Google Scholar]
  94. Dong, L. Developing individual and stand-level biomass equations in Northeast China forest area. Ph.D. Thesis, Northeast Forestry University, Harbin, China, 2015. (In Chinese with an English abstract). [Google Scholar]
  95. Wang, C. Biomass allometric equations for 10 co-occurring tree species in Chinese temperate forests. For. Ecol. Manag. 2006, 222, 9–16. [Google Scholar] [CrossRef]
Figure 1. Geographic distribution of sample plots in Dongfeng County, Jilin Province, Northeast China.
Figure 1. Geographic distribution of sample plots in Dongfeng County, Jilin Province, Northeast China.
Forests 14 01274 g001
Figure 2. Scatter plots of stand variables versus biomass components; stem (Bs), branch (Bb), needle (Bn), and belowground (Br).
Figure 2. Scatter plots of stand variables versus biomass components; stem (Bs), branch (Bb), needle (Bn), and belowground (Br).
Forests 14 01274 g002
Figure 3. The residual correlation matrices for the two types of model systems and the three conifer species based on NSUR fitting.
Figure 3. The residual correlation matrices for the two types of model systems and the three conifer species based on NSUR fitting.
Forests 14 01274 g003
Figure 4. Comparisons of component biomass predictions from Model 1 associated with the NSUR and GAM methods for the three conifer species. The red line indicates the slope is 45 degrees, and the intercept is 0.
Figure 4. Comparisons of component biomass predictions from Model 1 associated with the NSUR and GAM methods for the three conifer species. The red line indicates the slope is 45 degrees, and the intercept is 0.
Forests 14 01274 g004
Figure 5. Comparisons of component biomass predictions from Model 2 with NSUR and GAM for the three conifer species. The red line indicates a slope of 45 degrees, and the intercept is 0.
Figure 5. Comparisons of component biomass predictions from Model 2 with NSUR and GAM for the three conifer species. The red line indicates a slope of 45 degrees, and the intercept is 0.
Forests 14 01274 g005
Figure 6. The box plot of stand total and aboveground biomass errors for Model 1 and Model 2 against three conifer species for NSUR and GAM. The blue line means the error is equal to zero.
Figure 6. The box plot of stand total and aboveground biomass errors for Model 1 and Model 2 against three conifer species for NSUR and GAM. The blue line means the error is equal to zero.
Forests 14 01274 g006
Table 1. Descriptive statistics for variables and biomass components.
Table 1. Descriptive statistics for variables and biomass components.
VariablesKorean PineKorean LarchMongolian Pine
Min.Max.MeanSDMin.Max.MeanSDMin.Max.MeanSD
G (m2·ha−1)18.8242.5229.655.1811.0035.7624.715.1920.1845.8031.205.41
Dg (cm)7.6135.7821.074.868.4227.5018.994.0911.9523.9018.612.80
Hm (m)5.0818.1813.292.417.0224.9017.393.638.9218.9614.802.18
N (trees·ha−1)350.004375.00988.77561.95400.002625.00981.34452.92450.003200.001244.33481.30
V (m3·ha−1)78.97266.49172.5736.3855.94284.11185.8348.66109.92264.86183.6132.54
Age (a)19.0065.0045.7211.1214.0058.0039.7314.6018.0049.0033.967.58
Bt (Mg·ha−1)65.82224.97140.8432.2535.10235.74144.5344.3283.21208.13143.7226.06
Ba (Mg·ha−1)50.28181.12111.9126.5828.58185.76114.4534.6365.83171.71118.6722.12
Br (Mg·ha−1)15.5443.8528.925.826.5349.9730.099.7015.8436.4225.044.40
Bs (Mg·ha−1)40.33117.9277.5515.6124.02167.15101.9731.7351.36140.1395.3918.17
Bb (Mg·ha−1)4.8748.4922.418.624.3615.6310.722.388.4921.2014.612.67
Bn (Mg·ha−1)4.3220.1311.953.240.193.921.760.745.2112.848.671.55
Note: G represents basal area (m2·ha−1); Dg represents stand quadratic mean diameter (cm); Hm represents stand mean height (m); N represents stand density (trees·ha−1); V represents stand volume (m3·ha−1); Age represents stand age (a); and Bt, Ba, Br, Bs, Bb, and Bn represent the component biomass of total, aboveground, belowground, stem, branch, and needle (Mg·ha−1).
Table 2. Goodness-of-fit statistics and weight functions of stand biomass models for three species.
Table 2. Goodness-of-fit statistics and weight functions of stand biomass models for three species.
Model TypeMethodComponentsKorean PineKorean LarchMongolian Pine
R2RMSEWeight FunctionR2RMSEWeight FunctionR2RMSEWeight Function
Model-1NSURTotal0.94287.7168G2.67650.926112.0525G1.12280.92507.1377G4.4215
Aboveground0.92997.0347G2.74110.93029.1551G0.98100.90026.9891G4.3542
Belowground0.97380.9429G4.2025Hm−3.35740.91042.9025G0.28320.95950.8846G−2.2436Hm5.7752
Stem0.95153.4382G2.07900.92488.7041G0.93810.87746.3642G4.8456
Branch0.80583.7971G4.94910.98990.2387G2.6160Hm−2.68100.92600.7265G4.3753
Needle0.88391.1051G2.96620.64510.4435G3.7972Hm−3.19570.96800.2777G−3.0660Hm4.9095
Model-1GAMTotal0.95107.13760.927811.90500.93046.8725
Aboveground0.94006.50880.93189.04030.90796.7126
Belowground0.97670.88950.91242.86980.96030.8766
Stem0.95543.29720.92638.61450.88656.1239
Branch0.83453.50580.99010.23630.93170.6981
Needle0.89531.04910.66180.43290.96830.2766
Model-2NSURTotal0.98533.9155V3.34610.97407.1482V1.85830.98952.6644V0.1614
Aboveground0.97524.1908V2.62080.97735.2144V1.54990.97083.7786V0.3408
Belowground0.97930.8390V−0.44160.96001.9403V1.64940.87831.5342V−0.0474
Stem0.97652.3933V−0.05360.97145.3676V1.38320.95124.0169V0.3475
Branch0.79863.8668V−2.41780.95210.5207V−0.45250.97600.4135V−0.6777
Needle0.91970.9197V−1.52080.72280.3920V−0.64290.85330.5946V−0.4920
Model-2GAMTotal0.98793.55040.97497.02530.99062.5300
Aboveground0.97913.84070.97815.13060.97323.6238
Belowground0.97980.82850.96161.90110.87901.5297
Stem0.97742.34810.97255.26440.95483.8641
Branch0.82033.65240.95550.50170.97600.4136
Needle0.92490.88870.73030.38660.85350.5942
Table 3. Validation of stand biomass models for three conifer species.
Table 3. Validation of stand biomass models for three conifer species.
Model TypeMethodComponentsKorean PineKorean LarchMongolian Pine
MERMSERRMSEMERMSERRMSEMERMSERRMSE
Model-1NSURTotal−0.55047.85635.57830.952412.43458.60330.11557.29555.0763
Aboveground−0.41187.16476.40200.72509.44678.25440.03407.15036.0252
Belowground−0.13860.95483.30110.22742.99319.94820.08140.89513.5742
Stem−0.02733.50934.52520.70448.98348.81010.06896.50926.8235
Branch−0.27453.862217.23460.00430.24672.3020−0.01540.74375.0897
Needle−0.10991.12319.39480.01620.456425.8887−0.01940.28123.2436
Model-1GAMTotal−0.00997.56685.37280.022212.33068.5314−0.02297.24905.0439
Aboveground−0.01026.91196.17610.01789.36668.1844−0.02127.08725.9721
Belowground−0.00030.92833.20950.00442.96939.8691−0.00170.90773.6247
Stem−0.00853.50474.51940.01548.91988.7477−0.01986.46556.7768
Branch−0.00113.724516.62000.00260.25062.3384−0.00330.73755.0470
Needle−0.00061.10199.2177−0.00020.447725.39930.00190.28953.3393
Model-2NSURTotal−0.01733.98022.82610.24647.27275.0319−0.05362.71901.8919
Aboveground−0.02984.25523.80220.18485.30454.6350−0.04793.86063.2532
Belowground0.01240.85402.95280.06161.97486.5636−0.00561.56886.2644
Stem−0.26302.43983.14610.24475.46195.3565−0.05294.10054.2986
Branch0.19043.927417.5256−0.03880.53064.9521−0.00010.42582.9139
Needle0.04290.93547.8252−0.02110.398222.58690.00510.61037.0403
Model-2GAMTotal−0.00163.75322.66490.00447.17274.9627−0.01162.69351.8742
Aboveground−0.00254.04153.61130.00365.23814.5770−0.01603.82903.2265
Belowground0.00100.84292.91420.00071.94126.45190.00441.58296.3207
Stem0.00742.41073.10850.00295.37505.2713−0.01514.06874.2652
Branch−0.00973.820017.04630.00110.51284.7852−0.00360.43712.9913
Needle−0.00030.92077.7021−0.00030.394322.36550.00260.61417.0838
Table 4. Parameter estimates and standard errors (SE) of the NSUR-based stand biomass models for three conifer species.
Table 4. Parameter estimates and standard errors (SE) of the NSUR-based stand biomass models for three conifer species.
Model TypeComponentsKorean PineKorean LarchMongolian Pine
ci1ki1mi1ci2ki2mi2ci3ki3mi3
Model 1Belowground0.5676 **
(0.0122)
1.0160 **
(0.0068)
0.1898 **
(0.0060)
0.1378 **
(0.0121)
1.0948 **
(0.0284)
0.6488 **
(0.0347)
0.8164 **
(0.0223)
1.0083 **
(0.0082)
−0.0180 *
(0.0067)
Stem1.6851 **
(0.1178)
0.9574 **
(0.0227)
0.2259 **
(0.0210)
0.5606 **
(0.0437)
1.0805 **
(0.0260)
0.6020 **
(0.0311)
1.4037 **
(0.1161)
0.8019 **
(0.0195)
0.5426 **
(0.0240)
Branch0.0425 **
(0.0052)
1.0562 **
(0.0378)
1.0357 **
(0.0359)
0.3374 **
(0.0121)
1.0075 **
(0.0104)
0.0791 **
(0.0112)
0.2616 **
(0.0189)
0.8657 **
(0.0174)
0.3889 **
(0.0196)
Needle0.0923 **
(0.0058)
1.0199 **
(0.0204)
0.5448 **
(0.0206)
0.0033 *
(0.0010)
1.2843 **
(0.0734)
0.7405 **
(0.1026)
0.2676 **
(0.0085)
1.0555 **
(0.0095)
0.0078 ns(0.0076)
Model 2Belowground0.2053 **
(0.0095)
0.9607 **
(0.0088)
0.0438 **
(0.0028)
1.2478 **
(0.0125)
0.2266 **
(0.0269)
0.9029 **
(0.0223)
Stem0.6402 **
(0.0218)
0.9322 **
(0.0065)
0.1815 **
(0.0094)
1.2096 **
(0.0101)
0.3665 **
(0.0260)
1.0668 **
(0.0135)
Branch0.0110 *
(0.0032)
1.4739 **
(0.0560)
0.1553 **
(0.0082)
0.8121 **
(0.0101)
0.0692 **
(0.0038)
1.0266 **
(0.0104)
Needle0.0267 **
(0.0032)
1.1837 **
(0.0230)
0.0014 *
(0.0004)
1.3603 **
(0.0546)
0.0753 **
(0.0087)
0.9106 **
(0.0220)
Note: (**): parameter estimates significant at p < 0.0001; (*), parameter estimates significant at p < 0.05; (ns), non-significant parameter estimates.
Table 5. Parameter estimates, and standard errors (SE) of the GAM-based stand biomass models for three conifer species.
Table 5. Parameter estimates, and standard errors (SE) of the GAM-based stand biomass models for three conifer species.
Model TypeComponentsKorean PineKorean LarchMongolian Pine
β β β
Model-1Belowground28.9231 ** (0.0816)30.0867 ** (0.3025)25.0433 ** (0.0877)
Stem77.5495 ** (0.3035)101.9671 ** (0.9081)95.3928 ** (0.6195)
Branch22.4097 ** (0.3237)10.7153 ** (0.0249)14.6117 ** (0.0705)
Needle11.9541 ** (0.0966)1.7628 ** (0.0456)8.6684 ** (0.0277)
Model-2Belowground28.9231 ** (0.0753)30.0867 ** (0.2004)25.0430 ** (0.1530)
Stem77.5495 ** (0.2135)101.9671 ** (0.5549)95.3928 ** (0.3911)
Branch22.4100 ** (0.3360)10.7153 ** (0.0529)14.6117 ** (0.0414)
Needle11.9541 ** (0.0816)1.7628 ** (0.0408)8.6684 ** (0.0594)
Note: (**): parameter estimates significant at p < 0.0001.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xin, S.; Shahzad, M.K.; Mahardika, S.B.; Wang, W.; Jiang, L. An Alternative Method for Estimation of Stand-Level Biomass for Three Conifer Species in Northeast China. Forests 2023, 14, 1274. https://doi.org/10.3390/f14061274

AMA Style

Xin S, Shahzad MK, Mahardika SB, Wang W, Jiang L. An Alternative Method for Estimation of Stand-Level Biomass for Three Conifer Species in Northeast China. Forests. 2023; 14(6):1274. https://doi.org/10.3390/f14061274

Chicago/Turabian Style

Xin, Shidong, Muhammad Khurram Shahzad, Surya Bagus Mahardika, Weifang Wang, and Lichun Jiang. 2023. "An Alternative Method for Estimation of Stand-Level Biomass for Three Conifer Species in Northeast China" Forests 14, no. 6: 1274. https://doi.org/10.3390/f14061274

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop