Next Article in Journal
A Comparison of the Qualitative Characteristics of Pellets Made from Different Types of Raw Materials
Previous Article in Journal
The Temperature-Dependent Functional Response and Mutual Interference of Cyanopterus ninghais (Hymenoptera: Braconidae) Parasitizing Monochamus alternatus (Coleoptera: Cerambycidae)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assessment of Potential Prediction and Calibration Methods of Crown Width for Dahurian Larch (Larix gmelinii Rupr.) in Northeastern China

1
Department of Forest Management, School of Forestry, Northeast Forestry University, Harbin 150040, China
2
Key Laboratory of Sustainable Forest Ecosystem Management-Ministry of Education, School of Forestry, Northeast Forestry University, Harbin 150040, China
*
Author to whom correspondence should be addressed.
Forests 2023, 14(10), 2022; https://doi.org/10.3390/f14102022
Submission received: 1 September 2023 / Revised: 2 October 2023 / Accepted: 6 October 2023 / Published: 9 October 2023
(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)

Abstract

:
Crown width (CW) is an important indicator for assessing tree health, vitality, and stability, as well as being used to predict forestry models and evaluate forest dynamics. However, acquiring CW data is laborious and time-consuming, making it crucial to establish a convenient and accurate CW prediction model for forest management. In this study, we developed three models capable of conducting calibration: generalized models (GM), quantile regression models (QR), and mixed-effects models (MIXED). The aim was to effectively improve the prediction accuracy of CW using data from Dahurian larch (Larix gmelinii Rupr.) in Northeastern China. Different sampling designs were applied, including selecting the thickest, thinnest, intermediate, and random trees, with 1 to 10 sample trees for each design. The results showed that all models achieved accurate CW predictions. MIXED displayed the most superior fitting statistics than GM and QR. In model validation, with the increase in the number of sample trees, the model prediction accuracy gradually improved and the model differences gradually reduced. MIXED produced the smallest RMSE, MAE, and MAPE across all sampling designs. The intermediate tree sampling design with the best validation statistics for the given sample size was selected as the final sampling design. Under intermediate tree sampling design, MIXED required a minimum of five sample trees, while GM and QR required at least five and six sample trees for calibration, respectively. Generally, we suggested selecting MIXED as the final CW prediction model and using the intermediate tree sampling design of five trees per plot. This study could provide ideas and support for forest managers to accurately and efficiently predict CW.

1. Introduction

The exchange and conversion of material and energy between forests and the environment primarily occur through the interaction of light, water, and gases by trees [1]. The tree crown serves as the primary site for carrying out this crucial task, collectively influencing tree growth and biogeochemical cycles through the interception of light and regulation of gas and water exchange [2]. The amount of light intercepted by a tree is closely related to the distribution of leaves and branches in the crown, which holds significant importance for forest productivity and nutrient cycling [3,4,5]. Therefore, the crown width (CW), as one of the characteristics of leaf and branch distribution in horizontal space, has important research value for understanding tree architecture and forest ecosystem dynamics [6].
CW serves as a significant variable in various forestry models, including biomass models [7,8], volume models [9,10], taper equations [11,12], and basal area increment models [13], etc. Additionally, CW influences the health and vigor of trees [14], competition [15,16], and forest mortality rates [13], etc. Nevertheless, traditional manual methods of CW measurement demand substantial time and effort, which are constrained by terrain-vegetation occlusion, human perspective, and steel measuring tape deformations. Better methods of obtaining CW have also emerged with technological advances, such as LiDAR that can scan the morphology of trees throughout an entire plot [17] or remote sensing techniques that can identify tree crowns through algorithms [18,19]. However, these technologies require a high level of expertise and expensive software and hardware support to achieve accurate CW measurements. Therefore, the development of CW models using easily measurable variables for estimating CW remains a common and cost-effective choice.
Researchers had already developed numerous CW models, which served as important references for developing CW models [20,21,22,23]. But carelessly using these models may lead to substantial deviations in predictions due to the inherent variations in different tree species and established environments [24]. Therefore, it is necessary to build a model that can adapt to the distribution and changes in CW data over a broad region. It is advantageous to develop generalized CW models (GM) containing variables that reflect the attributes of the study trees and the environment in the research area, such as tree size, stand characteristics, and site characteristics to enhance predictive accuracy [25,26].
In addition to adding variables, the predictive accuracy of the CW model can be improved by using more advanced methods. Mixed-effects models (MIXED) with unique advantages usually achieve the highest predictive accuracy and are widely used [1,20,21]. On the one hand, MIXED can effectively analyze hierarchical nested data with spatial or temporal dependencies, making it less affected by heteroscedasticity and autocorrelation problems [27,28]. On the other hand, MIXED uses the empirical best linear unbiased prediction theory (EBLUP) and prior information from new stands (i.e., collecting several sample trees) to calibrate the random effects of individual stands for more accurate CW predictions [29].
Furthermore, calibration techniques applying GM and quantile regression (QR) have become prevalent in recent years to improve model prediction accuracy [30,31]. GM using the ordinary least squares method (OLS) [26] to estimate the model parameters is widely employed in modeling, and therefore research on calibrating GM has a generalizable value. However, research on GM calibration, aside from studies conducted in recent years, is limited to studies involving Hanus et al. [32] and Temesgen et al. [33]. QR is a flexible statistical method that offers several advantages, including estimates of the conditional quantile of the dependent variable distribution, not requiring further assumptions about the distribution of the error term, stronger robustness and insensitivity to outliers, and the fitted model not needing to satisfy equal variances [34,35]. Consequently, QR has been widely applied in forestry studies in recent years [30,31,36,37,38,39], and the method of combining multiple QR curves for calibration has shown promising results [30,31,38,39].
Different sampling strategies can change the prior information obtained, which may affect the accuracy of the predictions after calibration [40]. Therefore, the design and selection of sampling strategies need to strike a balance to ensure sufficiently high prediction accuracy while keeping the time and effort expended by surveyors within a reasonable range [29]. Currently, research on comparing sampling strategies for model calibration generally focuses on studying MIXED or QR separately, with relatively few comparisons between the sampling strategies of MIXED and QR [30,41]. And there is a lack of research comparing the three calibrations of models, namely MIXED, QR, and GM, in the context of CW models. Therefore, investigating the differences in results obtained from different sampling strategies for these three models during calibration would be highly meaningful.
Dahurian larch, a highly adaptable tree species to harsh climatic and soil conditions, plays a crucial role in climate change in the Far East region of the Eurasian continent [42,43]. Currently, Dahurian larch has been introduced in several countries such as Sweden and Finland [44]. Moreover, Dahurian larch plantation forests are the most common and typical research objects in Northeastern China [45]. Research on the health and stability status of Dahurian larch based on indicators such as CW is receiving increasing attention; however, there are few reports on developing CW models for Dahurian larch [21]. Therefore, the object of this study is the plantation forest of Dahurian larch, which makes up for the lack of current forestry research. Our primary goal is to develop and compare high-precision CW prediction models for Dahurian larch using different modeling methods including the generalized model (GM), mixed-effects model (MIXED), quantile regression model (QR), the corresponding calibration techniques, and a variety of sampling strategies. This study is intended to provide a broad range of ideas for CW studies of other tree species and forest management planners around the globe to ensure highly accurate and cost-effective CW predictions.

2. Materials and Methods

2.1. Study Area

This study focused on a sample of forests from the Weixing and Mayongshun forest farms, which are located in Tieli and Qingan, Heilongjiang Province, China (127°57′–128°12′ E, 47°02′–47°36′ N) (Figure 1). The predominant vegetation type is mixed broadleaf-conifer forest and plantations, with primary tree species including Korean pine (Pinus koraiensis Siebold and Zucc.), Dahurian larch, Scots pine (Pinus sylvestris L.), Chinese fir (Cunninghamia lanceolata (Lamb.) Hook.), ash (Fraxinus mandshurica Rupr.), elm (Ulmus pumila L. cv. Tenue), and white birch (Betula platyphylla Sukaczev). Forested lands cover an area of 0.2 million ha, with a total forest growing stock volume of 20.27 million m3, of which 11% is comprised of plantations. The average altitude is 513 m, with an annual precipitation of 831.2 mm and a mean annual temperature of 2.4 °C. The mean annual maximum and minimum temperatures are 8.8 °C and −3.4 °C, respectively. The climate in the study area is characterized by short, hot summers and predominantly long, cold, and dry winters with an extended duration of snow cover. The topography of the study site is defined by a small and flat plain in the west and low mountainous terrain in the east, which contains most of the forested areas. In China, the afforestation of Dahurian larch has been carried out for over 60 years since the founding of the country. Due to its excellent adaptability to cold climates and relatively fast growth rate, Dahurian larch has become a prominent species for reforestation and serves as a significant timber resource in China [45].

2.2. Data

In this study, we analyzed data collected in 2018 and 2019 from 62 temporary sample plots, comprising a total of 5059 Dahurian larch. Each plot had a square shape with an area of 0.09 ha. These plots were Dahurian larch plantations and exhibited no evidence of thinning, pruning, burning, severe disease, or other damage. All living trees without top damage were included in the data set for statistical analysis. Other tree species, such as ash, elm, and white birch, were present in the plots, but their numbers were too small to be included as target species in the analysis. However, they were used to calculate stand-level variables [46]. For each tree, the diameters (outside-bark 1.3 m above ground level) were measured to the nearest 0.1 cm to obtain diameter at breast height (D, cm). We retained all trees with D ≥ 5 cm and measured other variables, including total tree height (H, m) and height to crown base (HCB, m), with a precision of 0.1 m (measured using a Vertex IV ultrasonic hypsometer). We used the vertical sighting method [47] by looking up and walking to the edge of the crown in four different directions (north, east, south and west). The horizontal distance from the center of the trunk to the location (CRn, CRe, CRs, and CRw) was then measured with the help of a tape measure and a laser range finder. For the same crown radius, three or more measurers were asked to repeat the measurements and then the average value was calculated as the final value of the crown radius. CW was calculated by (CRe + CRs + CRw + CRn)/2 [1,28].
Several stand variables were counted and calculated to facilitate the development of generalized CW models. These variables included the dominant tree height (HDOM, m), dominant tree diameter (DDOM, cm), quadratic mean diameter (Dq, cm), basal area of the stand (BA, m2·ha−1), basal area of trees larger than the subject tree (BAL, m2·ha−1), and stem numbers (N, trees·ha−1). According to the IUFRO (International Union of Forest Research Organizations), HDOM was calculated as the average height of the 100 largest trees per hectare based on the plot area [48]. Additionally, the H-to-D ratio (HDR) and relative spacing index (RSI), D-to-Dq ratio (DQMD), and Reineke’s stand density index (SDI) have all been included as important candidate predictor variables in this paper [20,49]. Due to incomplete data collection during the data gathering process, information regarding stand age for various sites was not fully obtained. Therefore, stand age was not included in the modeling process. All the above variables were easily accessible and calculated through conventional methods, making the models user-friendly and practical for application. Table 1 provides a detailed summary of the data used in the analyses.

2.3. Generalized Model

2.3.1. Basic Model

The study utilized the commonly used nine CW-D models in the literature as candidates (Table 2) [26,50,51]. The data were input into these models to obtain CW predictions. The coefficient of determination (R2), root mean square error (RMSE), and Akaike information criterion (AIC) were calculated for each model and compared to select the optimal basic model.

2.3.2. Variable Selection

Tree size, site characteristics, and stand characteristics variables were included in the optimal basic model through the two-stage approach [27,28]. In the first stage, we fitted the basic model to the data for each plot separately; however, the model for some plots could not converge. Therefore, we fitted the mixed-effects model of the basic model to obtain random-effect parameters for each plot. The location of the random-effect parameters of the best-fitting model was used as the location of the added variable. The Pearson correlation plots of the random effect parameters for each plot were then plotted against all variables. In the second stage, the basic model was expanded through reparameterization to incorporate variables that were closely correlated with the parameters.
A priori analysis showed no variance heterogeneity across the range of predicted values, and therefore no variance function was applied for GM and MIXED. The effect of a covariate on variations in CW was evaluated graphically by incrementally changing a covariate variable at equal intervals while keeping the other variables at their mean values. The GM was fitted using the “nls” function in R software version 4.2.1 (R Foundation for Statistical Computing, Vienna, Austria) [52].

2.4. Mixed-Effects Model

We incorporated plot-level random effects into GM using the nonlinear mixed-effects modeling approach. The variation in random effect parameters in mixed-effects models can partially explain the differences in covariate values between groups or represent inter-group variability that cannot be explained in some applications [28,53]. The matrix form of the expression for the nonlinear mixed-effects model (MIXED) is defined as follows:
CW i j = f ( φ i , x i j ) + ε i j , φ i = A i β + B i b i ε i j ~ N 0 , R i , R i = σ 2 G i 0.5 Γ i G i 0.5 b i ~ N 0 , D
where CWij is the CW of the jth sample tree in the ith sample plot, xij is a vector for measurements of predictor variables on plot i, φi is the particular parameter of plot i, εij is the error term of the normal distribution within the sample plot. Ai and Bi are the design matrices, β is a p × 1 dimensional vector of fixed parameters, bi is a q × 1 dimensional vector for plot-level random effect parameters and is assumed to have a multivariate normal distribution with a zero mean and respective variance–covariance matrix D. Ri is the variance–covariance matrix within the plot where σ2 is the residual variance common to all plots, Γi is a matrix accounting for within-plot autocorrelations of the residuals, and Gi is a diagonal matrix accounting for the variance in within-plot heteroscedasticity. The “nlme” function of R software version 4.2.1 and the “nlme” package were used to estimate the random effect parameters [52].

2.5. Quantile Regression

Quantile regression offers a more complete picture of the relationship between variables by estimating multiple rates of change from the minimum to the maximum response. The parameters of quantile regression (QR) are obtained by minimizing the asymmetric loss function of the absolute values of the residuals as follows [54]:
S = y i j y ^ i j τ τ y i j y ^ i j τ + y i j < y ^ i j τ 1 τ y i j y ^ i j τ
where y ^ i j τ is the predicted value of CW in the τth quantile model, and yij is the measured value of CW. A series of QRs (τ = 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9) were constructed to fit the entire dataset, by using the “nlqr” function in the “quantreg” package of R software version 4.2.1 [52].

2.6. Predictive Ability of Different Models

2.6.1. Calibration Technique of GM

If the response variable of a subsample of nij trees from the ith plot is known, this prior information could improve the prediction accuracy of the response variable for the remaining trees in the same plot. The correction factor k i * could be calibrated using the following formula [33]:
k i * = j = 1 n i j CW ^ i j CW i j / j = 1 n i CW ^ i j 2
where C W i j ^ is the predicted CW from GM for the jth subsample tree in the ith plot. C W i j is the observed CW value of the jth subsample tree in the ith plot. nij is the number of sample trees used for calibration in the ith plot. Then, multiplying the correction factor k i * with the GM gives the calibration model (GMC). The function form of GMC is as follows:
CW i j = k i × GM

2.6.2. Calibration Technique of MIXED

The plot-specific random effects parameter of the ith plot could be predicted by using the first-order Taylor series expansion based on the subsample of trees in the same plot [53,55]. The random effects parameter b ^ i k is shown as follows:
b ^ i k = D ^ Z i T Z i D ^ Z i T + R ^ i 1 y i f x i , β ^ , b ^ i k 1 + Z i b ^ i k 1
where β ^ is a vector of fixed-effect parameters β , D ^ is the variance–covariance matrix at the sample site level, R ^ i is the error matrix, Z i is the partial derivative of the function f with respect to b i , shaped as f ( x i , β , b i ) b i β ^ , b i , and yi is a vector of measured values of CW.
Since both ends of Equation (5) contain random effect parameters b ^ i , we used an iterative process to solve it [56]. In this process, b ^ i k represents the estimated value of the random effect parameter for plot i in the k-th iteration. When the k-th iteration is completed and | b ^ i k b ^ i k 1 | < 1 × 10 6 is reached, b ^ i k represents the final estimate of the random effect parameter. It is added to the specified fixed-effect parameters to calculate the predicted CW:
y ^ i = f x i , β ^ , b ^ i k

2.6.3. Calibration Technique of QR

The goal of QR calibration is to improve prediction accuracy by identifying the quantile curve or the two closest quantile curves that the subsample of trees in each plot pass through [30,38,39]. The calibration formula for QR is shown as follows:
CW ^ i j = α y ^ k x i j + 1 α y ^ k + 1 x i j
where C W i j ^ is the predicted CW of calibration, y ^ k x i j and y ^ k + 1 x i j are the predicted CW of the two closest curves of a group of quantile curves, and α is the interpolation ratio in the form α = y ^ k + 1 x i j C W i j y ^ k + 1 x i j y ^ k x i j .
The calibration process was divided into two cases: measuring the CW of either a single tree or multiple trees in each plot. When measuring CW for a single tree, two quantile curves were selected to encompass CWij (i.e., y ^ k x i j     C W i j   y ^ k + 1 x i j ), and a calibrated CW curve passing through that point was obtained by interpolation using Equation (7). If the CW is higher than the maximum (qth) quantile curve, y ^ k is defined as y ^ q 1 , y ^ k + 1 is defined as y ^ q . Similarly, if the CW is lower than the minimum (1st) quantile curve, y ^ k and y ^ k + 1 are defined as y ^ 1 and y ^ 2 , respectively.
When measuring the CW of multiple trees, the calibration process involves finding the two quantile curves with sign changes that correspond to the mean bias error (MBE) between the observed CWs (CWobv) and predicted CWs (CWpre) of the sampled trees for each plot (i.e., MBE = 1 n j = 1 n ( C W o b v C W p r e ) at τ = k, MBE > 0; at τ = k + 1, MBE < 0), the interpolation ratio α is minimized by computing j = 1 n ( C W o b v C W p r e ) 2 , and α is substituted into Equation (7) to complete the calibration. In cases where most of the CWobv values are lower than the CWpre values of the minimum quantile (k = 1) curve (i.e., MBE < 0 in all quantile curves), y ^ k and y ^ k + 1 are defined as y ^ 1 and y ^ 2 , respectively. Similarly, if most of the CWobv values are larger than the CWpre values of the maximum quantile (k = q) curve (i.e., MBE > 0 in all quantile curves), then y ^ k is defined as y ^ q 1 , and y ^ k + 1 is defined as y ^ q .
Four combinations of quantile regression for calibration (QRC), containing three-quantiles (QRC3, τ = 0.1, 0.5, and 0.9), five-quantiles (QRC5, τ = 0.1, 0.3, 0.5, 0.7, and 0.9), seven-quantiles (QRC7, τ = 0.1, 0.3, 0.4, 0.5, 0.6, 0.7, and 0.9), and nine-quantiles (QRC9, τ = 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, and 0.9) were compared. The optimal QRC was then compared with GMC and MIXED.

2.7. Sample Alternatives in Prediction

Different sampling designs and sample sizes can change the correction factor k i * , random effects parameters b ^ i k , and interpolation ratio α in the above models, leading to variations in the accuracy of CW predictions. Therefore, several strategies of tree selection were compared for the models used in this study, including:
  • Selecting the 1 to 10 thickest sample trees from each plot for calibration;
  • Selecting the 1 to 10 sample trees that have DBH closest to the arithmetic average of all the sample trees from each plot for calibration;
  • Selecting the 1 to 10 thinnest sample trees from each plot for calibration;
  • Randomly selecting 1 to 10 sample trees from each plot for calibration, repeating the process 50 times, and calculating the average to reduce the extreme calibration results due to randomness.
Since the uncalibrated models were not subject to sampling, the model parameters remained unchanged. Previous research had demonstrated that selecting fewer than 10 sample trees was the most sensible option, taking into account the fluctuations in prediction accuracy and practical application [57]. The selection of sample size can be achieved by assessing the RMSE variation rates (ΔRMSE).

2.8. Model Estimation and Evaluation

All the fitting and validation statistics of the models were obtained using R software in version 4.2.1 [52]. A self-written function was used for model calibration, and the graphical results of the models were obtained using various functions from the “ggplot2” package [52]. The models were validated using leave-one-out cross-validation (LOOCV), with each plot being used as the validation data and the remaining plots as the fitting data. The fitting statistics of the models used R-squared (R2) and root-mean-square error (RMSE), while the validation statistics used RMSE, mean absolute error (MAE), and mean absolute percentage error (MAPE). The calculated evaluation statistics for fitting and validating are as follows:
R 2 = 1 i = 1 m j = 1 n ( CW i j CW i j ^ ) 2 i = 1 m j = 1 n ( CW i j CW i ¯ ) 2
RMSE = 1 i = 1 m n i = 1 m j = 1 n ( CW i j CW i j ^ ) 2
MAE = i = 1 m j = 1 n | CW i j CW i j ^ | i = 1 m n
MAPE = 1 i = 1 m n i = 1 m j = 1 n CW i j CW i j ^ CW i j × 100
Δ RMSE = R M S E ( S a m p l e = 1 ) R M S E ( S a m p l e = i ) R M S E ( S a m p l e = 1 ) × 100
where C W i j is the observed value of the jth tree CW in the ith plot, C W i j ^ is the model predicted value of C W i j , C W i ¯ is the average value of CW in the ith plot, m is the number of plots, and n is the number of trees per plot, RMSE ( Sample = 1 ) is the RMSE value of sample size = 1, and RMSE ( Sample = i ) is the RMSE value of sample size = i.

3. Results

3.1. Generalized Model

The forms and fitting statistics of the nine candidate base models are presented in Table 2. The parameter estimation values of all models were statistically significant (p < 0.05). The logistic model had the highest R2 and the lowest RMSE and AIC values, making it the optimal choice as the basic model for predicting CW. The basic model is shown as follows:
CW = β 0 1 + e x p ( β 1 D + β 2 )
where CW and D are the respective crown width and the diameter at breast height of each tree; β 0 , β 1 and β 2 are the model parameters.
The likelihood ratio tests for different random-effects parameters for the mixed-effects model form of the basic model are shown in Table 3. Adding the random-effects parameters to β1 and β2 of Equation (13) yielded the smallest AIC and BIC values, indicating that they were the optimal positions for incorporating the variables. The Pearson correlation coefficients (r) between the random-effect parameters added to β1 and β2 and the variables in Table 1 are depicted in Figure 2. The parameter β1 of the basic model had a strong correlation with Dq, HDOM, and DDOM (r > 0.6). The parameter β2 had a moderate correlation with H, N, Dq, HDOM, and DDOM (0.4 < r < 0.6). Different combinations of variables with moderate and strong correlations with parameters were added to the model for fitting. The best combination of variables added to the model was selected based on the fitting statistics RMSE and R2. The GM is shown as follows:
CW = β 0 1 + e x p ( ( β 1 + β 3 Dq ) D + ( β 2 + β 4 HDOM ) )
where Dq is the quadratic mean diameter of each plot; HDOM is the dominant tree height of each plot; β3 and β4 are the model parameters to be estimated; and other abbreviations have been defined in Equation (13).
The CW prediction curves of the GM are shown in Figure 3 for the influence of Dq and HDOM on CW. CW increased as HDOM increased and decreased as Dq increased. As the D increased, the differences in curves within the Dq became more pronounced, while those within the HDOM became less noticeable.

3.2. Mixed-Effects Model

Mixed-effects models were established according to Equation (14). There were 31 different combinations of random-effect parameters, of which 18 converged. A comparison of the optimal mixed-effects models for choosing different numbers of random-effects parameters is shown in Table 4. The random-effects parameters b0 and b1 had the lowest AIC and BIC values and the highest log-likelihood, making them the most optimal form of MIXED:
CW = β 0 + b 0 1 + e x p ( ( ( β 1 + β 3 Dq ) D + ( β 2 + ( β 4 + b 1 ) HDOM ) ) )
where b0 and b1 are the random-effects parameters; other abbreviations have been defined in Equation (14).

3.3. Quantile Regression

The same function form in Equation (14) was used to predict the τth CW quantile as follows:
CW τ = β 0 τ 1 + e x p ( ( ( β 1 τ + β 3 τ Dq ) D + ( β 2 τ + β 4 τ HDOM ) ) )
where C W τ is the predicted value of the τth quantile crown width, β 0 τ ~ β 4 τ is the parameter of the τth QR of CW, and the other abbreviations have been defined in Equation (14).
Figure 4 shows the various CW simulation curves based on quantiles (τ = 0.1, 0.2, 0.3, 0.4,0.5, 0.6, 0.7, 0.8, and 0.9). The QR curves did not show a significant simulation curve crossover at the points of maximum and minimum D, which suggested that they effectively reflected the distribution of CW. In addition, the simulation curve of GM has been included in Figure 4, and its shape closely resembles that of the median curve (τ = 0.5). The two curves almost completely overlap, except for the extreme values of D. These findings indicated that the distribution of CW in this study was uniform, and the mean value of the CW was similar to the median value.

3.4. Model Fitting

The model parameters and fitting statistics are presented in Table 5. The estimated values of the model parameters were all significant (p-values < 0.05) and possessed statistical significance. The highest fitting accuracy was achieved when τ = 0.5 (median quantile regression) among all quantile regressions, which corresponded to the optimal QR. The farther the magnitude away from the median, the worse the fitting accuracy. GM and QR (τ = 0.5) showed similar R2 and RMSE values, with GM slightly edging out QR (τ = 0.5). MIXED had the smallest RMSE and the largest R2, and was markedly superior to GM and QR (τ = 0.5), thereby making it the best-fitting model.

3.5. Model Validation

3.5.1. Comparison of Multiple QRCs

LOOCV was used to validate the calibration of the QRCs in the four sampling strategies. The results of the calibrated validation statistics are shown in Figure 5. The overall trend of MAE, MAPE, and RMSE decreased as the number of sample trees increased. Except for the thinnest tree sampling strategy, the difference in prediction accuracy was small for the same number of sample trees, especially when sufficient prior information was available (sample size > 5). The overall trend was that the more QR curves used, the higher the prediction accuracy of CW (with the thickest tree sampling design best fitting this pattern). When the prior information was limited (1 < sample size < 5), the prediction accuracy of CW using the intermediate tree and random tree sampling design was better for the QRC5 than for the QRC7 and QRC9. In short, the five-quartile (QRC5) calibration was slightly better than other QRCs for most comparisons with the same number of sample trees. Therefore, the QRC5 was chosen as the optimal QRC and compared with the calibration strategies of other models.

3.5.2. Comparison of GMC, MIXED, and QRC5

To compare the differences between GMC, MIXED, and QRC5 among four sampling designs with 1–10 sampling trees, LOOCV was used to obtain the validation statistics in Figure 6 (See Appendix A Table A1 for more details). The MAE, MAPE, and RMSE of GMC and QRC5 were lower than those of GM under a certain sample size for the four sampling designs, while the MAE, MAPE, and RMSE of the calibrated MIXED were consistently smaller than those of GM under any sampling strategy. The MAE, MAPE, and RMSE of each calibration decreased gradually with an increase in the number of sampling trees across the four sampling strategies. The thinnest tree strategy of GMC showed the maximum MAE, MAPE, and RMSE, indicating the poorest performance. Except for the thickest tree strategy, the ranking of model validation statistics for the other strategies was as follows: MIXED, QRC5, and GMC. The ranking for the thickest tree strategy was MIXED, GMC, and QRC5.
Figure 7 shows the comparison of the same calibration model under different sampling strategies in terms of RMSE. For MIXED, the RMSE of random tree and intermediate tree sampling strategies was similar and smaller than the two extreme sampling strategies (the thickest and thinnest tree) in all sampling trees. For GMC, the RMSE of the thinnest tree sampling strategy was significantly greater than the other three strategies, and it never exceeded the RMSE of GM. The RMSE of the intermediate tree and random tree sampling strategies were the smallest and closest to each other. For QRC5, the intermediate tree and random tree sampling strategies still had the smallest RMSE, while the differences between the two extreme sampling strategies and the intermediate tree and random tree sampling strategies were more significant. Moreover, only when the number of sampled trees was five did the two extreme strategies perform better than GM.
In summary, the validation statistics of the sampling designs for the intermediate trees and random trees were consistently smaller than those of the two extreme value sampling designs. The ΔRMSE for MIXED, QRC5, and GMC were recorded under the intermediate and random tree sampling designs to determine the minimum sample size (Table 6). Among the three models, the decline rates for intermediate trees and random tree sampling designs decreased from 12.8% and 18.6% to a mere 0.3% and 0.4%, respectively. Using a benchmark of ΔRMSE above 1%, the minimum sample sizes for MIXED, QRC5, and GMC under the intermediate tree sampling design were determined to be five, five, and six trees, respectively. Similarly, under the random tree sampling design, the minimum sample sizes were found to be five, six, and seven trees for MIXED, QRC5, and GMC, respectively. In both sampling designs, the optimal sample sizes for QRC5 and GMC were larger than MIXED. Although the RMSE for QRC5 and GMC under the random tree sampling design was lower than that of the intermediate tree sampling design, the optimal sample size for the intermediate tree sampling design consistently required one fewer tree.

4. Discussion

4.1. Model Selecting and Variables Adding

In this study, the logistic model was selected as the basic model for the CW-D model. This choice was made not only because the logistic model showed good fitting performance but also because it is commonly used to describe a prudent growth of population size [58]. The S-shaped growth curve characteristic of the logistic model can effectively simulate the pattern of rapid early growth followed by stabilized growth in forest trees.
By adding additional variables to the CW basic model, not only can the accuracy of CW predictions be improved, but the relationship between CW and D allometric growth can also be better explained. In this study, two stand variables, Dq and HDOM, were selected using a two-stage approach. The inclusion of Dq as a significant stand variable in the CW model is consistent with the choices made by Paulo and Tomé [59] and Sanchez-Gonzalez et al. [51]. HDOM is a fertility index reflecting the quality of the forest floor in terms of growth and yield capacity [60,61]. Overall, the higher the HDOM, the greater the soil fertility and the greater the growing resources, and therefore the greater the CW is expected to be [28,62,63]. Tree size variables besides D were not included, which might be attributed to the similarity in phenological and growth characteristics within pure stands, leading to crown growth being more concentrated within a specific layer of the canopy, making the impact of tree size variables such as H and HCB on CW weaker than that of stand variables [64,65].

4.2. Comparison between Different QRCs

QR can flexibly predict data at different quantiles (as observed in Figure 4 for different QR curves of CW). For studies focusing on extreme values, such as the characteristics of the outer contour or the outermost contour of the crown [23,66], using QR allows for effective predictions by the maximum or minimum QR curves. Therefore, selecting appropriate QR curves for calibration is crucial. In this study, to explore the impact of the number of QR curves on the accuracy of predictions, we divided the QR curves into four groups (QRC3, QRC5, QRC7, and QRC9). However, some studies only compared QRC3 and QRC5 [30,38,39,67]; others extended the comparison by including QRC9 [41]; and QRC7 was not included in these studies. This could be because QRC7 has uneven spacing for τ values. In the studies related to QRCs, the differences in validation statistics for various QRCs were not significant, which was consistent with the results of this study [30,38,39,41]. However, there have been diverging opinions regarding the selection of QRC. Cao and Wang [38], Özçelik et al. [67], Xie et al. [41], and others suggest using QRC5 or QRC9 in studies related to taper equations, branch growth, or tree height, while some researchers choose the QRC3 for calibration [30,39]. The possible reason could be overfitting caused by additional curves, where the improvement in predictive ability is not significant and could even result in reduced performance [39]. In conclusion, too many or too few QR curves in QRC may not be conducive to improving predictive ability. Therefore, considering the need to achieve a certain level of prediction accuracy in model calibration, this study selected a moderately sized QRC5 as the final QRC.

4.3. Comparison of GMC, MIXED and QRC

According to the validation statistics after calibration, the predictive accuracy of CW significantly improved in all three models (Figure 6 and Figure 7 and Appendix A Table A1). Furthermore, as the sample size increased, the differences in predictive accuracy among the models gradually decreased. The observed variation trend in predictive accuracy for these three models in this study aligns with the findings of Özçelik et al. [30] and Wang et al. [31], who also compared GMC, QRC, and MIXED models for predicted tree height and height to crown base, respectively. Additionally, several studies comparing QRC and MIXED have reported similar results [41,68]. The choice of MIXED as the final model is not only because the validation statistics consistently outperformed QRC and GMC, but also because MIXED demonstrates extremely high robustness. Even when the four sampling designs caused significant variations in CW prediction accuracy for QRC and GMC (Figure 7), MIXED was able to maintain a high level of CW prediction accuracy [30,31].

4.4. Selection of Calibration Strategies

In studies examining the impact of sampling strategies on calibration results, there is considerable controversy surrounding the selection of sampling designs. For example, Crecente-Campo et al. [57] and Castedo Dorado et al. [69] found that the best predictive performance for the height-diameter model in MIXED was achieved using the thinnest trees. In contrast, Temesgen et al. [33] and Fu et al. [70] found that the validation statistics performed best when the thickest trees were selected. However, it was decided to choose an intermediate or random tree sampling design rather than the thickest or thinnest tree sampling design in this study. Fu et al. [70] believed that selecting the thickest trees could provide information on the stand structure and tree development through the HDOM. However, the model developed in this study included HDOM, which may potentially limit the capacity of the thickest tree sampling to provide additional information for calibration [69,71]. Based on this reason, Crecente-Campo et al. [57] and Castedo Dorado et al. [69] also chose the thinnest trees as their final sampling strategy. At the same time, Crecente-Campo et al. [57] also pointed out that the thinnest tree sampling design provided biased samples. Without underlying sample probabilities, there is no statistical basis to estimate modeling error [72,73]. However, the highest predictive accuracy ultimately led them to choose the thinnest tree sampling. Therefore, the magnitude of predictive accuracy is the primary influencing factor in the selection of the sampling design. For this very reason, we made the decision to choose between random sampling and intermediate tree sampling designs as our final selection (Figure 6).
This study ultimately decided to choose an intermediate tree sampling design as the final sampling design because its RMSE was consistently smaller than that of the random tree sampling design (Table 6). This finding is consistent with that of Crecente-Campo et al. [74] and Yan et al. [75] in predicting tree height and crown width. A possible reason may be that intermediate trees provide more effective information by occupying the largest proportion in each plantation [75]. Although the random tree sampling design can generate more reliable and accurate predictions by averaging multiple simulation results [57,68], the repeated random sampling consumed a significant amount of time and effort in the forest survey. In conclusion, MIXED was ultimately chosen as the CW prediction model, and the most reasonable sampling strategy was chosen as selecting five intermediate sample trees. Of course, QRC5 and GMC can also be considered as an alternative to MIXED if the time and resource consumption of obtaining the sample tree information is ignored.
This study emphasizes the search for modeling methods and sampling schemes that can improve the accuracy of CW prediction, which is a significant reference for the overall design of CW prediction studies. However, since this study only focused on the Daurian larch plantation forest, more exploration is needed to explore the feasibility of the conclusions when applied to other tree species.

5. Conclusions

This study analyzed and compared the prediction accuracy of the crown width (CW) of Dahurian larch plantations in the Xiaoxing’an Mountains by different modeling approaches including the generalized model (GM), mixed-effects model (MIXED), quantile regression (QR), and their corresponding correction techniques. Firstly, we found that the prediction accuracy of the CW model was improved by incorporating stand variables of quadratic mean diameter (Dq) and dominant height (HDOM). Secondly, we discovered that the prediction accuracies of GM (GMC), MIXED, and QR (QRC3, QRC5, QRC7, and QRC9) were significantly improved when calibrated using different sampling designs. QRC5 exhibited superior prediction accuracy and stability compared to other QRC methods. MIXED consistently outperformed QRC5 and GMC in terms of CW prediction accuracy and robustness, regardless of the sampling design. Finally, intermediate tree sampling was chosen as the best sampling design because it provided the best CW prediction accuracy. The optimal sample size for MIXED (five trees) was smaller than that of QRC5 (five trees) and GMC (six trees). Therefore, it was decided to select MIXED with an intermediate tree sampling design and a sample size of six trees as the final model for predicting CW. The models and design options presented in this study will inspire other researchers to sample and calibrate CW prediction models when developing Daurian larch plantations or other coniferous forests. Future work will focus on conducting additional explorations to explore the feasibility of applying these findings to other tree species.

Author Contributions

Conceptualization, S.L. and L.J.; methodology, S.L. and L.J.; software, J.W. and S.L.; validation, S.L.; formal analysis, S.L.; investigation, S.L. and J.W.; resources, L.J.; data curation, S.L.; writing—original draft preparation, S.L.; writing—review and editing, J.W. and L.J.; visualization, S.L. and J.W.; supervision, L.J.; project administration, L.J.; funding acquisition, L.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Applied Technology Research and Development Plan Project of Heilongjiang Province (GA19C006) and Fundamental Research Funds for Central Universities (2572019CP15).

Data Availability Statement

The data presented in this study are available upon request from the corresponding author. The data are not publicly available due to confidentiality.

Acknowledgments

The authors would like to thank the faculty and students of the Department of Forest Management, Northeast Forestry University (NEFU), China, who collected and provided the data for this study.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Validating statistics of CW values by GMC, MIXED, and QRC5, which varied with different sampling designs and sample sizes.
Table A1. Validating statistics of CW values by GMC, MIXED, and QRC5, which varied with different sampling designs and sample sizes.
Sample DesignSample SizeMAERMSEMAPE
GMCMIXEDQRC5GMCMIXEDQRC5GMCMIXEDQRC5
None00.51550.51580.51430.66010.66310.661821.549121.889121.2178
The thickest trees10.60020.50310.68700.78830.64160.890323.622620.948427.6314
20.51740.48490.56860.65580.61690.721221.096520.271322.8772
30.48930.47740.52800.62240.61230.668020.178320.077821.7450
40.48320.47940.52630.61300.61410.663120.090019.863621.9423
50.47420.47010.51080.60250.61460.645819.898519.953421.5468
60.46520.46930.49630.59190.60260.632419.446119.514720.7956
70.45540.45370.48020.58050.58510.614519.045218.846720.0873
80.44850.44900.46720.57350.58080.600618.811118.623419.5990
90.44610.44850.46480.57080.58120.597918.703218.528719.4749
100.44460.44430.46150.56860.57510.594218.664418.393119.3447
The
intermediate trees
10.55590.46930.54390.72600.60600.709421.919119.787621.6668
20.48740.44970.47820.63280.58230.624519.532918.817919.1047
30.46300.43650.45380.60110.56660.594218.727818.232518.2701
40.44720.42950.43970.58120.55870.575718.256117.784617.8897
50.43890.42330.43120.57120.55190.565717.946617.477917.6264
60.43270.41970.42640.56390.54790.559617.728817.277117.4380
70.42930.41710.42190.55930.54500.554017.615217.106917.3167
80.42660.41440.41860.55640.54200.549617.513916.980917.2003
90.42410.41250.41520.55290.54000.546117.428416.873117.0913
100.42190.41070.41400.55030.53790.544417.376816.833617.0088
The thinnest trees11.20970.50470.70201.62310.64670.916044.733421.868528.6787
20.84090.49430.67601.16520.63650.956832.041621.186227.2681
30.69720.46360.58020.91020.59810.776626.404119.789823.4378
40.64770.45320.55810.83890.58350.726224.507019.175122.3591
50.59890.44960.50620.77870.58170.667922.711518.871120.4601
60.56030.44770.48550.73090.57980.633021.428418.761919.7522
70.53230.44230.46690.69300.57350.607120.395918.412619.0288
80.52580.43670.45260.68100.56880.587820.132618.113618.4996
90.51420.43610.44530.66960.56920.582519.783818.061818.2338
100.51720.43640.44430.67790.57230.584019.867018.038118.1712
Random trees10.60330.47390.57420.8049 0.61080.750623.428119.879422.7129
20.50500.45410.48660.6554 0.58800.634020.093118.965919.5145
30.47140.44030.46200.6098 0.57130.600619.001218.270618.6089
40.45510.43280.44850.5903 0.56260.584818.452717.920918.1518
50.44410.42650.43770.5760 0.55570.571018.145717.579717.7598
60.43870.42350.43050.5694 0.55240.562217.902617.484017.4378
70.43160.41810.42550.5606 0.54640.556717.728317.219417.2718
80.42780.41660.42120.5557 0.54430.551217.596117.142817.1185
90.42690.41420.41850.5547 0.54090.547617.560817.094517.0259
100.42350.41120.41590.5508 0.53840.544517.443116.839916.9377

References

  1. Lei, Y.; Fu, L.; Affleck, D.L.R.; Nelson, A.S.; Shen, C.; Wang, M.; Zheng, J.; Ye, Q.; Yang, G. Additivity of nonlinear tree crown width models: Aggregated and disaggregated model structures using nonlinear simultaneous equations. For. Ecol. Manag. 2018, 427, 372–382. [Google Scholar] [CrossRef]
  2. Lowman, M.D.; Schowalter, T.D. Plant science in forest canopies—The first 30 years of advances and challenges (1980–2010). New Phytol. 2012, 194, 12–27. [Google Scholar] [CrossRef] [PubMed]
  3. Jucker, T.; Bouriaud, O.; Coomes, D.A. Crown plasticity enables trees to optimize canopy packing in mixed-species forests. Funct. Ecol. 2015, 29, 1078–1086. [Google Scholar] [CrossRef]
  4. Hardiman, B.S.; Bohrer, G.; Gough, C.M.; Vogel, C.S.; Curtisi, P.S. The role of canopy structural complexity in wood net primary production of a maturing northern deciduous forest. Ecology 2011, 92, 1818–1827. [Google Scholar] [CrossRef]
  5. Barbeito, I.; Collet, C.; Ningre, F. Crown responses to neighbor density and species identity in a young mixed deciduous stand. Trees 2014, 28, 1751–1765. [Google Scholar] [CrossRef]
  6. Fischer, F.J.; Maréchaux, I.; Chave, J. Improving plant allometry by fusing forest models and remote sensing. New Phytol. 2019, 223, 1159–1165. [Google Scholar] [CrossRef]
  7. Hoffmann, C.W.; Usoltsev, V.A. Tree-crown biomass estimation in forest species of the Ural and of Kazakhstan. For. Ecol. Manag. 2002, 158, 59–69. [Google Scholar] [CrossRef]
  8. Gülci, S.; Akay, A.E.; Gülci, N.; Taş, İ. An assessment of conventional and drone-based measurements for tree attributes in timber volume estimation: A case study on stone pine plantation. Ecol. Inform. 2021, 63, 101303. [Google Scholar] [CrossRef]
  9. Gonzalez-Benecke, C.A.; Gezan, S.A.; Samuelson, L.J.; Cropper, W.P.; Leduc, D.J.; Martin, T.A. Estimating Pinus palustris tree diameter and stem volume from tree height, crown area and stand-level parameters. J. For. Res. 2014, 25, 43–52. [Google Scholar] [CrossRef]
  10. Bonnor, G.M. Stem diameter estimates from crown width and tree height. Commonw. For. Rev. 1968, 47, 8–13. [Google Scholar]
  11. Kalliovirta, J.; Tokola, T. Functions for estimating stem diameter and tree age using tree height, crown width and existing stand database information. Silva Fenn. 2005, 39, 227–248. [Google Scholar] [CrossRef]
  12. Lacerda, T.H.S.; Miranda, E.N.; Lopes, I.L.e.; Fonseca, G.R.; França, L.C.d.J.; Gomide, L.R. Feature selection by genetic algorithm in nonlinear taper model. Can. J. For. Res. 2022, 52, 769–779. [Google Scholar] [CrossRef]
  13. Monserud, R.A.; Sterba, H. A basal area increment model for individual trees growing in even- and uneven-aged forest stands in Austria. For. Ecol. Manag. 1996, 80, 57–80. [Google Scholar] [CrossRef]
  14. Zarnoch, S.J.; Bechtold, W.A.; Stolte, K.W. Using crown condition variables as indicators of forest health. Can. J. For. Res. 2004, 34, 1057–1070. [Google Scholar] [CrossRef]
  15. Krajicek, J.E.; Brinkman, K.A.; Gingrich, S.F. Crown competition—A measure of density. For. Sci. 1961, 7, 35–42. [Google Scholar] [CrossRef]
  16. Roy, B.; Hart, L.; Weih, R.; Smith, K. Crown radius and diameter at breast height relationships for six bottomland hardwood species. J. Ark. Acad. Sci. 2005, 59, 110–115. [Google Scholar]
  17. Slavík, M.; Kuželka, K.; Modlinger, R.; Tomášková, I.; Surový, P. UAV laser scans allow detection of morphological changes in tree canopy. Remote Sens. 2020, 12, 3829. [Google Scholar] [CrossRef]
  18. Wagner, F.H.; Ferreira, M.P.; Sanchez, A.; Hirye, M.C.M.; Zortea, M.; Gloor, E.; Phillips, O.L.; de Souza Filho, C.R.; Shimabukuro, Y.E.; Aragão, L.E.O.C. Individual tree crown delineation in a highly diverse tropical forest using very high resolution satellite images. ISPRS J. Photogramm. Remote Sens. 2018, 145, 362–377. [Google Scholar] [CrossRef]
  19. Deluzet, M.; Erudel, T.; Briottet, X.; Sheeren, D.; Fabre, S. Individual tree crown delineation method based on multi-criteria graph using geometric and spectral information: Application to several temperate forest sites. Remote Sens. 2022, 14, 1083. [Google Scholar] [CrossRef]
  20. Sharma, R.P.; Bílek, L.; Vacek, Z.; Vacek, S. Modelling crown width–diameter relationship for Scots pine in the central Europe. Trees 2017, 31, 1875–1889. [Google Scholar] [CrossRef]
  21. Wang, J.; Jiang, L.; Yan, Y. The impacts of climate, competition, and their interactions on crown width for three major species in Chinese boreal forests. For. Ecol. Manag. 2022, 526, 120597. [Google Scholar] [CrossRef]
  22. Westfall, J.A.; Nowak, D.J.; Henning, J.G.; Lister, T.W.; Edgar, C.B.; Majewsky, M.A.; Sonti, N.F. Crown width models for woody plant species growing in urban areas of the U.S. Urban Ecosyst. 2020, 23, 905–917. [Google Scholar] [CrossRef]
  23. Russell, M.B.; Weiskittel, A.R. Maximum and largest crown width equations for 15 tree species in Maine. North. J. Appl. For. 2011, 28, 84–91. [Google Scholar] [CrossRef]
  24. Mensah, S.; Pienaar, O.L.; Kunneke, A.; Du Toit, B.; Seydack, A.; Uhl, E.; Pretzsch, H.; Seifert, T. Height—Diameter allometry in South Africa’s indigenous high forests: Assessing generic models performance and function forms. For. Ecol. Manag. 2018, 410, 1–11. [Google Scholar] [CrossRef]
  25. Uzoh, F.C.C.; Oliver, W.W. Individual tree diameter increment model for managed even-aged stands of ponderosa pine throughout the western United States using a multilevel linear mixed effects model. For. Ecol. Manag. 2008, 256, 438–445. [Google Scholar] [CrossRef]
  26. Sönmez, T. Diameter at breast height-crown diameter prediction models for Picea orientalis. Afr. J. Agric. Res. 2009, 4, 215–219. [Google Scholar]
  27. Chen, Q.; Duan, G.; Liu, Q.; Ye, Q.; Sharma, R.P.; Chen, Y.; Liu, H.; Fu, L. Estimating crown width in degraded forest: A two-level nonlinear mixed-effects crown width model for Dacrydium pierrei and Podocarpus imbricatus in tropical China. For. Ecol. Manag. 2021, 497, 119486. [Google Scholar] [CrossRef]
  28. Sharma, R.P.; Vacek, Z.; Vacek, S. Individual tree crown width models for Norway spruce and European beech in Czech Republic. For. Ecol. Manag. 2016, 366, 208–220. [Google Scholar] [CrossRef]
  29. Calama, R.; Montero, G. Interregional nonlinear height-diameter model with random coefficients for stone pine in Spain. Can. J. For. Res. 2004, 34, 150–163. [Google Scholar] [CrossRef]
  30. Özçelik, R.; Cao, Q.V.; Trincado, G.; Göçer, N. Predicting tree height from tree diameter and dominant height using mixed-effects and quantile regression models for two species in Turkey. For. Ecol. Manag. 2018, 419–420, 240–248. [Google Scholar] [CrossRef]
  31. Wang, J.; Jiang, L.; Gaire, D.; He, P.; Yan, Y.; Xin, S. Predicting and calibrating height to crown base: A case for Dahurian larch (Larix gmelinii Rupr.) in Northeastern China. Can. J. For. Res. 2022, 52, 1303–1319. [Google Scholar] [CrossRef]
  32. Hanus, M.; Hann, D.; Marshall, D. Predicting Height for Undamaged and Damaged Trees in Southwest Oregon; Oregon State University, Forest Research Laboratory: Corvallis, OR, USA, 1999. [Google Scholar]
  33. Temesgen, H.; Monleon, V.J.; Hann, D.W. Analysis and comparison of nonlinear tree height prediction strategies for Douglas-fir forests. Can. J. For. Res. 2008, 38, 553–565. [Google Scholar] [CrossRef]
  34. Cade, B.S.; Noon, B.R. A gentle introduction to quantile regression for ecologists. Front. Ecol. Environ. 2003, 1, 412–420. [Google Scholar] [CrossRef]
  35. Koenker, R.; Bassett, G. Regression quantiles. Econometrica 1978, 46, 33. [Google Scholar] [CrossRef]
  36. Lanssanova, L.R.; Silva, F.A.d.; Machado, S.d.A.; Pelissari, A.L.; Figueiredo Filho, A.; Fiorentin, L.; Cerqueira, C.L. Hypsometric relationship in Tectona grandis L. F. stands using quantile regression. Sci. For. 2021, 49, e3559. [Google Scholar] [CrossRef]
  37. Paulo, J.A.; Firmino, P.N.; Faias, S.P.; Tomé, M. Quantile regression for modelling the impact of climate in cork growth quantiles in Portugal. Eur. J. For. Res. 2021, 140, 991–1004. [Google Scholar] [CrossRef]
  38. Cao, Q.V.; Wang, J. Evaluation of methods for calibrating a tree taper equation. For. Sci. 2015, 61, 213–219. [Google Scholar] [CrossRef]
  39. Bohora, S.B.; Cao, Q.V. Prediction of tree diameter growth using quantile regression and mixed-effects models. For. Ecol. Manag. 2014, 319, 62–66. [Google Scholar] [CrossRef]
  40. Yang, Z.; Liu, Q.; Luo, P.; Ye, Q.; Duan, G.; Sharma, R.P.; Zhang, H.; Wang, G.; Fu, L. Prediction of individual tree diameter and height to crown base using nonlinear simultaneous regression and airborne LiDAR data. Remote Sens. 2020, 12, 2238. [Google Scholar] [CrossRef]
  41. Xie, L.; Widagdo, F.R.A.; Miao, Z.; Dong, L.; Li, F. Evaluation of the mixed-effects model and quantile regression approaches for predicting tree height in larch (Larix olgensis) plantations in northeastern China. Can. J. For. Res. 2022, 52, 309–319. [Google Scholar] [CrossRef]
  42. Abaimov, A.P.; Zyryanova, O.A.; Prokushkin, S.G.; Koike, T.; Matsuura, Y. Forest ecosystems of the cryolithic zone of Siberia: Regional features, mechanisms of stability and pyrogenic changes. Eurasian J. For. Res. 2000, 1, 1–10. [Google Scholar]
  43. Qu, L.; Wang, X.; Mao, Q.; Agathokleous, E.; Choi, D.; Tamai, Y.; Watanabe, T.; Koike, T. Responses of ectomycorrhizal diversity of larch and its hybrid seedlings and saplings to elevated CO2, O3, and high nitrogen loading. Eurasian J. For. Res. 2022, 22, 23–27. [Google Scholar]
  44. Lukkarinen, A.J.; Ruotsalainen, S.; Peltola, H.; Nikkanen, T. Annual growth rhythm of Larix sibirica and Larix gmelinii provenances in a field trial in southern Finland. Scand. J. For. Res. 2013, 28, 518–532. [Google Scholar] [CrossRef]
  45. Jia, B.; Sun, H.; Shugart, H.H.; Xu, Z.; Zhang, P.; Zhou, G. Growth variations of Dahurian larch plantations across northeast China: Understanding the effects of temperature and precipitation. J. Environ. Manag. 2021, 292, 112739. [Google Scholar] [CrossRef]
  46. Thorpe, H.C.; Astrup, R.; Trowbridge, A.; Coates, K.D. Competition and tree crowns: A neighborhood analysis of three boreal tree species. For. Ecol. Manag. 2010, 259, 1586–1596. [Google Scholar] [CrossRef]
  47. Pretzsch, H.; Biber, P.; Uhl, E.; Dahlhausen, J.; Rötzer, T.; Caldentey, J.; Koike, T.; van Con, T.; Chavanne, A.; Seifert, T.; et al. Crown size and growing space requirement of common tree species in urban centres, parks, and forests. Urban For. Urban Green. 2015, 14, 466–479. [Google Scholar] [CrossRef]
  48. Tarmu, T.; Laarmann, D.; Kiviste, A. Mean height or dominant height—What to prefer for modelling the site index of Estonian forests? For. Stud. 2020, 72, 121–138. [Google Scholar] [CrossRef]
  49. Mattioli, W.; Ferrari, B.; Giuliarelli, D.; Mancini, L.D.; Portoghesi, L.; Corona, P. Conversion of mountain beech coppices into high forest: An example for ecological intensification. Environ. Manag. 2015, 56, 1159–1169. [Google Scholar] [CrossRef]
  50. Bragg, D.C. A local basal area adjustment for crown width prediction. North. J. Appl. For. 2001, 18, 22–28. [Google Scholar] [CrossRef]
  51. Sanchez-Gonzalez, M.; Canellas, I.; Montero, G. Generalized height-diameter and crown diameter prediction models for cork oak forests in Spain. For. Syst. 2007, 16, 76–88. [Google Scholar] [CrossRef]
  52. R Core Team. R: A Language and Environment for Statistical Computing; R Core Team: Vienna, Austria, 2022; Available online: https://www.R-project.org/ (accessed on 15 October 2022).
  53. Pinheiro, J.C.; Bates, D.M. Mixed Effects Models in S and S-PLUS; Springer: New York, NY, USA, 2000; ISBN 978-0-387-98957-0. [Google Scholar]
  54. Zhang, L.; Gove, J.H. Spatial assessment of model errors from four regression techniques. For. Sci. 2005, 51, 334–346. [Google Scholar]
  55. Vonesh, E.; Chinchilli, V.M. Linear and Nonlinear Models for the Analysis of Repeated Measurements; CRC Press: Boca Raton, FL, USA, 1996; ISBN 9780429180194. [Google Scholar]
  56. Lindstrom, M.J.; Bates, D.M. Nonlinear mixed effects models for repeated measures data. Biometrics 1990, 46, 673. [Google Scholar] [CrossRef]
  57. Crecente-Campo, F.; Tomé, M.; Soares, P.; Diéguez-Aranda, U. A generalized nonlinear mixed-effects height–diameter model for Eucalyptus globulus L. in northwestern Spain. For. Ecol. Manag. 2010, 259, 943–952. [Google Scholar] [CrossRef]
  58. Bertsimas, D.; King, A. Logistic regression: From art to science. Statist. Sci. 2017, 32, 367–384. [Google Scholar] [CrossRef]
  59. Paulo, J.A.; Tomé, M. An individual tree growth model for Juvenile Cork Oak stands in southern Portugal. In Silva Lusitana; Estação Florestal Nacional: Lisboa, Portugal, 2009. [Google Scholar]
  60. Lu, L.; Chhin, S.; Zhang, X.; Zhang, J. Modelling tree height-diameter allometry of Chinese fir in relation to stand and climate variables through Bayesian model averaging approach. Silva Fenn. 2021, 55, 10415. [Google Scholar] [CrossRef]
  61. Le Moguédec, G.; Dhôte, J.-F. Fagacées: A tree-centered growth and yield model for sessile oak (Quercus petraea L.) and common beech (Fagus sylvatica L.). Ann. For. Sci. 2012, 69, 257–269. [Google Scholar] [CrossRef]
  62. Fu, L.; Sharma, R.P.; Hao, K.; Tang, S. A generalized interregional nonlinear mixed-effects crown width model for Prince Rupprecht larch in northern China. For. Ecol. Manag. 2017, 389, 364–373. [Google Scholar] [CrossRef]
  63. Eerikäinen, K. Predicting the height–diameter pattern of planted Pinus kesiya stands in Zambia and Zimbabwe. For. Ecol. Manag. 2003, 175, 355–366. [Google Scholar] [CrossRef]
  64. Forrester, D.I.; Ammer, C.; Annighöfer, P.J.; Barbeito, I.; Bielak, K.; Bravo-Oviedo, A.; Coll, L.; del Río, M.; Drössler, L.; Heym, M.; et al. Effects of crown architecture and stand structure on light absorption in mixed and monospecific Fagus sylvatica and Pinus sylvestris forests along a productivity and climate gradient through Europe. J. Ecol. 2018, 106, 746–760. [Google Scholar] [CrossRef]
  65. Gonçalves, A.C. Stand structure impacts on forest modelling. Appl. Sci. 2022, 12, 6963. [Google Scholar] [CrossRef]
  66. Gao, H.; Bi, H.; Li, F. Modelling conifer crown profiles as nonlinear conditional quantiles: An example with planted Korean pine in northeast China. For. Ecol. Manag. 2017, 398, 101–115. [Google Scholar] [CrossRef]
  67. Özçelik, R.; Diamantopoulou, M.J.; Trincado, G. Evaluation of potential modeling approaches for Scots pine stem diameter prediction in north-eastern Turkey. Comput. Electron. Agric. 2019, 162, 773–782. [Google Scholar] [CrossRef]
  68. Ma, A.; Miao, Z.; Xie, L.; Dong, L.; Li, F. Crown width prediction for Larix olgensis plantations in Northeast China based on nonlinear mixed-effects model and quantile regression. Trees 2022, 36, 1761–1776. [Google Scholar] [CrossRef]
  69. Castedo Dorado, F.; Diéguez-Aranda, U.; Barrio Anta, M.; Sánchez Rodríguez, M.; von Gadow, K. A generalized height–diameter model including random components for radiata pine plantations in northwestern Spain. For. Ecol. Manag. 2006, 229, 202–213. [Google Scholar] [CrossRef]
  70. Fu, L.; Duan, G.; Ye, Q.; Meng, X.; Luo, P.; Sharma, R.P.; Sun, H.; Wang, G.; Liu, Q. Prediction of individual tree diameter using a nonlinear mixed-effects modeling approach and airborne LiDAR data. Remote Sens. 2020, 12, 1066. [Google Scholar] [CrossRef]
  71. Bronisz, K.; Mehtätalo, L. Mixed-effects generalized height–diameter model for young silver birch stands on post-agricultural lands. For. Ecol. Manag. 2020, 460, 117901. [Google Scholar] [CrossRef]
  72. Arnab, R. Survey Sampling Theory and Applications; Academic Press: Amsterdam, The Netherlands, 2017; ISBN 9780128118481. [Google Scholar]
  73. Von Klaus, G.; Timo, P.; Margarida, T.; Kangas, A.; Maltamo, M. Forest Inventory; Springer: Dordrecht, The Netherlands, 2006; ISBN 978-1-4020-4379-6. [Google Scholar]
  74. Crecente-Campo, F.; Corral-Rivas, J.J.; Vargas-Larreta, B.; Wehenkel, C. Can random components explain differences in the height–diameter relationship in mixed uneven-aged stands? Ann. For. Sci. 2014, 71, 51–70. [Google Scholar] [CrossRef]
  75. Yan, Y.; Wang, J.; Mahardika, S.B.; Jiang, L. Effects of climate and competition on crown width: A case of Korean pine plantations. Eur. J. For. Res. 2023, 142, 231–244. [Google Scholar] [CrossRef]
Figure 1. The geographical position of the study area in Tieli and Qingan, Heilongjiang Province, China.
Figure 1. The geographical position of the study area in Tieli and Qingan, Heilongjiang Province, China.
Forests 14 02022 g001
Figure 2. Correlation plot for adding random-effect parameters to β1 and β2 of Equation (13) with the variables in Table 1.
Figure 2. Correlation plot for adding random-effect parameters to β1 and β2 of Equation (13) with the variables in Table 1.
Forests 14 02022 g002
Figure 3. Effects of HDOM (a) and Dq (b) on the CW. The curves were created by using the mean values of the variables in Table 1, the parameters of GM in Table 3, and equal intervals of the HDOM and Dq, starting from the minimum to the maximum range.
Figure 3. Effects of HDOM (a) and Dq (b) on the CW. The curves were created by using the mean values of the variables in Table 1, the parameters of GM in Table 3, and equal intervals of the HDOM and Dq, starting from the minimum to the maximum range.
Forests 14 02022 g003
Figure 4. Graphs of observed CW measurements (black hollow dots) and simulation curves generated by GM and QRs based on nine quantiles (τ = 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8 and 0.9).
Figure 4. Graphs of observed CW measurements (black hollow dots) and simulation curves generated by GM and QRs based on nine quantiles (τ = 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8 and 0.9).
Forests 14 02022 g004
Figure 5. MAE, MAPE, and RMSE for calibration techniques of QR including QRC3, QRC5, QRC7, QRC9 and GM against a number of sample sizes per plot with different sampling strategies.
Figure 5. MAE, MAPE, and RMSE for calibration techniques of QR including QRC3, QRC5, QRC7, QRC9 and GM against a number of sample sizes per plot with different sampling strategies.
Forests 14 02022 g005
Figure 6. MAE, MAPE, and RMSE for MIXED, GMC, QRC5, and GM against a number of sample sizes per plot with different sample designs.
Figure 6. MAE, MAPE, and RMSE for MIXED, GMC, QRC5, and GM against a number of sample sizes per plot with different sample designs.
Forests 14 02022 g006
Figure 7. The RMSE of MIXED, GMC, QRC5, and GM against a number of sample sizes per plot with different sample designs.
Figure 7. The RMSE of MIXED, GMC, QRC5, and GM against a number of sample sizes per plot with different sample designs.
Forests 14 02022 g007
Table 1. Summary statistics for Dahurian larch (Larix gmelinii Rupr.) sample trees of the whole data.
Table 1. Summary statistics for Dahurian larch (Larix gmelinii Rupr.) sample trees of the whole data.
VariableDescriptionMinMaxAverageSD
D (cm)Diameter at breast height5.062.018.16.2
H (m)Total tree height3.829.717.03.6
HCB (m)Height to crown base1.721.011.52.6
CW (m)Crown width0.158.302.971.00
HDOM (m)Dominant tree height18.3628.5421.162.18
DDOM (cm)Dominant diameter21.9251.0027.435.03
Dq (cm)Quadratic mean diameter15.1235.8717.663.70
BA (m2·ha−1)Basal area per hectare23.5639.6230.432.93
BAL (m2·ha−1)Basal area of trees larger than the subject tree0.0038.3618.228.58
N (trees·ha−1)Stem numbers33317781324293
HDRH-to-D ratio0.441.830.980.18
DQMDD-to-Dq ratio0.202.091.020.25
RSIRelative spacing index0.110.200.130.01
SDIReineke’s stand density index514.61874.53697.4063.79
Table 2. Candidate basic models and fitting statistics.
Table 2. Candidate basic models and fitting statistics.
ModelFunction FormR2RMSEAICSD
Linear CW = β 0 + β 1 D 0.51910.693310,647.810.5191
Power CW = β 0 D β 1 0.53430.682110,493.820.5343
Compund CW = β 0 β 1 D 0.45640.737511,269.720.4564
Quadratic CW = β 0 + β 1 D + β 2 D 2 0.55120.669310,300.610.5512
Hossfeld I CW = ( D / ( β 0 + β 1 D ) ) 2 0.55040.670410,315.130.5504
Growth CW = e x p ( β 0 + β 1 D ) 0.45650.737611,269.720.4565
Exponential CW = β 0 e x p ( β 1 D ) 0.45650.737211,269.740.4565
Monomolecular CW = β 0 ( 1 e x p ( β 1 D ) ) 0.54610.673410,354.430.5461
Logistic CW = β 0 / ( 1 + e x p ( β 1 D + β 2 ) ) 0.55830.664510,225.330.5583
Table 3. Likelihood ratio tests (LRT) for basic model with different random-effects parameters.
Table 3. Likelihood ratio tests (LRT) for basic model with different random-effects parameters.
Random ParameterdfAICBICLog-LikLRTp-Value
β068242.2068281.380−4115.103
β168179.4818218.654−4083.740
β268125.9108165.084−4056.955
β0, β188119.1168171.347−4051.55810.79450.0045
β1, β288005.9918058.222−3994.995
Table 4. Likelihood ratio tests (LRT) for MIXED with different random-effects parameters.
Table 4. Likelihood ratio tests (LRT) for MIXED with different random-effects parameters.
Random ParameterdfAICBICLog-LikLRTp-Value
None69984.15610,023.329−4986.078
β088018.3398070.570−4001.1701969.8165<0.0001
β0, β4107913.6637978.952−3946.831108.6763<0.0001
β1, β2, β3137935.8928020.768−3954.94616.22970.001
Table 5. Parameter estimates (Par) and fit statistics (Stat) calculated using the whole dataset for the GM, MIXED, and QR at nine quantiles (τ = 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, and 0.9).
Table 5. Parameter estimates (Par) and fit statistics (Stat) calculated using the whole dataset for the GM, MIXED, and QR at nine quantiles (τ = 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, and 0.9).
Par/StatGMMIXEDQR
τ = 0.1τ = 0.2τ = 0.3τ = 0.4τ = 0.5τ = 0.6τ = 0.7τ = 0.8τ = 0.9
β06.09085.97244.37504.73824.92055.28495.41805.76446.23227.12339.3974
β1−0.1389−0.1591−0.1722−0.1701−0.1734−0.1684−0.1709−0.1565−0.1438−0.1229−0.0934
β23.20663.90542.65342.84953.10273.38803.66163.69373.71683.48673.1071
β30.00220.00290.00160.00190.00220.00250.00270.00270.00260.00230.0017
β4−0.0658−0.0966−0.0047−0.0235−0.0406−0.0620−0.0805−0.0919−0.1006−0.0950−0.0753
σ 2 0.2364
σ b 0 2 0.8562
σ b 1 2 0.0004
σ b 0 b 1 0.0129
R20.57890.74360.28480.45030.54570.57600.55570.47000.3033
RMSE0.64840.50591.02110.84500.74080.67340.65060.66600.72740.83401.0393
Note: “—” means that the model is fitted poorly and the R2 calculated to be negative.
Table 6. The RMSE variation rates (ΔRMSE) for MIXED, QRC5, and GMC with sample number in random tree sampling design and intermediate tree sampling design.
Table 6. The RMSE variation rates (ΔRMSE) for MIXED, QRC5, and GMC with sample number in random tree sampling design and intermediate tree sampling design.
ModelSample SizeIntermediate TreesΔRMSERandom TreesΔRMSE
MIXED10.6060 0.6108
20.58233.9%0.58803.7%
30.56666.5%0.57136.5%
40.55877.8%0.56267.9%
50.55198.9%0.55579.0%
60.54799.6%0.55249.6%
70.545010.1%0.546410.5%
80.542010.6%0.544311.0%
90.540010.9%0.540911.4%
100.537911.2%0.538411.8%
QRC510.7094 0.7506
20.624512.0%0.634015.5%
30.594216.2%0.600620.0%
40.575718.8%0.584822.1%
50.565720.3%0.571023.9%
60.559621.1%0.562225.1%
70.554021.9%0.556725.8%
80.549622.5%0.551226.5%
90.546123.0%0.547627.0%
100.544423.3%0.544527.5%
GMC10.7260 0.8049
20.632812.8%0.6554 18.6%
30.601117.2%0.6098 24.2%
40.581219.9%0.5903 26.7%
50.571221.3%0.5760 28.4%
60.563922.3%0.5694 29.4%
70.559323.0%0.5606 30.4%
80.556423.4%0.5557 31.0%
90.552923.8%0.5547 31.1%
100.550324.2%0.550831.6%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, S.; Wang, J.; Jiang, L. Assessment of Potential Prediction and Calibration Methods of Crown Width for Dahurian Larch (Larix gmelinii Rupr.) in Northeastern China. Forests 2023, 14, 2022. https://doi.org/10.3390/f14102022

AMA Style

Liu S, Wang J, Jiang L. Assessment of Potential Prediction and Calibration Methods of Crown Width for Dahurian Larch (Larix gmelinii Rupr.) in Northeastern China. Forests. 2023; 14(10):2022. https://doi.org/10.3390/f14102022

Chicago/Turabian Style

Liu, Suoming, Junjie Wang, and Lichun Jiang. 2023. "Assessment of Potential Prediction and Calibration Methods of Crown Width for Dahurian Larch (Larix gmelinii Rupr.) in Northeastern China" Forests 14, no. 10: 2022. https://doi.org/10.3390/f14102022

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop