1. Introduction
As the major forest type at mid-elevation (1000–2300 m) in the Qinling Mountains, pine-oak mixed forests dominated by Pinus tabulaeformis, Pinus armandii and Quercus aliena var. acuteserrata have been considered as the cornerstone of the local ecosystem, which supports a huge variety of plants and wildlife. The pine-oak forests in the Qinling Mountains were naturally regenerated after harvesting during the 1960s and 1970s. Therefore, the pine-oak forests are mostly in the middle of a succession process. In other words, the dynamics of the stand structure is crucial in the prediction of growth and yield for such forests.
As the most direct and measurable factor, diameter is generally related to other important variables including basal area, density and volume. This makes the diameter distribution model a useful tool to provide more detailed information about the stand without additional inventory costs [
1]. As a linkage between the whole stand model and the individual tree model, the diameter distribution model has been extensively explored by researchers and foresters [
2,
3,
4].
A variety of probability density functions (PDFs) including exponential [
5], log-normal [
6], gamma [
7], beta [
8], Weibull [
9] and Johnson’s S
B [
10] have been applied to characterize tree diameter distribution. Among these functions, the Weibull function might be the most popular because of its flexibility in shape and simplicity of mathematical operations. It has been widely used to describe the diameter distribution of loblolly pine [
11,
12,
13,
14,
15], slash pine [
16,
17], black spruce [
18], Scots pine [
19], European beech [
1], longleaf pine [
20], Austrian black pine [
21] and cork oak [
22], as well as mixed species [
23,
24,
25,
26].
The PDF parameters of the Weibull function are usually predicted by the parameter prediction or recovery approach [
27]. In the parameter prediction approach, parameters are predicted from stand variables by use of regression equations. The parameter recovery approach involves predicting diameter moments or percentiles from stand variables, and then using these values to recover the PDF parameters.
Coefficients of the regression equations were generally estimated by ordinary least squares (OLS) or seemingly unrelated regression (SUR). Cao [
28] introduced the maximum likelihood estimator regression (MLER) and also the cumulative distribution function regression (CDFR), both of which turned out to perform better than the previous methods. The Modified CDFR method, obtained by computing CDF with diameter class information, was found by Poudel and Cao [
15] to produce better results than the CDFR method.
The diameter distribution of pine-oak forests varies depending on succession stages, site conditions, forest types, and silvicultural treatments. This makes the prediction of stand structure for pine-oak forests difficult. Even though diameter distributions play an important role in effective forest management and planning, diameter distribution models for pine-oak mixed forests in the Qinling Mountains are still lacking. The objectives of this study were to: (1) develop diameter distribution models for uneven-aged pine-oak mixed forests in the Qinling Mountains of China; (2) evaluate two approaches to predict the parameters of the Weibull function for pine-oak forest diameter distributions; and (3) evaluate three methods of model fitting for each of the above prediction approaches.
2. Data
The data used in this study were collected between 2013 and 2014 from three forest regions in the Qinling Mountains of China: Xinjiashan (E 106°26′–106°38′, N 34°10′–34°20′), Huoditang (E 108°21′–108°29′, N 33°18′–33°28′) and Xunyangba (E 103°58′–109°48′, N 32°29′–33°13′). Plots of size 0.04 ha (20 m × 20 m) were randomly established where all trees with a minimum diameter at breast height (dbh) of 5 cm were tallied. A total of 8490 trees were measured in 152 plots.
Figure 1 presents the locations of the sampling plots.
The stands consisted of three primary tree species,
Pinus tabulaeformis Carr.,
Pinus armandii Franch., and
Quercus aliena var.
acuteserrata Maxim., as well as other species including
Toxicodendron vernicifluum (Stokes) F.A.Barkl.,
Acer davidii Franch. and
Betula albo-sinensis Burk. Even though the total amount of trees of all the miscellaneous species formed a considerable proportion in the stands, the number for each of them was small (usually less than 10% in a stand). Hence, all the miscellaneous species were pooled into one species group in this study. Twenty-two of the plots were dominated by
P. tabulaeformis, 26 by
P. armandii, and 55 by
Q. aliena var.
acuteserrata. The remaining 49 plots were dominated by the miscellaneous species. For the sake of simplification,
P. tabulaeformis,
P. armandii,
Q. aliena var.
acuteserrata and the miscellaneous species were specified as species group 1, 2, 3 and 4, respectively. The plots were categorized based on the proportion of stand basal area for each species group.
Figure 2 shows the diameter distribution of each species group for all plots.
The two-fold cross-validation scheme [
29] was applied in this study by dividing the data into two groups of 76 plots each (
Table 1). Model parameters estimated from data of one group were used to predict for the other group. Evaluation statistics were then calculated based on predictions from both groups.
3. Methods
The Weibull PDF, used to characterize diameter distributions, has the following form:
where
x is tree dbh;
a,
b, and
c are, respectively, the location, scale, and shape parameters of the Weibull distribution.
In this study, parameter a was set at 5 cm, which was the minimum diameter of all measured trees.
Because the pine-oak forests are uneven-aged, commonly used stand variables such as age and site index were not used in this study. The general model for regression equations to predict the Weibull parameters and moments was as follows:
where
ys is either a specific Weibull parameter or diameter variance for species
s;
Dq,
BA,
N are stand quadratic mean diameter, basal area, and number of trees per ha, respectively;
Hd is stand dominant height; and
RS is relative spacing,
, which is the ratio of average distance between trees and dominant height. Stand variables with subscript
s denotes those variables computed separately for species group
s. The ratios in Equation (2) act as composition indices for each species. The backward elimination approach with a 5% level of significance was employed to develop the final regression equations for each tree species.
3.1. Prediction of Weibull Parameters
We used two approaches to predict the Weibull parameters.
3.1.1. Moment Estimation
In the Moment Estimation approach, parameter
b and
c were recovered from two moments (quadratic mean diameter and diameter variance). Whereas quadratic mean diameter was observed, diameter variance was predicted from Equation (2). Similar to method three in Poudel and Cao [
15],
b and
c were obtained by solving the following system of two equations:
where
Gi = Γ (1 +
i/
c); Γ (•) is the complete gamma function and
is the predicted diameter variance.
3.1.2. Hybrid
The Hybrid approach is a combination of parameter prediction and recovery. This approach involved prediction of the shape parameter (c) from Equation (2). The Weibull scale parameter b was computed from c and the quadratic mean diameter (Dq) by use of Equation (3).
3.2. Model Fitting
We employed the three fitting methods described below to obtain estimates for the regression coefficients in Equation (2) with the SAS NLIN (nonlinear regression) procedure [
30]. Sample SAS programs for the MLER and CDFR methods can be found in Cao [
28], and Poudel and Cao [
15] presents a sample SAS program for the modified CDFR method.
3.2.1. MLER Method
The Maximum Likelihood Estimator Regression (MLER) method, developed by Cao [
28], was applied to obtain regression coefficients by maximizing the sum of the log-likelihood values from all plots:
where ln(
Li) is log-likelihood value for the
ith plot;
ni is number of trees in the
ith plot; and
p is number of plots.
3.2.2. CDFR Method
The Cumulative Distribution Function Regression (CDFR) method was proposed by Cao [
28]. The regression coefficients in this method were iteratively searched to minimize the sum of square differences between observed and predicted cumulative probabilities:
where
Fij = (
j − 0.5)/
ni is the observed cumulative probability of tree
j in the
ith plot;
= 1 − exp{- [(
xij −
a)/
b]
c} is the value of the Weibull CDF evaluated at
xij; and
xij is the dbh of tree
j in the
ith plot.
3.2.3. Modified CDFR Method
In the modified CDFR method [
15], the CDF was based on information derived from 2-cm diameter classes rather than from individual trees. The regression coefficients were acquired by minimizing the following function:
where
is the observed cumulative probability of the
kth diameter class in the
ith plot,
nik is the number of trees in the
kth diameter class in the
ith plot;
is the value of the Weibull CDF evaluated at the upper bound of the
kth diameter class; and
mi is the number of diameter classes in the
ith plot.
3.3. Model Evaluation
A two-fold evaluation strategy was carried out with three steps: (1) parameters of the regression equations, developed from Equation (2) for each tree species group, were estimated using data from group 1 (considered the fit data), and then used to predict for group 2 (considered the validation data); (2) repeat the procedure, with group 2 being the fit data and group 1 the validation data; (3) pool the predictions from both groups together to compute evaluation statistics.
3.3.1. Evaluation Statistics
We computed four goodness-of-fit evaluation statistics for each combination of parameter prediction approaches and model fitting methods. For each statistic, the best method produced the smallest value.
The Anderson–Darling (AD) statistic [
31]:
where
uj =
F(
x)
j = 1 − exp{− [(
xj −
a)/
b]
c},
ni is the number of trees in the
ith plot, and the
xj’s are diameter, sorted in ascending order for each plot (
x1≤
x2…≤
xni).
The one-sample Kolmogorov–Smirnov (KS) statistic [
32]:
Negative log-likelihood (-lnL) statistic:
where −lnL is the negative value of the log-likelihood function of the Weibull distribution.
Error Index (EI) [
33]:
where
nik and
are, respectively, the observed and predicted number of trees per ha in the
kth diameter class.
3.3.2. Ranking of Methods
The relative rank, proposed by Poudel and Cao [
15], was used to show the relative position of each fitting method. The relative rank is defined as:
where
Ri is the relative rank of method
i (
i = 1, 2, …,
m),
m is the number of methods evaluated,
Si is the evaluation statistic value of method
i, and
Smin and
Smax are respectively the minimum and maximum value of
Si.
Ri is a real number between 1 (best) and
m (worst).
4. Results and Discussion
4.1. Regression Equations for Each Species
The final models for diameter variance and Weibull parameter
c varied depending on species group (
Table 2). The fit index (
R2) values for regression equations to predict parameter
c were consistently lower than those to predict
Dvar for all species groups. Values of R
2 ranged from 0.47 to 0.58 for
Dvar, but only from 0.11 to 0.40 for parameter
c. This difficulty in predicting Weibull parameters has been documented in previous studies [
11,
12]. Diameter variance in uneven-aged forests could vary a great deal depending on various stand conditions, which makes it more useful in characterizing stand structure than that in even-aged counterparts.
Stand structures in uneven-aged mixed forests are diverse because of the interactions among tree species. For a specific species, the diameter distribution depends on its forest story, size position and composition proportion. The general model used in this study (Equation (2)) includes a majority of common stand-level variables for each species in uneven-aged pine-oak mixed forests.
Meanwhile, stand variables often perform well in groups. The backward elimination approach applied in this study has the advantage of keeping these sets of variables intact, in contrast to the forward and stepwise approaches.
Table 3 shows the coefficients of equations listed in
Table 2, obtained from the three fitting methods for each species group. These coefficients were estimated from the entire data set.
4.2. Moment Estimation vs. Hybrid
The evaluation statistics were computed separately for each species group. For each statistic and each species group, a relative rank (between 1–6) was computed for each of the six combinations (two prediction approaches × three fitting methods).
Table 4 displays the evaluation statistics and their relative ranks by species code, prediction approach, and fitting method.
Summing the relative ranks from the four evaluation statistics and three fitting methods allowed the evaluation of the Moment Estimation approach against the Hybrid approach (
Table 5). The Moment Estimation approach produced a better rank sum for species groups 2 and 4, and also for all species combined. As a result, the total sum of the ranks for each approach shows that the Moment Estimation (with a total of 165.62) was overall better than the Hybrid approach (with a total of 227.61). This is in line with Weiskittel et al. [
34] that the Moment approach is preferred in growth and yield modeling.
4.3. Evaluation of Three Fitting Methods
For each species group and each fitting method, the relative ranks were summed over four evaluation statistics and two prediction methods (
Table 6). The MLER method ranked best for species 1 and 3, but last for the remaining species groups. The CDFR method was best for species 2, whereas the Modified CDFR method ranked best for species group 4 and also for the combined all species group. The total sum of the ranks for each fitting method (
Table 6) reveals that the CDFR method was the best performer, followed closely by the Modified CDFR method with an overall relative rank of 1.51. The MLER method was a distant third.
The superior performance of the CDFR method as compared to the MLER method was supported by Cao [
28]. On the other hand, Poudel and Cao [
15] reported that the Modified CDFR method was better than the CDFR method, contrary to the findings from this study.
Overall, the CDFR method is recommended as the appropriate fitting method because it consistently ranked near the top for all species groups.
4.4. Model Performance
Figure 3 depicts the error percent of the predicted number of trees in the 4-cm diameter class for each tree species group, using the overall best approach (Moment Estimation using the CDFR method).
For all tree species groups, error percent decreased, and numbers of trees were overestimated in larger diameter classes, i.e., diameter classes larger than 39-cm for species 1, and 35-cm for other species groups. This is because few large trees were observed, which led to over-prediction by the diameter distribution models. Prediction errors are also related to sample area [
35]. Increasing sample area and plot size (0.04 ha in this study) may improve the accuracy for modeling diameter distributions of uneven-aged mixed forests.
In addition, overestimation was found for species 1 (
Pinus tabulaeformis Carr.) at the 15-cm diameter class. The reason may be that thinning operations for
Pinus tabulaeformis were carried out in a few stands, resulting in a bimodal diameter distribution (
Figure 2).
4.5. Case Study
Figure 4 shows the predicted diameter distributions for each species group in four sample stands, which were dominated by species 1, 2, 3 and 4, respectively. The overall best approach, i.e., Moment Estimation using the CDFR method, was also applied at this point.
The diameter distributions of the entire stand for all four sample stands appeared to be positively skewed (
Figure 4), meaning that the average age of the pine-oak mixed forests was quite young. Even though
Pinus tabulaeformis (species 1) is the dominant species in
Figure 4a, it also has fewer small trees than other tree species. The reason might be that
Pinus tabulaeformis is a shade-intolerant species, which makes its regeneration in mixed forests relatively difficult. In contrast, small trees of
Quercus aliena (species 3) frequently occur, especially in stands that are dominated by other tree species (
Figure 4a,b,d). This is because
Quercus aliena is shade-tolerant, and it can regenerate by both seedling and sprouting. On the other hand,
Pinus armandii (species 2), which is also shade-tolerant, can only regenerate by seedling because it is a pine species, resulting in diameter distributions in between those of
Pinus tabulaeformis and
Quercus aliena.
Bias could be found between the predicted diameter distribution of all species and that of the summation of each species group (
Figure 4). In other words, additivity was not satisfied. The average prediction error of each 4-cm diameter class using the diameter distribution model of all species were 32 trees/ha (
Figure 4a), 31 trees/ha (
Figure 4b), 41 trees/ha (
Figure 4c) and 42 trees/ha (
Figure 4d), respectively. The corresponding values by summing the diameter distribution of each species group were 25 trees/ha, 28 trees/ha, 37 trees/ha and 37 trees/ha. For the four sample stands, the prediction error was a little higher from modeling the entire stand as compared to summing distributions from individual species groups. For practical applications, Maltamo [
23] suggested that diameter distribution models fitted separately for different tree species would give better predictions.
5. Conclusions
Diameter distribution models for uneven-aged pine-oak mixed forests in the Qinling Mountains of China were developed and evaluated in this study. Both Moment and Hybrid estimation approaches were used to predict the Weibull parameters. For each approach, three fitting methods (maximum likelihood estimator regression (MLER), cumulative distribution function regression (CDFR) and modified CDFR) were employed to obtain estimates for coefficients of regression equations to predict Weibull parameters. The overall results indicated that the Moment Estimation approach was better than the Hybrid approach, and that the CDFR method was superior to the MLER and modified CDFR methods. The combination of Moment Estimation and CDFR is therefore recommended for this data set.
The proposed diameter distribution models enable one to predict the diameter distribution for a given pine-oak stand in the Qinling Mountains, using limited stand information. This makes the set of models a useful tool for the inventory and management of pine-oak forests. However, different results might be obtained for other forest types. The methodology could be extended to other distributions as well.
Author Contributions
Conceptualization, S.S., T.C. and Q.V.C.; data collection, S.S and T.C.; methodology and software, Q.V.C. and S.S.; drafting of the manuscript, S.S.; revisions and suggestions, Q.V.C. and T.C.; funding acquisition, T.C. and Q.V.C.
Funding
Support for this research was provided by the National Natural Science Foundation of China projects (Nos. 31170586 and 31670646). Support was also received from the National Institute of Food and Agriculture, U.S. Department of Agriculture, Mclntire-Stennis project LAB94223.
Acknowledgments
The authors are grateful to Ziyan Liao for mapping the plot locations.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Nord-Larsen, T.; Cao, Q.V. A diameter distribution model for even-aged beech in Denmark. For. Ecol. Manag. 2006, 231, 218–225. [Google Scholar] [CrossRef]
- Diamantopoulou, M.J.; Ozcelik, R.; Crecente-Campo, F.; Eler, U. Estimation of Weibull function parameters for modelling tree diameter distribution using least squares and artificial neural networks methods. Biosyst. Eng. 2015, 133, 33–45. [Google Scholar] [CrossRef]
- De Lima, R.A.F.; Batista, J.L.F.; Prado, P.I. Modeling tree diameter distributions in natural forests: An evaluation of 10 statistical models. For. Sci. 2015, 61, 320–327. [Google Scholar] [CrossRef]
- Ozcelik, R.; Fonseca, T.J.F.; Parresol, B.R.; Eler, U. Modeling the diameter distributions of Brutian pine stands using Johnson’s S-B distribution. For. Sci. 2016, 62, 587–593. [Google Scholar]
- Meyer, H. Eine matematisch-statistiche Untersuchung über den Aufbau des Plenterwaldes. Schweiz. Z. für Forstwes. 1933, 84, 33–46, 88–103, 124–133. [Google Scholar]
- Bliss, C.I.; Reinker, K.A. A lognormal approach to diameter distributions in even-aged stands. For. Sci. 1964, 10, 350–360. [Google Scholar]
- Nelson, T.C. Diameter distribution and growth of loblolly pine. For. Sci. 1964, 10, 105–114. [Google Scholar]
- Clutter, J.L.; Bennett, F.A. Diameter distribution in old-field slash pine plantations. GA. For. Res. Counc. Rep 1965, 13, 9. [Google Scholar]
- Bailey, R.L.; Dell, T.R. Quantifying diameter distributions with the Weibull function. For. Sci. 1973, 19, 97–104. [Google Scholar]
- Tham, Å. Structure of mixed Picea abies (L.) Karst. and Betula pendula Roth. and Betula pubescens Ehrh. stands in south and middle Sweden. Scand. J. Res. 1988, 3, 355–370. [Google Scholar] [CrossRef]
- Smalley, G.W.; Bailey, R.L. Yield tables and stand structure for loblolly pine plantations in Tennessee, Alabama, and Georgia highlands. USDA For. Serv. Res. Paper SO-96 1974, 96, 81. [Google Scholar]
- Feduccia, D.P.; Dell, T.R.; Mann, W.F., Jr.; Campbell, T.E.; Polmer, B.H. Yields of Unthinned Loblolly Pine Plantations on Cutover Sites in the West. Gulf Region; Res. Pap. SO-148; US Department of Agriculture, Forest Service, Southern Forest Experiment Station: New Orleans, LA, USA, 1979; Volume 148.
- Matney, T.G.; Sullivan, A.D. Compatible stand and stock tables for thinned and unthinned loblolly pine stands. For. Sci. 1982, 28, 161–171. [Google Scholar]
- Baldwin, V.C., Jr.; Feduccia, D.P. Loblolly pine growth and yield prediction for managed West. Gulf plantations. USDA For. Serv. Res. Paper SO-236 1987, 236, 27. [Google Scholar]
- Poudel, K.P.; Cao, Q.V. Evaluation of methods to predict Weibull parameters for characterizing diameter distributions. For. Sci. 2013, 59, 243–252. [Google Scholar] [CrossRef]
- Schreuder, H.T.; Hafley, W.L.; Bennet, F.A. Yield prediction for unthinned natural slash pine stands. For. Sci. 1979, 25, 25–30. [Google Scholar]
- Brooks, J.R.; Borders, B.E.; Bailey, R.L. Predicting diameter distributions for site-prepared loblolly and slash pine plantations. South. J. Appl. 1992, 16, 130–133. [Google Scholar]
- Newton, P.F.; Lei, Y.; Zhang, S.Y. Stand-level diameter distribution yield model for black spruce plantations. For. Ecol. Manag. 2005, 209, 181–192. [Google Scholar] [CrossRef]
- Sarkkola, S.; Hökkä, H.; Laiho, R.; Päivänen, J.; Penttilä, T. Stand structural dynamics on drained peatlands dominanted by Scots pine. For. Ecol. Manag. 2005, 206, 135–152. [Google Scholar] [CrossRef]
- Jiang, L.C.; Brooks, J.R. Predicting diameter distributions for young longleaf pine plantations in southwest Georgia. South. J. Appl. For. 2009, 33, 25–28. [Google Scholar]
- Stankova, T.V.; Zlatanov, T.M. Modeling diameter distribution of Austrian black pine (Pinus nigra Arn.) plantations: A comparison of the Weibull frequency distribution function and percentile-based projection methods. Eur. J. For. Res. 2010, 129, 1169–1179. [Google Scholar] [CrossRef]
- Carretero, A.C.; Alvarez, E.T. Modelling diameter distributions of Quercus suber L. stands in “Los Alcornocales” Natural Park (Cádiz-Málaga, Spain) by using the two-parameter Weibull functions. For. Syst. 2013, 22, 15–24. [Google Scholar]
- Maltamo, M. Comparing basal area diameter distributions estimated by tree species and for the entire growing stock in a mixed stand. Silva. Fennica. 1997, 31, 53–65. [Google Scholar] [CrossRef]
- Siipilehto, J. Improving the accuracy of predicted basal-area diameter distribution in advanced stands by determining stem number. Silva. Fenn. 1999, 33, 281–301. [Google Scholar] [CrossRef]
- Chen, W.J. Tree size distribution functions of four boreal forest types for biomass mapping. For. Sci. 2004, 50, 436–449. [Google Scholar]
- Liu, F.X.; Li, F.R.; Zhang, L.J.; Jin, X.J. Modeling diameter distributions of mixed-species forest stands. Scand. J. For. Res. 2014, 29, 653–663. [Google Scholar] [CrossRef]
- Burkhart, H.E.; Tomé, M. Modeling Forest Trees and Stands; Springer Science & Business Media: Dordrecht, The Netherlands, 2012; pp. 261–272. [Google Scholar]
- Cao, Q.V. Predicting parameters of a Weibull function for modelling diameter distribution. For. Sci. 2004, 50, 682–685. [Google Scholar]
- Cao, Q.V. An integrated system for modeling tree and stand survival. Can. J. For. Res. 2017, 47, 1405–1409. [Google Scholar] [Green Version]
- SAS Institute, Inc. Base SAS® 9.4 Procedures Guide; SAS Institute Inc.: Cary, NC, USA, 2014. [Google Scholar]
- Anderson, T.W.; Darling, D.A. A test of goodness of fit. J. Am. Stat. Assoc. 1954, 49, 765–769. [Google Scholar] [CrossRef]
- Massey, F.J., Jr. The Kolmogorov-Smirnov test for goodness of fit. J. Am. Stat. Assoc. 1951, 46, 68–78. [Google Scholar] [CrossRef]
- Reynolds, M.R.; Burk, T.E.; Huang, W.C. Goodness-of-fit tests and model selection procedures for diameter distribution models. For. Sci. 1988, 34, 373–399. [Google Scholar]
- Weiskittel, A.R.; Hann, D.W.; Kershaw, J.A., Jr.; Vanclay, J.K. Forest Growth and Yield Modeling; John Wiley & Sons: New York, NY, USA, 2011; pp. 170–173. [Google Scholar]
- Janowiak, M.K.; Nagel, L.M.; Webster, C.R. Spatial scale and stand structure in northern hardwood forests: Implications for quantifying diameter distributions. For. Sci. 2008, 54, 497–506. [Google Scholar]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).