1. Introduction
The tree stem shape, which, in many cases, is characterized by analyzing the variability of tree parameters (mainly diameter) relative to the height change along the tree [
1], as well as the volume resulting from it, is one of the most important types of theoretical and practical information used in the forestry and timber industry. One of the ways to define the longitudinal shapes of trees is to develop special models known as taper functions. These models allow us to determine the diameter of a tree at any height, and as a consequence, the desired volume and biomass [
2], as well as the amounts of various products that can be obtained from a tree after cutting [
3].
One group of taper models are linear models that describe the longitudinal shape of a tree using a certain number of diameters at different relative tree heights [
4,
5]. The linear model, developed on the basis of 15 relative tree heights, determines the longitudinal shape of the tree on the basis of the share of 15 sections of the tree volume was the first Polish taper model elaborated for Scots pine [
6]. Notwithstanding, nonlinear models represent the largest group of solutions found in the literature. In the simplest nonlinear cases, the shape of the stem can be mapped using a single mathematical function [
7]. This may be a polynomial [
8], power [
9], or exponential function [
10]. It is also possible to use a variable shape equation [
11], assuming that the longitudinal shape varies along the tree [
12,
13,
14]. Nonlinear taper models composed of two or more segments created using various mathematical functions are also available [
15,
16], as the shapes of various tree stem segments can be approximated with different geometric solids [
17].
Forest inventory data, including diameter measurements along the stem used as a basis for taper models, are often characterized by a hierarchical structure, which contains information about individual diameters, trees, or plots. The mixed-effects models, where the fixed and random effects are distinguished, are possible solutions for these types of data [
18]. The fixed part of these models allows data for a typical basic group (e.g., tree or plot) to be modeled, whereas the random part describes the difference between each group and the typical one, thus enabling descriptions of the specific relationships among all groups of the dataset [
19]. Random effects can be applied for modeling within repeated measurements along the stem and between-tree correlations [
20]. In addition, during random effects prediction, it is possible to include different levels of data grouping [
11,
21,
22]. In this context, the Bayesian estimator or best linear unbiased predictor (BLUP) are possible solutions for the prediction of random effects in taper models [
20,
23]. Considering the possibilities of using mixed-effects models, it is possible to select various strategies. One possibility is to define random-effects on individual tree level (without autocorrelation modeling). This approach allows us to explain as large a proportion of the between-tree variation as possible [
24]. Other elements worth considering are the practical use of the created model [
11] and the ability to analyze large data sets [
25]. In addition, convergence problems during model fitting should also be considered [
26,
27]. Notwithstanding, in practical applications of the taper model, where additional measurements of the tree diameter along the stem are necessary for random effects prediction, fixed effects prediction can be applied [
26].
Scots pine is the main forest-forming species in Poland. It covers 5367 thousand hectares, which translates into a 58.2% share of this tree species in Polish forests. It mainly occupies dry, oligotrophic habitats (dry, fresh, and mixed fresh coniferous forest site types according to Polish site classification [
28]). The average age of Scots pine stands in Poland is 60 years. Growing stock equals 1598 million cubic meters (61.2%), while the average gross timber volume per hectare is 298 cubic meters, and the 10-year timber removal rate for this tree species equals 17.76 cubic meters per hectare [
29].
Currently, in Poland, in order to determine the tree volume or merchantable volume of an analyzed tree species, one option is to apply empirical equations [
30,
31,
32]. Another possible solution is to use an indirect taper model; however, this utilizes a function involving the determination of the volume share of 15 stem sections [
33,
34,
35]. No taper model allows us to directly reach information about the longitudinal shape of the tree based on easily measurable tree features such as the height and diameter at breast height. Taking into account the above issues, the objectives of this study were to (i) to fill the gap in the existing models by providing the first direct and nonlinear Polish taper model for Scots pine using data from typical oligotrophic sites in west Poland, (ii) to fit the example of a taper model (namely, Kozak 1997) to the diameter along the stem of pine trees as a fixed-effects model and mixed-effects model, (iii) to compare both fitting strategies for measuring the diameter along the stem, and (iv) to compare both strategies for determining the tree volume, taking into account similar modeling data sets, as well as independent (validation) data, and considering the available Polish empirical volume equation [
31].
2. Materials and Methods
The data used for modeling consisted of information and measurements collected from 18 Scots pine stands located in west Poland (Lubsko and Gubin forest districts, Regional Directorate of State Forests in Zielona Góra,
Figure 1). Sample plots were located on oligotrophic sites typical for pine trees (i.e., dry, fresh, and fresh mixed coniferous forest, according to the Polish forest site classification). Borders of the chosen rectangular sample plots were permanently marked in the field. The size of each plot was set in such a way that each plot comprised at least 200 trees. The diameter at breast height (DBH) of all live trees (accuracy 0.01 cm) was measured on all plots, as well as at 25 heights (H) (accuracy 0.01 m) to build a local height–diameter model. From each stand, five model trees representing the entire diameter range were chosen proportionally to diameter classes and felled (90 trees in total). The heights of the analyzed trees varied from 6.95 to 25.65 m, and the DBH varied from 6.7 to 31.5 cm. The mean values of these parameters were 15.51 m and 15.7 cm, respectively (
Table 1).
Each model tree was measured section-wise (1 m section length, measurements at 0.5 m, 1.5 m, 2.5 m, etc.) to determine its diameter outside the bark (
Figure 2). The volumes of these trees were calculated using the sectional Huber’s equation.
Apart from the plots used for building taper equations, independent data were used to validate the created models. The validation dataset was collected from eight Scots pine stands in the western part of Poland (Sława Śląska and Sulechów forest districts, Regional Directorate of State Forests in Zielona Góra,
Figure 1). Sample plots of varying size (comprising at least 100 live standing pine trees) were established on the most prevalent site conditions for pine. The DBH of all trees was measured on all plots, as well as 25 tree heights, and height–diameter curves were created, as in the case of the main dataset. At each plot, three model trees representing the whole range of diameter structures were chosen and felled. In total, 24 model trees were used for further analyses (
Table 1). The height of the analyzed trees varied from 9.18 to 26.95 m, and the DBH varied from 6.85 to 34.25 cm. The average values of these parameters were 19.84 m and 20.87 cm, respectively (
Table 1). Felled trees were measured section-wise (1 m section length, measurements at 0 m, 1 m, 2 m, etc.) to determine their diameters outside the bark. The volumes of these trees were calculated using the sectional Smalian’s equation.
Despite the fact that modeling and validation volume data were obtained using different equations (sectional Huber’s and Smalian’s one), the resulting volumes are very close to the true ones, being practically accurate. A short length of sections, the top and bottom diameter ratio close to 1 for each section, and errors of various sections having different signs (as they actually cancel out), cause the resulting differences in the total tree volume obtained using different equations to differ from the true values by a maximum of ±1%. Such research dealing with empirical accuracy of sectional equations (applied to real trees, stems or bolts, not theoretical, applied to solids of revolutions, such as paraboloid, neiloid or cone) are rather scarce [
36], but available, mostly in classical textbooks on forest mensuration [
12,
37,
38,
39]. This is because of difficulties in the determination of the true (real) volume of trees, which requires using fluid (water) displacement methods (xylometry) to get reference values.
2.1. Taper Function
Considering the practical application of the created model, we chose a solution that used DBH and H to define the longitudinal tree shape, which, as a consequence, translated into the possibility of determining its volume. We applied a solution based on a previously tested function [
26], the 8-parameter taper model developed by Kozak [
7]:
where
D as dependent variable is the diameter along the stem (cm),
DBH is the diameter at breast height (cm),
H is the tree height (m),
q is the ratio between the diameter along the stem position and the tree height (
h/
H),
t is the relative breast height (1.30/
H),
DBH,
H,
q,
t are independent variables, and
b1,…,
b8 are the estimated model parameters.
This model (Equation (1)) was fitted by a fixed-effects (nonlinear least-squares method) and mixed-effects modeling approach. Considering the impacts of individual model parameters on different parts of the modeled tree shape, in the case of mixed-effects model solution, we defined three random parameters (β
1, β
2, β
3). On one hand, the definition of parameters in this way, in connection with adopting the tree as a grouping level, allowed us to avoid highly correlated random effects and convergence problems while, on the other hand, allowed us to consider the between-tree variation in the lower, top, and middle parts of the stem [
26]. As a consequence, we obtained the following model:
where
k refers to individual trees.
Models fitted using both strategies were compared based on goodness-of-fit measures such as the coefficient of determination (R
2):
where
is a measured diameter along the stem,
is a modeled diameter along the stem, and
is a mean value of measured diameter along the stem, root-mean-square error (RMSE):
where
is the number of diameters along the stem, and
is the number of models’ parameters, and mean error (ME):
In addition, we evaluated the obtained residuals graphically based on their relationships with the diameter along the stem and the height along the stem. Furthermore, the obtained taper models were graphically compared according to the relative height (tree height/diameter position along stem) and relative diameter (diameter along stem/diameter at the breast height) of the stem.
2.2. Volume Prediction
The volumes of model trees were calculated based on the 1 m section-wise Huber’s equation (modeling data):
where
is the reference volume (m
3),
is the length of the section (m),
,
,
,
is a basal area in the middle of the individual sections (m
2), and
is the volume of last section (m
3), and Smalian’s equation (validation data):
where
,
,
,
,
,
is a basal area at the beginning and end of individual sections (m
2).
During volume calculations based on the taper models obtained using both strategies, we adopted the assumption that additional measurements of tree diameter along the stem, which are necessary for random effects prediction, were not available. In the case of the fixed-effects model (first strategy), we used all fitted model parameters, whereas for the mixed-effects model, we applied fixed parameters only. For both fitting strategies, the predicted stem volume was computed by numerical integration [
11,
19]. The volumes for both strategies in combination with the volumes calculated using the empirical volume equation currently used in Poland [
31]:
where
is a basal area at the breast height (m
2), were compared based on the absolute error (AE, Equation (9)) and relative error (RE, Equation (10)) for independent data:
where
is the predicted volume using both strategies and the empirical volume equation (m
3), and
is the reference volume (m
3).
All analyses were performed using R 3.6.1 software [
40] and RStudio 1.2.1335 [
41]. The integrate function was used for volume calculation, the nlme package [
42] was used for mixed-effects model fitting, and the ggplot2 package [
43] was used to prepare figures.
4. Discussion
The presented study compared the fitting method (mixed and fixed-effects model approach) of a taper model for determining the outside bark diameter along the stem and predicting the tree volume for Scots pine in west Poland in the context of the practical application of the obtained results. During the analysis, we assumed that no additional measured data (diameter along stem for trees in question) was available; therefore, for the mixed-effects model approach, fixed effects prediction without random effects was applied. During the analyses, we fitted taper models based on an empirical modeling dataset (90 trees) and validated them using an independent empirical dataset (24 trees) and an independent empirical volume equation for Scots Pine [
31].
Taper functions are useful tools in forest inventory, management systems, and growth models, as they give information on diameter at any point of a stem. Various ideas and assumption have been used in the course of their development. According to Kublin et. al. [
16], the first taper model, described as a logarithmic function, was published more than 100 years ago [
44]. A single power taper function was presented as an idea for tree shape description in British Columbia by Demaerschalk [
9]. The author indicated that this model is characterized by greater accuracy for 16 species of trees growing in British Columbia than the model proposed by Kozak [
8]. An approach, in which one continuous variable-exponent taper equation based on the assumption that the tree stem shape changes longitudinally and presents a neiloidal form close to the ground, a conical form near the treetop, and a paraboloidal or cylindrical form in the middle of the stem, was further developed by Kozak [
45,
46]. Moreover, the parsimonious variable-form taper function was also proposed by Riemer et al. [
47].
The variable-exponent taper model analyzed in our paper was developed by Kozak based on data of western hemlock trees collected from Coastal British Columbia [
7]. This model utilizes the constant height from the ground (t), and its exponent was created in such a way that the intercorrelations between the independent variables are low. Advantages of this approach have been proven, among others, by de–Miguel [
26], who recognized Kozak’s model as being the best method among 33 analyzed taper functions to predict the volume of Brutia pine (
Pinus brutia). In addition, the authors indicated that the model well defines the complex shape of the tree in its different places, thanks to which it is possible to accurately map the entire longitudinal shape of the tree using just one function.
Apart from its functional and conceptual form, the model developed in this study follows a frequently used methodology: the mixed-effects approach, which allows for flexible modeling of hierarchical data structures. Such an approach has already been successfully applied, among others, by Arias-Rodil, et al. [
48]. The authors fitted the taper model proposed by Riemel, et al. [
47] using a nonlinear mixed-effects modeling approach for radiata pine to predict the diameter along the stem and the total stem volume. Similar to our case, they also selected the mixed-effects model with three random effects as the best model for tree-specific estimations and mentioned that the measurement of an additional stem diameter at a relative tree height of approximately 0.5 m provided the best calibrations for both stem diameters along the stem and total stem volume predictions. Notwithstanding, authors claim that from a prediction point of view, the use of the mixed-effects models is not recommended when additional stem diameter measurements are not available. Instead, obtained in our research median and mean absolute errors, and median relative errors of volume prediction indicate that the mixed-effects taper model is the best solution for tree volume prediction.
A systematic evaluation of nonlinear fixed- and mixed-effects taper models in volume prediction was also conducted by de-Miguel, et. al. [
26]. During this research, three alternative prediction strategies were compared using the variable-exponent taper, which was also applied in our study. Strategy 1 used a fixed parameter model, strategy 2 utilized the fixed part of a mixed-effects model, and strategy 3 calculated a prediction based on the mixed-effects model by averaging the predictions over the estimated distribution of random effects. Strategy 1 had the lowest bias when the volume predictions were compared with the modeling data, and the highest in the case of the independent one. On the other hand, a different parameter (the nonunity slope, NU) reached the lowest value for strategy 1, which indicates the possibility of using the fixed-effects approach for taper model prediction purposes. Therefore, as opposed to our research, the results obtained were not conclusive. The mixed-effects model approach was also used for the creation of a taper for 19 commercial tree species in the Northeastern United States [
49]. During this study, random effects parameters were included to account for within-tree correlations and to allow for customized calibration to each individual tree. The achieved results suggested that the prediction of random effects parameters provides for a better performance as compared to implementation of the fixed-effects model alone, but the authors also claimed that the efficacy of the random-effects is largely dependent on the amount of new information acquired (location along stem and number of measurements). An interesting voice in the discussion on the possibility of using the mixed-effects modeling approach for taper models was provided by Cao and Wang [
50]. The objective of their study was to evaluate the use of additional diameter measurements taken by laser dendrometers to calibrate the fixed-effects and mixed-effects of the segmented taper model developed by Max and Burkhart [
15]. The results achieved by those authors show that for uncalibrated (without additional measurements) models, the mean difference is lower for the fixed-effects approach (0.5817), while the coefficient of determination, as in our case, is higher for the mixed-effects model approach (0.9392).
According to Zheng, et al. [
51], compatible taper-volume models are flexible tools for total and merchantable volume prediction. Those models contain a taper equation and a total volume equation. Compatible models allow the volume computed by integration of the taper model from the ground to the top of the tree to be equal to that calculated by a total volume equation. The authors also claimed that to define compatible models, a linear mixed-effects model approach can be applied. An example of a similar approach was provided by Brooks, et al. [
2]. The authors used seemingly unrelated nonlinear regression for compatible stem taper, volume, and weight equations for longleaf pine plantations in Southwest Georgia. The above fitting procedure was also applied to integrate a variable-form taper model to a system of models involving a taper equation, merchantable volume equation, and total stem volume equation for loblolly and slash pine in the United States [
52]. Those three components were also included in a compatible taper function for Scots pine plantations in northwestern Spain, where a modified second-order continuous autoregressive error structure was used [
24]. During our analysis, we calculated the total volume of a tree using both fitted taper functions (function integrate in R). However, considering the above research, it is worth considering the possibility of developing a system of models involving not only the diameter along the tree and its volume, but also its biomass.
The taper model developed in this research is based on the tree height and diameter at breast height, which allows information about the longitudinal shape of the tree and its volume to be reached directly (as a result of taper model integration). On the one hand, it is a continuation of the already published research conducted in different countries [
7,
26], but on the other hand, it fills the gap in this type of solution used in Poland. Currently, to obtain the volume of Scots pine, it is possible to apply empirical equations [
31]. These equations were created based on empirical material containing measurements of 44,929 Scots pine trees from more than 2000 stands located in various parts of Poland. As a result of this research, various form factor equations were developed, including the use of diameter at breast height as an independent variable [
31]. In this context, it is worth highlighting that Bruchwald’s model was developed using a large empirical dataset utilizing methodology close to fixed-effects modeling. Thus, the similar accuracy of both solutions is not a surprise. Nonetheless, the use of mixed-effects models as a solution enabling the description of group-specific relationships for all groups of the dataset [
19] is definitely worth considering, especially when there is a possibility of non-invasively obtaining additional data for random effects prediction [
50].
Another possible solution to determine the volume of any part of the stem is to use an indirect taper model. However, this utilizes a function involving the volume share of 15 stem sections and also needs the empirical volume equation as its integral part [
33,
34,
35]. For Polish conditions, a trigonometric taper model for the determination of the longitudinal shape of Norway spruce (
Picea abies (L.) Karst.) stems was developed [
53], and its accuracy has been evaluated and compared with the accuracy of other possible solutions [
54].