1. Introduction
Functional diversity of forests is essential for preserving the stability of terrestrial biodiversity. However, biodiversity is increasingly being threatened by climate change caused by large-scale, manmade changes to the biosphere [
1,
2,
3]. Therefore, there is a need for regular monitoring of forest functional diversity, to effectively preserve and protect it. For this purpose, the Group of Earth Observation Biodiversity Observation Network (GEO BON), in collaboration with the broader biodiversity research community, developed a list of essential biodiversity variables (EBVs). The aim is to bridge the gap between ecological in situ observations and remotely sensed equivalents. EBVs represent an open framework of variables that are scalable, temporally sensitive, and ecologically meaningful [
4,
5]. Improving the understanding between EBV ecosystem structure and ecosystem functional diversity may help in achieving global biodiversity targets [
6].
One of the major descriptors of ecosystem functions in terrestrial habitats is vegetation structure [
7]. Vegetation structure in forests is quantified by measuring three-dimensional (3D) morphological traits such as tree height, canopy cover, and structural complexity [
6]. These morphological traits have a mechanistic relationship to ecosystem properties (e.g., leaf area index, biomass, and carbon storage), indicating the status of ecosystem functions and services [
6,
8]. Structural EBVs aim to assess vertical and horizontal structural variability of vegetated habitats [
4]. Several studies have suggested that while species richness is linked to ecosystem services, other diversity measurements, such as functional and structural diversity, are better predictors of key ecosystem functions and can capture changes in ecosystem functions that species richness does not reflect [
9,
10]. Information on species diversity and richness can be difficult to acquire, as all species need to be first located and then correctly identified, which is often a challenging task in forest environments [
9,
11,
12]. Calders et al. [
8], Schweiger et al. [
12] showed that functional diversity measurements helped improve quantitative monitoring of changes in ecosystem functions. The habitat heterogeneity hypothesis, which predicts that species biodiversity increases with a greater habitat heterogeneity, does not necessarily apply to all species groups. Heidrich et al. [
13] found that horizontal and vertical forest structure measures are, in most cases, good descriptors of habitat heterogeneity. However, a single universal mechanism determining a clear pattern and a unique relationship between habitat heterogeneity and biodiversity is still missing.
Recent studies have highlighted the need for high-spatial-resolution data capable of capturing individual plants [
8,
14]. Such fine detail enables new applications, for instance, delineating foliage of individual plants and detecting intraspecies trait variations [
15,
16,
17], which subsequently improves the prediction of ecosystem properties [
17]. Moreover, detailed spatial information about the habitat helps with modelling species distribution at local to regional scales [
18] and improves our understanding of animal interactions with their habitats [
19]. Other applications benefiting from high-spatial-resolution observations are mapping of fire fuel load and smaller ecosystem units such as canopy elements [
20,
21,
22,
23] or detection of habitat disturbances through size estimation of woody species populations that are more resilient to environmental fluctuations [
24]. Therefore, quantifying morphological traits of individual plants can improve functional biodiversity mapping of local biotopes and regional ecosystems.
Several studies have demonstrated the capability of remote sensing techniques for measuring morphological traits across landscapes and natural ecosystems [
6,
25,
26]. In particular, active light detection and ranging (lidar) technology has been proven to be highly effective in capturing 3D forest structure [
6,
27]. Lidar is effective in modelling forest understorey [
28]. Other methods for capturing 3D forest structure, such as photogrammetric structure from motion (SfM) techniques, were found to be less effective in capturing the full canopy profile than lidar because lidar can penetrate the canopy deeper [
29,
30]. Although a variety of platforms has been used, lidar collections of high-density point clouds (e.g., 100
/
−2), detailing leaves of individual trees, are often performed with on-ground sensors, i.e., by terrestrial laser scanning (TLS) or mobile laser scanning (MLS). Typically, such collections are limited to local plots with time requirements of 3–6 days per 1
[
31]. In contrast to TLS and MLS, airborne laser scanning (ALS) and satellite platforms can cover large areas from landscapes to continents, but their point cloud density is significantly lower. Maintaining a relatively high point density, unoccupied aerial systems (UAS, popularly known as drones) mounted laser scanners (hereafter referred to as ULS) have the potential to (i) capture detailed morphological traits across local landscape scales, and (ii) link and upscale in situ TLS observations to ALS or spaceborne lidar observations [
32,
33].
The main advantage of UAS remote sensing is its ability to collect data at a variety of spectral, spatial, and temporal scales, tailored to the application requirements. UASs are capable of collecting data from which similar information can be extracted compared to in situ field observations but over larger areas, advancing the understanding of ecosystem functions through remote sensing [
8,
34,
35]. Compared to other remote sensing platforms (e.g., full-size aircraft and satellites), UAS can be deployed on demand, regularly, and more frequently (under overcast conditions), allowing ecologists to observe phenomena, such as forest growth dynamics, in space and through time [
8,
36].
Schneider et al. [
26] created a new framework for mapping functional diversity of forests with six remotely sensed morphological and physiological traits derived from airborne lidar and imaging spectroscopy data.These traits were combined in a multidimensional trait space, quantifying the hypervolume that samples/pixels occupy as a measure of functional diversity. This novel approach can be applied to map functional diversity across a range of scales, from an individual tree to landscapes [
37,
38]. It was later refined by adopting trait probability density (TPD) [
37], developed as a measure of functional diversity by Carmona et al. [
39]. TPD applies a kernel density estimation (KDE) that is more robust towards potential outliers in functional traits, but it requires input traits that are linked to specific ecological processes to be ecologically meaningful. A key question for any algorithm that is applied to fine-scale data, for which it was not originally designed for, is if it is capable of exploiting the full potential of these in situ data [
8].
KDE is limited by the number of traits (dimensions) it can be applied to, with the recommended number of traits being 3–5 [
40,
41]. Selecting the appropriate traits requires careful consideration, as functional diversity indices derived from the trait hypervolume are sensitive to the number and type of traits [
42]. Recent ecology studies have shown that only a small number of plant traits are sufficient to describe ecosystem functions [
7,
17]. High-point-density lidar can measure a large number of traits and, therefore, traits explaining the largest variance in the data and capturing key elements of 3D forest structure [
6] must be selected first. However, certain applications require more plant traits (e.g., capturing intraspecies variations and gaining new functional information), in which case KDE might not be suitable. Reducing the number of dimensions through techniques such as the principal component analysis (PCA) or alternatives to KDE such as one-class support vector machine (SVM) might be more suitable when dealing with higher-dimensional hypervolumes [
43,
44]. These approaches have not been compared in remote sensing studies of functional diversity, and their practical implementations are unknown.
Schneider et al. [
26] successfully demonstrated that ALS remote sensing can map the functional diversity of a European forest. Nevertheless, its scalability to higher-dimensional hypervolumes derived from fine-resolution morphological traits and high-density 3D point cloud data is yet to be tested. Our work aims to contribute to the assessment of functional diversity by providing new insights in assessing detailed ecologically meaningful traits at the spatial scale suitable for capturing individual trees. The original operational scientific contribution of our research is a workflow capable of assessing functional diversity of an Australian forest ecosystem in a high-dimensional space, at the local scale, suitable for long-term monitoring. In addition, we address questions about the applicability and limitations of the workflow. Specifically, we compare the variability in ULS-derived morphological traits to TLS-derived traits and evaluate the applicability of the TPD approach on high-dimensional ULS remote sensing data.
4. Discussion
4.1. Selection of Traits
The main trait selection factor was the suitability of published variables based on the five criteria outlined at the beginning of
Section 2.3. The second selection factor was the recommendation by Valbuena et al. [
6] to use a standardised framework of traits describing canopy height, cover, and structural complexity, which allows for comparison with results of previous studies, e.g., Schneider et al. [
37]. The selection process resulted in nine morphological traits that met the above-outlined criteria, from which we used four core traits selected based on statistical analysis. In our experience, the available size of computer RAM is one of the potential limiting factors when employing a high number of traits for mapping FD without the capability of employing out-of-memory techniques. Although scaling solutions such as Dask for Python [
86] have the potential to reduce the memory requirements, we decided to limit our approach to only four core traits to lower TPD computational demands and hardware requirements when constructing the trait space in the hypervolume. The selection of core traits makes our method faster, more operational, less complex, and, therefore, usable by a broader scientific community.
The four core traits are
,
,
, and
.
was selected for the ecological reasons outlined in
Section 2.3. We selected
over
because of a higher correlation between TLS and ULS results; however, the regression statistical indicators did not point out a clear interpretation advantage of either one. The technical advantages of
are its ability to produce fewer empty cells in the grid layer and its intuitive use, as high values indicate greater vertical structural variability. This relationship in the case of
is negative [
59]. Given the similarity in calculation and the fact that
has been proven to be ecologically significant, we assume that
has equal ability to predict ecosystem services [
26,
60,
67].
was found to have a higher PCA loading compared to
and
, which is why it was selected as a core trait. However, the implementation of
depends on the number of lidar return points. Therefore, the lidar sensor type and platform have an influence on the
performance. Our implementation was based on the number of classified ground points instead of returns because the number of return points between ULS and TLS was inconsistent due to the different viewing angles. Another recent study Jiang et al. [
77] used Beer’s law to estimate leaf area index from aerial full-waveform lidar data, which offers future avenues for our follow-up research.
is a relatively new variable that we found to be consistent between ULS and TLS platforms, as well as across different grid sizes. is based on space voxelisation and, therefore, captures three-dimensional structural complexity of an investigated canopy. , selected because of its statistical performance, is the only trait that captures the whole three-dimensional space, making it a novel addition.
Although
had a consistently high PCA loading, independent of other variables, and was capable of explaining a large portion of the variation in the data that was not captured by other traits, it was not selected due to the lack of clear ecological meaning and poor relative performance in the regression and PCA. A similar performance was found in other studies, e.g., Schneider et al. [
37], but no causal factor for the ecological significance of
is evidenced in the published literature. Yet, it might be a suitable candidate, especially in future studies encompassing structurally different forest environments.
4.2. Comparison between TLS- and ULS-Derived Traits
TLS and ULS have inherent differences that we attempted to minimise through voxelisation of the TLS point clouds. However, the different viewing directions and levels of occlusion of both systems (TLS from the ground up and ULS from the top down) cannot be fully compensated and result in an uneven spread of point density through the vertical column of the forest canopy. Object occlusion in a TLS point cloud results in lack of details on the upper canopy, while an ULS point cloud lacks detailed characterisation of the understorey vegetation.
Since our selection was focused on traits that were consistent for both techniques, the traits that were not selected (i.e.,
,
R,
, and
) might give us some additional insight into the differences between TLS and ULS approaches. As discussed in
Section 4.1,
and
might be more suitable for products of aerial full-waveform lidar systems. Our implementation of
had to be adjusted to use ground-classified points.
relies on the relative height ratio, which is based on the 98th and 25th percentile. These are likely to be different between TLS and ULS due to differences in point density distribution within the vertical column and, therefore, are inconsistent between the two techniques. Finally,
R, based on the standard deviation of the canopy height, is likely to differ since TLS acquisitions are impacted by more frequent object occlusion in the upper parts of the scanned canopy than its ULS acquisitions. The consistency and accuracy between traits-derived ULS and TLS can be improved by using TLS as a local calibration tool for ULS [
87].
4.3. Trait Probability Density Approach Comparison
Comparison of KDE, PCA, and SVM approaches revealed their advantages and disadvantages. In all approaches, the values declined with an increasing number of traits. This is expected because increasing number of traits is also increasing the space within the hypervolume, while the number of data points remains constant. However, based on SVM started at much higher values and dropped noticeably with an increasing number of traits, while both KDE and PCA showed only a slight decline. Hence, the hypervolume derived from SVM is seemingly more sensitive with an increasing number of traits. In our tests, adjusting and parameters of SVM had the strongest impact on values.
was gaining values close to five for all approaches and all tested traits. The artificial dataset is most likely regularly distributed, resulting in a low evenness, and an increase in the hypervolume decreases the evenness even further.
The value of stabilises at a very small value with four or more traits for any of the three approaches. KDE- and PCA-based are almost identical and started higher than SVM, indicating the initial divergence of the dataset. SVM, on the contrary, failed to capture this divergence.
Computation time of , , and for each approach increases significantly for an increasing number of traits. While SVM was found to be unsuitable for computing TPD in high-dimensional hypervolumes, PCA was able to calculate FD output for one more additional trait in comparison to KDE. However, this computation depends on the correlation between traits in the dataset, and PCA might encompass more additional traits when applied to real-world data. PCA has the potential to reduce memory requirements by reducing the number of dimensions. Comparing the computation time for different kernel sizes only, SVM speed was similar to PCA, but it was faster than KDE. This indicates that, except for the FD calculation, SVM might have some benefits over KDE when working with a high-dimensional hypervolume.
4.4. Kernel Size
In the context of TPD computation, the kernel size was determined based on two considerations: (i) its mathematical validity and (ii) a spatial extent of the investigated ecological unit. For KDE, Blonder et al. [
84] recommends that the ratio between kernel size and number of traits should not be below 10. We adopted these recommendations and calculated TPD for kernel sizes of
pixels for 3–5 traits. These kernel sizes, in combination with the number of traits, provided sufficient data points for KDE to produce a mathematically valid result, although some previous studies ignored these recommendations and used a
kernel [
26,
37].
Table A22 in the
Appendix B provides an overview of the minimum recommended kernel sizes in relation to the number of traits.
The second consideration relates to the ecological unit under the functional diversity assessment. Once the size of the kernel is set, the pixel spatial resolution must be appropriately sized to capture sufficient details of the desired ecological unit to which the TPD approach is applied. In our study, focusing on forest canopy elements of dominant trees, high-point-density data enabled us to cover the appropriate spatial extent with a kernel size of at a pixel size of .
4.5. Functional Diversity Maps
It has been shown that other environmental factors, such as soil type, have a strong influence on
[
26,
37]. Therefore, interpretation of FD maps might benefit from ancillary information provided by environmental, topographic, and soil maps or from other existing FD maps. Although a single FD map has a limited explanatory power, one can relate some of its informational content to the actual forest structure. For instance, areas with high
in red colour are related to a fully occupied vertical column in the forest. Towards the edges of these areas, one can find high
in blue colour that might indicate a change or transition in distribution of forest elements occupying space within the vertical column. Finally, the cyan-coloured areas indicate presence of only little vegetation, such as a low-stature understorey. As stated in
Section 2.1, Australian forests are occasionally impacted by wildfires that modify their forest structure. After a fire reduction of the under- and mid-storey layers, the cyan-coloured areas are likely to expand because high
values represent a canopy dominated by stems of large trees. During the post-burning recovery phase, one expects that new areas with high
start to emerge, and they transition over time to high
(red colours). FD maps have the potential to evaluate differences in functional diversity among larger geographical areas. Nevertheless, intercomparison of FD maps is only possible when all maps are based on the same traits, TPD approach, and kernel size. Such a standardised approach can facilitate regular monitoring of functional diversity over time, such as in the example of bushfire recovery monitoring suggested in this section.