Considerations for Assessing Functional Forest Diversity in High-Dimensional Trait Space Derived from Drone-Based Lidar

Leonard Hambrecht; Arko Lucieer; Zbyněk Malenovský; Bethany Melville; Ana Patricia Ruiz-Beltran; Stuart Phinn

doi:10.3390/rs14174287

,

and

¹

School of Geography, Planning, and Spatial Sciences, University of Tasmania, Hobart, TAS 7001, Australia

²

Department of Geography, University Bonn, 53113 Bonn, Germany

³

Remote Sensing Research Centre, School of Earth and Environmental Sciences, The University of Queensland, Brisbane, QLD 4072, Australia

^*

Author to whom correspondence should be addressed.

Remote Sens.2022, 14(17), 4287;https://doi.org/10.3390/rs14174287

This article belongs to the Section Ecological Remote Sensing

Version Notes

Order Reprints

Review Reports

Abstract

Remotely sensed morphological traits have been used to assess functional diversity of forests. This approach is potentially spatial-scale-independent. Lidar data collected from the ground or by drone at a high point density provide an opportunity to consider multiple ecologically meaningful traits at fine-scale ecological units such as individual trees. However, high-spatial-resolution and multi-trait datasets used to calculate functional diversity can produce large volumes of data that can be computationally resource demanding. Functional diversity can be derived through a trait probability density (TPD) approach. Computing TPD in a high-dimensional trait space is computationally intensive. Reductions of the number of dimensions through trait selection and principal component analysis (PCA) may reduce the computational load. Trait selection can facilitate identification of ecologically meaningful traits and reduce inter-trait correlation. This study investigates whether kernel density estimator (KDE) or one-class support vector machine (SVM) may be computationally more efficient in calculating TPD. Four traits were selected for input into the TPD: canopy height, effective number of layers, plant to ground ratio, and box dimensions. When simulating a high-dimensional trait space, we found that TPD derived from KDE was more efficient than using SVM when the number of input traits was high. For five or more traits, applying dimension reduction techniques (e.g., PCA) are recommended. Furthermore, the kernel size for TPD needs to be appropriate for the ecological target unit and should be appropriate for the number of traits. The kernel size determines the required number of data points within the trait space. Therefore, 3–5 traits require a kernel size of at least

7 \times 7

pixels

. This study contributes to improving the quality of TPD calculations based on traits derived from remote sensing data. We provide a set of recommendations based on our findings. This has the potential to improve reliability in identifying biodiversity hotspots.

Keywords:

functional traits; KDE; morphological traits; UAV; ULS; remote sensing; TLS; TPD

1. Introduction

Functional diversity of forests is essential for preserving the stability of terrestrial biodiversity. However, biodiversity is increasingly being threatened by climate change caused by large-scale, manmade changes to the biosphere [1,2,3]. Therefore, there is a need for regular monitoring of forest functional diversity, to effectively preserve and protect it. For this purpose, the Group of Earth Observation Biodiversity Observation Network (GEO BON), in collaboration with the broader biodiversity research community, developed a list of essential biodiversity variables (EBVs). The aim is to bridge the gap between ecological in situ observations and remotely sensed equivalents. EBVs represent an open framework of variables that are scalable, temporally sensitive, and ecologically meaningful [4,5]. Improving the understanding between EBV ecosystem structure and ecosystem functional diversity may help in achieving global biodiversity targets [6].

One of the major descriptors of ecosystem functions in terrestrial habitats is vegetation structure [7]. Vegetation structure in forests is quantified by measuring three-dimensional (3D) morphological traits such as tree height, canopy cover, and structural complexity [6]. These morphological traits have a mechanistic relationship to ecosystem properties (e.g., leaf area index, biomass, and carbon storage), indicating the status of ecosystem functions and services [6,8]. Structural EBVs aim to assess vertical and horizontal structural variability of vegetated habitats [4]. Several studies have suggested that while species richness is linked to ecosystem services, other diversity measurements, such as functional and structural diversity, are better predictors of key ecosystem functions and can capture changes in ecosystem functions that species richness does not reflect [9,10]. Information on species diversity and richness can be difficult to acquire, as all species need to be first located and then correctly identified, which is often a challenging task in forest environments [9,11,12]. Calders et al. [8], Schweiger et al. [12] showed that functional diversity measurements helped improve quantitative monitoring of changes in ecosystem functions. The habitat heterogeneity hypothesis, which predicts that species biodiversity increases with a greater habitat heterogeneity, does not necessarily apply to all species groups. Heidrich et al. [13] found that horizontal and vertical forest structure measures are, in most cases, good descriptors of habitat heterogeneity. However, a single universal mechanism determining a clear pattern and a unique relationship between habitat heterogeneity and biodiversity is still missing.

Recent studies have highlighted the need for high-spatial-resolution data capable of capturing individual plants [8,14]. Such fine detail enables new applications, for instance, delineating foliage of individual plants and detecting intraspecies trait variations [15,16,17], which subsequently improves the prediction of ecosystem properties [17]. Moreover, detailed spatial information about the habitat helps with modelling species distribution at local to regional scales [18] and improves our understanding of animal interactions with their habitats [19]. Other applications benefiting from high-spatial-resolution observations are mapping of fire fuel load and smaller ecosystem units such as canopy elements [20,21,22,23] or detection of habitat disturbances through size estimation of woody species populations that are more resilient to environmental fluctuations [24]. Therefore, quantifying morphological traits of individual plants can improve functional biodiversity mapping of local biotopes and regional ecosystems.

Several studies have demonstrated the capability of remote sensing techniques for measuring morphological traits across landscapes and natural ecosystems [6,25,26]. In particular, active light detection and ranging (lidar) technology has been proven to be highly effective in capturing 3D forest structure [6,27]. Lidar is effective in modelling forest understorey [28]. Other methods for capturing 3D forest structure, such as photogrammetric structure from motion (SfM) techniques, were found to be less effective in capturing the full canopy profile than lidar because lidar can penetrate the canopy deeper [29,30]. Although a variety of platforms has been used, lidar collections of high-density point clouds (e.g., 100

pts

/

m

⁻²), detailing leaves of individual trees, are often performed with on-ground sensors, i.e., by terrestrial laser scanning (TLS) or mobile laser scanning (MLS). Typically, such collections are limited to local plots with time requirements of 3–6 days per 1

h a

[31]. In contrast to TLS and MLS, airborne laser scanning (ALS) and satellite platforms can cover large areas from landscapes to continents, but their point cloud density is significantly lower. Maintaining a relatively high point density, unoccupied aerial systems (UAS, popularly known as drones) mounted laser scanners (hereafter referred to as ULS) have the potential to (i) capture detailed morphological traits across local landscape scales, and (ii) link and upscale in situ TLS observations to ALS or spaceborne lidar observations [32,33].

The main advantage of UAS remote sensing is its ability to collect data at a variety of spectral, spatial, and temporal scales, tailored to the application requirements. UASs are capable of collecting data from which similar information can be extracted compared to in situ field observations but over larger areas, advancing the understanding of ecosystem functions through remote sensing [8,34,35]. Compared to other remote sensing platforms (e.g., full-size aircraft and satellites), UAS can be deployed on demand, regularly, and more frequently (under overcast conditions), allowing ecologists to observe phenomena, such as forest growth dynamics, in space and through time [8,36].

Schneider et al. [26] created a new framework for mapping functional diversity of forests with six remotely sensed morphological and physiological traits derived from airborne lidar and imaging spectroscopy data.These traits were combined in a multidimensional trait space, quantifying the hypervolume that samples/pixels occupy as a measure of functional diversity. This novel approach can be applied to map functional diversity across a range of scales, from an individual tree to landscapes [37,38]. It was later refined by adopting trait probability density (TPD) [37], developed as a measure of functional diversity by Carmona et al. [39]. TPD applies a kernel density estimation (KDE) that is more robust towards potential outliers in functional traits, but it requires input traits that are linked to specific ecological processes to be ecologically meaningful. A key question for any algorithm that is applied to fine-scale data, for which it was not originally designed for, is if it is capable of exploiting the full potential of these in situ data [8].

KDE is limited by the number of traits (dimensions) it can be applied to, with the recommended number of traits being 3–5 [40,41]. Selecting the appropriate traits requires careful consideration, as functional diversity indices derived from the trait hypervolume are sensitive to the number and type of traits [42]. Recent ecology studies have shown that only a small number of plant traits are sufficient to describe ecosystem functions [7,17]. High-point-density lidar can measure a large number of traits and, therefore, traits explaining the largest variance in the data and capturing key elements of 3D forest structure [6] must be selected first. However, certain applications require more plant traits (e.g., capturing intraspecies variations and gaining new functional information), in which case KDE might not be suitable. Reducing the number of dimensions through techniques such as the principal component analysis (PCA) or alternatives to KDE such as one-class support vector machine (SVM) might be more suitable when dealing with higher-dimensional hypervolumes [43,44]. These approaches have not been compared in remote sensing studies of functional diversity, and their practical implementations are unknown.

Schneider et al. [26] successfully demonstrated that ALS remote sensing can map the functional diversity of a European forest. Nevertheless, its scalability to higher-dimensional hypervolumes derived from fine-resolution morphological traits and high-density 3D point cloud data is yet to be tested. Our work aims to contribute to the assessment of functional diversity by providing new insights in assessing detailed ecologically meaningful traits at the spatial scale suitable for capturing individual trees. The original operational scientific contribution of our research is a workflow capable of assessing functional diversity of an Australian forest ecosystem in a high-dimensional space, at the local scale, suitable for long-term monitoring. In addition, we address questions about the applicability and limitations of the workflow. Specifically, we compare the variability in ULS-derived morphological traits to TLS-derived traits and evaluate the applicability of the TPD approach on high-dimensional ULS remote sensing data.

2. Methods

2.1. Study Site and Data Collection

The experimental ULS and TLS data were collected at the Terrestrial Ecosystem Research Network (TERN) supersite in Tumbarumba, located in the Bago State Forest in New South Wales, Australia (latitude: 35.6566°S, longitude: 48.1517°E, elevation: 1260

m

AMS), between 17 and 19 September 2019. The Tumbarumba site was selected for its past and perspective future contributions to the calibration and validation of satellite-derived Earth observation products. With an annual average rainfall of 1417

m m

and a mean temperature of 9.8

^{\circ} C

[45], the site’s climate is considered as temperate. The soil type is classified as redoxic hydrosol and the vegetation type is an open wet sclerophyll Eucalyptus forest with two dominant tree species, mountain gum (Eucalyptus dalrympleana) and alpine ash (Eucalyptus delegatensis), gaining a maximum canopy height of 47

m

[45]. Figure 1 shows a canopy height model of the study site, which provides an indication of the distribution of tall trees. The study area is approximately

108 \times 106

m

large. After the data collection was completed, the study site was impacted by a natural bushfire event in the summer of 2019/2020.

Figure 1. Photo of the vegetation type on the left and canopy height map with 0.5m grid cell resolution of Tumbarumba on the right.

2.1.1. Terrestrial Laser Scanning

TLS data were collected with a Leica P30 ScanStation at a scan rate of 1

M pts / s

⁻¹ and a maximum range of 270

m

[46]. The instrument has a survey-grade dual-axis tilt compensation and its laser point spacing was set to 3.1

m m

at 10

m

. Additionally, it can capture RGB and high-dynamic-range (HDR) digital images of scanned objects. At the Tumbarumba study site, three plots of

30 \times 30

m

were established, and a scan was performed at every 10

m

, resulting in 16 scans per plot [31]. In every plot, four Leica black and white targets were distributed in addition to 12 styrofoam spheres used for co-registration of the scans. This sampling pattern was based on the sampling pattern for dense vegetation recommended by Wilkes et al. [31]. Three ground control points (GCP) were placed in each plot for georeferencing purposes. The coordinates for the GCPs were recorded with a dual-frequency real-time kinematic (RTK) global navigation satellite system (GNSS) receiver.

2.1.2. UAS Laser Scanning

A DJI Matrice 600 UAS airframe equipped with a Velodyne VLP-16 Puck lidar scanner was employed for the UAS data collection [47,48,49,50]. The UAS lidar acquired laser returns at a height of 10

m

–15

m

above the top of the canopy or around 60

m

above ground (depending on the local topography) at a flying speed of 3

m / s

⁻¹, using a “lawnmower” (overlapping parallel flight lines) pattern with a side overlap of 50%. The lidar sensor was set to a scanning frequency of 66.667

pts / s

⁻¹ with an across-track field of view of 80

^{\circ}

. The Velodyne scanner has an along-track field of view of 30

^{\circ}

with 16 parallel scan layers separated by 2

^{\circ}

. This configuration allows laser beams to penetrate the canopy in nadir and oblique angles with a resulting point density of 1491.33

pts / m

⁻². The flight speed was matched to the scan frequency with the aim of achieving an even point density along and across the track.

2.2. Lidar Data Processing

A standardisation preprocessing workflow was applied before extraction of morphological traits of interest. Since TLS and ALS datasets have fundamental differences in point distribution throughout the canopy [51], caused by different sensor positioning, we attempted to reduce these differences through voxelisation. The TLS data were spatially co-registered and georeferenced using the collected GCPs, while the ULS point cloud was directly georeferenced based on onboard GNSS and IMU observations. ULS GNSS data were post-processed against GNSS base-station data acquired at 10

Hz

frequency. An in-house Python script for direct georeferencing was applied to produce ULS LAS files from raw scans and post-processed GNSS and IMU data. A graphical overview outlining the workflow for TLS and ULS data processing is presented in Figure 2.

Figure 2. Flowchart showing processing workflows designed forTLS and ULS data interpretations into morphological functional traits.

The TLS data were processed per-plot with the Leica Cyclone Register 360 software [52]. Scans of each plot were spatially co-registered and georeferenced by marking the GCPs as targets in the point clouds and assigning them with the reference coordinates collected in the field through an RTK GNSS survey. The co-registration in Register 360 was, however, inaccurate and showed signs of a point cloud “ghosting” (point clouds were not correctly aligned and branches appearing twice). Therefore, the TLS point clouds were imported into the RiSCAN PRO 2.10 software [53] followed by manual co-registration based on the Multi Station Adjustment Plugin (MSA). The MSA was applied to improve the accuracy of the co-registration, with an error

> 10

c

m

. The MSA makes an optimisation of all scan positions, matching the point clouds from neighbouring scans.

The scans were merged and then tiled to be compatible with our airborne laser scanning workflow. Based on the scanner locations, a polygon shapefile including a 5

m

buffer was created to select the core area of the scans. The point clouds were ground-classified and height-normalised in the LASTools software [54]. A voxelisation of

20 \times 20 \times 20

c m

was applied to achieve a similar point spacing to the ULS point cloud at the top of the canopy.

Raw data acquired with the ULS payload Velodyne Puck were converted into the LAS file format with a custom-built script. Noise in ULS data was reduced by applying the noise filtering algorithm in LASTools and by manually removing outliers above known canopy height and below ground in the CloudCompare software [55]. To classify ground and to normalise the ULS point cloud, the same processing steps as for the TLS point clouds were applied in LASTools. To match the spatial extent, both point clouds were clipped based on the earlier-created plot polygon layer, thereby removing the points at cloud edges and avoiding edge artefacts. The final computation of morphological traits is discussed in the following section.

2.3. Morphological Traits

Selection of morphological traits from the literature applied the following criteria: (i) ecological meaningfulness, (ii) relation to at least one ecosystem function (e.g., primary productivity), (iii) relevance to functional diversity assessment, (iv) wide acceptance in the peer-reviewed literature, and (v) need for minimum prior knowledge about the studied vegetation. Additionally, traits needed to be scalable across operational scales, specifically from TLS to ALS, applicable to a wide range of lidar platforms, and capable of being a part of an area-based workflow producing raster maps. Seven traits that met the above-listed criteria were selected and are presented in Table 1.

Table 1. Overview of traits and axis captured.

Our literature review revealed some variations in definitions of morphological forest variables. Therefore, the traits of interest, in the context of this study, are defined as follows: canopy height (

C H

) describes the maximum vertical distance between the top of canopy and ground. It is a critical variable for quantifying forest biomass, carbon stocks, growth, and primary productivity, and it has been related to species richness [58,65]. Effective number of layers (

E N L

) is based on several measures of diversity and further advances the concept of the foliage height diversity (

F H D

) [59,66].

E N L

is described by the three variables

^{0} D E N L

,

^{1} D E N L

, and

^{2} D E N L

, defined as:

^{0} D E N L = \sum_{i = 1}^{N_{t} o p} p_{i}^{o}

(1)

^{1} D E N L = e x p (- \sum_{i = 1}^{N_{t} o p} p_{i} * ln p)

(2)

^{2} D E N L = \frac{1}{\sum_{i = 1}^{N_{t} o p} p_{i}^{2}}

(3)

where

p_{i}

is the proportion of points in the ith vertical layer of the sum of all points in a point cloud [59,60]. The thickness of the vertical layers was set to ( 1

m

).

^{0} D E N L

computes the total number of layers in the point cloud that is equivalent to canopy height. Similar to

F H D

,

^{1} D E N L

is based on the Shannon diversity index [60]. It was used in several studies and is positively related, for instance, to bird diversity [26,37,60,61,67,68,69].

^{2} D E N L

is based on the Simpson diversity index; however, it does not utilise the natural logarithm of

p_{i}

. This is advantageous for high-point-density clouds, as occlusions and small gaps in the vertical layer profile can lead to empty layers to which the logarithm cannot be applied. Therefore,

^{2} D E N L

uses the square of

p_{i}

, which remedies this issue. Use of

2 D E N L

is more intuitive because, unlike

F H D

, its high values indicate a high vertical structural variability [59].

The canopy ratio (

C R

) is calculated as the percentage of canopy depth to canopy height and is defined as:

C R = \frac{R H 98 - R H 25}{R H 98}

(4)

where

R H

is the relative height as percentiles of the vertical distribution of points at 25 and 98%, respectively.

Rugosity (R) is a measure of structural heterogeneity and a frequently used indicator of forest structural changes and habitat complexity [6,8,16,70]. It is defined as the standard deviation of canopy height [71,72].

Gap fraction (

G F

) is based on Beer’s law regarding gap fraction theory [73]; it measures the vertical openness of the canopy and is, therefore, related to total forest biomass [74].

G F

summarises the fraction of lidar points that reach a defined forest canopy layer and those that pass through the layer as:

P_{i} = \frac{N_{[0; z]}}{N_{t o t a l} - N_{[0; z + d z]}}

(5)

where

N_{[0; z]}

is the number of returns below a given height z, and

N_{t o t a l}

is the total number of returns.

N_{[0; z + d z]}

is the number of returns below

z + d z

, with

d z

being the thickness of the layer set to 1

m

[62].

Leaf area density (

L A D

) was designed to provide information about the vegetation understorey and is based on

G F

.

L A D = - \frac{l n (P_{i})}{k}

(6)

with k being the extinction coefficient and set to 0.25, which is suitable for a canopy with an erectophile distribution of leaf inclination angles of eucalypts [75,76].

G F

and

L A D

are based on Beer’s law, which is used to estimate the leaf area index for large-footprint full-waveform lidar [77]. In this case,

G F

and

L A D

are categorised as vertical traits along the x, y axis (see Table 1).

Plant–groundratio (

P G R

) is equal to the plant area index, as described in Schneider et al. [63], but applied solely to the first and the last lidar returns. As a simplified lidar proxy of leaf area index, it describes the ratio between the number of ground points and other points in the canopy point cloud.

Box dimensions (

D_{b}

) is a fractal analysis of the 3D shape of a point cloud designed to gain better insight into the shape of individual trees and the architectural cost–benefit ratio [64]. It is sensitive to the outer shape and internal structure of trees.

D_{b}

describes how many voxels are required to enclose all elements of a 3D point cloud and how this number changes with the ratio of the size of the voxel. Seidel et al. [64] found a positive relationship between

D_{b}

and growth performances of individual trees. We assume that

D_{b}

can describe a similar relationship when applied with an area-based approach instead of on individual trees.

Extraction of the above-mentioned morphological variables from the ULS and TLS point clouds was performed in R [78] using the lidR package [79]. Each variable can be derived per point cloud (e.g., a whole study site or its plots) but also as a grid metric, where the trait is calculated per individual grid cell, and can be displayed as a 2D grid layer. The two-dimensional grid layers, capturing spatial variation, are suitable to assess the spatial variability of morphological variables within a site or its sub-plots.

2.4. Applicability across Spatial Scales

The above-described morphological traits are expected to resolve not only high-density TLS point clouds but also lower-point-density ULS and, ultimately, ALS data. To assess the impact of spatial scale and grid resolution on trait retrieval, we tested a range of grid cell sizes from 0.5 up to 5

m

. For reference, Table 2 provides an overview of the different lidar platforms and their suitability to deliver products with different grid cell sizes, ranging from typical TLS (around

5 \times 5

m m

) through ULS and ALS (above

10 \times 10

m

), up to the spaceborne lidar GEDI (which stands for Global Ecosystem Dynamics Investigation program, and is the first spaceborne lidar designed to measure vegetation structure) (25 m ø) [25,80].

Table 2. Overview of grid sizes for different lidar platforms. + indicate the suitability of the grid size for the platform.

2.5. Statistical Analysis

To assess the variance in morphological traits derived from TLS and ULS scans of the three study plots, statistical analyses were performed per trait at a given grid size between the three plots. All traits were scaled between 0 and 1, with a

0.5

% cutoff at the top and bottom range to remove outliers [37]. Wilcoxon rank sum tests were applied to calculate correlation coefficients and derive linear, power-law, and logarithmic regressions. Variance among the plots was representative for the whole study site. For further analysis, the data of the three study plots were merged into a single dataset. In addition to the tests mentioned above, a PCA was performed to assess the variability between TLS and ULS-derived traits [37]. To simulate different spatial scales, these tests were performed at multiple grid sizes listed in Section 2.4, aiming to assess suitability of traits obtained from different lidar platforms. Selection of traits for the next analytical step was performed based on the obtained results.

2.6. Trait Probability Density

Trait probability density (TPD) is based on an n-dimensional hypervolume called trait space, using traits as the axes. TPD within the trait space was calculated by modelling the data through KDE and estimating the density along the n-dimensional sampling grid with 0.1 trait intervals (equivalent to 10 bins along each trait axis) [37,39]. For a trait space based on three traits, the sampling grid becomes a 3D object of

10 \times 10 \times 10 = 1000

voxels. Computational resources required to apply KDE to an n-dimensional hypervolume increase exponentially with the number of dimensions (traits) of the hypervolume. Therefore, applying KDE to large datasets of high dimensionality is computationally demanding [81]. It is, therefore, recommended to use this approach only for 3–5 traits [40,41]. Different techniques can be applied to reduce the number of traits. For example, traits can be selected based on (i) ecological meaningfulness (see Section 2.3), (ii) reduced reciprocal correlation and high PCA loading (see Section 2.5), and (iii) core trait functionality capturing at least the canopy height, cover, and structural complexity [6] (see Table 1). If trait selection reduces the number of traits insufficiently or the case requires use of more traits, methods such as dimensionality reduction (e.g., PCA) or alternative approaches such as SVM might be required [43,44]. The number of dimensions might be reduced through PCA, striking a balance between the number of PCA components retained and the percentage of variance-explained. Optionally, a machine learning approach such as an SVM classifier can group data into one or two classes. SVM replaces KDE in the trait space and assesses if points lie inside or outside the hypervolume along the sampling grid [81]. In comparison to KDE, the SVM cannot calculate the density of data as it only models the decision boundary between samples inside and outside the hypervolume. SVM could be more suitable for highly dimensional hypervolumes [44].

Independent of the method used, the TPD approach, originally proposed by Carmona et al. [39], Villéger et al. [43], uses three functional diversity indices to describe the shape of the hypervolume in trait space: functional richness (

F R i c

), functional evenness (

F E v e

), and functional divergence (

F D i v

).

F R i c

describes the volume of function space occupied by traits [37,43].

F E v e

is based on the O index proposed by Bulla [82], describing the evenness in the distribution of occupied trait space [83].

F D i v

is another measure for the distribution of TPD in the trait space. It indicates if the occupied trait space is distributed more towards the edges of the space or centred in the middle [83]. Before calculating these functional diversity indices, a threshold of >2 data points per cell in the sampling grid was applied to the density estimated by the KDE-based TPD to reduce the influence of outliers. Therefore, the indices were based on at least two estimated data points. No threshold was applied to SVM-based TPD because it does not provide a density estimate.

We applied KDE based on a Gaussian kernel with a bandwidth of 0.1 [37]. PCA components were selected to retain 90 % explanatory power; however, due to the trait selection (see Section 2.5), this often resulted in only 2–3 components being used. Our SVM classifier used a radial basis function kernel with

ν

set to 0.1 and

γ

at 0.9 to achieve results that were similar to KDE [44]. TPD was applied to the raster-based traits through a moving-window approach with a kernel size of

9 \times 9

pixels

. Based on the recommendation by Blonder et al. [84], the kernel size of

9 \times 9

pixels

elements was chosen to ensure a

\frac{N u m b e r o f d a t a p o i n t s}{N u m b e r o f t r a i t s} > 10

relationship. Furthermore, this kernel size was sufficient to capture larger canopy elements at a grid cell size of

0.5

m

as our target ecological unit.

Computation of TPD was implemented in Python using the scikit-learn Density Estimation module [85]. The code was executed on a Windows server with an Intel Xeon E5-2637 v3 @

3.5

G

Hz

CPU and 256 GB RAM. A sensitivity analysis was performed on an artificial simulated dataset to assess the influence of number of dimensions and kernel size on the functional diversity output and computation requirements. All analyses were performed with multiple iterations (n = 10) and the mean and standard deviation are presented where relevant. The first dataset contained a varying number of dimensions, where each dimension comprised 80 data points, simulating a

9 \times 9

kernel. The second dataset had a fixed number of four dimensions, but varying number of data points per dimension. We measured the time that each of the three approaches needed to process these datasets. Next, we applied the three approaches with the moving-window, with a kernel size of

7 \times 7

pixels, to traits derived from a ULS point cloud of the whole Tumbarumba study site to assess their performance and outputs. The trait grid layers had dimensions of

256 \times 261

pixels

and consisted of four traits:

C H

,

^{2} D E N L

,

P G R

, and

D_{b}

(see Section 3.1). We limited the number of traits to four due to computer resource limitations for computation of the FD maps. While out-of-memory processing is possible, we assumed that typical end-users do not have the required high-performance computing infrastructure or the technical capabilities to implement it. Furthermore, an increase in the number of dimension results in an exponential increase in required memory, with 11 traits resulting in 2 TB of random access memory (RAM) requirements.

3. Results

3.1. Statistical Analysis

The data points from the three TLS plots were combined to capture the variation in traits across the study site. ULS and TLS traits have similar characteristics, as can be seen by comparing Figure 3a,b. These two figures only show a subset of the traits; the full plots can be seen in Figure A1 and Figure A2 in Appendix A.1.1. Appendix A contains the outputs for ULS and TLS traits, organised by grid size (

0.5

m

, 1

m

, and 5

m

). The output of the correlation analysis can be found in Appendix A.1.2, Appendix A.2.2 and Appendix A.3.2, and Wilcoxon tests are presented in Appendix A.1.3, Appendix A.2.3 and Appendix A.3.3. The outputs of the linear regression analysis and PCA explained variation and loadings are in Appendix A.1.4, Appendix A.2.4 and Appendix A.3.4 and Appendix A.1.5, Appendix A.2.5 and Appendix A.3.5, respectively. The outputs of the logarithmic and power-law regression are not included because they did not contain any significant findings.

Figure 3. Pair plot comparing

C H

,

^{2} D E N L

, and

D_{b}

between each other for ULS (blue) and TLS (orange). (a) Selection of ULS traits. (b) Selection of TLS traits.

For area-based traits,

P G R

had higher PCA loadings for components 1–3 for both TLS and ULS compared to

G F

and

L A D

(see Figure A14, Figure A15, Figure A16, Figure A17, Figure A18 and Figure A19 in Appendix A.1.5).

L A D

PCA loadings were generally higher compared to

G F

.

D_{b}

showed the highest correlation and significant linear relationship between ULS and TLS of all variables across the different grid sizes. Another potential trait was

C R

, as it consistently had the highest PCA loading of all traits, which is consistent with other studies [37]; however, we limited the number of traits to four due to computational memory constraints.

3.2. Trait Probability Density

The three approaches for computing TPD are presented in two parts. First, we present a sensitivity analysis based on artificial datasets to assess the impact of the number of dimensions and number of data points on the TPD output and computation time. In the second part, we present a remote sensing example of applying the three methods to real-world data for the Tumbarumba study site.

Comparing the TPD outputs of KDE, KDE+PCA, and SVM in relation to the number of dimensions provides insight into how sensitive each approach is to a varying number of traits and how well they can be compared with each other.

TPD outputs for KDE, KDE+PCA, and SVM in relation to the number of dimensions are presented in Figure 4. These results show that TPD outputs for the SVM and KDE approaches can only be produced for up to five and six dimensions, respectively, due to memory constraints. The general trends of the TPD outputs are consistent in relation to the number of traits; however, some outliers exist. In Figure 4a,

F R i c

of SVM is significantly higher compared to

F R i c

of KDE, and it shows a clear decline with an increase in dimensions.

F R i c

of KDE decreases slightly with an increase in the number of dimensions, while PCA-based

F R i c

stays consistent and similar to KDE-based

F R i c

for three and four dimensions. All

F E v e

(Figure 4b are identical from all three methods and did not change with an increase in the number of dimensions. Figure 4c shows that

F D i v

values for KDE and PCA are identical and show an increase with an increase in the number of dimensions from three to four, but then drop to around 0 at five dimensions.

F D i v

for SVM is at 0.

Figure 4. Chart comparing TPD outputs between KDE, KDE+PCA, and SVM in relation to the number of dimensions. (a) Functional richness. (b) Functional evenness. (c) Functional divergence.

Computation time is an important consideration when choosing an approach. Here, we compare the computation for calculating

F R i c

,

F E v e

, and

F D i v

between the three different approaches versus only applying KDE, KDE+PCA, and SVM without calculating the TPD outputs. The line represents the mean time of multiple iterations (n = 10) and the shaded area represents the standard deviation.

Comparing Figure 5a,b, strong differences in performance between the three approaches can be observed, indicating that the calculation of the TPD output itself has an impact on performance in terms of computation time and computer resources. As stated above, SVM only managed to calculate TPD outputs for five dimensions, due to memory constraints, and it took longer than for KDE. However, Figure 5b shows that SVM is faster than KDE for six and seven dimensions. The computation time for KDE and SVM to compute TPD outputs is similar for three and four dimensions. PCA performed the fastest, since the number of PCA components did not change.

Figure 5. Comparison of computation time in relation to number of dimensions. (a) Computation time for TPD outputs. (b) Computation time for kernel only.

Additionally, we compared the computation time to produce KDE and SVM TPD outputs when increasing the number of data points, from

3 \times 3

,

5 \times 5

,

7 \times 7

,…,

15 \times 15

, to simulate common kernel sizes while keeping the number of dimensions at four. Figure 6 shows the mean computation time based on multiple iterations (n = 10) with the shaded area representing the standard deviation. KDE shows a higher increase in computation time with an increase in data points compared to SVM.

Figure 6. Chart comparing TPD computation time between KDE and SVM in relation to the number of data points.

Next, the three approaches were implemented in a moving-window approach to be applied to the ULS dataset. The TPD outputs are presented per approach in Figure 7, Figure 8, Figure 9 and Figure 10.

Figure 7. KDE functional diversity maps with

F R i c

,

F E v e

, and

F D i v

in their own figure.

Figure 8. SVM functional diversity maps with

F R i c

,

F E v e

, and

F D i v

in their own figure.

Figure 9. PCA, from four traits, functional diversity maps with

F R i c

,

F E v e

, and

F D i v

in their own figure.

Figure 10. PCA, from nine traits, functional diversity maps with

F R i c

,

F E v e

, and

F D i v

in their own figure.

Figure 7, Figure 8, Figure 9 and Figure 10 show functional diversity maps computed for the three different approaches. Computation time of the KDE images was 19 min, and SVM took 18 min to compute. Figure 9 and Figure 10 were produced with the PCA approach, one with the four selected traits and the other one with all nine original traits. Both PCA maps used three components. The first calculation took 9 min and explained 92% of the variance, while the second took 14 min and explained 87% of the variance.

The

F R i c

maps have proportionally more bright areas compared to the

F E v e

and

F D i v

maps. Comparing the close-up images, it is clear that areas with high

F R i c

have low

F E v e

and

F D i v

, while high

F E v e

and

F D i v

values seem to occur together. Some of these observation might correlate to areas with rapid change of presence of vegetation or lack thereof. Comparing Figure 7a to the study site in Figure 1, high

F R i c

can be observed on the edge of the canopy of large trees, while

F R i c

is low in the centre of these trees.

F D i v

maps show geometric artefacts, which is consistent with the patterns from other studies [26]. These artefacts occur along the edges of canopies where

F R i c

is high.

The SVM map (Figure 8) shows clearly different patterns in the distribution of the traits when compared to the KDE map, and is not directly comparable. Areas of high

F R i c

are continuous with a clear edge, separating them from the other areas. While similar patterns can be clearly observed in the

F D i v

image, these artefacts are particularly noticeable in the

F D i v

maps from the SVM results.

Comparing the effect of PCA in the FD maps, Figure 9 looks similar to the KDE images in Figure 7 with only sight changes in brightness. However, comparing the close-up of Figure 9, showing PCA results based on four traits, to Figure 10, showing PCA results based on nine traits, shows subtle differences in FD patterns. PCA from nine traits reduces the proportion of high

F E v e

and

F D i v

values compared to PCA from four traits.

4. Discussion

4.1. Selection of Traits

The main trait selection factor was the suitability of published variables based on the five criteria outlined at the beginning of Section 2.3. The second selection factor was the recommendation by Valbuena et al. [6] to use a standardised framework of traits describing canopy height, cover, and structural complexity, which allows for comparison with results of previous studies, e.g., Schneider et al. [37]. The selection process resulted in nine morphological traits that met the above-outlined criteria, from which we used four core traits selected based on statistical analysis. In our experience, the available size of computer RAM is one of the potential limiting factors when employing a high number of traits for mapping FD without the capability of employing out-of-memory techniques. Although scaling solutions such as Dask for Python [86] have the potential to reduce the memory requirements, we decided to limit our approach to only four core traits to lower TPD computational demands and hardware requirements when constructing the trait space in the hypervolume. The selection of core traits makes our method faster, more operational, less complex, and, therefore, usable by a broader scientific community.

The four core traits are

C H

,

^{2} D E N L

,

P G R

, and

D_{b}

.

C H

was selected for the ecological reasons outlined in Section 2.3. We selected

^{2} D E N L

over

^{1} D E N L

because of a higher correlation between TLS and ULS results; however, the regression statistical indicators did not point out a clear interpretation advantage of either one. The technical advantages of

^{2} D E N L

are its ability to produce fewer empty cells in the grid layer and its intuitive use, as high values indicate greater vertical structural variability. This relationship in the case of

^{1} D E N L

is negative [59]. Given the similarity in calculation and the fact that

^{1} D E N L

has been proven to be ecologically significant, we assume that

^{2} D E N L

has equal ability to predict ecosystem services [26,60,67].

P G R

was found to have a higher PCA loading compared to

G F

and

L A D

, which is why it was selected as a core trait. However, the implementation of

P G R

depends on the number of lidar return points. Therefore, the lidar sensor type and platform have an influence on the

P G R

performance. Our implementation was based on the number of classified ground points instead of returns because the number of return points between ULS and TLS was inconsistent due to the different viewing angles. Another recent study Jiang et al. [77] used Beer’s law to estimate leaf area index from aerial full-waveform lidar data, which offers future avenues for our follow-up research.

D_{b}

is a relatively new variable that we found to be consistent between ULS and TLS platforms, as well as across different grid sizes.

D_{b}

is based on space voxelisation and, therefore, captures three-dimensional structural complexity of an investigated canopy.

D_{b}

, selected because of its statistical performance, is the only trait that captures the whole three-dimensional space, making it a novel addition.

Although

C R

had a consistently high PCA loading, independent of other variables, and was capable of explaining a large portion of the variation in the data that was not captured by other traits, it was not selected due to the lack of clear ecological meaning and poor relative performance in the regression and PCA. A similar performance was found in other studies, e.g., Schneider et al. [37], but no causal factor for the ecological significance of

C R

is evidenced in the published literature. Yet, it might be a suitable candidate, especially in future studies encompassing structurally different forest environments.

4.2. Comparison between TLS- and ULS-Derived Traits

TLS and ULS have inherent differences that we attempted to minimise through voxelisation of the TLS point clouds. However, the different viewing directions and levels of occlusion of both systems (TLS from the ground up and ULS from the top down) cannot be fully compensated and result in an uneven spread of point density through the vertical column of the forest canopy. Object occlusion in a TLS point cloud results in lack of details on the upper canopy, while an ULS point cloud lacks detailed characterisation of the understorey vegetation.

Since our selection was focused on traits that were consistent for both techniques, the traits that were not selected (i.e.,

C R

, R,

G F

, and

L A D

) might give us some additional insight into the differences between TLS and ULS approaches. As discussed in Section 4.1,

G F

and

L A D

might be more suitable for products of aerial full-waveform lidar systems. Our implementation of

P G R

had to be adjusted to use ground-classified points.

C R

relies on the relative height ratio, which is based on the 98th and 25th percentile. These are likely to be different between TLS and ULS due to differences in point density distribution within the vertical column and, therefore, are inconsistent between the two techniques. Finally, R, based on the standard deviation of the canopy height, is likely to differ since TLS acquisitions are impacted by more frequent object occlusion in the upper parts of the scanned canopy than its ULS acquisitions. The consistency and accuracy between traits-derived ULS and TLS can be improved by using TLS as a local calibration tool for ULS [87].

4.3. Trait Probability Density Approach Comparison

Comparison of KDE, PCA, and SVM approaches revealed their advantages and disadvantages. In all approaches, the

F R i c

values declined with an increasing number of traits. This is expected because increasing number of traits is also increasing the space within the hypervolume, while the number of data points remains constant. However,

F R i c

based on SVM started at much higher values and dropped noticeably with an increasing number of traits, while both KDE and PCA showed only a slight decline. Hence, the hypervolume derived from SVM is seemingly more sensitive with an increasing number of traits. In our tests, adjusting

ν

and

γ

parameters of SVM had the strongest impact on

F R i c

values.

F E v e

was gaining values close to five for all approaches and all tested traits. The artificial dataset is most likely regularly distributed, resulting in a low evenness, and an increase in the hypervolume decreases the evenness even further.

The value of

F D i v

stabilises at a very small value with four or more traits for any of the three approaches. KDE- and PCA-based

F D i v

are almost identical and started higher than SVM, indicating the initial divergence of the dataset. SVM, on the contrary, failed to capture this divergence.

Computation time of

F R i c

,

F E v e

, and

F D i v

for each approach increases significantly for an increasing number of traits. While SVM was found to be unsuitable for computing TPD in high-dimensional hypervolumes, PCA was able to calculate FD output for one more additional trait in comparison to KDE. However, this computation depends on the correlation between traits in the dataset, and PCA might encompass more additional traits when applied to real-world data. PCA has the potential to reduce memory requirements by reducing the number of dimensions. Comparing the computation time for different kernel sizes only, SVM speed was similar to PCA, but it was faster than KDE. This indicates that, except for the FD calculation, SVM might have some benefits over KDE when working with a high-dimensional hypervolume.

4.4. Kernel Size

In the context of TPD computation, the kernel size was determined based on two considerations: (i) its mathematical validity and (ii) a spatial extent of the investigated ecological unit. For KDE, Blonder et al. [84] recommends that the ratio between kernel size and number of traits should not be below 10. We adopted these recommendations and calculated TPD for kernel sizes of

7 \times 7

pixels for 3–5 traits. These kernel sizes, in combination with the number of traits, provided sufficient data points for KDE to produce a mathematically valid result, although some previous studies ignored these recommendations and used a

3 \times 3

pixels

kernel [26,37]. Table A22 in the Appendix B provides an overview of the minimum recommended kernel sizes in relation to the number of traits.

The second consideration relates to the ecological unit under the functional diversity assessment. Once the size of the kernel is set, the pixel spatial resolution must be appropriately sized to capture sufficient details of the desired ecological unit to which the TPD approach is applied. In our study, focusing on forest canopy elements of dominant trees, high-point-density data enabled us to cover the appropriate spatial extent with a kernel size of

9 \times 9

pixels

at a pixel size of

0.5 \times 0.5

m

.

4.5. Functional Diversity Maps

It has been shown that other environmental factors, such as soil type, have a strong influence on

F R i c

[26,37]. Therefore, interpretation of FD maps might benefit from ancillary information provided by environmental, topographic, and soil maps or from other existing FD maps. Although a single FD map has a limited explanatory power, one can relate some of its informational content to the actual forest structure. For instance, areas with high

F R i c

in red colour are related to a fully occupied vertical column in the forest. Towards the edges of these areas, one can find high

F D i v

in blue colour that might indicate a change or transition in distribution of forest elements occupying space within the vertical column. Finally, the cyan-coloured areas indicate presence of only little vegetation, such as a low-stature understorey. As stated in Section 2.1, Australian forests are occasionally impacted by wildfires that modify their forest structure. After a fire reduction of the under- and mid-storey layers, the cyan-coloured areas are likely to expand because high

F R i c

values represent a canopy dominated by stems of large trees. During the post-burning recovery phase, one expects that new areas with high

F D i v

start to emerge, and they transition over time to high

F R i c

(red colours). FD maps have the potential to evaluate differences in functional diversity among larger geographical areas. Nevertheless, intercomparison of FD maps is only possible when all maps are based on the same traits, TPD approach, and kernel size. Such a standardised approach can facilitate regular monitoring of functional diversity over time, such as in the example of bushfire recovery monitoring suggested in this section.

5. Conclusions

In this paper, we investigated the role of trait dimensionality in the TPD approach by comparing different techniques for quantifying hypervolumes and methods to reduce the data dimensionality. These approaches were tested on morphological traits derived from high-point-density lidar data of Australian forest canopies collected with UAS and TLS systems. Our study expanded the knowledge on the existing workflow established by Schneider et al. [26] by assessing functional diversity from multi-trait TPD in the context of lidar remote sensing observations. We identified the following functional traits to be ecologically meaningful and statically sound, while maintaining relative consistency in traits derived from ULS and TLS data: canopy height, effective number of canopy layers, plant–groundratio, and box dimensions. We compared three different methods for calculating TPD in a high-dimensional hypervolume from the morphological forest traits. The outcomes revealed that an optimal selection of morphological canopy traits combined with the principal component analysis and an appropriate kernel density calculation resulted in the most efficient functional diversity estimation. This approach provided consistent results for a varying number of traits and tested spatial resolutions in a remote sensing context. Yet, we would like to highlight the importance of the relationship between the kernel size and the number of traits for producing statisticallyrobust and scientifically sound functional diversity maps. In our research, we learned that trait selection plays an important role when assessing functional diversity at different spatial scales from data acquired by different lidar systems, and we formulated our most important findings in the set of recommendations.

Only ecologically and statisticallysound traits are advised to be included.
If five or more traits are selected, PCA transformation should be applied. No more than four PCA components retaining 80–95% of variance explained should be used.
KDE is computationally more efficient than SVM and, therefore, recommended for computing TPD.
The kernel must have a size suitable to capture the ecological target unit; however, it should be of at least $7 \times 7$ $pixels$ when using 3–5 traits.
Depending on the kernel size, collected data should have a pixel spatial resolution capable of capturing required spatial details of the targeted ecological unit.

The results and recommendations concluded from this work are expected to support the research effort for global monitoring of functional vegetation diversity through remote sensing by improving the quality of functional biodiversity spatial assessments.

Author Contributions

A.L. and S.P. secured funding for the project; A.L. conceived the ideas; L.H. and A.L. designed the methodology; A.L. and L.H. collected the data; L.H., A.L. and A.P.R.-B. prepared the data; B.M., L.H. and A.L. wrote the code; L.H. and A.L. analysed the data; L.H., A.L. and Z.M. led the writing of the manuscript. All authors contributed critically to the manuscript drafts and gave final approval for publication. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially funded by the Australian Government through the Australian Research Council’s Discovery Projects funding scheme (project DP180103460).

Data Availability Statement

Data will be made openly available through TERN Data Discovery Portal in accordance with the TERN terms of use. A DOI for the data will be available.

Acknowledgments

This work is supported by the use of Terrestrial Ecosystem Research Network (TERN) infrastructure and funding, which is enabled by the Australian Government’s National Collaborative Research Infrastructure Strategy (NCRIS). The authors would like to thank Will Woodgate, Emiliano Cimoli, and Juliane Bendig for their support.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Statistical Analysis

Appendix A.1. Grid Size 0.5 m

Appendix A.1.1. Plot Data at 0.5 m Grid Size

Figure A1. Scatterplot matrix of ULS traits at 0.5 m grid size.

Figure A2. Scatterplot matrix of TLS traits at 0.5 m grid size.

Appendix A.1.2. Correlation at 0.5 m Grid Size

Figure A3. ULS correlation matrix at 0.5 m grid size.

Figure A4. TLS correlation matrix at 0.5 m grid size.

Table A1. Overview of correlation between ULS and TLS traits at 0.5 m grid size.

Trait	Correlation
$^{1} D E N L$	0.600701
$^{2} D E N L$	0.347168
$D_{b}$	0.157310
$C H$	0.446589
$C R$	0.043564
$G F$	0.166342
$L A D$	0.165330
$P G R$	0.226489
R	0.330439

Appendix A.1.3. Wilcoxon Test at 0.5 m Grid Size

Table A2. Overview of Wilcoxon test results at 0.5 m grid size.

Trait	W	p-Value
$^{1} D E N L$	$1.18227 \times 10^{8}$	$1.67179 \times 10^{- 29}$
$^{2} D E N L$	$1.0753 \times 10^{8}$	$7.12492 \times 10^{- 110}$
$D_{b}$	$3.92161 \times 10^{7}$	0
$C H$	$9.5478 \times 10^{7}$	0
$C R$	$7.13027 \times 10^{7}$	0
$G F$	$1.33403 \times 10^{8}$	0.0241798
$L A D$	$1.34264 \times 10^{8}$	0.163594
$P G R$	$4.65149 \times 10^{7}$	0
R	$1.03943 \times 10^{8}$	$1.14384 \times 10^{- 210}$

Appendix A.1.4. Linear Regression at 0.5 m Grid Size

Table A3. Overview of ULS linear regression at 0.5 m grid size.

X	Y	A	B	R²	RMSE
$^{1} D E N L$	$^{2} D E N L$	0.215091	0.127912	0.279958	0.131094
$^{1} D E N L$	$D_{b}$	0.00752914	0.576217	0.000390069	0.144851
$^{1} D E N L$	$C H$	0.429321	0.522204	0.399033	0.200232
$^{1} D E N L$	$C R$	−0.436834	0.745498	0.263619	0.277466
$^{1} D E N L$	$G F$	0.065624	0.20423	0.0380166	0.125456
$^{1} D E N L$	$L A D$	−0.0654678	0.398805	0.0220413	0.16573
$^{1} D E N L$	$P G R$	0.345233	0.410379	0.419777	0.154252
$^{1} D E N L$	R	0.301955	0.448956	0.243971	0.20201
$^{2} D E N L$	$D_{b}$	−0.300517	0.641774	0.102692	0.137239
$^{2} D E N L$	$C H$	0.780059	0.520697	0.217695	0.228453
$^{2} D E N L$	$C R$	−0.385628	0.661853	0.0339492	0.317803
$^{2} D E N L$	$G F$	0.0544472	0.217523	0.00432463	0.127634
$^{2} D E N L$	$L A D$	−0.374048	0.452281	0.118901	0.157309
$^{2} D E N L$	$P G R$	0.626913	0.409242	0.228749	0.177841
$^{2} D E N L$	R	0.507538	0.456474	0.113905	0.218697
$D_{b}$	$C H$	0.023127	0.670128	0.000168282	0.258269
$D_{b}$	$C R$	−0.714061	0.994834	0.102368	0.306343
$D_{b}$	$G F$	−0.0953114	0.284078	0.0116544	0.127163
$D_{b}$	$L A D$	0.205753	0.255065	0.031639	0.164915
$D_{b}$	$P G R$	0.424685	0.294186	0.0923166	0.19293
$D_{b}$	R	−0.0225023	0.575443	0.000196907	0.232306
$C H$	$C R$	−0.536924	0.948358	0.183961	0.292088
$C H$	$G F$	0.0802974	0.174003	0.026291	0.126218
$C H$	$L A D$	0.237826	0.211647	0.134355	0.155923
$C H$	$P G R$	0.547531	0.16585	0.487717	0.14494
$C H$	R	0.786997	0.0244859	0.76552	0.112501
$C R$	$G F$	−0.161387	0.322712	0.166434	0.116782
$C R$	$L A D$	0.039433	0.351281	0.00578838	0.167101
$C R$	$P G R$	−0.427951	0.788892	0.466916	0.147853
$C R$	R	−0.113494	0.628395	0.0249493	0.229412
$G F$	$L A D$	0.0394476	0.365176	0.000906514	0.167511
$G F$	$P G R$	0.508362	0.42374	0.103108	0.19178
$G F$	R	0.187975	0.519388	0.0107104	0.231081
$L A D$	$P G R$	−0.0732528	0.56751	0.00367505	0.202131
$L A D$	R	0.640295	0.322811	0.213322	0.206064
$P G R$	R	0.569222	0.254978	0.246163	0.201717

Table A4. Overview of TLS linear regression at 0.5 m grid size.

X	Y	A	B	R²	RMSE
$^{1} D E N L$	$^{2} D E N L$	0.259072	0.125031	0.330936	0.133259
$^{1} D E N L$	$D_{b}$	−0.00767013	0.70368	0.000547971	0.1185
$^{1} D E N L$	$C H$	0.430284	0.546475	0.379723	0.198943
$^{1} D E N L$	$C R$	−0.0686811	0.753639	0.00926953	0.256863
$^{1} D E N L$	$G F$	−0.031123	0.245778	0.00772719	0.127585
$^{1} D E N L$	$L A D$	−0.0271697	0.385896	0.00353271	0.165074
$^{1} D E N L$	$P G R$	0.238619	0.296743	0.233342	0.156468
$^{1} D E N L$	R	0.337549	0.407016	0.284584	0.193609
$^{2} D E N L$	$D_{b}$	−0.0323939	0.708001	0.00198233	0.118415
$^{2} D E N L$	$C H$	0.760602	0.538919	0.24064	0.22012
$^{2} D E N L$	$C R$	−0.177999	0.767424	0.0126274	0.256427
$^{2} D E N L$	$G F$	−0.0113033	0.236609	0.00020671	0.128068
$^{2} D E N L$	$L A D$	−0.285755	0.439214	0.0792541	0.158678
$^{2} D E N L$	$P G R$	0.644431	0.243069	0.345168	0.144607
$^{2} D E N L$	R	0.430607	0.438001	0.0939279	0.217885
$D_{b}$	$C H$	−0.1963	0.845548	0.00848485	0.251528
$D_{b}$	$C R$	−0.250691	0.903544	0.0132589	0.256345
$D_{b}$	$G F$	0.136055	0.138749	0.0158538	0.127062
$D_{b}$	$L A D$	−0.0500356	0.410763	0.0012863	0.16526
$D_{b}$	$P G R$	0.213243	0.236868	0.0200068	0.176903
$D_{b}$	R	−0.204117	0.676759	0.0111723	0.227618
$C H$	$C R$	−0.159323	0.840657	0.0243211	0.254904
$C H$	$G F$	−0.0899681	0.297792	0.0314831	0.126049
$C H$	$L A D$	0.278758	0.178343	0.181315	0.149625
$C H$	$P G R$	0.34654	0.140965	0.239956	0.155791
$C H$	R	0.760947	−0.0050223	0.705162	0.124291
$C R$	$G F$	−0.184106	0.3681	0.137598	0.118943
$C R$	$L A D$	0.0479266	0.340814	0.0055938	0.164903
$C R$	$P G R$	−0.345989	0.63814	0.249645	0.154795
$C R$	R	0.0390485	0.505292	0.00193804	0.228678
$G F$	$L A D$	−0.14807	0.410361	0.0131526	0.164275
$G F$	$P G R$	0.662696	0.231174	0.225607	0.157255
$G F$	R	−0.396077	0.626434	0.0491176	0.223208
$L A D$	$P G R$	−0.0765121	0.415054	0.00501308	0.178251
$L A D$	R	0.530108	0.334553	0.146666	0.211449
$P G R$	R	0.305275	0.415783	0.0567985	0.222305

Table A5. Overview linear regression between TLS and ULS at 0.5 m grid size.

Trait	A	B	R²	RMSE
$^{1} D E N L$	0.631066	0.138877	0.360842	0.303833
$^{2} D E N L$	0.329217	0.135556	0.120525	0.144883
$D_{b}$	0.192276	0.444299	0.0247464	0.143075
$C H$	0.456647	0.360222	0.199442	0.231103
$C R$	0.0545837	0.541631	0.00189782	0.323032
$G F$	0.16612	0.19	0.0276696	0.126129
$L A D$	0.16755	0.311257	0.0273339	0.165281
$P G R$	0.256659	0.440949	0.0512973	0.197241
R	0.335388	0.383412	0.10919	0.219278