6.1. Site Characteristics and Differences
The results revealed considerable differences between coniferous and deciduous forests, as well as variations within the individual sites. In the Huss site, the analysis resulted in generally higher mean Δ
h100 under leaf-off conditions (7–8 m) compared to leaf-on conditions (5–6 m). Similar findings were observed for the standard deviation, with higher values during winter (3.5–4.3 m) compared to summer (2.0–2.5 m). In deciduous forests, these differences are surprisingly high but could potentially be attributed to the coarse resolution processing of the data. However, similar results were reported by Schlund et al. [
32] in Hainich, which may be primarily influenced by the absence of leaves in March [
32]. Since the SAR signal in winter interacts solely with the canopy structure, the leaves in summer hinder the SAR signal to some extent from penetrating deeper into the canopy [
75]. Additionally, frost potentially reduces the dielectric constant in the vegetation, leading to higher mean Δ
h100 [
31,
76] and consequently, lower positions of the SPC within the canopy. The results indicate that the mean Δ
h100 depends on various factors, including environmental, topographical, and acquisition parameters, and are in line with the findings of other studies [
31,
32,
77].
The analysis comparing the two polarization types revealed generally lower mean Δ
h100 and standard deviations for VV polarization compared to HH. This difference could be attributed to a slightly higher ground contribution to HH polarization [
31] since leaf-off conditions in March allow the SAR signal to interact more with the ground surface than under leaf-on conditions. This effect may be further increased by frozen vegetation conditions (decreased vegetation dielectric constant) as frost can occur even in March in Hainich. Consequently, both factors decrease volume attenuation, resulting in slightly higher mean Δ
h100 in HH polarization [
31]. However, the penetration for both polarizations remained stable in summer (5–6 m), as the ground contribution in the HH polarization is minimized due to the presence of leaves and the absence of frost.
The comparison between the two sites revealed a smaller mean Δ
h100 in the coniferous forest (3–4 m) compared to the deciduous forest, even under leaf-on conditions (5–6 m). These results are in line with the findings of Praks et al. [
50], who found a mean Δ
h100 of 4 m. However, the mean Δ
h100 in the deciduous forest was consistently lower than in the coniferous forest [
50], which is inconsistent with the findings in this study. Nevertheless, when considering the mean Δ
h100 as a percentage relative to the average tree height, the coniferous forest exhibited a slightly smaller value (15%) compared to the deciduous forest (16–18%) (see
Figure 6). Izzawati et al. [
49] found that the height underestimation of TanDEM-X data is higher for cone-shaped than for ellipse-shaped crowns and strongly depends on the tree density within a forest patch. Hence, it could be argued that the higher mean Δ
h100 in the Huss site may be attributed to the lower tree density, as the estimated densities were approximately 350 trees/ha in Roda and 130 trees/ha in the Huss site. Additionally, only averages were used within a 12 × 12 grid cell, which could be influenced by clear-cut areas with high Δ
h100. However, it has to be mentioned that the number of trees per hectare was estimated only for trees taller than 5 m. Other studies accounted for undergrowth and found a tree density of approximately 330 trees/ha (stem diameter > 7 cm) in Hainich [
78,
79]. As the influence of undergrowth on the behavior of the SAR signal at this spatial resolution is expected to be minimal, the undergrowth was generally considered negligible in the analysis.
Another potential factor that could contribute to the observed differences is the tree height itself. Soja et al. [
25] found a general increase of Δ
h100 with increasing tree height. Considering that the average tree height at the Huss site (34 m) is considerably higher than at the Roda site (23 m), this height difference may explain the higher penetration observed in the deciduous forest compared to the coniferous forest. As a result, it can be concluded that the InSAR Height varies across different forest types, specifically between coniferous and deciduous forests.
The layerwise analysis conducted between InSAR Height and
h showed a significant linear relationship (
p-values < 0.001) starting from medium tree heights in both sites. This finding supports the first objective of the study, indicating that the SPC coincides with the center of
h in these layers. At the Huss site, the layer at 27.5 m showed the strongest agreement with the InSAR Height, which was found to be the layer with the most ALS points (
Appendix A.1). In the lowest densest layer, considerable height differences of 20 m or more appeared in both study areas. Regarding the second objective of this study, this may indicate that the canopy density of the overstory layers is sufficiently high to hinder the SAR signal from reaching the densest layer. On the contrary, at the Huss site, the majority of these pixels are located on the forest road at the southern limit or in small clearings within the forest, and consequently, this could partly be the result of geometric distortions caused by the SAR side-looking geometry. Additionally, the uneven distribution of
h samples (as seen in
Figure 7), with a concentration of densest layer pixels in the upper parts of the canopy (i.e., the tree crown), could increase the influence of outliers in layers with fewer samples. In contrast to the deciduous forest, the SAR signal does not penetrate through the densest coniferous layers. This may be attributed to the different shapes of the crowns, as cone-shaped crowns are generally expected to interact differently with the SAR signal compared to ellipse-shaped crowns [
49].
The analysis of mean InSAR Heights revealed that, on average, the SAR signal does not fully penetrate through
h in any of the scenes (see
Figure 8). However, there are noticeable differences between the polarization types. The lower mean InSAR Height in HH polarization suggests a slightly higher ground contribution, as mentioned earlier. Additionally, in HH polarization, the mean InSAR Height is more than two meters deeper in March compared to the summer acquisitions. Similar results are observed in VV polarization, but with smaller differences between March (−0.4 m) and May (+0.7 m). These findings could be attributed to the leaf-off conditions in the broadleaf forest during winter. When compared to the deciduous forest, the higher mean InSAR Height in the coniferous patch (on average 1.9 m above the center of
h) suggests that the densest part of coniferous trees is located slightly higher than that of deciduous trees. Consequently, the SAR signal “gets stuck” at upper heights.
6.2. Data and Methodology
One of the challenges of comparing the InSAR Heights with
was the comparatively low spatial resolution of the ALS data (12 m). This approach is used by many other studies [
31,
32,
71,
72,
73], but one may argue that generating a CHM at this resolution could systematically overestimate the forest heights. On the contrary, high-resolution processing allows the laser beams to penetrate deeper into the forest, which may underestimate the forest top height [
30,
31,
80]. In our study, we tried both approaches and found that the majority of ALS heights in the second approach were lower than the InSAR Height. Thus, we decided to consider only the first approach, which allows compensating for the underestimation of ALS-derived forest heights [
31,
81,
82]. However, at a resolution of 12 m, the CHM loses more information compared to ALS data processed at 1 m, as multiple trees (i.e., crowns) could fall within a grid cell [
83]. In terms of SAR, this mixed-pixel problem results in a combination of scattering phase centers from individual trees. As a consequence, the InSAR Height of a pixel can be influenced by different tree crowns. In this context, another limitation to consider is the relatively small size of the study areas, which may call into question the broad applicability of the results, especially in areas with heterogeneous forest stands. However, since both study sites are characterized by dense and homogeneous forest stands, they are generally assumed to provide a good representation of the characteristics of both forest types.
A further potential source of error lies in the definition and processing of canopy densities. Due to the different point densities of the ALS campaigns (as shown in
Table 3), it was not possible to analyze and compare absolute canopy densities. Therefore, canopy density was defined as the ratio of the number of ALS points per height layer and grid cell to the total number of points in all layers within that grid cell (ORD). As described in
Section 4.1.2, this methodology does not fully account for the attenuation of the SAR signal when penetrating through the canopy layers. Conversely, NRD ignores overstory returns to a certain amount, which would result in much higher densities in the understory [
45]. According to Campbell et al. [
45], it is not yet clear which of the two methods performs better or which should be preferred for estimating canopy density using discrete return lidar.
Further, it has to be mentioned that the densest layer processed according to the ORD method may not necessarily represent the actual densest part of the canopy. The ALS data were collected during winter, and consequently, the laser interacts primarily with the branches and stems of the trees. In the deciduous forest, this could lead to different densities compared to leaf-on conditions, as the presence of leaves is generally expected to lead to more ALS points within the point cloud. Additionally, the side-looking geometry of the laser beam can cause some pulses to penetrate through the canopy, hitting lower branches or the ground before producing the first return [
84]. These so-called “pits” result in an underestimation of ALS points in the upper layers of the canopy. However, since the ground points are masked out in this study, the influence of these pits is expected to be minimal.
The vertical and horizontal rasterization of the forest patch allows for some potential error. The number of lidar points per square meter is specified with >4 points, and thus, the number of density values within a 1 m voxel would be quite small (e.g., 0%, 33%, 66% or 100% for 3 points within a voxel). The horizontal resolution was resampled to 12 m to match the resolution of the TanDEM-X pixels, while the vertical resolution was set to 5 m, resulting in eight height layers in Hainich and five layers in the Roda site (see
Figure 7). One may argue that higher horizontal resolutions and smaller layer intervals would yield more accurate results. However, increasing the spatial resolution leads to a substantial decrease in the number of points within a voxel, leading to no-data voxels at some point. Several layer intervals between 1 m and 10 m were tested, and a 5 m layer interval was found to be the most suitable in this study. Having more lidar points per voxel would allow for smaller layer intervals, potentially improving the accuracy of the analysis. To overcome this limitation, the use of full-waveform lidar data should be considered, which can provide more detailed information about the canopy structure by utilizing statistical metrics such as the height of median energy (HOME) [
85]. However, this study is based on discrete return lidar data, and therefore, the mentioned methodology using full-waveform lidar data could not be employed.
The interpretation of InSAR Heights relative to h was based on considering the mean height (e.g., 12.5 m for the layer between 10 m and 15 m) as the center of this layer. However, it should be noted that the densest part of the layer may not always be located exactly at the center. This results in a maximum potential error of 2.4 m within each densest layer. Smaller layer intervals could decrease this error but require higher point densities than those used in this study.
Another important factor to consider is the relative height accuracy of the DEM, which is specified as 2 m for flat terrain (slopes < 20°) and 4 m for mountainous regions (slopes > 20°) [
62,
63,
86,
87]. Regarding the interpretation of the data, this allows for some margin of error, as the accuracy of the InSAR Height measurements is strongly influenced by the relative height accuracy of the TanDEM-X data. Abdullahi et al. [
39] took into consideration a height offset to mitigate systematic height errors by using time-staggered TanDEM-X mosaics for all acquisitions. However, as the two sites are characterized by predominantly flat terrain, it is expected that the relative height error is generally low.
In addition to tree height and canopy density, other topographic or radar-specific variables may influence the penetration behavior of X-band SAR data into the canopy. Izzawati et al. [
49] investigated factors such as crown shape, incidence angle, and slope. Results from model simulations indicated that variations in viewing angle and small slopes (<30°) have minimal effects. Regarding tree density, dense canopy forests were found to have the smallest errors and the least additional errors due to slope. Thus, these effects were generally assumed as negligible in this study. Further, the incidence angle could potentially affect Δ
h100, but since it remained relatively constant for all acquisitions in the study sites, its influence was not further investigated (see
Table 2). Additionally, longer perpendicular baselines between the acquisitions are generally expected to be more sensitive to phase noise [
32]. However, the Huss site showed almost similar effective baselines, indicating comparable conditions for all acquisitions (see
Table 2). In contrast, the Roda 2012 acquisition had a considerably higher baseline, which could lead to slightly more phase noise. Moreover, the oblique side-looking geometry of SAR data causes a tree height-dependent ground range offset. In this regard, Soja et al. [
25] found that the difference between ascending and descending is greater for tall trees than for smaller trees.
Due to cost- and time-expensive campaigns in Thuringia, the acquisition of ALS data is conducted in a five-year cycle. In this study, the InSAR Height could not be analyzed in its phenological stages as it was carried out by Praks et al. [
50]. Therefore, no conclusion can be drawn regarding seasonal differences in the SAR signal, except for spring and summer. Another factor to be considered is the time offset between the data, which ranges from 12 to 17 months. Particularly in the Huss site, the summer acquisitions may not accurately represent the true mean Δ
h100 since they are compared with ALS campaigns conducted in winter. These variances are generally expected to be lower in the coniferous forest. However, the long time span between the acquisitions introduces a margin of error, as small parts of the forest could have changed due to wind throws or selective logging. Since all SAR scenes were acquired after the ALS campaign, this would result in extremely high mean Δ
h100 in those pixels. Given that only a few trees were affected per year, the influence of this effect was generally considered low. The same applies to the Roda site. Nevertheless, the comparison between different seasons could be important at some point, as weather conditions (e.g., precipitation, soil moisture, and frost) may influence the penetration behavior of the SAR signal [
31,
76,
88]. Moreover, forest growth could decrease the height difference between TanDEM-X and ALS data or even lead to negative Δ
h100, but since most trees in the study sites are mature, their growth in height was considered marginal [
32].