1. Introduction
Forest height is one of the most important vertical structure parameters, which is not only an essential element for studies on species composition, stand quality and climate effects [
1], but also a key parameter for simulating ecosystem service functions, forest fires and forest biomass monitoring [
2,
3,
4]. Accurate estimation of forest vertical structure parameters is of great significance for global carbon cycle research and forest resource monitoring and management [
5]. In the process of estimating forest height, the traditional method based on the ground sample standard survey is not only time-consuming and laborious, but it is also difficult to obtain spatially continuous measurements of forest parameters, which cannot satisfy the requirements of modern forest resource management and the ecological environment scientific research. Remote sensing technology provides an effective approach for estimating forest structure parameters at regional and global scales. Interferometric synthetic aperture radar (InSAR) has a series of characteristics such as high temporal resolution, all-weather, no cloud interference and strong penetration ability. It is one of the effective remote sensing tools for estimating forest parameters over large areas. Forest height estimation using InSAR is based on InSAR’s acquisition of the scattering phase center height of vegetation, which is affected by vegetation structure, scattering mechanisms and sensor characteristics (wavelength, polarization, viewing angle, etc.) [
6,
7,
8,
9,
10]. InSAR cannot be easily restricted by various climatic conditions and hence, can interact with the interior of vegetation canopy (e.g., X-band: frequency range 8–12.5 GHz, wavelength range 2.4–3.75 cm, C-band: frequency range 4–8 GHz, wavelength range 3.75–7.5 cm), and provide new information that is not available from visible and infrared remote sensing. Moreover, forest height estimation using InSAR only requires processing image information and does not require the establishment of relationships with other forest parameters. Now InSAR has been widely used in forest height studies [
6,
11,
12].
There are two main ideas for estimating forest height using InSAR techniques: (1) based on phase information, forest height is estimated by interferometric phase difference representing two scattering mechanisms, top of canopy and understory structure, also known as the DEM differencing method; (2) based on coherence magnitude information, forest height is inferred from coherence information by establishing the relationship between body scattering decoherence and forest height (random body scattering model) [
6]. In vegetation-covered areas, the information obtained from InSAR includes the height of the vegetation scattering phase center, which is the basis for forest height estimation. In the study of forest height estimation using the DEM difference method, Cloude et al. [
13] first used Digital Elevation Model (DEM) differential inversion to obtain vegetation height from polarimetric interferometric SAR data in 1998, then various forest tree height inversion algorithms started to emerge. Izzawati et al. [
14] used the DSM obtained by X-band InSAR to distinguish the height of plantation from Ordnance Survey DEM(OSDEM) data. Kenyi et al. [
15] obtained forest height by differentiated processing of the C-band Space Shuttle Radar Terrain Mission (SRTM), DEM, and National Elevation Dataset (NED). For satellite-based SAR, Soja and Ulander [
16] used TanDEM-X data combined with LiDAR-derived high-precision DEM to obtain forest height estimation results. Garestier et al. [
17] found that the height difference between the phase centers of HV (cross polarization mode in which radar transmits electromagnetic waves horizontally and receives them vertically) and HH polarization (copolarization mode in which radar transmits and receives electromagnetic waves horizontally) channels reflected the canopy height of pine forests. The multi-frequency interferometry technology uses the difference of penetration ability of forest wavelength and the difference of high and low frequency SAR to obtain the height of scattering phase center of vegetation body and woodland. Subsequently, it utilizes the phase difference to estimate the forest height. For example, the dual-frequency InSAR method uses the long-wavelength (e.g., L-band: frequency range 1–2 GHz, wavelength range 15–30 cm, P-band: frequency range 280–390 MHz, wavelength range 77–107 cm) InSAR data to penetrate the tree canopy, and then subtracts DTM from the interference surface height of single-channel short-wavelength band (such as X-band) to obtain the estimation of forest canopy height [
18]. Pang et al. [
19] used the C-band and L-band of SIR-SAR for interferometry to estimate the forest stand height. Santos et al. [
20] used X-band and P-band InSAR phase center to obtain forest height. Balzter et al. [
8] estimated forest canopy height while utilizing airborne X-band VV-polarized single-pass and L-band HH-polarized repeat-pass SAR interferometry. Sexton et al. [
21] obtained forest height by differencing the heights obtained from X-band and P-band InSAR data. Moreover, in the study of forest height estimation the coherent magnitude method was used. Cloude proposed the coherent phase-amplitude inversion method in 2005, also known as SINC model, which demonstrated high stability and can ignore the influence of topographic factors [
22]. The feasibility of SINC model was verified using TanDEM-X bipolarization data in high latitudes and temperate rainforests regions [
23]. Qi et al. [
24] used CAMSAR’s X-band to verify the SINC model and proved its higher practical value. Olesk et al. [
25] utilized the SINC model to retrieve forest height from Tandem-X InSAR data and found that the estimation accuracy of deciduous season in broadleaf forest was higher than that of non-deciduous season.
However, when using the above two methods for forest height estimation, it is necessary to collect field data or obtain high-precision Light detection and ranging (LiDAR) data as the real tree height value for verification. These data can also be costly in terms of labor and materials. In practical applications, we can establish the relationship between phase center height and true height depending on the vegetation type and wavelength, and then derive the true vegetation height [
26]. Both classical statistical regression methods and machine learning methods can be used to establish relationships between variables. Compared with traditional statistical methods, machine learning methods have higher accuracy and are suitable for forest areas affected by multiple variables. At present, machine learning is widely used for the inversion of forest height. KS Džeroski [
27] modelled vegetation height and canopy cover in Slovenia by combining LiDAR, Landsat satellite data and machine learning. WANG et al. [
28] used spatially weighted geographic random forest (GRF) and traditional random forest (TRF) methods to predict the canopy height of mixed dry woodland in complex mountainous areas. Maryam et al. [
29] used the machine learning method to combine multi-baseline polarization interferometric SAR (PolInSAR) data with LiDAR to improve the forest canopy height estimation.
However, there are few studies in the past that have used machine learning to establish the relationship between the phase center height acquired by InSAR and the real tree height acquired by UAV. Additionally, no matter which method was used in the previous study, both the image data used for tree height estimation and the measured data used for validation have high precision and large acquisition cost. The research objectives of this study are: 1) Establish the relationship between InSAR phase center height and real tree height using machine learning methods; 2) Verify the applicability of open-source InSAR data and UAV data in forest tree height estimation, then estimate the forest height efficiently and at a low cost. In this study, open source and easily accessible Sentinel-1A C-band data was selected, and two common machine learning methods, random forest and multi-layer perceptron (also known as Deep Learning network), were selected. They are also typical algorithms of integrated learning and single learner, respectively. In this study, the UAV images and InSAR data were utilized to obtain tree height data. The tree height extracted by the UAV demonstrated a small deviation from the actual tree height [
30,
31]. So, the tree height obtained by the UAV was considered the real tree height in this study. The height obtained by InSAR was considered as phase center height [
26]. This study establishes the relationship between the phase center height which calculated by DEM differential interference method and coherent magnitude method with the real tree height, respectively. Finally, two machine learning methods were utilized to evaluate the accuracy of tree height estimation.
2. Materials and Methods
2.1. Study Area Overview
The Wolong Nature Reserve in Wenchuan County and Sichuan Province, with latitude 30°45′37″ N~31°19′23″ N and longitude 102°51′44″ E~103°29′11″ E, is the research object of this study while covering an area of 1736.81 km
2. Wolong Nature Reserve is located in the junction of the mountainous area and plain area, with the highest elevation of 6132 m in the west adjacent to Mountain Siguniang and the lowest elevation of 898 m in the east to Yingxiu Town of Wenchuan City, which is the end part of Zipingpu Reservoir, with an overall average elevation of 3544 m a.s.l. (
Figure 1). The DEM data was obtained from ESA’s PALSAR 12.5 m resolution. The topography in the area was undulating, with an average slope of 31.4°. The slope of the area above 25° reached 74.7%, among which the slope of the Zhenghe River and the north, as well as the south sides of the Yuzi River, demonstrated the most obvious undulation (
Figure 2).
The reserve belongs to the eastern edge of the climate zone of the Tibetan Plateau. Considering its geographical location and topography, it has formed a typical subtropical inland mountain climate. The area is dry in winter and has a lot of precipitation in summer. Three-fifths of the study area has more than 80% vegetation cover.
2.2. Sample Sites Data Processing
A total of 64 sample sites (
Figure 2) were recorded from the field in October 2020, with elevations ranging from 940 m a.s.l.to 4193 m a.s.l. According to the community species composition, the 64 sample sites included 18 shrub forests, 9 tree forests, and 36 mixed tree and shrub forests. According to the nature of trees, there were 32 broad-leaved forests, 31 coniferous forests, and 1 bamboo forest. The distribution of vegetation in the sample plots demonstrated that the study area was rich in species diversity. As a large proportion of trees and mixed trees occupy the study area, it is particularly important to estimate the height of trees.
The field measurements included geodetic values, elevation, slope, aspect, disturbance level, disturbance type, and community depression of each sample site. For trees greater than 1.5 m in height, the diameter at breast height, root diameter, tree height, crown width and LAI were measured. Considering the trees less than 1.5 m in height, the basal diameter, root diameter, tree height and crown width were measured. For shrubs, the clump diameter, root diameter, crown diameter and height were measured. Among them, the coordinates and elevation were measured using a handheld GPS locator (Zhitu H50). The locator utilized BeiDou, GPS, GLONASS multi-star high-precision positioning and coordinate systems including WGS84, Xian80, Beijing54 and CGS2000. The terrain was decided by a multifunctional slope measuring instrument (model JZC-B2), which had a high-strength level tube (double emphasis on the ring), a large dial and a slope angle comparison table. The LAI was measured using vegetation canopy analyzer LAI2000 and a portable app “Smart LAI”. Moreover, tape measurement was also utilized to determine the diameter at the breast height of trees.
2.3. UAV Image Processing and Extraction of Tree Height at the Sampling Site
The UAV equipment is DJI Phantom 4 Pro, which utilizes Flight Autonomy technology, quadrotor UAV as the flight platform with 6 vision sensors, main camera, 2 groups of IR sensors, 1 group of ultrasonic sensors, GPS/GLONASS dual-mode satellite positioning system, IMU and compass dual redundant sensors. DJIGS Pro was used as the ground station system for field image data acquisition. The effective pixels were 20 million for the single-lens visible sensor and the single-frame resolution was 5472 × 3648 pixels. The UAV images were acquired in October 2020. The flight height of the UAV was determined to be 120 m. The latitude cover was between 31°1′33″ N–31°1′40″ N, whereas the longitude cover was between 103°10′1″ E–103°10′8.4″ E. The total covered area was 0.578 km
2, with an average ground sampling distance (GSD) of 3.79 cm. The UAV flew in the horizontal direction according to “s” trajectory. The flight trajectory is shown in
Figure 3a, and the map of the photographed features is shown in
Figure 3b.
A total of five orthophoto maps corresponding to the sample areas were considered for field sampling. There are a total of 56 photos in the sample plot studied in this paper. The photogrammetric processing software Pix4D Mapper was used in the later stage to pre-process the acquired raw images for aerial photography, map formation, feature point extraction and matching, automatic aerial triangulation and digital orthophoto generation. The pre-processing process mainly included: (1) acquiring raw data, i.e., confirming the integrity of raw image data, viewing POS data files, camera parameters and control point files; (2) establishing the project and importing data, i.e., creating the project in Pix4D Mapper, including initialization settings, point cloud encryption, regional 3D reconstruction and orthophoto generation.
In this study, as the tree canopy height was estimated for the study area, the sample area with high forest depression was selected. The field sample site photo and the 3D point cloud images generated by the UAV are shown in
Figure 4. The Digital Surface Model (DSM) and Digital Terrain Model (DTM) generated by the UAV with a resolution of 3.79 cm are shown in
Figure 5. The GSD criterion was used to define the spatial resolution of DSM/DTM. The threshold segmentation used for point cloud segmentation is the most common method to determine the maximum variance between classes, namely the OTSU method [
26].
The mathematical expression of OTSU is mentioned as follows:
Assuming the threshold value as “t”, it falls between the maximum value Z
max and the minimum value Z
min. The point cloud height value “Z
i” is not bigger than t for {Z
1, Z
2, …, Z
l}, and bigger than t for {Z
l+1, Z
l+2, …, Z
n}. The threshold value “t” classifies the point clouds into two classes as ground point clouds G (ground) and tree point clouds T (tree). Therefore, the probability of one of the point clouds can be classified into two classes as:
where, P
G(t) is the probability of the point cloud assigned to the ground point and P
T(t) is the probability of the point cloud assigned to the tree point.
The mean height of tree point clouds was mentioned as “μT(t)”, whereas the mean height of ground point clouds was noted as “μG(t)”. Moreover, the mean height of all point clouds was noted as “μ”. The mean squared differences between the two types of point clouds were termed as and . The mean-variance of all point cloud heights was “”. For the point cloud of the same tree, the mean height “μ” and the mean square height “” were fixed. In addition, there was no relationship with the threshold “t”. The intra-class variance of the two classes of point cloud heights segmented by the threshold “t” was noted as “”. The inter-class variance of the two classes of point cloud heights segmented by the threshold t was termed as “”. According to the principle of selecting the optimal threshold, once “” is largest, the value of “t” will be optimal. After finding the optimal threshold t, the point cloud was divided into two parts as ground point cloud and tree point cloud, followed by the generation of DSM and DTM for sample areas.
As shown in
Figure 4a and based on the field sample survey, the canopy closure in the area was between 0.7–1. There were 60 trees in the 10 m × 10 m sample square with a density of 1, while most of the trees are in the growing stage. The tree crown width was between 1–2 m, and the average tree height was about 9–11 m. The tree forest occupied about 40% of the UAV area (
Figure 3). While using the systematic sampling method, 420 sample points were taken with a sampling spacing of 20 m. After eliminating the non-tree areas and some sample points located in the clearing, 152 sampling points were finally left (
Figure 6).
Since the resolution of UAV data and InSAR data were 0.038 m and 20 m, respectively, it was necessary to resample the UAV image to 20 m. By using the DSM generated by UAV and DTM, the Canopy Height Model (CHM) was calculated, CHM value, slope value, the aspect value generated by DTM were extracted, the height generated by InSAR was also calculated.
2.4. InSAR Data Processing
The Sentinel-1A used in this study was a Level-1 data product for SLC images in IW mode, which was imaged on 1 October 2020 and 25 October 2020, with VH and VV polarization and 5 m × 20 m ground resolution. The Sentinel-1A data was from European Space Agency (ESA)’s Copernicus program.
The process of generating InSAR was performed by SNAP software. The first step was to register the image with parameters set that included (1) setting the master and slave images; (2) setting the subswath; (3) setting the satellite orbit file; and (4) setting the auxiliary DEM data. The DEM data was obtained from ALOS PALSAR 12.5 m DEM and the orbit file was from ESA’s official track information website. The second step was to form the interferogram. The corresponding interference results were obtained based on Orb_Stack generated in the previous step. Subsequently, the topographic phase was removed, and the terrain was corrected using 12.5 m DEM. To reduce noise for multi-looking processing, Gaussian filtering was utilized, followed by Goldstein phase filtering. In the final stage, the geocoding of the results was conducted.
According to the InSAR inversion principle of forest structure parameters (
Figure 7) [
8,
10], the change in scattering characteristics of target ground objects and the elevation of corresponding points can be obtained by the interference phase difference of radar images in different periods when ignoring the influence of different factors in the electromagnetic wave transmission process. For different polarizations at the same wavelength, the ability of cross-polarization to receive echo signals was weaker than that of co-polarization. It is generally believed that the echoes of cross-polarization come from the canopy, while co-polarization comes from the ground. This causes different polarization methods to obtain different radar backscattering coefficients. In this study, cross-polarization VH and co-polarization VV of Sentinel-1A radar was used for calculation. The main image was acquired on 1 October 2020, while the second image was acquired on October 15, 2020. The VH and VV radar images were interferometrically processed based on different interferometric phases corresponding to different ground elevations. The PALSAR DEM data of 12.5 m resolution was subsequently utilized to simulate and remove the topographic phases from the interferogram of the study area. In the calculation process, the body scattering dominated the echo signal of the VH cross-polarization mode, whereas the surface scattering dominated the echo signal of the VV co-polarization mode.
A1 and A2 represent two antenna positions, the baseline B is the distance between A1 and A2, r1 and r2 are the distance of the antenna to ground point P, h is the elevation of P, P0 is the point on the reference plane equidistant from A1 to P, θ and θ0 are the point view of A to P and P0, ɚθ is the amount of change in the view of A1 from P0 to P. The angle between the baseline and the horizontal direction is α, and B⟂ is the component of B in the vertical SAR line of sight direction.
2.4.1. DEM Differential Interference Method
The basic idea of this method is to calculate the canopy phase and ground phase separately, while using the vertical effective wave number k
z and the canopy-ground phase difference to obtain the vegetation height. In the polarization system, the representative scattering becomes closer to the plane scattering once the effective ground-to-body amplitude ratio of a certain polarization channel becomes larger. In this case, the scattering center phase can be regarded as ground phase. If the VH polarization is dominated by body scattering and VV polarization is dominated by ground scattering, then the calculation formula can be presented as Equation (2).
γVV represents the phase after interference processing in the co-polarization mode; γVH represents the phase after interference processing in the cross-polarization mode; ϕv represents the scattering phase center of the vegetation canopy, and ϕs represents the scattering phase center of the ground surface.
The height of the vegetation
hv can be represented by Equation (3).
Out of which, the “
kz” can be expressed as:
where
kz is vertical effective wave number,
p denotes the InSAR data acquisition mode,
B⟂ is the vertical baseline,
λ is wavelength,
θ is the radar view, and
R is the slope distance [
10]. If the vegetation canopy is high and sparsely distributed, then the extinction coefficient will be small and the phase center will be high due to the structure. However, if the vegetation canopy is densely distributed, the phase center may be located anywhere between half of the tree height and the canopy, depending on the vertical structural variation of the vegetation and the average extinction coefficient.
2.4.2. Coherent Magnitude Method
The coherence magnitude method is a model based on the theory of interference decoherence. Due to the short time interval of the C-band InSAR data used in this study, the problem of temporal decoherence was not considered. The C-band InSAR data is dominated by body scattering. Using body scattering decoherence model, also known as SINC model, can estimate the forest height.
If the scattering model is considered as an independent vegetation body, the amplitude ratio of the ground to body is considered to be reduced to 0. If the extinction coefficient was 0, the coherence coefficient is only related to the body scattering coherence. If the ground reflection and extinction coefficients were ignored, the coherence coefficient amplitude can only relate to the tree height. Once the backscattered energy of the vegetation layer is evenly distributed, the coherent tree height z will be consistent with the SINC model. The body scattering decoherence model is expressed as:
Body scattering decoherence γ and forest height
hv are satisfied as a function of SINC [
10] and the simple inversion model between the coherence coefficient amplitude and the tree height can be established using the SINC function as follows:
This method only considers the amplitude information of the complex coherence coefficients and does not consider its phase information. Therefore, this method is also called the coherent amplitude method.
2.5. Machine Learning Methods Used for Tree Height Estimation
Two machine learning methods-random forest (RF) and multilayer perceptron (MLP) were used to combine the phase center height generated by InSAR technology with canopy height generated by UAV. Both of them were run in anaconda using python. Each method included three steps. Firstly, the input variables of two methods were phase center height, slope, aspect, backscattering coefficient and coherence coefficient, in which the topography factor is removed from the input variables when using the coherent magnitude method, and the output variables was canopy height. Secondly, the dataset was divided into two sets: 70% of dataset was taken as the training set, and 30% of them was considered as the testing set. Finally, the testing set was used to evaluate the performance of each method in estimating tree height.
The main idea of modeling using random forests is as follows.
- (1)
The number of training cases (samples) N was 152 and the number of features M was 5;
- (2)
The number of input features, m, was used to determine the decision outcome of a node in the decision tree, where m was much smaller than M;
- (3)
From the 152 training cases (samples), a training set was formed by put-back sampling (i.e., bootstrap sampling) 106 times, and the unsampled cases (samples) were used as predictions to evaluate their errors;
- (4)
For each node, “m” features were randomly selected and the decision of each node in the decision tree was determined based on these features. According on these m features, its optimal split was calculated;
- (5)
Each tree grew intact without pruning, which was likely to be adopted after building a normal tree classifier.
For the multilayer perceptron approach, we complied with the main principle of multilayer perceptron. This experiment fixed the number of hidden layers to 1, which can fit any function “containing a continuous mapping from one finite space to another finite space” [
32]. The number of neurons in the input layer was 5 and the number of neurons in the output layer was 1.
4. Discussion
This study successfully established the model relationship between the phase center height and the real tree height and proved that the forest tree height can be effectively estimated by the combination of InSAR technique and machine learning methods. The method proposed in this paper allows us to obtain the forest height over a large area at low cost and effectively, where the combination of DEM difference method and random forest can achieve higher estimation accuracy, while the combination of coherent magnitude method and random forest can ignore the influence of topographic factors.
4.1. Analysis of Tree Height Estimation Results
The determination of forest tree height is a key element in studies on the estimating forest parameters such as forest biomass. In previous studies, two main methods using UAV airborne LiDAR and photogrammetry were used to extract tree height. The airborne LiDAR can provide detailed estimation of forest vertical scale information [
33] and LiDAR-generated 3D point clouds can segment tree structures to improve the estimation accuracy. However, using airborne LiDAR for multi-time and large-scale forest monitoring is usually difficult in practical applications. While using UAV photogrammetry for tree height extraction can also achieve higher accuracy [
34,
35]. Therefore, the UAV tree height in this study can be regarded as the real tree height. However, while estimating forest tree height over a large area, InSAR data is still considered the mainstream. The difficulties in estimating forest tree height using InSAR data currently lie in the acquisition of high-precision data and mechanistically complex parameter models. While using InSAR data to invert forest tree heights, Feng et al. [
36] used an improved three-stage algorithm that considered topographic factors to extract tree heights in sample areas along transmission line corridors based on ALOS PALSAR dual-polarization satellite data. They verified the accuracy of extracted tree heights using field-collected airborne LiDAR data. Liao [
12] used P-band SAR data, PolInSAR and TomoSAR techniques to conduct an in-depth study on the influence of complex subsurface, topographic relief, and saturation in the estimation of forest above-ground forest biomass. It is closely related to the forest structure parameters. In addition, the use of InSAR data for canopy height inversion can also be affected by surface diagonality. Although L-band and P-band data have strong penetration, they can also have large acquisition costs, in comparison to C-band data, which are easier to obtain and demonstrate better results in forests with low biomass. The analysis of C-band interferometric height discontinuity at forest edges reported by Hagberg et al. [
37] and the effective interferometric C-band height from ERS-1 reported by Askne et al. [
38] proved its validity.
This study proved that combining C-band InSAR data and machine learning methods can establish the relationship between phase center height and real tree height. It can efficiently and cost-effectively estimate the forest height values over a large area, and also solve the problems of high cost of data acquisition and complicated inversion models in previous studies. Nasiri et al. [
39] proved the suitability of integrating UAV-obtained RGB images, Sentinel-2 data, and ML models for the estimation of FCC, Yu et al. [
40] found that the forest structure classification achieved from two-seasonal UAV optic images and LiDAR-derived DSMs can reach 0.9 with the application of an optimal machine learning approach. A combination of InSAR techniques, aerial photography images, and machine learning has been shown to be useful in the study of forest structure parameters. For this study, our results showed that the use of DEM difference method combined random forest was influenced by topography with the highest accuracy of 0.95 and a root-mean-square error of 1.76 m, the accuracy is significantly improved compared to the root-mean-square error of 3.5 m and 7 m in the height extraction studies of Balzter et al. [
8] and Kenyi et al. [
41] while using SRTM C-band and USGS DTM data. The estimation accuracy of the coherent amplitude method combined random forest regression model can reach 0.94 and the root-mean-square error was 1.86 m without considering phase information, it also indicated that using coherent amplitude method to estimate tree height is less affected by topographic factors. However, the performance of multilayer perceptron method was poor when compared with RF. The accuracy of two InSAR methods combined with multilayer perceptron was 0.25 and 0.2, and the root-mean-square error was 3.96 m and 4.13 m. However, the root mean square error of InSAR combined MLP is still better than the results of the previous study. The two machine learning methods used in this study have their own advantages. For example, RF has characteristics such as fast training, high accuracy and resistance to overfitting [
42], MLP has strong nonlinear fitting ability with multiple layers to learn the representation in data [
43]. However, this experiment showed that, despite the advantages of each of the two machine learning methods, RF performs better than MLP in estimating the forest height, the stability and accuracy of MLP which is a single learner is slightly inferior compared RF.
4.2. Analysis of Uncertainty of This Study
In this study, the tree height obtained by UAV photogrammetry was used as the real tree height. Although it has been proved that the tree height value obtained by UAV can be used as the real tree height value in the actual study, there are still errors using this method, which come from the influence of terrain in field measurement, human operation error, sensor error, etc.
The InSAR data used in the study is the open-source Sentinel-1A data, which is one of the main features of this study. The choice of data source does not require an excessive cost, but it also brings some uncertainties. The radar scattering mechanism in forest areas is complex and difficult to control, radar data with longer wavelengths may have a better performance. In addition, InSAR and UAV have different resolutions and have some computational complexity in scale conversion.
In the selection of machine learning methods, we chose two machine learning algorithms that are more commonly used and have certain representativeness, then compared the estimation accuracy of both. However, the current machine learning models are diverse, exploring machine learning tree-height estimation models with higher accuracy is a development direction for future research.
Besides, there are some limitations in this study; the object is broadleaf forest in the growing stage, and the applicability of this approach to coniferous forest and broadleaf forest in the mature stage is unclear. The complexity of the method lies in the inconsistent resolution of the InSAR and UAV images, the selection of the scale conversion method needs to be further explored. In future studies, the research object can be extended to other natural forests or plantations, the InSAR data used could be converted from repeat-pass InSAR to single-pass InSAR, and machine learning methods with higher accuracy can be developed.