Multispectral Image Determination of Water Content in Aquilaria sinensis Based on Machine Learning

Wang, Peng; Wu, Yi; Wang, Xuefeng; Shi, Mengmeng; Chen, Xingjing; Yuan, Ying

doi:10.3390/f14061144

Open AccessArticle

Multispectral Image Determination of Water Content in Aquilaria sinensis Based on Machine Learning

¹

Institute of Forest Resource Information Techniques, Chinese Academy of Forestry, Beijing 100091, China

²

Key Laboratory of Forest Management and Growth Modelling, National Forestry and Grassland Administration, Beijing 100091, China

³

College of Forestry, Nanjing Forestry University, Nanjing 210037, China

^*

Author to whom correspondence should be addressed.

Forests 2023, 14(6), 1144; https://doi.org/10.3390/f14061144

Submission received: 30 March 2023 / Revised: 12 May 2023 / Accepted: 29 May 2023 / Published: 1 June 2023

(This article belongs to the Special Issue Advanced Applications in Remote Sensing and GIS to Forest Management and Planning)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The real-time nondestructive monitoring of plant water content can enable operators to understand the water demands of crops in a timely manner and provide a reliable basis for precise irrigation. In this study, a method for rapid estimation of water content in Aquilaria sinensis using multispectral imaging was proposed. First, image registration and segmentation were performed using the Fourier–Mellin transform (FFT) and the fuzzy local information c-means clustering algorithm (FLICM). Second, the spectral features (SFs), texture features (TFs), and comprehensive features (CFs) of the image were extracted. Third, using the eigenvectors of the SFs, TFs, and CFs as input, a random forest regression model for estimating the water content of A. sinensis was constructed, respectively. Finally, the monarch butterfly optimization (MBO), Harris hawks optimization (HHO), and sparrow search algorithm (SSA) were used to optimize all models to determine the best estimation model. The results showed that: (1) 60%–80% soil water content is the most suitable for A. sinensis growth. Compared with waterlogging, drought inhibited A. sinensis growth more significantly. (2) FMT + FLICM could achieve rapid segmentation of discrete A. sinensis multispectral images on the basis of guaranteed accuracy. (3) The prediction effect of TFs was basically the same as that of SFs, and the prediction effect of CFs was higher than that of SFs and TFs, but this difference would decrease with the optimization of the RFR model. (4) Among all models, SSA-RFR_CFs had the highest accuracy, with an R² of 0.8282. These results confirmed the feasibility and accuracy of applying multispectral imaging technology to estimate the water content of A. sinensis and provide a reference for the protection and cultivation of endangered precious tree species.

Keywords:

Aquilaria sinensis; water content; multispectral image; feature extraction; machine learning

1. Introduction

Aquilaria sinensis is a tree species of Aquilaria in the family Thymelaeaceae and is mainly distributed in coastal provinces such as Hainan, Guangdong, and Guangxi in China and in some Southeast Asian countries [1]. The resin secreted by A. sinensis after injury has a strong fragrance and anti-inflammatory and antioxidant functions. It is widely used in the fields of incense making, pharmaceuticals, and handicraft manufacturing [2,3]. High-quality A. sinensis is very scarce and can be worth tens of thousands of dollars per kilogram. The preciousness of A. sinensis has caused wild A. sinensis to be overcut by humans, and coupled with habitat changes, natural wild A. sinensis is on the verge of extinction. In 2004, A. sinensis was officially listed in Appendix II of the Convention on International Trade in Endangered Species of Wild Fauna and Flora, to prohibit the illegal trade of A. sinensis and make it sustainable [4]. To protect this tree species and meet market demand, the planting of A. sinensis has been vigorously promoted in southern China. However, juvenile A. sinensis is very sensitive to water, and most operators cannot determine the water demand of A. sinensis in real time, resulting in its poor growth and even death. Therefore, operators need a real-time and accurate non-destructive estimation method of the water content of A. sinensis to adjust the water conditions for the growth of A. sinensis in a timely manner, thus ensuring the quality and output of A. sinensis.

In actual production, an operator usually judges whether a plant lacks or has abundant water according to changes in its appearance and color, but this method largely depends on the actual experience and subjective judgement of the operator. In recent years, with the development of artificial intelligence and the Internet of Things, image-based plant water monitoring technology has gradually been applied in practice [5]. This approach has strong flexibility and operability. By establishing the relationship between spectral information and plant water content, real-time feedback on plant water demand can be realized in the subsequent operation process. Hyperspectral imaging technology can capture the rich spectral information of the measured plants and performs well in predicting plant water content [6,7]. For example, Yang et al. [8] used hyperspectral information to construct multiple Back Propagation Neural Network (BPNN) prediction models of winter wheat leaf water content after flooding eastern wheat, providing a theoretical basis for the prevention and control of winter wheat waterlogging disasters. Xuan et al. [9] used hyperspectral imaging technology to accurately evaluate the ripening period and water content of fresh okra fruit, providing technical support for farmers to optimize harvest dates. However, hyperspectral imaging technology still faces some problems, such as its high cost and excessive redundant information. In contrast, multispectral imaging technology does not have these problems, so it is easier to understand and apply [10,11].

At present, many scholars have applied multispectral imaging technology to plant water content monitoring. They have established the relationships between multispectral image features and plant water content by choosing different models to judge the water demands of plants. In previous studies, the most commonly used modeling methods were multivariate statistics and machine learning algorithms [12,13,14]. Among them, multivariate statistics can quantitatively describe the functional relationship between plant water content and various parameters, with stronger interpretability. However, the nonlinear mapping ability of machine learning algorithms is stronger than that of multivariate statistics. Models such as random forests, support vector machines, and neural networks are not sensitive to the absence of missing values (such as some attribute values in the sample) and they have strong anti-noise capabilities and good predictive capabilities [15].

When using multispectral images to estimate water content, the choice of explanatory variables is very important. Studies have shown that the green and near-infrared bands are ideal for identifying water in plant tissues, and related vegetation indices (VIs), such as the green normalized difference vegetation index (GNDVI), normalized difference vegetation index (NDVI), and optimized soil adjusted vegetation index (OSAVI), have been proven to be accurate in predicting plant water content [16,17]. For example, some scholars extracted multiple VIs from multispectral images of eggplant and built a linear water content prediction model one by one. The results showed that the two models with NDVI and OSAVI as independent variables had the highest prediction accuracy [12]. Torres et al. [13] used near-infrared imaging equipment to obtain the reflectance of 12 bands of olive tree multispectral images. After eliminating abnormal sample information through principal component analysis (PCA), partial least squares regression (PLSR) was used to construct the water content model of olive trees, and both the training and verification sets showed strong robustness, providing a reliable basis for operators to reasonably irrigate olive trees. Malvandi et al. [14] used the successive projection algorithm (SPA) to extract the reflectance of three characteristic bands from apple multispectral images and used them as explanatory variables to construct a PLSR prediction model for apple water content. The correlation coefficient (R²) of the model reached 0.99, realizing the accurate and nondestructive detection of apple water content. In addition, some scholars used the texture features (TFs) of multispectral images as model parameters to estimate water content. For example, Zhou et al. [18] obtained canopy images of winter wheat by using a UAV equipped with a multispectral sensor and constructed three wheat stomatal conductance estimation models, Cubist, BPNN, and Elaboration Likelihood Model (ELM), by extracting and combining the spectral features (SFs) and TFs of the image. The results showed that texture features were significantly correlated with wheat water content, and the accuracy of the comprehensive features (CFs) model based on TFs and SFs was more than 20% higher than that of the single-feature model.

The above studies have shown that multispectral images perform well in monitoring plant water content, but there are few reports on the estimation of water content in precious tree species. At the same time, most studies focus on image feature extraction and model construction, ignoring the process of image segmentation and model optimization. Based on the above considerations, the main goal of this study was to propose a multispectral image estimation method for the moisture content of A. sinensis seedlings under different moisture gradients and to analyze the effect of soil water content on the growth of A. sinensis seedlings. Based on the field water capacity, four water gradients were set up in the experiment, multispectral images of A. sinensis were obtained, and the moisture content was measured. On this basis, the water content estimation model of A. sinensis was constructed. The specific objectives were as follows: (i) Through the quantitative analysis of the growth difference of A. sinensis seedlings under different water gradients, clarify the specific influence of soil water content on its growth and determine the most suitable water conditions for the growth of A. sinensis seedlings. (ii) Combine image registration methods with segmentation algorithms to achieve the fast and accurate segmentation of A. sinensis multispectral images. (iii) Apply dimensionality reduction algorithm to eliminate multicollinearity of image features. On this basis, the water content of A. sinensis was predicted using SFs, TFs, and CFs to analyze the predictive performance of different image features. (iv) Apply the swarm intelligence optimization algorithm to adaptively optimize the model parameters in order to determine the best model for predicting the water content of A. sinensis.

2. Materials and Methods

The technical flow chart of this study is shown in Figure 1. Firstly, images of A. sinensis were collected using the Mica Sense Edge 3™ multispectral camera. Secondly, the Fourier–Mellin transform (FMT) was applied to register the images in different bands, and the fuzzy local information clustering algorithm (FLICM) was applied to segment the image, separating the foreground and background of the image. Again, spectral features (SFs) and texture features (TFs) in the foreground image were extracted, and composite features (CFs) composed of SFs and TFs were obtained. Then, the local tangent space arrangement (LTSA) was used to extract the feature vectors of the three types of image features, and the random forest regression model (RFR) for predicting the water content of A. sinensis was constructed, respectively. In the process of model construction, MBO, HHO, and SSA were used to adaptively optimize the numbers of decision trees and node features. Finally, the accuracy of the constructed model was tested to determine the optimal random forest regression model for predicting the water content of A. sinensis.

2.1. Overview of the Study Site

The study area was located in Wenchang City, Hainan Province, China (19°36′~20°3′ N, 109°12′~111°2′ E), with an average altitude of 42.55 m, belonging to the coastal plain on the northern edge of the tropics (Figure 2). There is no obvious seasonal variation in this area, the annual average temperature is 23.90 °C, the rainfall mainly occurs from May to October, the annual precipitation is 1721.60 mm, and the annual average humidity is 87%. The main soil is coastal sandy soil. The pH value of the soil is between 5.0 and 6.6, which is very suitable for the growth of tropical crops.

2.2. Experimental Design and Data Acquisition

2.2.1. Experimental Design

We used seeds to raise seedlings in this experiment. After 2 years of growth, the tree height, crown width, and ground diameter of all A. sinensis seedlings were measured (mean values were 29.8 cm, 18.9 cm, and 6.1 mm, respectively). We selected 52 A. sinensis seedlings with no pests or diseases and uniform growth (The range of tree height was 29.8 ± 2 cm, the range of crown width was 18.9 ± 2 cm, and the range of ground diameter was 6.1 ± 1 mm) and moved them into flowerpots of the same size. Before transplanting the seedlings, 5 kg of air-dried seaside sandy loam was placed in each flowerpot and the soil nutrient content was measured using a soil nutrient meter, wherein the organic matter content was 10.5 g/kg, the available nitrogen content was 98.3 mg/ kg, the available phosphorus content was 3.38 mg/kg, and the available potassium content was 69.9 mg/kg. In this study, to simulate drought, flood, and normal conditions, we set 4 water gradients based on the field water-holding capacity (Table 1) with 13 replicates at each level, and we evenly divided 52 A. sinensis seedlings into 4 groups. To ensure the normal growth of A. sinensis, we applied the same amount of nitrogen, potassium, and phosphate fertilizers to the 4 groups during the experiment and carried out weeding and pesticide spraying. The experiment lasted for a total of 6 months. After the experiment, A. sinensis was moved indoors to take pictures and measure the basic growth factors and water content. On this basis, we used one-way analysis of variance to test the significance of the differences in height, crown growth, and ground diameter of A. sinensis under different water treatments and used Duncan’s test to analyze the differences between groups. If there is a significant difference between the groups, it will be labeled with a different letter; if there is no significant difference between the groups, the same letter will be used for labeling.

2.2.2. Data Acquisition

In this study, we built a darkroom with a length of 1 m, width of 1 m, and height of 2 m using steel pipes and shading cloth (Figure 3). Except for the direction facing the camera, the other directions of the darkroom were covered by black shading cloth of aluminum foil composite film material. We installed 7 LED lights (Hangzhou SPL Photonics Co., Ltd., Hangzhou, China) on the top and front side frame of the darkroom as an active light source. A. sinensis was placed in the darkroom, and the camera was mounted using a tripod 2 m directly in front of the chamber. The camera center point and A. sinensis center point were kept at the same height. The camera used for image collection was a Mica Sense Ede 3™ equipped with 5 narrow-band spectral sensors, whose specific parameters are shown in Table 2. In actual operation, all A. sinensis were photographed according to the four orientations of due east, due west, due north, and due south. A total of 52 groups of 1040 images (1280 × 960 pixels) were obtained. After the images were taken, A. sinensis was cut at the base of the stem, and the fresh weights of the stem and leaf were determined using an electronic balance with an accuracy of 0.01 g. Then, it was dried in an oven (85 °C) and weighed. The water content was calculated using Equation (1):

W C = (F W - D W) / F W \times 100 %

(1)

where

W C

is the water content of A. sinensis,

F W

is the fresh weight, and

D W

is the dry weight.

2.3. Image Processing

The multispectral images were acquired including a foreground region (called the region of interest, ROI) with A. sinensis information and a background region carrying a lot of irrelevant information. Before extracting the features of the A. sinensis multispectral image, it was necessary to segment the ROI area from the whole image. At the same time, the spectral sensors of the Mica Sense Edge 3™ used in this study were independent of each other, and there was a relative offset between the five discrete images obtained. If the A. sinensis image was directly segmented, the segmentation efficiency would be greatly reduced and the segmentation accuracy of each band would not be uniform. On the contrary, registering the images before segmentation could make the images of each band consistent in space, thus only the image with the highest definition and the easiest segmentation as the reference image for segmentation needed to be used, and the results could be directly applied to other bands.

2.3.1. Image Registration

In this study, we applied the Fourier–Mellin Transform (FMT) to the registration of images. The FMT introduces polar coordinate transformation into the phase correlation method, which can realize fast registration of two images with rotation, zoom, and translation [19,20]. The principle of FMT is as follows.

Let two images be

f_{1} (x, y)

and

f_{2} (x, y)

. Assume that there are rotation, scaling, and translation relationships between these images that can be expressed as shown in Equation (2):

f_{2} (x, y) = f_{1} [a (x \cos θ_{0} + y \sin θ_{0}) - Δ x, a (- x \cos θ_{0} + y \sin θ_{0}) - Δ y]

(2)

where

a

is the scaling factor,

θ_{0}

is the rotation angle, and

Δ x

and

Δ y

are the displacements in the horizontal and vertical directions, respectively.

Fourier transforms are performed on

f_{1} (x, y)

and

f_{2} (x, y)

, and the relative translation between the two images is obtained through the phase spectrum. The magnitude spectrum is calculated on both sides to obtain Equation (3):

M_{2} (u, v) = \frac{1}{a^{2}} M_{1} [\frac{1}{a} (u \cos Δ θ + v \sin Δ θ), \frac{1}{a} (- u \sin Δ θ + v \cos Δ θ)]

(3)

where

M_{1} (u, v)

and

M_{2} (u, v

) are the spectrum amplitudes of

f_{1} (x, y)

and

f_{2} (x, y)

, respectively.

Since the spectrum amplitude is only related to the scaling factor and the rotation angle, the image is consistent with the scaling factor and the rotation angle of the spectrum amplitude. Therefore, the effects of rotation and scaling can be reduced by a logarithmic-polar coordinate transformation, and the polar coordinate transformation is shown in Equation (4):

M_{2} (\lg ρ, θ) = \frac{1}{a^{2}} M_{1} (\lg ρ - \lg a, θ - Δ θ)

(4)

where

M_{1} (\lg ρ, θ)

and

M_{2} (\lg ρ, θ)

are the logarithmic polar coordinates of

M_{1} (u, v)

and

M_{2} (u, v

), respectively.

In polar coordinates, the rotation and scaling of the two images are transformed into translation

(\lg ρ, θ)

. At this time, the relative displacement of the two rotated, scaled, and translated images can be calculated by using the phase correlation method again, and the registration between images can be realized.

2.3.2. Image Segmentation

Affected by factors such as the sensor material and working environment, noise will inevitably be introduced when collecting multispectral images, and it is difficult to accurately segment noise using traditional segmentation algorithms. To meet the research needs, we applied the fuzzy local information clustering algorithm (FLICM) with a strong anti-noise ability for image segmentation. To evaluate the image segmentation effect, we used the partition coefficient Vpc and partition entropy Vpe as the evaluation indices. The calculation methods of Vpc and Vpe are shown in Equations (5) and (6), respectively.

V p c = \sum_{i = 1}^{N} \sum_{k = 1}^{K} u_{k i}^{2} / N

(5)

V p e = - \sum_{i = 1}^{N} \sum_{k = 1}^{K} u_{k i} \times \log (u_{k i}) / N

(6)

where N is the total number of pixels, K is the number of clusters, and

u_{k i}

is the membership degree of the pixel belonging to the Kth class.

2.4. Feature Extraction

After the image segmentation was complete, we extracted the gray features of the image and obtained the reflectance of the B, G, R, NIR, and RE bands by calculating the gray mean value of each band image in the four directions. On this basis, 20 VIs directly or indirectly related to the water content were extracted by combining the reflectivity of each band [21,22,23], and 25 spectral signatures (SFs) composed of spectral reflectance and VIs were finally obtained, as shown in Table 3.

Texture features can effectively reflect the representational changes of A. sinensis, so we described the texture features by extracting the grayscale cooccurrence matrix (GLCM) of the image. In this study, the GLCM of the image was extracted from four directions, 0°, 45°, 90°, and 135°, and the image contrast (CON), correlation (COR), angular second-order moment (ASM), inverse differential moment (IDM), and entropy (ENT) were calculated in the four directions. To reflect the grayscale transformation of the image in different directions, we took the mean values of CON, COR, ASM, IDM, and ENT in the four directions as the texture features of the image, and the calculation methods are shown in Equations (7)–(11). There were 5 features per band, so a total of 25 texture features (TFs) were extracted. For ease of distinction, we named the extracted texture features in the form of X_Y, e.g., B_CON, which represents the contrast of B-band images.

C O N = \sum_{i} \sum_{j} {(i - j)}^{2} P (i, j)

(7)

C O R = [\sum_{i} \sum_{j} ((i j) P (i, j)) - μ_{x} μ_{y}] / σ_{x} σ_{y}

(8)

A S M = \sum_{i} \sum_{j} P {(i, j)}^{2}

(9)

I D M = \sum_{i} \sum_{j} \frac{P (i, j)}{1 + {(i - j)}^{2}}

(10)

E N T = - \sum_{i} \sum_{j} P (i, j) \log P (i, j)

(11)

where

i

and

j

are image gray levels,

P (i, j)

is the probability of

i

and adjacent

j

gray level,

μ_{x}

and

μ_{y}

are mean values, and

σ_{x}

and

σ_{y}

are standard deviations.

After extracting the SFs and TFs, they were combined to obtain the composite features (CFs) containing 25 spectral features and 25 texture features.

2.5. Data Analysis and Modelling

2.5.1. Data Dimensionality Reduction

When the feature vector has multicollinearity, it will lead to overfitting of the model. If the data are reduced at this time, this problem can be effectively solved. In this study, we used three dimensionality reduction algorithms, linear discriminant analysis (LDA), local tangent space arrangement (LTSA), and maximum variance spread (MUV), to extract the feature vectors of the SFs, TFs, and CFs and to construct the models. In this study, we used the intrinsic_dim function in drtoolbox to estimate the intrinsic dimensions of the SFs, TFs, and CFs (the minimum number of dimensions required to solve high-dimensional optimization problems), which were 2, 2, and 3, respectively, that is, from 25, 25, and 50 dimensions to 2, 2, and 3 dimensions, respectively. Among them, LDA is similar to PCA, which applies the idea of matrix decomposition in dimensionality reduction and can map the initial sample to a lower dimension sample space. However, unlike PCA, LDA is a supervised linear dimensionality reduction algorithm, which can ensure that the mean difference between various types of data is the largest and the intra-class variance is the smallest. LTSA is an unsupervised nonlinear manifold learning algorithm. It can calculate the overall low-dimensional embedding coordinates by rearranging the projection coordinates of the local space to achieve dimensionality reduction. MUV is similar to LTSA, but MUV is a global algorithm that not only considers the local information of the sample but also considers the relationship between the sample points and the non-adjacent sample points, thereby extending the high-dimensional data manifold in the low-latitude space.

The three algorithms were selected by comparing the accuracy of the models. The constructed model was named in the form of X-Y_Z, where X was the dimensionality reduction algorithm, Y was a regression model, Z was the feature type, and X-Y-Z was a subset of Y-Z. For example, LDA-RFR_SFs was a subset of RFR_SFs, which represented a random forest regression model that took the spectral features after dimensionality reduction by the LDA algorithm as input.

2.5.2. Random Forest Regression Model

Random forest (RF), a supervised ensemble learning algorithm based on decision trees that can be used for both classification and regression, is one of the most practical algorithms in bagging ensemble strategies [40]. The random forest regression (RFR) model consists of multiple decision trees. Each tree is independent of the others and does not affect the other, thus the final result of the model is jointly determined by each decision tree [41].

2.5.3. Model Parameter Optimization

When the number of decision trees is too large, it may lead to overfitting of the model. On the contrary, when the number of decision trees is too small, it might not ensure the fitting accuracy of the model. In addition, the number of randomly sampled variables (number of node features) when building a decision tree branch will also affect the accuracy of the model. This is because the number of node features controls the degree of randomness introduction. When the number of features extracted by tree nodes is large, the strength of each decision tree increases, but the overall randomness decreases. When the number of features extracted by the tree nodes is small, the overall randomness increases, but the complexity of each tree decreases. Applying the intelligent optimization algorithm to adaptively optimize the model parameters can quickly find an optimal parameter eigenvector, so that the model fitting effect can be optimal, and the RMSE of the model is used as the fitness function in the optimization process, which can also avoid overfitting of the model. Based on the above considerations, this study used three population optimization algorithms, the monarch butterfly optimization (MBO) [42], Harris hawks optimization (HHO) [43], sparrow search algorithm (SSA) [44], to adaptively optimize the numbers of decision trees and node features in the RFR model. The naming rules referred to those in Section 2.5.1 where X represented the optimization algorithm, such as MBO-RFR_SFs.

2.5.4. Model Evaluation

Due to the small number of samples (52), this study adopted the leave-one-out cross-validation method (LOOCV) to train the model, that is, one different sample was drawn from 52 samples each time as the test set, and the remaining 51 samples were used as the training set. By analogy, training 52 times so that all samples could be used as a test set, a prediction model containing 52 sub-models was obtained. The accuracy of the model was evaluated by the correlation coefficient (R²), root mean square error (RMSE), and mean absolute percentage error (MAPE), and the calculation methods were as follows:

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}

(12)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(13)

M A P E = \frac{1}{n} \sum_{i = 1}^{n} |\frac{{\hat{y}}_{i} - y_{i}}{y_{i}}| \times 100 %

(14)

where n is the number of samples,

y_{i}

is the measured value,

\bar{y}

is the average of all, and

{\hat{y}}_{i}

is the predicted value.

3. Results

3.1. Effect of Soil Water Content on the Growth of A. sinensis

A shown in Figure 4a,b, there were significant differences in tree height and canopy growth under different water treatments (p < 0.05). The tree height and crown width of the T1 group increased by 17.0 and 11.2 mm, respectively, and the tree height and crown width of the T2 group increased by 11.9 and 5.8 mm, respectively, which were significantly higher than those of the CK group. However, the tree height and crown width of the T3 group increased by only 3.1 and 2.3 mm, respectively, which were significantly less than the T1, and T2 groups. This showed that both drought and waterlogging conditions could inhibit the growth of A. sinensis, but the inhibitory effect of the drought condition was more obvious. As shown in Figure 4c, the difference in ground diameter growth under different water treatments was not significant (p < 0.05). This showed that the effect of water treatment on ground diameter was relatively small compared with tree height and crown width. Compared with the other treatment groups, the tree height, crown width, and ground diameter growth of the T3 group were the largest, and the differences with the CK group were the most significant, indicating that a soil water content of 60%–80% was most suitable for the growth of A. sinensis.

3.2. Image Segmentation Effect

In the initial visual interpretation of the raw multispectral images, the B-band image with less noise and obvious foreground background difference was selected as the reference image, and the remaining band images were registered. The results are shown in Figure 5a. After the B-band image was segmented, a binary image was obtained, which was used as a mask to perform point multiplication with the five-band images at the same time to obtain the final segmentation result, as shown in Figure 5b. Among them, B, G, R, RE, and NIR represent the images of each band, and RGB represents the fused R, G, and B band images. Figure 5a shows that the RGB image synthesized after registration had no band shift and no band information was missing. Figure 5b shows that the synthesized RGB image was consistent with the foreground in Figure 5a, and the information was effectively retained. This study evaluated the effect of image segmentation more accurately by calculating Vpc and Vpe. The larger the Vpc value and the smaller the Vpe value, the smaller the fuzziness of the segmentation matrix, the more accurate the pixel classification, and the better the segmentation effect. Table 4 shows the values of Vpc and Vpe under the optimal and worst segmentation effects. The results showed that after using FLICM to segment all B-band images, the Vpc value of the segmentation matrix was be-tween 0.9640 and 0.9771 and the value of Vpe was between 0.0233 and 0.0344, indicating that the model degree of the segmentation matrix was lower than 3.60%, the classification accuracy of the image pixels was higher than 97.67%, and the blurring degree of the segmentation matrix was very small. Only a few pixels were misclassified and the segmentation worked well. Figure 5 and Table 4 show that the application of FMT + FLICM could achieve fast and accurate segmentation of multispectral images.

3.3. A. sinensis Water Content Prediction Model

3.3.1. Correlation Analysis

The SFs and TFs together form the CFs. As long as one of the SFs or TFs has multicollinearity, the CFs need dimensionality reduction. Therefore, we performed correlation analysis on SFs and TFs, respectively. Figure 6a,b shows the results of SF correlation analysis and TF correlation analysis, respectively. Figure 6a shows that in the 99% confidence interval, nearly 50% of the feature correlations were higher than 0.7, indicating that there was certain multicollinearity between the SFs. Figure 6b shows that within the 99% confidence interval, more than 80% of the feature correlations were above 0.7 or below −0.7, indicating that there was strong multicollinearity between the TFs. Therefore, it was necessary to reduce the dimensionality of the extracted image features.

3.3.2. Selection of the Dimensionality Reduction Algorithm

Figure 7 shows the R² and RMSE values of the RFR_SFs, RFR_TFs, and RFR_CFs models constructed with the low-latitude feature vectors extracted by the three dimensionality reduction algorithms as explanatory variables. Figure 7a,d shows the prediction accuracy of the RFR_SF model. The results showed that LTSA-RFR_SFs had the best fitting effect, the largest R² of 0.4168, and the smallest RMSE of 3.4208%. Figure 7b,e shows the prediction accuracy of the RFR_TF model, in which the R² of LTSA-RFR_TFs was the largest at 0.4065, and the RMSE of LDA-RFR_TFs was the smallest at 3.3646%. However, the R² of LDA-RFR_TFs was the smallest of the three types of models, and it could be seen that the fitting effect of LTSA-RFR_TFs was the best. Figure 7c,f also shows that the prediction accuracy of the model constructed using CFs through LTSA dimensionality reduction was the highest, the accuracies of LTSA-RFR_SFs and LTSA-RFR_TFs were improved, and the R² values increased by 15% and 17%, respectively. The RMSE values decreased by 5% and 6%, respectively. Figure 7 shows that the model accuracy based on the three dimensionality reduction algorithms was ranked as follows: LTSA-RFR > LDA-RFR > MUV-RFR. Therefore, we used the feature vectors after dimensionality reduction of the LTSA algorithm for model construction.

3.3.3. Model Optimization and Verification

Table 5 shows the model evaluation results after applying different algorithms to optimize the numbers of decision trees and node features. The model was evaluated according to the R², RMSE, and MAPE values. The goodness of fit of the RFR_SF model was ranked as follows: SSA-RFR_SFs > MBO-RFR_SFs > HHO-RFR_SFs; the goodness of fit of the model constructed by RFR_TFs was ranked as follows: SSA-RFR_TFs > HHO-RFR_TFs > MBO-RFR_TFs; the goodness of fit of the RFR_CF model was consistent with that of the RFR_SF model. The results of the model evaluation showed that the three optimization algorithms could effectively improve the model accuracy, and the optimization effect of SSA was the best. After the optimization of the SSA algorithm, the accuracy of the model constructed based on the three features was significantly improved. Among them, the R² of RFR_SFs increased by 85%, and the RMSE and MAPE decreased by 60% and 53%, respectively; the R² of RFR_TFs increased by 80%, and the RMSE and MAPE decreased by 50% and 42%, respectively; the R² of RFR_CFs increased by 73%, and the RMSE and MAPE decreased by 74% and 62%, respectively.

There were also differences in the model accuracies of RFR_SFs, RFR_CFs, and RFR_SFs. Taking the SSA-RFR with the best goodness of fit as an example, the R² of SSA-RFR_CFs was 7% and 13% higher than that of SSA-RFR_SFs and SSA-RFR_TFs, respectively, and the RMSE was reduced by 15% and 24%, respectively. The R² of SSA-RFR_SFs was 5% higher than that of SSA-RFR_TFs, and the RMSE was reduced by 8%. The comparison of the three types of models showed that the estimation effect of the comprehensive feature was better than that of the single feature, and the accuracy of the model constructed from spectral features was slightly higher than that of the model constructed from texture features, which was consistent with the results in Figure 7c,f.

3.3.4. Comparison between SSA-RFR_CFs and CNN_CFs Models

Taking the CFs as an independent variable, a Convolutional Neural Networks (CNN) model for estimating the water content of A. sinensis was established and compared with the SSA-RFR_CFs model constructed in this study. Figure 8a,b shows the estimation results of the SSA-RFR_CFs and CNN_CFs models, respectively, where the R² and RMSE of the SSA-RFR_CFs model were 0.8282 and 0.0186, respectively, and the R² and RMSE of the CNN_CFs model were 0.4733 and 0.0325, respectively. The R² of SSA-RFR_CFS was 75% higher than that of CNN_CFs, while the RMSE was 47% lower. At the same time, within the 95% prediction band, the estimated value of SSA-RFR_CFs was more evenly distributed on both sides of the 1:1 line, and the difference between the estimated and measured values was smaller. It could be seen from the above analysis that compared with CNN, the SSA-RFR model proposed in this study had a better fitting effect and higher prediction accuracy.

4. Discussion

4.1. Segmentation of Multispectral Images

Separating the research area from the background is the basis of applying image information to retrieve the plant water content, but some bands of multispectral images are difficult to separate. The multicamera multispectral imager divides the light from the target into several beams and records the information of each band separately. Although the image quality is improved compared with the beam-splitting multispectrometer, it also makes the images of different bands shift, which makes segmentation work difficult [45].

This study provided a way to segment discrete multispectral images. In terms of image registration, in addition to the frequency domain transformation method of FMT, registration methods in the spatial domain, such as image registration methods based on cross-correlation information and feature-based methods, were included [46,47]. In terms of image segmentation, in addition to clustering algorithms such as FLICM, threshold- and vegetation index-based algorithms were also included [48,49]. Therefore, in follow-up research work, image registration and segmentation algorithms can be combined according to specific situations. It should be noted that we used the B-band image as the reference image in this study because of its large foreground–background difference and low noise. However, changing the light source will change the imaging effect of each band. Thus, we must select the reference image according to the quality of the image in practical applications.

4.2. Model Construction and Optimization

4.2.1. Extraction of Feature Vectors

In previous studies, some scholars used a single VI to invert the water content of potatoes, and the RVI, which was highly correlated with water content, was considered to be the best predictor [50]. Other scholars used multiple combinations of a single VI to build models separately and chose the best prediction scheme by comparing the performance of different models [13]. The above studies all reduced the dimensionality of explanatory variables by selecting eigenvectors. Although the key factors are retained, this also results in a serious lack of original information. In contrast, this problem does not occur in feature extraction because a kernel function is applied to transform the original information, removing a considerable amount of redundant information on the basis of less information loss [51,52,53]. Therefore, this study used the LTSA algorithm to extract the eigenvectors of the SFs, TFs, and CFs, using them as explanatory variables to construct an estimation model for the water content of A. sinensis. The results showed that the three types of models all showed strong robustness. For example, the MBO-RFR_SFs, HHO-RFR_SFs, and SSA-RFR_SFs models constructed based on the eigenvalues of the SFs in Table 5 all achieved satisfactory prediction results.

4.2.2. Selection of Explanatory Variables

In previous studies, TFs have often been used for classification and biomass prediction [9,54], and there have been relatively few reports on TFs for plant water content prediction. In this study, water conditions had a significant impact on tree height and canopy width, and increases in tree height and canopy width caused the surface texture of A. sinensis to constantly change. Therefore, we used TFs as parallel variables of SFs and constructed an A. sinensis model based on the TF water content prediction model. The results in Table 5 showed that the prediction model built with TFs as explanatory variables also had strong robustness. However, when we compared the performance of the RFR_TF and RFR_SF models, we found that when the conditions were consistent, the accuracy of the TF-based prediction model was lower than that of the SF-based prediction model. Although this was consistent with the research results of Zhou et al. [18], in our study, the difference was much smaller (5%). This may have been related to the settings of the texture parameters because studies have shown that changing texture parameters can affect the prediction accuracy of texture features [54], and in some studies estimating plant physiological parameters, TFs showed higher prediction ability than SFs [38,55]. The research of Zhang et al. [56] also showed that compared with texture information, spectral information is more susceptible to the influence of instruments and environments. Therefore, TFs, which are complementary to SFs, could be used as another important indicator to predict the water content of A. sinensis.

A study on predicting winter rapeseed biomass showed that the fusion of SFs and TFs significantly improved the accuracy of the prediction model [57]. Zhou et al. [18] constructed a wheat stomatal conductance model using comprehensive features and single features, and the results showed that the prediction accuracy of the comprehensive feature model was more than 20% higher than that of the single-feature model. In this study, we constructed a single-feature model and a comprehensive feature model, and the results showed that the predictive ability of the comprehensive feature model was higher than that of the single-feature model, which was consistent with previous research results. Different from previous studies, the accuracy improvement of the comprehensive feature model was not significant compared with the single-feature model in our study. This was because RFR is an integrated algorithm composed of a large number of sub-models, which has the ability to overcome overfitting. When we used an optimization algorithm to optimize the model parameters, we could maximize the accuracy of the model on the basis of avoiding overfitting of the two types of models.

4.2.3. RFR Hyperparameter Optimization

For machine learning models, kernel-based learning methods directly determine the ability of the model to estimate specific variables [58]. When some parameters in the model need to be set manually, it is difficult to ensure the fitting effect of the model. With the development of bionic technology, heuristic algorithms have been applied to the field of model parameter optimization. This is because the heuristic optimization algorithm has strong convergence and generalization capabilities, can take the fitness function as the optimization target, and obtains the optimal solution of the parameters through continuous iteration, thereby improving the fitting effect of the model [59]. However, the update rules of each optimization algorithm are different, and the predicted effects will also be different. For example, in a study on estimating spring corn evapotranspiration, bionic algorithms such as SSA were used to optimize an extreme learning machine (ELM) model, and the results showed that the estimation accuracy of SSA-ELM was significantly higher than that of other models [60]. In this study, MBO, HHO, and SSA were used to optimize the numbers of decision trees and node features of the RFR model, and the results showed that the prediction accuracy of the model was significantly improved by the optimization of the MBO, HHO, and SSA algorithms. Among the three types of algorithms, SSA obtained the best optimization effect, but it was not much different from that of MBO and HHO because the heuristic optimization algorithm is part of a class of probability-based evolutionary algorithms and each algorithm has great similarity in terms of structure and other aspects.

4.3. Future Outlook

Water stress is one of the forms of plant adversity and has become the main factor limiting the development of agriculture and forestry. Real-time monitoring of plant water content is of great significance for precision irrigation and is an effective means to address water stress [61]. The image-based estimation method of plant water content has the characteristics of flexibility and strong operability. Extracting the correlation information in an image can indirectly reflect the change in plant water content. In this study, multispectral imaging technology and related machine learning methods were used to construct A. sinensis water content prediction models under different water gradients. The R² of the RFR_CFs model optimized by SSA reached 0.8282, which proved the feasibility of this method in A. sinensis water content estimation. Due to the preciousness of A. sinensis, we took a small number of samples, but we chose the RFR model as the basic model and trained it using LOOCV, which effectively improved the fitting effect of the model. In future work, we will continue to explore related algorithms in machine learning and deep learning, build a model with higher accuracy and stability, and further improve the accuracy of prediction. In addition to estimating plant water content, multispectral imaging techniques are also being used in other aspects of agroforestry. For example, a rubber blade nitrogen content prediction model was built by obtaining multiple band reflectance data of rubber blades, providing technical support for the rapid detection of rubber blade nitrogen content [62]. There are also studies that have used remote sensing multispectral imaging to classify normal and green bug-infested wheat fields, providing a nondestructive and inexpensive method for pest reconnaissance in the field [63]. In future research, we will continue to explore the application of multispectral imaging technology in nutritional diagnosis and pest control to provide a real-time, nondestructive, and accurate solution for the cultivation and protection of precious tree species.

5. Conclusions

In this study, the multi-spectral image of A. sinensis was segmented by FFT + FLICM, which provided a new idea for the rapid and accurate segmentation of discrete multi-spectral images. On this basis, we constructed models of A. sinensis water content based on image SFs, TFs, and CFs, and analyzed the prediction ability of different image features. In the process of model construction, we used the dimensionality reduction algorithm to extract the feature vectors of the SFs, TFs, and CFs, using them as explanatory variables to effectively avoid overfitting the model. At the same time, we applied the swarm intelligence optimization algorithm to optimize the model parameters and finally determined the best model for predicting the water content of A. sinensis. The main conclusions of this study are as follows:

(1) Water treatment has a significant effect on the A. sinensis tree height and crown width but little effect on the ground diameter. Compared with waterlogging conditions, drought conditions inhibited the growth of A. sinensis more significantly, and a soil water content of 60%–80% was most suitable.

(2) FMT registration can be used to realize the fusion of discrete A. sinensis multispectral images, FLICM segmentation effectively suppressed image noise, and the FMT + FLICM scheme realized the fast and accurate segmentation of A. sinensis multispectral images.

(3) The effect of TFs on predicting the water content of A. sinensis was basically the same as that of SFs, which can be used as another important index to predict the water content of A. sinensis. The predictive power of CFs was higher than that of SFs and TFs, but this difference decreased with the optimization of the RFR model.

(4) The model accuracy was greatly improved by optimizing the hyperparameters of the RFR model, and the optimization effect of the SSA algorithm was the best. Compared with the original model, the R² of SSA-RFR_SFs was improved by 85%, the R² of SSA-RFR_TFs was improved by 80%, and the R² of SSA-RFR_CFs was improved by 73%. Among all models, SSA-RFR_CFs had the highest accuracy, and its R² was 7% and 13% higher than that of SSA-RFR_SFs and SSA-RFR_TFs, respectively.

Author Contributions

P.W. performed the experiments, analyzed the data, and wrote the manuscript. X.W. designed the research and conducted the field measurements and collected the samples. Y.W., M.S., X.C. and Y.Y. analyzed the data. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Special Funds for Fundamental Research Business Expenses of the Central Public Welfare Research Institution’s “Precise Image Judgment Technology for Health Status of Precious Tree Species”, grant number CAFYBB2021ZB002.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to the confidentiality of the project.

Acknowledgments

We acknowledge the support from the IFRIT of CAF.

Conflicts of Interest

The authors declare no conflict of interest.

References

Li, W.; Chen, H.Q.; Wang, H.; Mei, W.L.; Dai, H.F. Natural products in agarwood and Aquilaria plants: Chemistry, biological activities and biosynthesis. Nat. Prod. Rep. 2021, 38, 528–565. [Google Scholar] [CrossRef]
Feng, J.; Yang, X.W.; Wang, R.F. Bio-assay guided isolation and identification of α-glucosidase inhibitors from the leaves of Aquilaria sinensis. Phytochemistry 2011, 72, 242–247. [Google Scholar] [CrossRef]
Wongwad, E.; Pingyod, C.; Saesong, T.; Waranuch, N.; Wisuitiprot, W.; Sritularak, B.; Ingkaninan, K. Assessment of the bioactive components, antioxidant, antiglycation and anti-inflammatory properties of Aquilaria crassna Pierre ex Lecomte leaves. Ind. Crop. Prod. 2019, 138, 111448. [Google Scholar] [CrossRef]
Ma, S.; Fu, Y.; Li, Y.; Wei, P.; Liu, Z. The formation and quality evaluation of agarwood induced by the fungi in Aquilaria sinensis. Ind. Crop. Prod. 2021, 173, 114129. [Google Scholar] [CrossRef]
Yuan, Y.; Wang, X.; Shi, M.; Wang, P. Performance comparison of RGB and multispectral vegetation indices based on machine learning for estimating Hopea hainanensis SPAD values under different shade conditions. Front. Plant Sci. 2022, 13, 28953. [Google Scholar] [CrossRef]
Yi, Q.; Bao, A.; Wang, Q.; Zhao, J. Estimation of leaf water content in cotton by means of hyperspectral indices. Comput. Electron. Agric. 2013, 90, 144–151. [Google Scholar] [CrossRef]
Dong, C.; An, T.; Yang, M.; Yang, C.; Liu, Z.; Li, Y.; Fan, S. Quantitative prediction and visual detection of the moisture content of withering leaves in black tea (Camellia sinensis) with hyperspectral image. Infrared Phys. Technol. 2022, 123, 104118. [Google Scholar] [CrossRef]
Yang, F.F.; Liu, T.; Wang, Q.Y.; Du, M.Z.; Yang, T.L.; Liu, D.Z.; Liu, S.P. Rapid determination of leaf water content for monitoring waterlogging in winter wheat based on hyperspectral parameters. J. Integr. Agric. 2021, 20, 2613–2626. [Google Scholar] [CrossRef]
Xuan, G.; Gao, C.; Shao, Y.; Wang, X.; Wang, Y.; Wang, K. Maturity determination at harvest and spatial assessment of moisture content in okra using Vis-NIR hyperspectral imaging. Postharvest Biol. Technol. 2021, 180, 111597. [Google Scholar] [CrossRef]
Tran, C.D.; Grishko, V.I. Determination of water contents in leaves by a near-infrared multispectral imaging technique. Microchem. J. 2004, 76, 91–94. [Google Scholar] [CrossRef]
Huang, Y.; Sui, R.; Thomson, S.J.; Fisher, D.K. Estimation of cotton yield with varied irrigation and nitrogen treatments using aerial multispectral imagery. Int. J. Agric. Biol. Eng. 2013, 6, 37–41. [Google Scholar]
Mwinuka, P.R.; Mbilinyi, B.P.; Mbungu, W.B.; Mourice, S.K.; Mahoo, H.F.; Schmitter, P. The feasibility of hand-held thermal and UAV-based multispectral imaging for canopy water status assessment and yield prediction of irrigated African eggplant (Solanum aethopicum L). Agric. Water Manag. 2021, 245, 106584. [Google Scholar] [CrossRef]
Torres, I.; Sánchez, M.T.; Benlloch-González, M.; Pérez-Marín, D. Irrigation decision support based on leaf relative water content determination in olive grove using near infrared spectroscopy. Biosyst. Eng. 2019, 180, 50–58. [Google Scholar] [CrossRef]
Malvandi, A.; Feng, H.; Kamruzzaman, M. Application of NIR spectroscopy and multivariate analysis for Non-destructive evaluation of apple moisture content during ultrasonic drying. Spectrochim. Acta A Mol. Biomol. Spectrosc. 2022, 269, 120733. [Google Scholar] [CrossRef] [PubMed]
Chen, Z.; Wang, X. Model for estimation of total nitrogen content in sandalwood leaves based on nonlinear mixed effects and dummy variables using multispectral images. Chemom. Intell. Lab. Syst. 2019, 195, 103874. [Google Scholar] [CrossRef]
Kyratzis, A.C.; Skarlatos, D.P.; Menexes, G.C.; Vamvakousis, V.F.; Katsiotis, A. Assessment of Vegetation Indices Derived by UAV Imagery for Durum Wheat Phenotyping under a Water Limited and Heat Stressed Mediterranean Environment. Front. Plant Sci. 2017, 8, 1114. [Google Scholar] [CrossRef]
Mwinuka, P.R.; Mourice, S.K.; Mbungu, W.B.; Mbilinyi, B.P.; Tumbo, S.D.; Schmitter, P. UAV-based multispectral vegetation indices for assessing the interactive effects of water and nitrogen in irrigated horticultural crops production under tropical sub-humid conditions: A case of African eggplant. Agric. Water Manag. 2022, 266, 107516. [Google Scholar] [CrossRef]
Zhou, Y.; Lao, C.; Yang, Y.; Zhang, Z.; Chen, H.; Chen, Y.; Yang, N. Diagnosis of winter-wheat water stress based on UAV-borne multispectral image texture and vegetation indices. Agric. Water Manag. 2021, 256, 107076. [Google Scholar] [CrossRef]
Ge, P.; Lan, C.; Wang, H. An improvement of image registration based on phase correlation. Optik 2014, 125, 6709–6712. [Google Scholar] [CrossRef]
Chelbi, S.; Mekhmoukh, A. Features based image registration using cross correlation and Radon transform. Alex. Eng. J. 2018, 57, 2313–2318. [Google Scholar] [CrossRef]
Sims, D.A.; Gamon, J.A. Relationships between leaf pigment content and spectral reflectance across a wide range of species, leaf structures and developmental stages. Remote Sens. Environ. 2002, 81, 337–354. [Google Scholar] [CrossRef]
Schumacher, P.; Mislimshoeva, B.; Brenning, A.; Zandler, H.; Brandt, M.; Samimi, C.; Koellner, T. Do Red Edge and Texture Attributes from High-Resolution Satellite Data Improve Wood Volume Estimation in a Semi-Arid Mountainous Region? Remote Sens. 2016, 8, 540. [Google Scholar] [CrossRef]
Yang, G.; Li, C.; Wang, Y.; Yuan, H.; Feng, H.; Xu, B.; Yang, X. The DOM Generation and Precise Radiometric Calibration of a UAV-Mounted Miniature Snapshot Hyperspectral Imager. Remote Sens. 2017, 9, 642. [Google Scholar] [CrossRef]
Tucker, C.J. Red and photographic infrared linear combinations for monitoring vegetation. Remote Sens. Environ. 1979, 8, 127–150. [Google Scholar] [CrossRef]
Buschmann, C.; Nagel, E. In vivo spectroscopy and internal optics of leaves as basis for remote sensing of vegetation. Int. J. Remote Sens. 1993, 14, 711–722. [Google Scholar] [CrossRef]
Woebbecke, D.M.; Meyer, G.E.; Von Bargen, K.; Mortensen, D.A. Color indices for weed identification under various soil, residue, and lighting conditions. Trans. ASAE. 1995, 38, 259–269. [Google Scholar] [CrossRef]
Roujean, J.L.; Breon, F.M. Estimating PAR absorbed by vegetation from bidirectional reflectance measurements. Remote Sens. Environ. 1995, 51, 375–384. [Google Scholar] [CrossRef]
Gitelson, A.A.; Kaufman, Y.J.; Merzlyak, M.N. Use of a green channel in remote sensing of global vegetation from EOS-MODIS. Remote Sens. Environ. 1996, 58, 289–298. [Google Scholar] [CrossRef]
Chen, J.M. Evaluation of vegetation indices and a modified simple ratio for boreal applications. Can. J. Remote Sens. 1996, 22, 229–242. [Google Scholar] [CrossRef]
McFeeters, S.K. The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features. Int. J. Remote Sens. 1996, 17, 1425–1432. [Google Scholar] [CrossRef]
Huete, A.R.; Liu, H.Q.; Batchily, K.V.; Van Leeuwen, W.J.D.A. A comparison of vegetation indices over a global set of TM images for EOS-MODIS. Remote Sens. Environ. 1997, 59, 440–451. [Google Scholar] [CrossRef]
Kawashima, S.; Nakatani, M. An algorithm for estimating chlorophyll content in leaves using a video camera. Ann. Bot. 1998, 81, 49–54. [Google Scholar] [CrossRef]
Datt, B. Visible/near infrared reflectance and chlorophyll content in Eucalyptus leaves. Int. J. Remote Sens. 1999, 20, 2741–2759. [Google Scholar] [CrossRef]
Gitelson, A.A.; Gritz, Y.; Merzlyak, M.N. Relationships between leaf chlorophyll content and spectral reflectance and algorithms for non-destructive chlorophyll assessment in higher plant leaves. J. Plant Physiol. 2003, 160, 271–282. [Google Scholar] [CrossRef] [PubMed]
Gitelson, A.A.; Viña, A.; Ciganda, V.; Rundquist, D.C.; Arkebauer, T.J. Remote estimation of canopy chlorophyll content in crops. Geophys. Res. Lett. 2005, 32, 8403. [Google Scholar] [CrossRef]
Meyer, G.E.; Neto, J.C. Verification of color vegetation indices for automated crop imaging applications. Comput. Electron. Agric. 2008, 63, 282–293. [Google Scholar] [CrossRef]
Siegmann, B.; Jarmer, T.; Lilienthal, H.; Richter, N.; Selige, T.; Höfle, B. Comparison of narrow band vegetation indices and empirical models from hyperspectral remote sensing data for the assessment of wheat nitrogen concentration. In Proceedings of the 8th EARSeL Workshop on Imaging Spectroscopy, Nantes, France, 8–10 April 2013; pp. 8–10. [Google Scholar]
Possoch, M.; Bieker, S.; Hoffmeister, D.; Bolten, A.; Schellberg, J.; Bareth, G. Multi-temporal crop surface models combined with the RGB vegetation index from UAV-ed images for forage monitoring in grassland. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2016, 41, 991–998. [Google Scholar] [CrossRef]
Vala, M.; Baxi, A. A review on Otsu image segmentation algorithm. Comput. Sci. 2013, 2, 387–389. [Google Scholar]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Cutler, D.R.; Edwards, T.C., Jr.; Beard, K.H.; Cutler, A.; Hess, K.T.; Gibson, J.; Lawler, J.J. Random forests for classification in ecology. Ecology 2017, 88, 2783–2792. [Google Scholar] [CrossRef]
Heidari, A.A.; Mirjalili, S.; Faris, H.; Aljarah, I.; Mafarja, M.; Chen, H. Harris hawks optimization: Algorithm and applications. Futur. Gener. Comp. Syst. 2019, 97, 849–872. [Google Scholar] [CrossRef]
Wang, G.G.; Deb, S.; Cui, Z. Monarch butterfly optimization. Neural Comput. Appl. 2019, 31, 1995–2014. [Google Scholar] [CrossRef]
Xue, J.; Shen, B. A novel swarm intelligence optimization approach: Sparrow search algorithm. Syst. Sci. Control Eng. 2020, 8, 22–34. [Google Scholar]
Li, Y.; Wang, J.; Yao, K. Modified phase correlation algorithm for image registration based on pyramid. Alex. Eng. J. 2022, 61, 709–718. [Google Scholar] [CrossRef]
Li, J. An image feature point matching algorithm based on fixed scale feature transformation. Optik 2013, 124, 1620–1623. [Google Scholar] [CrossRef]
Cun, X.; Pun, C.M.; Gao, H. Applying stochastic second-order entropy images to multi-modal image registration. Signal Process. Image Commun. 2018, 65, 201–209. [Google Scholar] [CrossRef]
Hamuda, E.; Glavin, M.; Jones, E. A survey of image processing techniques for plant extraction and segmentation in the field. Comput. Electron. Agric. 2016, 125, 184–199. [Google Scholar]
Suh, H.K.; Hofstee, J.W.; van Henten, E.J. Investigation on combinations of colour indices and threshold techniques in vegetation segmentation for volunteer potato control in sugar beet. Comput. Electron. Agric. 2020, 179, 105819. [Google Scholar] [CrossRef]
Kim, Y.; Jackson, T.; Bindlish, R.; Lee, H.; Hong, S. Radar vegetation index for estimating the vegetation water content of rice and soybean. IEEE Geosci. Remote Sens. Lett. 2011, 9, 564–568. [Google Scholar]
Zabalza, J.; Ren, J.; Wang, Z.; Marshall, S.; Wang, J. Singular spectrum analysis for effective feature extraction in hyperspectral imaging. IEEE Geosci. Remote Sens. Lett. 2014, 11, 1886–1890. [Google Scholar] [CrossRef]
Patra, S.; Modi, P.; Bruzzone, L. Hyperspectral Band Selection Based on Rough Set. IEEE Trans. Geosci. Remote Sens. 2015, 53, 5495–5503. [Google Scholar] [CrossRef]
Qu, Y.; Liu, Z. Dimensionality reduction and derivative spectral feature optimization for hyperspectral target recognition. Optik 2017, 130, 1349–1357. [Google Scholar] [CrossRef]
Liang, Y.; Kou, W.; Lai, H.; Wang, J.; Wang, Q.; Xu, W.; Lu, N. Improved estimation of aboveground biomass in rubber plantations by fusing spectral and textural information from UAV-based RGB imagery. Ecol. Indic. 2022, 142, 109286. [Google Scholar] [CrossRef]
Dube, T.; Mutanga, O. Investigating the robustness of the new Landsat-8 Operational Land Imager derived texture metrics in estimating plantation forest aboveground biomass in resource constrained areas. ISPRS-J. Photogramm. Remote Sens. 2015, 108, 12–32. [Google Scholar] [CrossRef]
Zhang, C.; Huang, C.; Li, H.; Liu, Q.; Li, J.; Bridhikitti, A.; Liu, G. Effect of Textural Features in Remote Sensed Data on Rubber Plantation Extraction at Different Levels of Spatial Resolution. Forests 2020, 11, 399. [Google Scholar] [CrossRef]
Liu, Y.; Liu, S.; Li, J.; Guo, X.; Wang, S.; Lu, J. Estimating biomass of winter oilseed rape using vegetation indices and texture metrics derived from UAV multispectral images. Comput. Electron. Agric. 2019, 166, 105026. [Google Scholar] [CrossRef]
Mohammadi, B. Letter to the editor “Estimation of sodium adsorption ratio indicator using data mining methods: A case study in Urmia Lake basin, Iran” by Mohammad Taghi Sattari, Arya Farkhondeh, and John Patrick Abraham. Environ. Sci. Pollut. Res. 2019, 26, 10439–10440. [Google Scholar] [CrossRef]
Yin, Z.; Wen, X.; Feng, Q.; He, Z.; Zou, S.; Yang, L. Integrating genetic algorithm and support vector machine for modeling daily reference evapotranspiration in a semi-arid mountain area. Hydrol. Res. 2017, 48, 1177–1191. [Google Scholar] [CrossRef]
Jia, Y.; Su, Y.; Zhang, R.; Zhang, Z.; Lu, Y.; Shi, D.; Huang, D. Optimization of an extreme learning machine model with the sparrow search algorithm to estimate spring maize evapotranspiration with film mulching in the semiarid regions of China. Comput. Electron. Agric. 2022, 201, 107298. [Google Scholar] [CrossRef]
Cai, F.; Zhang, Y.; Mi, N.; Ming, H.; Zhang, S.; Zhang, H.; Zhao, X. Maize (Zea mays L.) physiological responses to drought and rewatering, and the associations with water stress degree. Agric. Water Manag. 2020, 241, 106379. [Google Scholar] [CrossRef]
Tang, R.; Luo, X.; Li, C.; Zhong, S. A study on nitrogen concentration detection model of rubber leaf based on spatial-spectral information with NIR hyperspectral data. Infrared Phys. Technol. 2022, 122, 104094. [Google Scholar] [CrossRef]
Backoulou, G.F.; Elliott, N.C.; Giles, K.L.; Mirik, M. Processed multispectral imagery differentiates wheat crop stress caused by greenbug from other causes. Comput. Electron. Agric. 2015, 115, 34–39. [Google Scholar] [CrossRef]

Figure 1. Technique flow chart.

Figure 2. Location of the study area.

Figure 3. Schematic diagram of the image acquisition device: (a) darkroom; (b) A. sinensis; (c) tripod; (d) Mica Sense Edge 3™.

Figure 4. The growth differences of height (a), ground diameter (b), and crown width (c) of A. sinensis under different water gradients. The letter is used to mark whether the difference between the groups is significant.

Figure 5. Image registration (a) and segmentation (b) results.

Figure 6. (a) Correlation analysis of SFs; (b) correlation analysis of CFs. * stands for 95% confidence level, ** stands for 99% confidence level, and *** stands for 99.9% confidence level.

Figure 7. RFR-SFs, RFR-TFs, and RFR-CFs model evaluation indicators based on three dimensional reduction algorithms: (a,d) R² and RMSE of RFR-SFs model; (b,e) RFR-TFs R² and RMSE of the model; (c,f) R² and RMSE of the RFR-CFs model.

Figure 8. SSA-RFR_CFs model (a) and CNN_CFs model (b) estimation results.

Table 1. Soil water content under different water gradients.

Category	Group	Soil Water Content
Control group	CK	30%–40%
Treatment group 1 Treatment group 2 Treatment group 3	T1	40%–60%
	T2	60%–80%
	T3	80%–90%

Table 2. Band information of Mica Sense Edge 3™.

Band Name	Abbreviation	Center Wavelength (nm)	Bandwidth FWHM (nm)
Blue	B	475	20
Green	G	560	20
Red	R	668	10
Near IR	NIR	840	40
Red Edge	RE	717	10

Table 3. Vegetation indices related to water content.

Vegetation Index	Abbreviation	Formula *	Reference
Difference Vegetation Index	DVI	NIR − R	[24]
Normalized Difference Vegetation Index	NDVI	(NIR − R)/(NIR + R)	[25]
Excess Green Index	EXG	2 $\times$ G − R − B	[26]
Normalized Green-Bule Difference Index	NGBDI	(G − B)/(G + B)	[26]
Normalized Green-Red Difference Index	NGRDI	(G − R)/(G + R)	[26]
Water Index	WI	(G − B)/(R − G)	[26]
Renormalized Difference Vegetation Index	RDVI	NIR − R/(NIR + R)1/2	[27]
Green Normalized Difference Vegetation Index	GNDVI	(NIR − G)/(NIR + G)	[28]
Modified Simple Ratio	MSR	(NIR/R − 1)/(NIR/R + 1)1/2	[29]
Normalized Difference Water Index	NDWI	G − NIR/G + NIR	[30]
Enhanced Vegetation Index	EVI	2.5 $\times$ (NIR − R)/(NIR + 6 $\times$ R − 7.5 $\times$ B + 1)	[31]
Kawasaki Index	IKAW	(R + B)/(R − B)	[32]
Simple Ratio Vegetation Index	SR	R/G × NIR	[33]
Chlorophyll Index	CI	(R + G)/2	[34]
Red-Edge Chlorophyll Vegetation Index	RECI	(NIR/RED) − 1	[35]
Excess Red Index	EXR	1.4 $\times$ R − G	[36]
Excess Green Minus Excess Red	EXGR	3 $\times$ G − 2.4 $\times$ R − B	[36]
Normalized Difference Red Edge Vegetation Index	NDRE	(NIR – RE)/(NIR + RE)	[37]
Red Green Blue Vegetation Indices	RGBVI	(G² – B $\times$ R²)/(G² + B * R²)	[38]
Green Index	GLI	(2 $\times$ G − R − B)/(R + G + B)	[39]

* Among them, B, G, R, NIR, and RE are the pixel gray mean of the foreground image.

Table 4. Image segmentation effect evaluation.

Evaluation Index	Best Effect	Worst Effect	Average
Vpc	0.9771	0.9640	0.9722
Vpe	0.0233	0.0344	0.0262

Table 5. Comparison of prediction model results for water content of A. sinensis.

Model	R²	RMSE/%	MAPE/%	Rank
MBO-RFR_SFs	0.7581	2.2030	2.5662	⑤
HHO-RFR_SFs	0.7528	2.2274	2.5388	⑥
SSA-RFR_SFs	0.7731	2.1336	2.5388	④
MBO-RFR_TFs	0.7265	2.3425	2.8680	⑨
HHO-RFR_TFs	0.7286	2.3338	2.8709	⑧
SSA-RFR_TFs	0.7348	2.3069	2.8412	⑦
MBO-RFR_CFs	0.8204	1.8986	2.3085	②
HHO-RFR_CFs	0.8182	1.9101	2.3147	③
SSA-RFR_CFs	0.8282	1.8566	2.2864	①

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, P.; Wu, Y.; Wang, X.; Shi, M.; Chen, X.; Yuan, Y. Multispectral Image Determination of Water Content in Aquilaria sinensis Based on Machine Learning. Forests 2023, 14, 1144. https://doi.org/10.3390/f14061144

AMA Style

Wang P, Wu Y, Wang X, Shi M, Chen X, Yuan Y. Multispectral Image Determination of Water Content in Aquilaria sinensis Based on Machine Learning. Forests. 2023; 14(6):1144. https://doi.org/10.3390/f14061144

Chicago/Turabian Style

Wang, Peng, Yi Wu, Xuefeng Wang, Mengmeng Shi, Xingjing Chen, and Ying Yuan. 2023. "Multispectral Image Determination of Water Content in Aquilaria sinensis Based on Machine Learning" Forests 14, no. 6: 1144. https://doi.org/10.3390/f14061144

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multispectral Image Determination of Water Content in Aquilaria sinensis Based on Machine Learning

Abstract

1. Introduction

2. Materials and Methods

2.1. Overview of the Study Site

2.2. Experimental Design and Data Acquisition

2.2.1. Experimental Design

2.2.2. Data Acquisition

2.3. Image Processing

2.3.1. Image Registration

2.3.2. Image Segmentation

2.4. Feature Extraction

2.5. Data Analysis and Modelling

2.5.1. Data Dimensionality Reduction

2.5.2. Random Forest Regression Model

2.5.3. Model Parameter Optimization

2.5.4. Model Evaluation

3. Results

3.1. Effect of Soil Water Content on the Growth of A. sinensis

3.2. Image Segmentation Effect

3.3. A. sinensis Water Content Prediction Model

3.3.1. Correlation Analysis

3.3.2. Selection of the Dimensionality Reduction Algorithm

3.3.3. Model Optimization and Verification

3.3.4. Comparison between SSA-RFR_CFs and CNN_CFs Models

4. Discussion

4.1. Segmentation of Multispectral Images

4.2. Model Construction and Optimization

4.2.1. Extraction of Feature Vectors

4.2.2. Selection of Explanatory Variables

4.2.3. RFR Hyperparameter Optimization

4.3. Future Outlook

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI