Next Article in Journal
Industry Perspectives on Water Pollution Management in a Fast Developing Megacity: Evidence from Dhaka, Bangladesh
Previous Article in Journal
The Effect of the Dwelling Environment on Rural Elderly Cognition: Empirical Evidence from China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Influence of Different Meteorological Factors on the Accuracy of Back Propagation Neural Network Simulation of Soil Moisture in China

1
School of Remote Sensing and Information Engineering, North China Institute of Aerospace Engineering, Langfang 065000, China
2
Aerospace Remote Sensing Information Processing and Application Collaborative Innovation Center of Hebei Province, Langfang 065000, China
3
National Joint Engineering Research Center of Space Remote Sensing Information Application Technology, Langfang 065000, China
4
School of Liberal Arts and Sciences, North China Institute of Aerospace Engineering, Langfang 065000, China
5
Space Information Innovation Institute, Chinese Academy of Sciences, Beijing 100190, China
*
Author to whom correspondence should be addressed.
Sustainability 2022, 14(24), 16381; https://doi.org/10.3390/su142416381
Submission received: 10 October 2022 / Revised: 2 December 2022 / Accepted: 4 December 2022 / Published: 7 December 2022
(This article belongs to the Section Air, Climate Change and Sustainability)

Abstract

:
Soil moisture is one of the most critical elements of the Earth system and is essential for the study of the terrestrial water cycle, ecological processes, climate change, and disaster warnings. In this study, the training sample was selected to divide the dataset according to months from 2000 to 2018 after the advantages of three training samples were compared using a backpropagation (BP) neural network model. Furthermore, the monthly surface soil moisture in China in 2019 and 2020 was simulated based on various meteorological elements. The results demonstrate that evapotranspiration has the greatest influence on soil moisture among the various meteorological factors, followed by precipitation on a national scale throughout the year. Additionally, the accuracy of the training and simulation results with BP neural networks in the national winter months is slightly worse. In the future, the training samples of the BP neural network can be optimized following the differences in the dominant influence of various meteorological factors on soil moisture in different areas at different times to improve the simulation prediction accuracy.

1. Introduction

Soil moisture (SM) is a vital component of the earth system [1,2]. It plays a vital role in various exchange processes on the earth’s surface [3] and influences the exchange of heat, water, momentum, and chemical substances between the earth’s surface and the atmosphere. Through the upward movement of evapotranspiration (EV) and the transpiration of soil and vegetation, a large amount of SM becomes atmospheric moisture [4]. Thus, SM is the main source of atmospheric moisture [5] and the basis of plant growth [6,7,8]. It affects plant growth by affecting ventilation, temperature, nutrient transport, uptake, and transformation [9]. In terrestrial ecosystems, SM controls plant growth as the link between the biosphere and the soil zone [10]. SM is closely associated with different meteorological factors. It is a storage component of precipitation (PR) and radiation anomalies [5], influencing soil surface albedo and soil heat capacity [11], as well as climate change through latent and sensible heat transfer. Therefore, the study of SM is essential for water resources management, ecological security evaluation, the judgment of climate change trends, and meteorological disaster prediction in the context of global climate change [12,13,14]. Understanding the multiple impacts of SM and climate and its relevance to climate change projections is critical. Reducing uncertainty in future climate scenarios will be substantially aided by a better comprehension and quantification of SM-climate-related processes [5]. Using estimates of SM based on water balance models and temperature observations, Whan, K., J. et al. investigated the effect of SM on summer monthly maximum temperatures from 1984 to 2013 in Europe and revealed a negative relationship between SM and summer monthly temperatures [15]. Over the past 40 years, the total explanatory power of PR, temperature, and vegetation in the permafrost region of the Qinghai-Tibet Plateau was 51%, and the contribution rate of PR was the highest, accounting for 47.2%. Concurrently, the contribution rate of PR was over 41%, and the combined explanatory power of the three components in the seasonal permafrost area was 69.1% [14]. Surface SM in northeastern and central China and eastern Inner Mongolia exhibited a consistent trend of dryness from 1979 to 2010, while soils on the Qinghai-Tibet Plateau presented a trend of wetness. PR is the main factor responsible for these trends, controlling the direction of SM change while increasing temperature also exacerbates soil drying during periods of reduced PR [16]. Zeng et al. discovered that PR variability dominated soil wetting in typical semi-arid and typical humid areas of China after 2005, and surface-atmosphere temperature contributed more to drought in drylands [2].
The decreasing trend of SM over East Asia in the 21st century is mainly induced by decreasing PR while increasing temperature has almost tripled soil drying [17]. Cheng et al.’s analysis of the global SM trend from 1948 to 2010 demonstrated primarily negative trends in SM. Although the temperature was the primary factor in the long-term trend of SM overall, PR had a dominant influence on the variability of SM at interannual to decadal time scales for the world average [18]. Deng et al. unveiled that the global average SM in 1979–2017 decreased markedly, 65.1% of the global soil drying trend was attributed to temperature rising, and the combined effects of PR, temperature, and NDVI were responsible for 82% of the wetting trend [19].
Generally, methods for estimating SM commonly include field measurements, remote sensing measurements, and model simulation [2]. While measuring SM in the field is reliable, retrieving samples and performing laboratory measurements both demand a sizable workforce [5]. Therefore, it is difficult to obtain large-scale measured SM in a relatively short time. Remote sensing measurements can provide a large range of near-surface estimation [20]. However, both the high temporal resolution and high spatial resolution of remote sensing measurement data can hardly be satisfied simultaneously [21]. Model simulations can obtain spatially continuous and temporally complete SM estimates under certain accuracy conditions while acquiring exact associations with SM-related factors. BP neural networks are widely adopted for data fitting and prediction with their good learning ability and self-adaptive capability. BP neural networks can respond to system changes by modifying the link values of the network. It is a multi-input and multi-output structural model with the ability to adapt to multivariate control systems.
The effects of various meteorological factors on SM have been investigated. Nevertheless, there is little research on seasonal differences in meteorological factors affecting SM at a large regional scale. In this study, a BP neural network model is employed to simulate and predict SM for each month of 2019 and 2020 based on PR, EV, air temperature (AT), surface net solar radiation (SR), surface temperature (ST), and air pressure (AP) in China. First, an optimal training sample is selected for the BP neural network model to obtain the best simulation results. Then, the influence of different meteorological factors on the SM is revealed using the BP neural network. Finally, the seasonal differences in the simulation accuracy of SM across China are investigated. This study contributes to the influence of meteorological factors on SM in China and lays a scientific foundation for decision-making on disaster warning and ecological security evaluation in the context of global change.

2. Materials and Methods

2.1. Data Sources

The monthly average data of SM (0–7 cm) (m3/m3) and PR, EV, AT, SR, ST, and atmospheric pressure in China from 2000 to 2020 were obtained from ERA5-Land (https://doi.org/10.24381/cds.68d2bb30 (accessed on 15 July 2021)) of the European Center for Medium-Range Weather Forecasts (https://cds.climate.copernicus.eu/ (accessed on 15 July 2021)). ERA5-Land is a reanalysis dataset that provides a globally uniform format for multiple meteorological factors and multi-layer SM with a spatial resolution of 0.1°.

2.2. Data Processing

The initial data of SM, PR, EV, AT, SR, ST, and AP from 2000 to 2020 provided by ERA5-Land reanalysis data are all in NetCDF format, which needs to be processed and finally converted into a CSV format dataset for BP neural network training and simulation prediction. The processing flow is illustrated in Figure 1. Firstly, a MATLAB program is applied to convert the global NetCDF format files into global GeoTiff format files. Then, the mask is adopted in the ArcGIS tool to extract the GeoTiff format file of the China region. Afterward, GeoTiff format files are converted into the ACSII format file. Finally, another MATLAB program is utilized to read the data in the ACSII file and write it into a CSV file. Since the China region contains 96,028 rasters, there are 96,028 monthly data in the CSV file.

2.3. Methods

The BP neural network is a multi-layer forward neural network based on an error backpropagation algorithm (BP algorithm). It is by far the most famous multi-layer network learning algorithm and the most widely used algorithm model in an artificial neural network. A typical BP network is a feed-forward network with three layers: The input layer, hidden layer, and output layer. Its structure is exhibited in Figure 2. Some relevant variables are detailed as follows:
(1)
Suppose there are d input neurons, l output neurons, and q hidden layer neurons.
(2)
Set the threshold of the jth neuron in the output layer as θj.
(3)
Set the threshold of the hth neuron in the hidden layer as γh.
(4)
The connection weight between the ith neuron in the input layer and the hth neuron in the hidden layer is vih.
(5)
The connection weight between the hth neuron in the hidden layer and the jth neuron in the output layer is whj.
(6)
The hth neuron in the hidden layer receives input αh from the input layer.
α h = i = 1 d v i h x i
Note that the input received by the jth neuron in the output layer from the hidden layer is βj:
β j = h = 1 q w h j b i
where bh denotes the output of the hth neuron in the hidden layer.
The weight from the input layer to the hidden layer is set as vih, and the threshold of the hth neuron in the hidden layer is set as γh. The weight from the hidden layer to the output layer is set as whj, and the threshold of the jth neuron in the output layer is set as θj. In Figure 2, there are d input neurons and q hidden neurons. The hidden layer has q hidden neuron thresholds and l output neurons. Thus, there are l output neuron thresholds.
b h = f ( α h γ h )
The activation functions of the hidden layer and output layer are all Sigmod functions.
In a training example (xk, yk), the training output of the neural network is assumed to be
g k = ( g 1 k , g 2 k , , g l k )
The output is an l-dimensional vector, where
g k = f ( β i θ i )
In this study, the number of nodes in the hidden layer is set as 9. The monthly average data of six meteorological elements in China, such as PR, EV, AT, SR, ST, and AP, were used as the input, and the monthly average SM was used as the output. Epochs were uniformly set as 1000 iterations, the expected error goal was 0.001, and the learning rate lr was 0.01. The BP neural network sample training set, the validation set, and the test set are divided in a ratio of 7:2:1.

2.4. Data Sample

Deep learning models require a considerable number of data for training, provided that data integration and matching are required and datasets with a long-time span are used to ensure complete data features. In this paper, the SM of each month in 2019 and 2020 in China was simulated and predicted based on the relationship between the monthly average value of six meteorological elements, such as PR, EV, AT, SR, ST, AP, and SM at the corresponding time in China from 2000 to 2018.
In this paper, three sets of strategies are proposed to evaluate and select the most appropriate training samples:
  • Group 1 used all the data of 12 months from 2000 to 2018 and simulated and predicted the SM of each month in 2019 and 2020 after overall training.
  • Group 2 divided the data of 19 years and 12 months from 2000 to 2018 into four parts according to spring, summer, autumn, and winter. The data in March, April, and May of 19 years were used as spring data. After training, the SM in March, April, and May of 2019 and 2020 was simulated and predicted. The data of June, July, and August of 19 years were adopted as summer data. After training, the SM in June, July, and August 2019 and 2020 was simulated and predicted. The data of September, October, and November of 19 years were utilized as the autumn data. After training, the SM in September, October, and November 2019 and 2020 was simulated and predicted. The data of December, January, and February of 19 years were employed as winter data. After training, the SM in December, January, and February 2019 and 2020 was simulated and predicted.
  • Group 3 divided the period from 2000 to 2018 into 12 datasets following 12 months. The January data of these 19 years were trained to simulate the prediction of SM in January 2019 and 2020, and 12 datasets were trained to simulate the prediction of SM in the 12 months of 2019 and 2020, respectively.

2.5. Performance Evaluation Measures

In this paper, the accuracy of the training set, validation set, and test set of training samples was evaluated by the Pearson correlation coefficient R, expressed as:
R = i = 1 n ( x i x ¯ i ) ( y i y ¯ i ) i = 1 n ( x i x ¯ i ) 2 ( y i y ¯ i ) 2
where n denotes the total number of samples, xi indicates the simulated value, yi represents the verified value, x ¯ i signifies the average value of the simulated sample, and y ¯ i refers to the average value of the verified sample.
The accuracy of the simulated samples was evaluated by the Mean Squared Error (MSE) and R Squared (R2).
MSE is:
M S E = 1 n i = 1 n ( y i y ^ i ) 2
R2 is:
R 2 = 1 i = 1 n ( y i y ^ i ) 2 i = 1 n ( y i y ¯ i ) 2
where n represents the total number of samples, ŷi refers to the simulated value, yi indicates the verified value, and y ¯ i stands for the average value.
Partial correlation coefficients between SM and PR, EV, SR, AT, and ST are calculated as:
T = r 11 r 12 r 1 m r 12 r 22 r 2 m r m 1 r m 2 r m m
In this paper, the sixth-order matrix T of the linear correlation coefficient between the two variables was obtained first to calculate the partial correlation coefficients between SM and PR, EV, SR, AT, and ST. With SM as an example, the linear correlation coefficients between SM and itself, PR, EV, net surface radiation, AT, and ST are r11, r12, r13, r1i4, r15, and r16, respectively. The inverse T−1 of matrix T is expressed as:
T 1 = ( c i j ) = c 11 c 12 c 1 m c 12 c 22 c 2 m c m 1 c m 2 c m m
According to the inverse matrix T−1, the partial correlation coefficient can be calculated by:
r i j = c i j c i i c j j
where rij· indicates the partial correlation coefficient between two specific variables xi and xj when two or more variables are fixed.
The test statistic of partial correlation analysis is the t-statistic, which is mathematically defined as:
t = r n q 2 1 r 2
where r denotes the partial correlation coefficient, n refers to the number of samples, and q is the order. If the probability p-value of the test statistic is less than a given significance level α, the original hypothesis should be rejected.

3. Results

3.1. Selection of Optimal Training Samples

The characteristics and quantity of the BP neural network training data are the most imperative factors to determine the performance of the model. The larger the sample size, the more accurate the results. The selection of training samples of the BP neural network has a great influence on the generalization ability of the network. Particularly, selecting appropriate training samples from an enormous dataset is the key to the good or bad training effect of the model [22]. The best training samples of the model are those comprising frequently occurring features in the test data while avoiding over-selection of very frequently occurring features in the training data [23]. The correlation coefficients of the training set of BP neural network samples of the three modes are close to their corresponding validation set and test set (Table 1). The time span of the three groups of training sample data is the same, and they all use the data from 2000 to 2020. The number of training data is one of the most crucial factors to determine whether the developed model is effective, which is essential for the performance of deep learning models [24]. Group1 has the largest amount of data. Nonetheless, the accuracy after BP neural network training is not the highest. Compared to Group1, Group2 and Group3 have a much smaller number of data for training samples. The training sets for spring, summer, and autumn in Group2 and Group3 have higher accuracy after training with the BP neural network. Meanwhile, only the training sets for winter months have slightly lower accuracy. The lowest value of correlation coefficient R between Group2 and Group3 training sets and their corresponding verification and test sets is also above 0.85. The summer training set in Group2 and the training set from May to September in Group3 have a correlation coefficient R of more than 0.95 with their corresponding validation and test sets. Apart from sufficient data, the success of the deep learning process and the performance of the model are controlled by the diversity of high-quality data [25]. The data performance of Group2 is better compared to Group1, though the data volume of training samples for spring, summer, and autumn in Group2 and training samples from April to October in Group3 is much smaller than that of Group1. Similarly, the training samples in January, February, and December in Group3 are better than the winter training samples in Group2. Among the training samples of the three modes, Group3 has the best performance with a large number of data in the single monthly training sample.
Next, the SM in China in 2019 and 2020 for 12 months each year is predicted based on the three groups of training sample data. Generally, the annual variation trend of MSE and R2 of the predicted and verified SM are similar. The MSE of predicted SM and verified SM obtained from the three groups of training sample data is high in January and February and then decreases gradually. It is low in June, July, and August, and then continues to increase, reaching the highest value in December (Figure 3a). In contrast to MSE, R2 is low in January and February and gradually increases in spring, reaching a peak level in June, July, and August. Afterward, it gradually decreases, exhibiting a low level in December (Figure 3b).
The predicted SM in 2019 and 2020 was consistent with the annual variation trend of MSE and R2 of verified SM (Figure 3a,b). The smaller the MSE, the larger the R2. The training sample data of Group 1 were used for prediction, and the MSE of predicted SM and verified SM was the largest and R2 was the smallest. The training sample data of Group 3 were used for prediction. The MSE of predicted SM and verified SM was the smallest, and R2 was the largest. Using the training sample data of Group 2 for prediction, MSE and R2 for predicting and verifying SM were in the middle. Therefore, the Group 3 training sample data constitute the best method for SM prediction.

3.2. Simulation of SM in China Based on Optimal Training Samples

In Group 3, the training samples in February had the lowest accuracy and the training samples in June had the highest accuracy. The training results for a single month of the 12 months in Group 3 were used to predict SM across the country for 12 months in 2019 and 2020. In these two years, the accuracy of the prediction results was the lowest in January and the highest in July. The R2 of the prediction results in January and July 2019 was 0.702 and 0.919 (Figure 3b and Figure 4), and the MSE was 0.00638 and 0.00184, respectively (Figure 3a). The R2 of the prediction results in January and July 2020 was 0.725 and 0.927 (Figure 3b and Figure 5), and the MSE was 0.00495 and 0.00173 (Figure 3a), respectively.
Subsequently, the SM in each month of 2019 and 2020 in China was simulated with the Group 3 training sample data. The simulation results suggested that the slope of the unary linear regression equation between the simulated SM value and the verified SM value in most months was greater than 1 (Figure 4 and Figure 5). Hence, most simulated SM values were greater than the verified SM value. However, the simulated values of SM at sites with high actual SM (>0.5 m3/m3) are almost less than the verified SM values throughout the year. SM has significant spatial variability [26], consistent with the national monthly SM in 2019 and 2020 (Figure 6 and Figure 7).
Furthermore, the national spatial distribution pattern of verified and simulated SM values in 2019 was compared month by month (Figure 6 and Figure 8). The simulated and verified SM values in 12 months of 2020 demonstrated a similar spatial distribution pattern (Figure 7 and Figure 9). In the middle summer of 2019 and 2020, R2 was the largest, and the spatial distribution pattern similarity between simulated and verified SM values was the highest. In winter, R2 was small, and the spatial distribution pattern similarity between simulated and verified SM values was slightly lower (Figure 6, Figure 7, Figure 8 and Figure 9). From the national perspective, a small part of the northern and northeastern parts of the Qinghai-Tibet Plateau has perennial SM higher than 0.5 m3/m3, ranking among the top in China. Since all of these regions have simulated SM values less than 0.5 m3/m3, the spatial distribution of simulated SM cannot reflect this local characteristic. Compared with the actual spatial distribution pattern of SM, the fragmentation degree of the simulated spatial distribution pattern of SM was considerably reduced, while the national spatial distribution pattern of the simulated and verified SM values was similar.

3.3. Effect of Different Meteorological Factors on the Accuracy of BP Neural Network Training Samples

The simulated SM in China has seasonal variations, as revealed by the relationship between SM and six meteorological factors, such as PR, EV, AT, net surface radiation, ST, and AP. Then, the accuracy of simulated SM training samples of a single meteorological factor was compared over 12 months from 2000 to 2018 to understand the differences in the effects of different meteorological factors on SM in different months. According to Table 1, the correlation coefficients of the training set and its corresponding validation set and test set are close, and only the correlation coefficients of the test set are compared (Table 2). After training, the results of the 12-month data in this study implied that the correlation coefficient of SM simulated by EV was the largest, and the annual correlation coefficient ranged from 0.6490 to 0.9301. Furthermore, the correlation coefficient of PR-simulated SM was the largest, and the annual correlation coefficient ranged from 0.6490 to 0.9032. The correlation coefficient of SM simulated by SR in spring was also large. For example, the correlation coefficient of April was 0.5849, which was second only to the correlation coefficient of SM simulated by EV (0.8429) and SM simulated by PR (0.8162). The correlation coefficients of SM simulated by ST in summer and temperature in winter were also high.
Six meteorological factors including PR, EV, AT, SR, ST, and AP were combined pairwise. The two meteorological factors from 2000 to 2018 were employed to simulate the comparison of the accuracy of SM training samples in 2019 and 2020. Moreover, the correlation coefficient of the test set was also compared (Table 3). From May to October, the correlation coefficient of SM simulated by PR-EV was the largest. In March, April, and November, the correlation coefficient of SM simulated by PR-AT or PR-ST was the largest. In January and December, the correlation coefficient of SM simulated by AT-ST was the largest. From the perspective of the whole country, only second to PR and EV, SR has a relatively large impact on SM change in summer, and the temperature has a relatively large impact on SM change in winter (Table 2 and Table 3). Furthermore, the effect of AP on SM change is modest and can be ignored compared with PR, EV, AT, SR, and ST.

4. Discussion

Multi-factor uncertainty co-influences the accuracy of SM estimates [27,28,29]. SM has a complex dependence on various meteorological factors [26,30]. There is a close relationship between SM and PR [16]. The correlation coefficients between simulated and validated SM test sets of a single meteorological factor in 12 months of the year revealed that the correlation coefficient of simulated SM with evaporation was the largest, followed by the correlation coefficient of simulated SM with PR. Overall, nationwide PR and EV are the main controlling factors of SM in different regions, consistent with the research results of Meng et al. [31]. According to the correlation coefficient between the simulated and validated SM test set of combined meteorological factors, the maximum correlation coefficient of simulated SM of combined meteorological factors varied in different seasons. In other words, the main controlling factors of SM varied with the shift of seasons. The effects of summer PR on SM and winter temperature on SM were significant.
The AP factor, which has almost no influence on SM, was neglected to further elaborate on the influence of different meteorological factors on SM and to further analyze the reasons for the low accuracy of SM training and simulation in winter. The average annual changes in five meteorological factors (PR, EV, AT, SR, and ST) from 2000 to 2018 were compared with those in 2019 and 2020. SM was low in January and flat or slightly increased in February, March, and April. It fell to the lowest level in the whole year in April, increased in the following months, reached the highest level in July and August, and decreased steadily after September (Figure 10a). The annual changes of EV, net surface radiation, AT, and ST were similar. All of them were lowest in January and December in winter, increased from January to June, reached the peak of the year in June, July, and August, and declined steadily after August to winter. EV, AT, and ST lagged behind the rise of net surface radiation in spring and the fall of net surface radiation in autumn (Figure 10c–f). Compared with meteorological factors such as EV, AT, and land ST, PR was low in the winter months and high in the summer while increasing sharply after May and decreasing sharply after September. This change was significant from the average level of 2000–2018 (Figure 10b).
The BP neural network was applied to train the relationship between six meteorological elements and SM in the corresponding month over 19 years from 2000 to 2018. The training set accuracy was the lowest in January and the highest in June, and in July it was close to June (Table 1). PR, EV, AT, and ST were fundamentally the lowest in January and the highest in July (Figure 10). There was a degree of correlation between the factors affecting SM, and they were scale-dependent [28]. For example, the rising AT increased surface EV and decreased SM [18]. A partial correlation analysis of SM and five influencing factors such as PR, EV, SR, AT, and land ST in January and July from 2000 to 2020 was conducted at the pixel scale to clarify the significant differences in the influence of PR, EV, AT, SR, and ST on SM across the country in winter (represented by January) and summer (represented by January) (Figure 11). Then, the respective partial correlation coefficient (Rp) was obtained.
In January, SM was mainly positively correlated with PR, with Rp values ranging from −0.87 to 0.96. The ratio of areas with positive correlations to areas with negative correlations was 9:2. The Sichuan Basin and the middle and lower reaches of the Yangtze River were significantly positively correlated regions (Figure 11a). There was a positive correlation between SM and PR in July, with Rp values ranging from −0.89 to 0.99. The area of positive correlation area accounted for one-third of the total area of China, mainly located in the North and Qinghai-Tibet Plateau (Figure 11b). In January, SM and EV were mainly negatively correlated, with Rp values ranging from −0.99 to 0.86. The ratio of areas with positive correlations to areas with negative correlations was 1:70. North China, Northwest China, southeast China, and Southwest China were significantly negatively correlated regions (Figure 11c). In July, SM and EV were mainly negatively correlated, with Rp values ranging from −0.99 to 0.87. The ratio of areas with positive correlations to areas with negative correlations was close to 1:43. The positive and negative correlation areas were significantly compared. The areas with significant negative correlation were mainly located in the Northeast, North China, and southeast and scattered in the Northwest (Figure 11d). There was a negative correlation between SM and SR. The Rp value of partial correlation between SM and SR in January was −0.92–0.88, and the area ratio of positive correlation to negative correlation was 1:9. The regions with a significant negative correlation were primarily in the Pearl River Basin, Southwest China, and the Sichuan Basin (Figure 11e). The Rp value of SM and SR in July ranged from −0.97 to 0.94, and the area with a negative correlation was one-fifth of the total national area. The area with a significant negative correlation was primarily in the eastern region (Figure 11f). The partial correlation between SM and AT in January was mainly positive, with Rp values ranging from −0.9 to 0.92. The area ratio of positive correlation to negative correlation was 3:1. The significantly positive correlation dominated in the central South and Qinghai-Tibet Plateau (Figure 11g). In July, SM and AT were mainly negatively correlated, with Rp values ranging from −0.96 to 0.92. The ratio of areas with positive correlations to areas with negative correlations was 1:3, and significant negative correlations were mainly observed in North China, Southwest China, and part of the Qinghai-Tibet Plateau (Figure 11h). The partial correlation between SM and land ST in January was mainly negative. The Rp value of partial correlation was −0.97–0.93. The area ratio of positive correlation to negative correlation was 1:4, and a significant negative correlation appeared in the southern part and the Qinghai-Tibet Plateau (Figure 11i). The Rp value was −0.94−0.94. The ratio of areas with positive correlations to areas with negative correlations was close to 1:1, and the areas with significant negative correlations were in the Northeast and Northwest (Figure 11j). On the whole, PR was mainly positively correlated with SM, while EV, SR, and temperature were negatively correlated with SM, consistent with the research results of Cho et al. [32].
The strongest partial correlation coefficients in January suggest that EV plays a dominant role in SM changes nationwide at the pixel scale, EV plays a dominant role in north, northwest, southeast, and southwest China, SR plays a dominant role in south-central China, and PR plays a dominant role in the Sichuan basin and parts of the middle reaches of the Yangtze River. The area ratio of PR, EV, SR, AT, and ST for the dominant role of SM is approximately 10:56:17:7:16 nationwide. The area of the region with a significant bias correlation between SM and the five meteorological factors accounts for approximately 55% of the total area of the country, and the area of the region with a significant bias correlation with PR and EV accounts for approximately 34% of the total area of the country (Figure 12a). PR in July plays a dominant role in SM changes nationwide at the pixel scale, a mixture of PR and EV plays a dominant role in the northern, western, and southwestern regions, a mixture of EV and SR plays a dominant role in the northeastern region, and the dominant role of SR is evident in most of the southern regions except the southwest. The area ratio of PR, EV, SR, AT, and ST to SM was approximately 10:9:6:1:1, the area with a significant bias correlation between SM and the five meteorological factors occupied approximately 84% of the total area of the country, and the area with significant bias correlation between PR and EV accounted for approximately 58% of the total area of the country (Figure 12b).
Among the five meteorological factors (PR, EV, SR, AT, and ST), PR and EV had a mixed dominance on the effect of SM (Table 2 and Table 3). However, the results of the partial correlation analysis revealed that EV dominated the effect of SM in winter. Commonly, high EV led to a significant reduction in surface SM [33]. In winter, the ST was higher than the AT, and the soil water EV was impacted by the ST [34]. Simultaneously, the effective amount of PR was minor, resulting in a low SM. Compared with summer, the area of SM nationwide with a significant bias correlation either with PR and EV or with five meteorological factors was considerably lower than that of summer. As a result, the accuracy of training data and the prediction effect in winter were lower compared to summer when the national SM was simulated by the BP neural network model.

5. Conclusions

In this paper, the SM for each month of 2019 and 2020 was simulated and predicted using the BP neural network trained on the relationship between monthly averages of six meteorological elements (including PR, EV, AT, net radiation, ST, and barometric pressure) and SM at the corresponding time for the whole country from 2000 to 2018. Generally, the spatial distribution pattern of simulated and verified SM values was similar. However, the degree of fragmentation of the simulated spatial distribution pattern of SM was considerably reduced compared with the verified spatial distribution pattern of SM.
The results suggested that the SM was affected by numerous meteorological factors, among which EV was most closely correlated. The atmospheric pressure was scarcely correlated with the SM changes. The SM was positively correlated with PR and negatively correlated with EV, SR, and temperature. When the BP neural network was employed to simulate SM for 12 months, the accuracy of training and simulation results in summer was high, while it was slightly worse for winter. The area with a significant partial correlation between SM and five meteorological factors such as PR, EV, SR, AT, and land ST in summer accounted for approximately 84% of the total area of China, while it reached 55% in winter. As a result, the accuracy of the BP neural network simulation of SM in winter was lower than that in other seasons.
To summarize, there was spatial variability not only in SM but also in the dominant meteorological factors influencing SM. Although the accuracy of the BP neural network simulating SM across the country in winter was slightly lower, the results of the BP neural network simulating SM variation in China throughout the year were acceptable. In the future, the training samples of the BP neural network should be optimized according to the difference in the influence of various meteorological factors on soil moisture in different regions at different times, so as to improve the simulation effect.

Author Contributions

Conceptualization, Y.L.; methodology, Y.L., F.S. and Z.Z.; software, Y.L. and X.L. (Xuan Liu); validation, Y.L., X.L. (Xuan Liu), and Y.S.; formal analysis, Y.L. and F.S.; resources, Y.L., X.L. (Xuan Liu), and X.Z.; writing—original draft preparation, Y.L.; writing—review and editing, Y.L., F.S. and W.L.; visualization, Y.L., X.L. (Xuqing Li), and Y.W.; supervision, Y.Z.; project administration, Y.J. and W.Z.; funding acquisition, Y.J. and Y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key R&D Program Project: Asia Oceania Regional Integrated Earth Observation Knowledge Hub and Application Demonstration (2019YFE0126600); Research on Large Scale Collaborative Design and Digital Constellation Simulation Technology of Space Engineering for New Strategy of Remote Sensing Development (JCKY2020908B001); Major Special Project: The China High-Resolution Earth Observation System (30-Y30F06-9003-20/22); Project of Hebei Higher Education Scientific and Technological Research Young Talent Program: Research on Screening and Monitoring Method of Heavy Metal Stress in Agricultural Land Based on Spatial and Temporal Spectrum Analysis of Multi-source Remote Sensing Data (BJ2020056); Doctoral Research Startup Fund Project (BKY-2021-35); and Hebei Provincial Education Department Project: Evaluation of Ecological Environment Quality in Hebei Province Based on GIS Spatial Analysis Research (ZC2022014).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The study did not report any data.

Acknowledgments

Special thanks are given to the European Center for Medium-Range Weather Forecasts for providing ERA5-Land monthly averaged data.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Legates, D.R.; Mahmood, R.; Levia, D.F.; DeLiberty, T.L.; Quiring, S.M.; Houser, C.; Nelson, F.E. SM: A Central and Unifying Theme in Physical Geography. Prog. Phys. Geogr. Earth Environ. 2011, 35, 65–86. [Google Scholar] [CrossRef]
  2. Zeng, J. Effects of Meteorological Forcings and Land Surface Model on SM Simulation over China. J. Hydrol. 2021, 603, 126978. [Google Scholar] [CrossRef]
  3. Schwingshackl, C.; Hirschi, M.; Seneviratne, S.I. Quantifying Spatiotemporal Variations of SM Control on Surface Energy Balance and Near-Surface AT. J. Clim. 2017, 30, 20. [Google Scholar] [CrossRef]
  4. Fu, B.; Zhu, Y.; Wang, S. Earth Surface Processes and Environmental Sustainability in China. Earth Environ. Sci. Trans. R. Soc. Edinb. 2018, 109, 373–374. [Google Scholar] [CrossRef] [Green Version]
  5. Seneviratne, S.I.; Corti, T.; Davin, E.L.; Hirschi, M.; Jaeger, E.B.; Lehner, I.; Orlowsky, B.; Teuling, A.J. Investigating SM–Climate Interactions in a Changing Climate: A Review. Earth-Sci. Rev. 2010, 99, 125–161. [Google Scholar] [CrossRef]
  6. Padilla, F.M.; Pugnaire, F.I. Rooting Depth and SM Control Mediterranean Woody Seedling Survival during Drought. Funct. Ecol. 2007, 21, 489–495. [Google Scholar] [CrossRef]
  7. Li, C.; Peng, F.; Xue, X.; You, Q.; Lai, C.; Zhang, W.; Cheng, Y. Productivity and Quality of Alpine Grassland Vary With Soil Water Availability Under Experimental Warming. Front. Plant Sci. 2018, 9, 1790. [Google Scholar] [CrossRef] [Green Version]
  8. Jiang, K. Influence Patterns of SM Change on Surface-AT Difference under Different Climatic Background. Sci. Total Environ. 2022, 9, 153607. [Google Scholar] [CrossRef]
  9. Pragna, G.; Jyothi, M.; Kunari, N.; Rao, I.B. The Effect Of Different Soil Amendment On Irrigation Frequency, Crop Yield, Water Use Efficiency Of Spinach. Int. J. Appl. Biol. Pharm. Technol. 2021, 8, 12–18. [Google Scholar]
  10. Holsten, A.; Vetter, T.; Vohland, K.; Krysanova, V. Impact of Climate Change on SM Dynamics in Brandenburg with a Focus on Nature Conservation Areas. Ecol. Model. 2009, 220, 2076–2087. [Google Scholar] [CrossRef]
  11. Haubrock, S.-N.; Chabrillat, S.; Lemmnitz, C.; Kaufmann, H. Surface SM Quantification Models from Reflectance Data under Field Conditions. Int. J. Remote Sens. 2008, 29, 3–29. [Google Scholar] [CrossRef]
  12. Cai, W.; Cowan, T.; Briggs, P.; Raupach, M. Rising Temperature Depletes SM and Exacerbates Severe Drought Conditions across Southeast Australia. Geophys. Res. Lett. 2009, 36, L21709. [Google Scholar] [CrossRef]
  13. Deng, Y.; Wang, S.; Bai, X.; Luo, G.; Wu, L.; Chen, F.; Wang, J.; Li, Q.; Li, C.; Yang, Y.; et al. Characteristics of SM Storage from 1979 to 2017 in the Karst Area of China. Geocarto Int. 2021, 36, 903–917. [Google Scholar] [CrossRef]
  14. Li, H.; Liu, F.; Zhang, S.; Zhang, C.; Zhang, C.; Ma, W.; Luo, J. Drying–Wetting Changes of Surface SM and the Influencing Factors in Permafrost Regions of the Qinghai-Tibet Plateau, China. Remote Sens. 2022, 14, 2915. [Google Scholar] [CrossRef]
  15. Whan, K.; Zscheischler, J.; Orth, R.; Shongwe, M.; Rahimi, M.; Asare, E.O.; Seneviratne, S.I. Impact of SM on Extreme Maximum Temperatures in Europe. Weather Clim. Extrem. 2015, 9, 57–67. [Google Scholar] [CrossRef] [Green Version]
  16. Jia, B.; Liu, J.; Xie, Z.; Shi, C. Interannual Variations and Trends in Remotely Sensed and Modeled SM in China. J. Hydrometeorol. 2018, 19, 17. [Google Scholar] [CrossRef]
  17. Cheng, S.; Guan, X.; Huang, J.; Ji, F.; Guo, R. Long-Term Trend and Variability of SM over East Asia. J. Geophys. Res. Atmos. 2015, 120, 8658–8670. [Google Scholar] [CrossRef] [Green Version]
  18. Cheng, S.; Huang, J. Enhanced SM Drying in Transitional Regions under a Warming Climate: ENHANCED SM DRYING. J. Geophys. Res. Atmos. 2016, 121, 2542–2555. [Google Scholar] [CrossRef]
  19. Deng, Y.; Wang, S.; Bai, X.; Luo, G.; Wu, L.; Cao, Y.; Li, H.; Li, C.; Yang, Y.; Hu, Z.; et al. Variation Trend of Global SM and Its Cause Analysis. Ecol. Indic. 2020, 110, 105939. [Google Scholar] [CrossRef]
  20. Cao, Y.; Khudanpur, S. Sample Selection for Large-Scale MT Discriminative Training. In Proceedings of the 10th Conference of the Association for Machine Translation in the Americas, San Diego, CA, USA, 28 October–1 November 2012. [Google Scholar]
  21. Köhli, M.; Schrön, M.; Zreda, M.; Schmidt, U.; Dietrich, P.; Zacharias, S. Footprint Characteristics Revised for Field-scale SM Monitoring with Cosmic-ray Neutrons. Water Resour. Res. 2015, 51, 5772–5790. [Google Scholar] [CrossRef] [Green Version]
  22. Radhika, K.S.R.; Rao, C.V.; Kamakshi Prasad, V. Improving the Spatial Resolution of AWiFS Sensor Data Using LISS III and AWiFS DataPair with Contourlet Transform Learning. In Advances in Intelligent Systems and Computing, Proceedings of the Second International Conference on Computational Intelligence and Informatics, Coimbatore, India, 23–24 November 2017; Bhateja, V., Tavares, J.M.R.S., Rani, B.P., Prasad, V.K., Raju, K.S., Eds.; Springer: Singapore, 2018; Volume 712, pp. 105–119. [Google Scholar] [CrossRef]
  23. Kirchhoff, K.; Bilmes, J. Submodularity for Data Selection in Statistical Machine Translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 25–29 October 2014. [Google Scholar]
  24. Georgievski, B. Image Augmentation with Neural Style Transfer. In ICT Innovations 2019. Big Data Processing and Mining; Gievska, S., Madjarov, G., Eds.; Communications in Computer and Information Science; Springer International Publishing: Cham, Switzerland, 2019; Volume 1110, pp. 212–224. [Google Scholar] [CrossRef]
  25. Ozcan, S.; Mustacoglu, A.F. Transfer Learning Effects on Image Steganalysis with Pre-Trained Deep Residual Neural Network Model. In Proceedings of the 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA, 10–13 December 2018. [Google Scholar]
  26. Li, Q.; Wang, Z.; Shangguan, W.; Li, L.; Yao, Y.; Yu, F. Improved Daily SMAP Satellite SM Prediction over China Using Deep Learning Model with Transfer Learning. J. Hydrol. 2021, 600, 126698. [Google Scholar] [CrossRef]
  27. Famiglietti, J.S.; Ryu, D.; Berg, A.A.; Rodell, M.; Jackson, T.J. Reply to Comment by H. Vereecken et al. on Field Observations of SM Variability across Scales. Water Resour. Res. 2008, 44, W12602. [Google Scholar] [CrossRef]
  28. Zhao, W.; Fang, X.; Daryanto, S.; Zhang, X.; Wang, Y. Factors Influencing SM in the Loess Plateau, China: A Review. Earth Environ. Sci. Trans. R. Soc. Edinb. 2018, 109, 501–509. [Google Scholar] [CrossRef]
  29. Fu, X.; Jiang, X.; Yu, Z.; Ding, Y.; Lü, H.; Zheng, D. Understanding the Key Factors That Influence SM Estimation Using the Unscented Weighted Ensemble Kalman Filter. Agric. For. Meteorol. 2022, 313, 108745. [Google Scholar] [CrossRef]
  30. Cai, Y.; Zheng, W.; Zhang, X.; Zhangzhong, L.; Xue, X. Research on SM Prediction Model Based on Deep Learning. PLoS ONE 2019, 14, e0214508. [Google Scholar] [CrossRef] [PubMed]
  31. Cho, E. Regional Scale Spatio-Temporal Variability of SM and Its Relationship Climate, Vegetation, Soil and Groundwater on SM Spatiotemporal Variability in the Mongolian Plateau. Sci. Total Environ. 2022, 809, 152198. [Google Scholar] [CrossRef]
  32. Cho, E. Regional Scale Spatio-Temporal Variability of SM and Its Relationship with Meteorological Factors over the Korean Peninsula. J. Hydrol. 2014, 516, 317–329. [Google Scholar] [CrossRef]
  33. Pandey, V.; Srivastava, P.K. Integration of Satellite, Global Reanalysis Data and Macroscale Hydrological Model for Drought Assessment in Sub-tropical Region of India. In Proceedings of the ISPRS TC III Mid-Term Symposium “Developments, Technologies and Applications in Remote Sensing”, Beijing, China, 7–10 May 2018. [Google Scholar]
  34. Liu, P.; Xia, Y.; Shang, M. A Bench-Scale Assessment of the Effect of Soil Temperature on Bare Soil EV in Winter. Hydrol. Res. 2020, 51, 1349–1357. [Google Scholar] [CrossRef]
Figure 1. Data processing flow chart.
Figure 1. Data processing flow chart.
Sustainability 14 16381 g001
Figure 2. The structure of BP neural network.
Figure 2. The structure of BP neural network.
Sustainability 14 16381 g002
Figure 3. Comparison of MSE and R2 among three groups in 2019 and 2020.
Figure 3. Comparison of MSE and R2 among three groups in 2019 and 2020.
Sustainability 14 16381 g003
Figure 4. Comparison of simulated soil moisture and verified soil moisture in 2019.
Figure 4. Comparison of simulated soil moisture and verified soil moisture in 2019.
Sustainability 14 16381 g004
Figure 5. Comparison of simulated soil moisture and verified soil moisture in 2020.
Figure 5. Comparison of simulated soil moisture and verified soil moisture in 2020.
Sustainability 14 16381 g005
Figure 6. Spatial distribution of verified soil moisture in 2019.
Figure 6. Spatial distribution of verified soil moisture in 2019.
Sustainability 14 16381 g006
Figure 7. Spatial distribution of verified soil moisture in 2020.
Figure 7. Spatial distribution of verified soil moisture in 2020.
Sustainability 14 16381 g007
Figure 8. Spatial distribution of simulated soil moisture in 2019.
Figure 8. Spatial distribution of simulated soil moisture in 2019.
Sustainability 14 16381 g008
Figure 9. Spatial distribution of simulated soil moisture in 2020.
Figure 9. Spatial distribution of simulated soil moisture in 2020.
Sustainability 14 16381 g009
Figure 10. Monthly mean soil moisture (a), precipitation (b), evapotranspiration (c), Surface net solar radiation (d), air temperature (e), and surface temperature (f) in China.
Figure 10. Monthly mean soil moisture (a), precipitation (b), evapotranspiration (c), Surface net solar radiation (d), air temperature (e), and surface temperature (f) in China.
Sustainability 14 16381 g010
Figure 11. Partial correlation coefficients between monthly average precipitation, monthly average evapotranspiration, monthly average surface net solar radiation, monthly average air temperature, monthly average surface temperature, and soil moisture in January 2000–2020 (a,c,e,g,i). Partial correlation coefficients between monthly average precipitation, monthly average evapotranspiration, monthly average surface net solar radiation, monthly average air temperature, monthly average surface temperature, and soil moisture in July 2000–2020 (b,d,f,h,j).
Figure 11. Partial correlation coefficients between monthly average precipitation, monthly average evapotranspiration, monthly average surface net solar radiation, monthly average air temperature, monthly average surface temperature, and soil moisture in January 2000–2020 (a,c,e,g,i). Partial correlation coefficients between monthly average precipitation, monthly average evapotranspiration, monthly average surface net solar radiation, monthly average air temperature, monthly average surface temperature, and soil moisture in July 2000–2020 (b,d,f,h,j).
Sustainability 14 16381 g011
Figure 12. The strongest correlation coefficient (a) in January 2000–2020. The strongest correlation coefficient (b) in July 2000–2020.
Figure 12. The strongest correlation coefficient (a) in January 2000–2020. The strongest correlation coefficient (b) in July 2000–2020.
Sustainability 14 16381 g012
Table 1. Accuracy of the six-factor sample training set, validation set, and test set.
Table 1. Accuracy of the six-factor sample training set, validation set, and test set.
SampleTraining RValidation RTest R
Group 12000–20180.903720.904930.90331
Group 2Spring0.922630.921050.92074
Summer0.957060.956530.95676
Autumn0.917650.916550.91884
Winter0.856840.857470.85715
Group 3January0.868010.867970.86698
February0.857610.858690.85694
March0.901740.901850.90088
April0.937580.933590.93303
May0.952450.953280.95393
June0.961060.951650.95106
July0.958220.957830.95811
August0.957770.956390.95683
September0.954890.952250.95272
October0.930970.930030.93075
November0.903430.902120.90381
December0.895660.900460.89542
Table 2. Accuracy of the single-factor sample training set.
Table 2. Accuracy of the single-factor sample training set.
SamplePREVATSRSTSP
January0.69770.72400.61380.34250.56450.3966
February0.68800.73910.58070.44980.50230.4006
March0.75740.79630.41970.56820.38820.4142
April0.81620.84290.30780.58490.29980.4275
May0.86440.91050.18200.67260.39950.4073
June0.87450.90080.38480.65310.55830.4080
July0.89240.92030.42440.53950.64160.4717
August0.90320.93010.39430.54620.58560.4648
September0.87730.91400.32230.39000.29310.4264
October0.81640.86630.41980.41190.40990.3686
November0.75270.78940.54200.31580.47600.3932
December0.64900.73750.62440.26570.54820.3933
Table 3. Accuracy of single factor sample training set.
Table 3. Accuracy of single factor sample training set.
SamplePR-EVPR-ATPR-SRPR-STPR-SPEV-ATEV-SREV-STEV-SPAT-SRAT-STAT-SPSR-STSR-SPST-SP
January0.72380.73280.71330.72890.73540.71430.7430.68070.66500.68430.81100.70990.70540.56570.7119
February0.73600.76210.76670.73880.77540.70000.74530.64660.67200.67630.6770.64950.66320.59610.6379
March0.78540.86360.76890.82320.79380.74610.77650.78330.69880.65790.69710.55110.68220.67250.5296
April0.85030.84530.83650.85840.84820.78600.8370.82160.7490.66510.78280.56170.67820.68030.6036
May0.90630.88680.88060.89580.89360.81120.8640.85620.79680.77910.84120.59050.80160.75240.6782
June0.90970.89120.88480.90440.90050.83570.88050.86080.82450.80120.83990.69120.83000.77850.7873
July0.92210.90620.89830.90750.91670.88260.88910.88970.87550.73750.86730.78520.79870.76110.859
August0.92960.92250.92150.9260.93070.89660.89710.90150.90130.79080.86220.71730.82010.78440.8082
September0.91350.89430.88560.89590.90920.87840.88820.88550.8790.59650.78730.52830.54160.64720.6046
October0.86120.84880.84060.85360.8580.80720.84190.81620.78370.63120.50350.59880.61320.61250.5847
November0.78730.79610.79030.77680.79350.79080.79540.72420.73630.6690.7820.68960.66050.58190.6811
December0.75120.72170.6890.69370.72020.75390.75920.7140.74420.69040.82560.71640.67890.55270.7176
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Liu, Y.; Shi, F.; Liu, X.; Zhao, Z.; Jin, Y.; Zhan, Y.; Zhu, X.; Luo, W.; Zhang, W.; Sun, Y.; et al. Influence of Different Meteorological Factors on the Accuracy of Back Propagation Neural Network Simulation of Soil Moisture in China. Sustainability 2022, 14, 16381. https://doi.org/10.3390/su142416381

AMA Style

Liu Y, Shi F, Liu X, Zhao Z, Jin Y, Zhan Y, Zhu X, Luo W, Zhang W, Sun Y, et al. Influence of Different Meteorological Factors on the Accuracy of Back Propagation Neural Network Simulation of Soil Moisture in China. Sustainability. 2022; 14(24):16381. https://doi.org/10.3390/su142416381

Chicago/Turabian Style

Liu, Yuyan, Fei Shi, Xuan Liu, Zihui Zhao, Yongtao Jin, Yulin Zhan, Xia Zhu, Wei Luo, Wenhao Zhang, Yuefang Sun, and et al. 2022. "Influence of Different Meteorological Factors on the Accuracy of Back Propagation Neural Network Simulation of Soil Moisture in China" Sustainability 14, no. 24: 16381. https://doi.org/10.3390/su142416381

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop