1. Introduction
In many regions across the world, the most predominant type of land degradation is soil erosion, which has adverse environmental and socioeconomic consequences [
1,
2,
3]. Soil erosion is the process of moving soil particles by external forces, such as mass movement, wind, and water [
4,
5]. In Europe, where a humid climate dominates, water-induced soil erosion is the main form of erosion, which poses a serious environmental concern in many European countries [
6]. Furthermore, soil erosion by wind and dust storms is one of the challenges in European countries [
7,
8,
9].
Soil erosion by water has numerous environmental impacts. For instance, detaching soil particles from the upper layer of the soil causes a deterioration in agriculture productivity through the loss of organic matter, nutrients, and soil depth [
10]. Moreover, moving soil particles over vast distances affects the ecosystem service quality in downstream rivers by increasing the sedimentation and the contamination of aquatic life [
11,
12]. Since measuring soil erosion at a large scale is difficult, expensive, and time consuming, several models have been developed in recent decades to estimate soil erosion [
13,
14,
15].
In Europe, the Universal Soil Loss Equation (USLE) [
15], and its modified version, the Revisited Universal Soil Loss Equation (RUSLE) [
14], is the most widely used in quantifying soil erosion at multiple scales across Europe. At large spatial scales, RUSLE is typically the most frequently used model to estimate soil erosion [
16]. In the RUSLE model, the average annual soil erosion is calculated by multiplying six factors, including the rainfall erosivity factor (R factor). These factors are slope length (L-factor), soil erodibility (K-factor), slope steepness (S-factor), supporting conservation practices (P-factor), and crop type and management (C-factor). In this sense, rainfall erosivity is considered the most important, as rainfall has a direct impact on detaching and moving the soil particles [
15].
Rainfall erosivity is the potential force of raindrops to detach and erode soil particles [
17]. As it is one of the main causes of floods and landslides, researchers have highlighted rainfall erosivity as an important indicator to be investigated [
18]. The rainfall erosivity factor is calculated using rainfall records with 1–5 min precipitation intervals [
19]; however, these records are rarely accessible for long enough in most of the world. As a result, the kinetic energy concept has been widely employed to estimate the rainfall erosivity factor from half-hourly or hourly datasets [
20].
To accurately estimate the R factor using the kinetic energy concept, it is necessary to measure both the intensity and the kinetic energy of the rain, but it is highly challenging to achieve this directly since the equipment needed is expensive and measuring the distribution of the rainstorm’s drop sizes is a tedious process [
21]. To overcome this, researchers have developed numerous empirical equations that describe the relationship between rainfall intensity and its kinetic energy [
22]. To provide a comprehensive review of these equations, Dash et al. [
21] compared six of the most universal equations in more detail and provided a deep evaluation of their applicability in calculating the R factor. Alternative methods for calculating the R factor include index techniques, such as the Modified Fournier Index (
MFI), especially when high-resolution rainfall records (half-hourly or hourly) are not available. The
MFI is one of the methods suggested by Arnoldus [
23] for calculating the R factor based on the monthly rainfall data. However, some adjustment is required for calculating the R factor based on the
MFI result [
24]. The
MFI was used to estimate catastrophic erosion by evaluating rainfall erosivity and its association with other meteorological factors [
25]. Previously, the
MFI was implemented in many parts of the world, as can be seen in
Table 1.
Recently, the artificial neural network (ANN) and machine learning algorithms have been widely used to predict environmental processes (erosion, contamination, and drought) in many parts of the world [
29]. For instance, the multilayer perceptron neural network (MLPNN) model is one of the most widely used models for predicting hydrological data [
30]. Mishra and Desai [
31] used the ANN, RBF, and adaptive neural network-based fuzzy inference system (ANFIS) to forecast drought (SPI) at various timescales and found that ANN has better performance than RBF and ANFIS. In Iran, MLP, ANFIS, and multiple linear regression models were used for forecasting precipitation; the output showed that MLP produced better results [
32]. Jalalkamali et al. [
33] compared stochastic models with the ANN to forecast SPI-9 in Iran, and their results revealed that stochastic models performed better. The different model results depend on the drought index and its scale [
34]. More examples of the implementation of the ANN for predicting certain environmental variables are presented in
Table 2.
Based on the literature, few studies used the ANN to predict rainfall erosivity. However, limited information is available on MFI changes in Central Europe. Thus, the main goals of this research were to: (1) assess the Modified Fournier Index (MFI) as a representative for the erosivity index in three stations in Hungary between 1901 and 2020; (2) evaluate the ability of ANNs (multilayer perceptron (MLP) and the radial basis function (RBF)) to predict the MFI; and (3) rate the importance of input variables in predicting the MFI based on sensitivity analysis (∂). Overall, the implementation of ANN to predict MFI is still less common, which give this work novelty in its field, where the output will serve researchers, planners, and decision makers.
4. Discussion
In this research, the
MFI was calculated for tracking rainfall erosivity in Central Europe; then, two ANN (RBF and MLP) algorithms were tested to assess their ability in the prediction of the
MFI. At the three studied stations, the
MFI values ranged from very low to high (1901–2020) (
Table 7). Previously, De Luis et al. [
41] analyzed the erosivity trend in Western Europe (Iberian Peninsula) and detected a notable decrease in rainfall erosivity based on the
MFI (1951–2000). For the Netherlands, Lukić et al. [
10] reported that the
MFI values ranged between 77.93 and 97.27 (1957–2016). However, changes in erosivity class from low to moderate were reported in the same study. These changes in rainfall erosivity in Europe can be mainly explained by climate change (i.e., extreme events: flood and drought), which largely affects the precipitation patterns, not only in Europe but all over the world [
58,
59,
60,
61].
The output of RBF and MLP showed that the RBF outperformed the MLP. However, both algorithms were perfectly capable of predicting the
MFI values, with some differences. The differences between the output could be explained by the way that each algorithm works. The necessary step for the proper functioning of the NN is to optimize the weights, known as calibration. Different types of algorithms can be used to optimize the weight, e.g., back propagation [
62] and Levenberg–Marquardt [
63]. These algorithms can minimize the disparity between forecasted and observed values by adjusting the network weight [
46].
Generally, the ANN works on the principle of the training dataset. There are various kinds of neural network (NN) models, but usually, two models are used in prediction applications, i.e., recurrent network and feedforward network. The backpropagation algorithm is used to train both models [
49,
50,
51,
52,
53,
54,
55,
56,
57,
58,
59,
60,
61,
62,
63,
64]. When the backpropagation algorithm is used to change the weight of neurons, it works on the gradient descent method (weights change in downward direction). The signal strength between nodes is directly dependent on the weights of neurons [
49]. Feedforward NN is a basic type, and it is capable of estimating constant and integral functions.
The network architecture of MLP comprises neurons put together into layers. The MLP contains three layers of nodes, i.e., input, hidden, and output layers. The MLP can have one or more hidden layers with various numbers of neurons. In addition to the input node, the hidden and output nodes are considered neurons [
65]. When we used the MLP to study rainfall erosivity (
MFI), the input layer contained the variables (
Pd-max,
pi,
Ptotal,
Tavg,
Td-max, and
Td-min), and the output layer presented the predicted
MFI (
Figure 3), while the hidden layer included a nonlinear function and utilized weight for the input layer. Neurons in the hidden layer work in a trial and error approach [
34].
The MLP and RBF consist of three network layers; however, the main difference between the RBF and MLP is that the RBF’s hidden and output layers are different, unlike those of the MLP [
66]. The hidden layer neurons are nonlinear, while the output layer neurons are linear in the RBFs. The nonlinear hidden layer neuron plays a significant role in the nonlinear modeling task [
67]. The RBF network is simpler compared to MLP. However, the MLP is more successfully implemented in various complex problems. The RBF is a local approximation network, and its output can be estimated by hidden units in a local receptive field. The MLP network works globally, and its output is determined by all the neurons [
68]. Despite the similarity between both algorithms, the differences in the architecture process led to different output and accuracy (
Figure 7,
Figure 8 and
Figure 9).
Overall, the implementation of the ANN for predicting the
MFI or other hydrological and environmental variables was proven to be a useful tool for predicting and forecasting [
69]. However, the output of this research could be useful for local planners on a county scale for predicting the
MFI values based only on monthly and yearly rainfall.
5. Conclusions
Land degradation is a major issue all over the world due to its negative impact on the agroecosystem and environmental components. Recently, machine learning and the artificial neural network have been implemented in environmental research for predicting natural hazards. In this research, ANN (MLP and RBF) algorithms were implemented to predict the MFI as a representative of erosivity factor (soil erosion) in Central Europe. Five scenarios with different inputs (rainfall and temperature) were suggested for exploring the accuracy of ANN (MLP and RBF) algorithms. The output of this research can be summarized as follows:
- 1-
The MFI ranged between 91.97 (Budapest) and 80.25 (Pecs), with a notable decrease in MFI values (1901–2020).
- 2-
The SC2 (Pd-max + pi + Ptotal) was the best scenario for predicting the MFI using the ANN–MLP.
- 3-
The SC4 (Ptotal + Tavg + Td-max + Td-min) was the most accurate scenario for predicting the MFI by using the ANN–RBF.
- 4-
The sensitivity analysis revealed that pi followed by Ptotal are the most important input variables for predicting MFI values.
It is good to mention that this research was only focused on MFI as one of the factors that contribute to soil erosion based on the monthly rainfall data. Some other factors such as land use (agricultural areas), soil properties (i.e., texture, structure), vegetation cover, and inclination angle of rainfall streams were not considered in this research.
Local planers, environmental organizations, and decision makers will be able to use the output of this research, where the prediction of the MFI could be performed to a satisfactory level based on the total rainfall in the target regions. In the next steps, other machine learning methods will be implemented to test their accuracy in the prediction of the MFI. However, the output of this research could serve as a good result for both scientific and industrial communities.