Next Article in Journal
Automated Counting Grains on the Rice Panicle Based on Deep Learning Method
Next Article in Special Issue
Mask R-CNN and OBIA Fusion Improves the Segmentation of Scattered Vegetation in Very High-Resolution Optical Sensors
Previous Article in Journal
Hearing Loss Is Associated with Increased Variability in Double Support Period in the Elderly
Previous Article in Special Issue
Mapping and Discriminating Rural Settlements Using Gaofen-2 Images and a Fully Convolutional Network
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Flash-Flood Potential Mapping Using Deep Learning, Alternating Decision Trees and Data Provided by Remote Sensing Sensors

1
Research Institute of the University of Bucharest, 90-92 Sos. Panduri, 5th District, 050663 Bucharest, Romania
2
National Institute of Hydrology and Water Management, București-Ploiești Road, 97E, 1st District, 013686 Bucharest, Romania
3
Department of Geomorphology, Tarbiat Modares University, Tehran 36581-17994, Iranmailto
4
Department of Geoinformatics–Z_GIS, University of Salzburg, 5020 Salzburg, Austria
5
Environmental Quality, Atmospheric Science and Climate Change Research Group, Ton Duc Thang University, Ho Chi Minh City 700000, Vietnam
6
Faculty of Environment and Labour Safety, Ton Duc Thang University, Ho Chi Minh City 700000, Vietnam
7
Geotechnical Engineering Deparment, University of Transport Technology, Hanoi 100000, Vietnam
8
University Center for Research & Development (UCRD), Chandigarh University, Punjab 140413, India
9
Department of Civil Engineering, University Institute of Engineering, Chandigarh University, Punjab 140413, India
10
Department of Geography, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi 110025, India
11
Institute of Research and Development, Duy Tan University, Danang 550000, Vietnam
12
Faculty of Environmental and Chemical Engineering, Duy Tan University, Danang 550000, Vietnam
13
Faculty of Geography, University of Bucharest, Bd. Nicolae Bălcescu No 1, 1st District, 010041 Bucharest, Romania
*
Authors to whom correspondence should be addressed.
Sensors 2021, 21(1), 280; https://doi.org/10.3390/s21010280
Submission received: 6 November 2020 / Revised: 2 December 2020 / Accepted: 22 December 2020 / Published: 4 January 2021
(This article belongs to the Special Issue Deep Learning Methods for Remote Sensing)

Abstract

:
There is an evident increase in the importance that remote sensing sensors play in the monitoring and evaluation of natural hazards susceptibility and risk. The present study aims to assess the flash-flood potential values, in a small catchment from Romania, using information provided remote sensing sensors and Geographic Informational Systems (GIS) databases which were involved as input data into a number of four ensemble models. In a first phase, with the help of high-resolution satellite images from the Google Earth application, 481 points affected by torrential processes were acquired, another 481 points being randomly positioned in areas without torrential processes. Seventy percent of the dataset was kept as training data, while the other 30% was assigned to validating sample. Further, in order to train the machine learning models, information regarding the 10 flash-flood predictors was extracted in the training sample locations. Finally, the following four ensembles were used to calculate the Flash-Flood Potential Index across the Bâsca Chiojdului river basin: Deep Learning Neural Network–Frequency Ratio (DLNN-FR), Deep Learning Neural Network–Weights of Evidence (DLNN-WOE), Alternating Decision Trees–Frequency Ratio (ADT-FR) and Alternating Decision Trees–Weights of Evidence (ADT-WOE). The model’s performances were assessed using several statistical metrics. Thus, in terms of Sensitivity, the highest value of 0.985 was achieved by the DLNN-FR model, meanwhile the lowest one (0.866) was assigned to ADT-FR ensemble. Moreover, the specificity analysis shows that the highest value (0.991) was attributed to DLNN-WOE algorithm, while the lowest value (0.892) was achieved by ADT-FR. During the training procedure, the models achieved overall accuracies between 0.878 (ADT-FR) and 0.985 (DLNN-WOE). K-index shows again that the most performant model was DLNN-WOE (0.97). The Flash-Flood Potential Index (FFPI) values revealed that the surfaces with high and very high flash-flood susceptibility cover between 46.57% (DLNN-FR) and 59.38% (ADT-FR) of the study zone. The use of the Receiver Operating Characteristic (ROC) curve for results validation highlights the fact that FFPIDLNN-WOE is characterized by the most precise results with an Area Under Curve of 0.96.

1. Introduction

In recent decades, climate change and its related phenomena, e.g., flash floods, have had significant negative effects worldwide for both human society and environment [1]. The extreme rainfalls, extreme river discharge values, and therefore the flash-flood risk are characterized by a continuous increasing trend [2]. This trend is also validated by the huge amount of damages that flash floods generate worldwide. Therefore, an increasing number of studies in the literature approaching the subject of flash-flood susceptibility can be also observed [3,4,5,6]. Moreover, the estimation of flood risk and vulnerability became an essential and mandatory procedure which should be included in the Flood Risk Management strategy [7]. In this regard, the Geographic Informational Systems (GIS) and Remote Sensing (RS) techniques represent the necessary tools, which facilitate the spatial modelling and mapping of flash-flood susceptible areas. It is worth emphasizing the crucial role of Remote Sensing sensors in the observation’s campaigns conducted for the identification of areas already affected by flash-flood processes [8]. Thus, without the RS sensors, the correct inventory of the torrential areas, which favor the occurrence of flash flood, will be impossible. Consideration of the previously affected areas and their involvement as input data in more advanced techniques such as machine learning or bivariate statistics, is of a real help to estimate as accurate as possible the flash-flood susceptibility within a specific catchment [9].
In recent years, new techniques and models have been developed by researchers worldwide [10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35]. During the last 6 years, several studies have been individualized regarding the flash-flood susceptibility investigations, which were carried out through the integration of GIS techniques with bivariate statistical models such as: frequency ratio [36], weights of evidence [37], statistical index [38], evidential belief function [39], certainty factor [40], or index of entropy [41]. Another category of methods successfully used in this type of study are those included in Multicriteria Decision Making such as: Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) [42], Analytical Hierarchy Process (AHP) [43], Analytical Network Process (ANP) [44] or Vlse Kriterijuska Optamizacija I Komoromisno Resenje (VIKOR) [45]. Promising results in terms of flash-flood susceptibility were also provided by machine learning models such as: logistic regression [46], naïve bayes [47], artificial neural network [48], random forest [49,50], support vector machine [51], neuro-fuzzy inference system [52], k-nearest neighbor [53] or deep learning neural network [54]. The attempts of researchers to combine models from the same category or from different categories to generate ensemble algorithms that are considered much more accurate than the stand-alone ones should also be noted [55]. In this regard, the following examples can be provided: Fuzzy Unordered Rules Induction Algorithm (FURIA) [3], Bayesian-based machine learning models [9], machine learning and multicriteria decision making ensembles [7], machine learning and bivariate statistics ensembles [56].
Taking into account the previously presented aspects, the main purpose of the proposed research work is to estimate the susceptibility to flash floods in the basin of the Bâsca Chiojdului river from Romania. Estimation of flash-flood exposure will be based on the data collected using Remote Sensing sensors and the GIS database and their use in a number of four ensemble models generated by combining bivariate statistics with deep learning neural networks and alternating decision trees. Thus, on the one hand, the Frequency Ratio and Weights of Evidence bivariate statistical models will be used; these being combined with deep learning neural network and alternating decision trees. The construction of Receiver Operating Characteristic (ROC) curve and the calculation of several statistical metrics will ensure the validation of the results and the evaluation of the models’ performances. It is worthwhile to note that the present study is intended to enrich the scientific literature regarding the flash-flood susceptibility assessment by proposing, for the first time in the literature, the combination above mentioned of four machine learning ensemble models with the GIS and remote sensing techniques.

2. Study Area

The Bâsca Chiojdului river basin from Romania, on which the present research is focused, has a total area of 340 km2. The basin has an elevation which varies from 242 m to 1463 m, and a slope angle with an average value of 12.3°. It should be noted that a percentage of 79% of the total area is characterized by slope angles higher than 7° [57]. The circularity ratio, that is another important feature with a high influence on flash-flood susceptibility, has a value of 0.46, while the river basin concentration time is 7.27 h [36]. The low concentration time highlights a high predisposition of the study area to the flash-flood events. The forest vegetation covers a total percentage of 50%, while in terms of the soil component, the hydrological group B accounts for approximately 41% of the total research area.
The lithology consists mainly of the sedimentary rocks included in the Paleogene and Cretaceous flysch. The climate is characterized by a high continentalism degree, and especially in the warm season, the heavy rainfalls often lead to severe flash-flood phenomena. Due to the geographical characteristics of the Bâsca Chiojdului river basin, the socio-economic elements located across its territory suffered material losses following the flash-flood propagation. The most important flash-flood event occurred in 1975, when the maximum discharge value (300 m3/s) of the Bâsca Chiojdului river reached the historical maximum [57]. More information regarding the main flash floods occurred across the study area, as well as the damages caused by these phenomena, can be found in the research works carried out by: Costache and Zaharia [10], Prăvălie and Costache [57], Costache et al. [38], Zarea and Gheorghe [58], Prăvălie and Costache [59].

3. Data

In order to carry out the present study, data consisting of torrential areas polygons and flash-flood predictors were gathered.

3.1. Torrential Area Inventory and Sampling

The inventory of surfaces previously affected by a specific process is essential for an accurate prediction of the areas where that phenomenon can occur in the future [60]. In the present research work, we consider the torrential surfaces as the spatial indicator for the areas with a high susceptibility for flash-flood genesis. In order to identify, as accurate as possible, the areas affected by torrential phenomena, analysis of the images provided by the Remote Sensing sensors was mandatory. This fact highlights the crucial role that this type of sensor has in the analysis of natural hazard susceptibility. Thus, using the Google Earth imagery a total area of 34 km2 was delimited. These surfaces were created by the accelerated surface runoff occurring on the slopes. The manner in which these surfaces are delineated is described in the study carried out by Costache [61]. According to Costache and Zaharia [8], the torrential areas are defined as the areas characterized by the unified presence of torrential microform of relief such as ravines and gullies, which are generated by surface runoff. They are located in the upper part of the river basin, where the absence of vegetation and the high slopes favor the production of such phenomena. In order to be taken into account in the present study, a sample of 481 points representing locations where the torrential runoff took place was extracted from the entire delimited area. Moreover, another sample of 481 points was placed within the study area, representing points without torrential processes (Figure 1). Both torrential pixels and non-torrential pixels were divided into training (70%) and validating (30%) samples. This division was necessary in order to train the models and then to validate the results regarding the susceptibility to flash floods.

3.2. Flash-Flood Predictors

For the realization of this study, a number of 10 flash-flood conditioning factors were taken into account. Their main properties are described in the following lines. Slope angle was calculated using the Digital Elevation Model (DEM) taken from Shuttle Radar Topographic Mission (SRTM) 30 m database and processed in ArcGIS 10 software. A high value of slope angle will influence in a positive water runoff velocity, while the low values of the same parameter will be restrictive for the surface runoff occurrence [56]. For the study area, the map of slope angle was designed by splitting its range of values into five classes as following [12]: <3°; 3°–7°; 7.1°–15°; 15.1°–25°; >25° (Figure 2a). Another water surface runoff predictor is represented by the Topographic Wetness Index (TWI) calculated by the DEM processing in SAGA GIS 2.1.0. The algorithm used to calculate this index requires the use of the area upslope to each pixel and the tangent value of the slope value recorded in the same pixel [53]. The generation of TWI map was possible following the partition of its values into the next five classes using Natural Breaks method: 3.15–6.1, 6.11–7.78, 7.79–10.21, 10.22–14.5, 14.51–24.59 (Figure 2b). Topographic Position Index (TPI) is a mandatory flash-flood predictor which should be involved in the susceptibility related studies because its values emphasize the altitude difference between the location of a specific point and its neighboring area [62]. This important morphometric indicator was achieved at a spatial resolution of 30 m and its values ranging from −20 to 20 were divided into the next five classes using Natural Breaks method: (−20)–(−3.8), (−3.7)–(−1.1), (−1.1)–1.3, 1.4–4.5, 4.6–20 (Figure 2c). Profile curvature is mainly used to delineate the surfaces on which an accelerated surface runoff is manifested from those on which a decelerated surface runoff occurs [63]. According to the literature [38], positive profile curvature is characteristic for areas with a decelerated water runoff, while the negative values show the surfaces that increase the water runoff velocity. Across the study area, the profile curvature was classified into the following three intervals: (−3)–0, 0.1–0.9, 1–2 (Figure 2d). The ability of convergence index morphometric factor consists of the differentiation of the areas belonging the river valleys from those which are situated along the interfluvial lines. This index, achieved by DEM processing in SAGA GIS 2.1.0, was classified according to the literature: (−99)–(−3), (−2.9)–(−2), (−1.9)–(−1), (−0.9)–0, 0–99 (Figure 2e). Stream Power Index (SPI) is another morphometric factor that is generated in SAGA GIS 2.1.0 based on the values of upslope region that drains into a pixel and the tangent applied to the slope angle [64]. This predictor, which shows the capacity of the river for sediment transport, was mapped using the following classes values: <50, 50–500, 501–2000, 2001–5000, >5000 (Figure 2f). Slope aspect (Figure 3a) is the seventh morphometric index taken into account for the present research. The slope orientation has a big influence in the surface runoff process because the humidity condition will vary due to the different quantity of solar radiation [65]. The slope aspect predictor was derived from the DEM.
Land use, which is the main interface between the torrential rainfalls and the ground surface, has an important influence on the runoff velocity [66]. For the present study, the land use layer was taken from the Corine Land Cover 2018 database. According to Figure 3b, a number of eight land use categories were delineated within the study area perimeter. Hydrological soil group was considered as a flash-flood predictor in the present research due to its incontestable influence on vertical infiltration of water in the ground [67]. Within the Bâsca Chiojdului cathcment, all of the four hydrological soil groups are present (Figure 3c). A similar contribution, as soil groups, in flash-flood genesis is held by the lithological groups. In the area of the Bâsca Chiojdului catchment, a total of 10 lithological groups can be found (Figure 3d).

4. Methods

The main steps of the methodological workflow are synthetically described in Figure 4.

4.1. Linear Support Vector Machine (LSVM) for Feature Selection

In a study that aims to estimate the qualitative flash-flood susceptibility, it is imperative to analyze the predictive ability of flash-flood conditioning factors in order to see if they all manage to contribute to some extent to the genesis of flash floods. In the present research paper, the evaluation of the prediction ability of flash-flood predictors was determined using Linear Support Vector Machine (LSVM). This method is widely used because it is able to remove redundant and irrelevant information from input data [68]. The following equation is used to compute the predictive ability through LSVM algorithm [69]:
f ( x ) = s i g n ( C T i + j )
where CT is equal to the inverse of weight matrix attributed to each flash-flood predictor, i = (i1, i2,, i11) is the vector containing the ten flash-flood predictors, j is equal to the offset value calculated from the hyper-plane origin [5].
This algorithm was applied with the help of Weka 9.3 software.

4.2. Weights of Evidence (WOE)

The bivariate statistics model represented by Weights of Evidence (WOE) is a very frequently used algorithm involved in the studies focused on natural hazards predisposition evaluation [40]. In this study, the WOE model is used to calculate the weight that each factor class/category has in relation to the genesis of the flash-flood process. In order to derive the WOE coefficients, first, computing the positive (W+) and negative (W) weights is required. The positive weight highlights the association between a factor class/category and the torrential points, while the negative weight indicates the absence of this spatial association [36]. The following relations should be employed in the weights computation [70]:
W + = ln P { B | S } P { B | S ¯ }
W = ln P { B ¯ | S } P { B ¯ | S ¯ }
where: W+—positive weight, W—negative weight, P—the probability, B—the presence of flash-flood predictor, B ¯ —the absence of flash-flood predictor, S—the presence of torrential pixels, S ¯ —the absence of torrential pixels.
The final WOE coefficients can be derived using the next equation [71]:
Wf = Wplus + WmintotalWmin
where: Wplus—positive weight of a class factor, Wmin—negative weight of a class factor, Wmintotal—the total of all negative weights in a multiclass map.
The final WOE values will be used as input data into the Deep Learning and Alternating Decision Tree models through which the flash-flood susceptibility will be determined.

4.3. Frequency Ratio (FR)

Frequency Ratio (FR) is the second bivariate statistical model which will be employed in order to prepare the input data in the Deep Learning and Alternating Decision Tree algorithms. The FR model consists of the calculation of the ratio between the sum of torrential pixels within a specific category of predictor, and the sum of torrential pixels within the entire study zone. The following relation can be used to estimate the FR coefficients [72]:
F R = N p ( L X i ) i = 1 m N p ( L X i ) N p ( X j ) j = 1 n N p ( X j )
where: FR—the frequency ratio of class i of factor j; Np(LXi)—the number of pixels with torrentiality within class i of factor variable X; Np(Xj)—the number of pixels within factor variable Xj; m—the number of classes in the factor variable Xi; n—the number of factors in the study area.

4.4. Deep Learning Neural Network (DLNN)

Besides one hidden layer neural networks, the Deep Learning Neural Network (DLNN) is characterized by a feed-forward architecture which contains more than one hidden layer [73]. Due to this fact, DLNN model is considered better than the simple neural network in terms of complex classification problems [74]. In the DLNN structure, the information from the input layer will be transmitted to the hidden layers where it is processed and then forwarded to the output layer. Further, the backpropagation algorithm will be employed to send back the error from the output layer to the input layer [75]. The training procedure of DLNN, which is a type of fee-forward neural network, is ensured by the application of Rectified Linear Unit (ReLU) activation function [76]. This function, which is able to reduce the vanishing gradient, is expressed as follows:
r ( x ) = { | x   i f   x > 0 | 0   i f   x 0 = max ( 0 , x )
where x is the input signal transmitted to neuron, while r is the ReLU function.
The derivate associated to the ReLU function, which are required by the back-propagation algorithm, can be calculated using the following relation:
r ( x ) = { | 1 ,   x > 0 | 0 ,   x 0
It should be remarked that the cross-entropy function is also involved in the training procedure because it helps the DLNN to achieve a higher degree of accuracy [77]. The cross-entropy is mathematically described using the next equation:
E = 1 N n = 1 N M ln ( P ) + ( 1 M ) ln ( 1 P )  
where N is the total number of records in training sample; M is the predictor values, while P is the predicted values.
The adaptive momentum (Adam) prediction model, implied in the stochastic optimization process, is used to complete the training process of DLNN. Through the Adam model, the first and second moments could be computed via the exponential moving averages highlighted through the next relations [78]:
m t = β 1 m t 1 + ( 1 β 1 ) g t
v t = β 2 v t 1 + ( 1 β 2 ) g t 2
where m and v are the values of the moving averages, g represents current mini-batch gradient, β is new hyper-parameters computed via the algorithm.
In order to apply the DLNN-FR and DLNN-WOE ensembles, the specific lines of code were written in R programming language. More specifically, the Keras and Lime package from R Studio were used in this regard.

4.5. Alternating Decision Tree

Alternating Decision Tree (ADT) model is an ensemble of the decision tree and boosting method [79]. ADT structure has a lower complexity than decision tree models such as Rotation Forest, Classification and Regression Tree or Random Forest [80]. ADT model uses a natural extension of decision tree and voted stumps and is formed by prediction alternate layers and nodes of decision [81]. Within the ADT algorithm, the decision nodes will specify the predicate condition; meanwhile the prediction nodes will be characterized by a single number [80].
Let c1 be the value of a precondition, c2 the value of a base condition, and a and b the values of two real numbers; then a and b will be computed using the relations [82]:
a = 0.5 ln W + ( c 1 c 2 ) W ( c 1 c 2 ) ,   b = 0.5 ln W + ( c 1 c 2 ¯ ) W ( c 1 c 2 ¯ )
where W denotes the sum of the values from any prediction node, and the best c1 and c2 are estimated by minimizing the Zt (c1, c2), determined as follows:
z t ( c 1 c 2 ) = 2 W + ( c 1 c 2 ) W ( c 1 c 2 ) + W + ( c 1 c 2 ¯ ) W ( c 1 c 2 ¯ ) + W ( c 2 ¯ )
The ADT-FR and ADT-WOE ensembles were run and implemented in Weka software.

4.6. Model Performance and Results Validation

4.6.1. Statistical Measures

At the first stage, the models’ performance assessment will consist of the computation of the next statistical metrics: specificity, sensitivity, accuracy, kappa index. The aforementioned indices will be computed using the next mathematical relations:
k = p o p e 1 p e
S e n s i t i v i t y = T P T P + F N
S p e c i f i c i t y = T N F P + T N
A c c u r a c y = T P + T N T P + F P + T N + F N
where TP (True Positive) and TN (True Negative) are the sum of points that will be correctly classified, FP (False Positive) and FN (False Negative) are the sum of points erroneously classified; k is kappa coefficient, po is the sum of initially established torrential pixels, and pe is the sum of predicted torrential pixels.

4.6.2. ROC Curve

The second stage of results validation implied the application of the ROC curve and Area Under Curve (AUC) to measure the model performance. An AUC closer to 1 will highlight a performant model, while the values near to 0 will indicate a weak prediction ability of the models [83,84]. The Success Rate will represent a first form of ROC curve which will be constructed with the training samples, while the Prediction Rate is the second variant of ROC curve which will be designed with the help of validation sample. The AUC values will be determined using the next formula:
A U C = ( T P +   T N ) ( P + N )
where P is the sum of points having torrential phenomena and N is the sum of non-torrential points.

5. Results

5.1. Feature Selection Using LSVM

According to the results achieved through Weka software, the application of LSVM provided the next scores: slope (0.659), profile curvature (0.476), land use (0.429), tpi (0.394), twi (0.362), convergence index (0.338), hydrological soil group (0.283), spi (0.253), lithology (0.231) and aspect (0.162) (Figure 5).

5.2. FR and WOE Coefficients

The values of FR and WOE coefficients are inserted in Table 1. The largest value of FR coefficients (7.295) was achieved by TWI class between 14.6 and 24.6, followed by slope class between 15 and 25° (3.925), SPI values lower than 50 (3.205), built-up areas land use category (2.715) and TPI class between −1 and 1.3 (1.695) (Figure 6). In terms of WOE weights, the highest score was assigned to built-up areas land use category (3.96), followed by TWI class between 14.6 and 24.6 (2.67), slope class between 15 and 25° (2.48), SPI values lower than 50 (1.88) and TPI class between −1 and 1.3 (1.39).
In order to be used as input in ADT and DLNN models, the FR and WOE values were normalized between 0 and 1.

5.3. Models Performance Assessment

The configuration, in terms of the hardware and software environments, that was required for the computational modelling, is presented in Table 2.
It is mandatory that before the final mapping of flash-flood potential, the model’s performance must be evaluated in order to verify its reliability in the methodological process. Thus, in terms of the training dataset, the DLNN-WOE ensemble achieved the highest accuracy (0.985), followed by DLNN-FR (0.982), ADT-FR (0.923) and ADT-WOE (0.92). In terms of the validating sample, the highest accuracy was achieved by DLNN-WOE (0.92), followed by DLNN-FR (0.903), ADT-WOE (0.896) and ADT-FR (0.878) (Table 3).

5.4. Results of Machine Learning Ensembles

5.4.1. DLNN-FR and DLNN-WOE Results

The DLNN based ensembles were trained by establishing the maximum number of epochs to 100 (Figure 7).
Figure 7 highlights the variability of loss and model accuracy according to the epochs number and also for both training and validating samples. Particularly, in the case of the DLNN-FR model, the best performances were achieved with the following model parameters: a number of two hidden layers; a maximum number of 100 hidden neurons in each hidden layer; a dropout rate of 0.3; a batch size value of 5 and a validation split of 0.3. The same number of hidden layers and neurons was used also in the case of DLNN-WOE, while the other parameters have the following value: a dropout rate of 0.4; a batch size of 4 and a validation rate of 0.2. The architecture of the DLNN-based ensembles are represented in Figure 8.
The next step in the flash-flood susceptibility computation process is the derivation of the flash-flood predictor’s importance. In terms of DLNN-FR, the highest importance was assigned to slope factor (0.2). On the second-place rank, land use (0.143), followed by profile curvature (0.12), TWI (0.109), hydrological soil group (0.097), lithology (0.094), TPI (0.08), SPI (0.067), convergence index (0.061) and aspect (0.029) (Figure 9). The application of DLNN-WOE revealed that the most important factor was slope (0.235), and is followed by land use (0.149), SPI (0.089), hydrological soil group (0.086), TPI (0.086), TWI (0.082), lithology (0.074), convergence index (0.072), profile curvature (0.064) and aspect (0.063).
The weights of flash-flood predictors were used in ArcGIS map algebra in order to derive the flash-flood potential index values. All the Flash-Flood Potential Index (FFPI) results, with values between 0 and 1, were reclassified in five classes using Natural Breaks method. In terms FFPIDLNN-FR, the very low flash-flood potential values cover around 7.5% of the study area and range between 0 and 0.42 (Figure 10a). The low flash-flood potential appears on around 15.6% of Bâsca Chiojdului river catchment and has values ranging from 0.43 and 0.55. It should be remarked that these values are mainly spread on the southern half of the area. The medium flash-flood potential has a span of 30.28% of the entire territory (Figure 11) and is characterized by FFPIDLNN-FR between 0.56 and 0.66. These values are uniformly distributed across the study zone. The high and very high flash-flood potential appears on areas with FFPIDLNN-FR higher than 0.67 and covers approximately 46.57% of the research area. This potential degree is mainly present in the northern half of Bâsca Chiojdului river basin.
In terms of FFPIDLNN-WOE, the very low flash-flood potential is characteristic for a percentage of 7% from the entire study perimeter, while the low values of the same indicator cover an area of 13.41% of the total territory. Ranging from 0.59 to 0.68 (Figure 10b), the medium flash-flood potential spans accross approximately 28.76% of the Bâsca Chiojdului river catchment. High and very high flash-flood susceptibility has values of FFPIDLNN-WOE higher than 0.69 and is spread over more than 50% of the research zone. It should be noted that the areas delineated through DLNN-WOE have a lower degree of fragmentation than the areas delineated by DLNN-FR.

5.4.2. ADT-FR and ADT-WOE Results

A trial procedure was applied in order to determine the best parameter associated with the highest accuracy of ADT-FR and ADT-WOE for both training and validating samples. Thus, in terms of ADT-FR, the highest accuracies (0.923 for training and 0.878 for validating) were achieved after 23 iterations, while in terms of ADT-WOE the best accuracies (0.92 for training and 0.896 for validating) were determined after a number of 28 iterations (Table 4). Once the best parameters were determined, the optimally pruned decision trees were constructed (Figure 12a,b) and the flash-flood predictors importance were calculated.
Therefore, in terms of ADT-FR, the highest importance was assigned to slope factor (0.191). On the second-place rank land use (0.134), followed by hydrological soil group (0.131), lithology (0.125), profile curvature (0.108), convergence index (0.102), TWI (0.091), SPI (0.07), TPI (0.034) and aspect (0.013) (Figure 9). The application of ADT-WOE revealed that the most important factor was slope (0.198), and is followed by land use (0.156), hydrological soil group (0.123), lithology (0.117), profile curvature (0.096), TWI (0.086), convergence index (0.075), SPI (0.066), aspect (0.051) and TWI (0.032).
As in the case of the previous two ensembles, the FFPIADT-FR and FFPIADT-WOE were calculated. In terms FFPIADT-FR, the very low flash-flood potential spans around 3.26% of the study area and has values between 0 and 0.39 (Figure 10c). The low flash-flood potential is distributed on around 11.74% of the Bâsca Chiojdului river catchment and has values ranging from 0.4 to 0.58. The medium flash-flood potential spans 25.63% of the entire territory and has values between 0.59 and 0.7 (Figure 10c). The high and very high flash-flood potentials appear on areas with FFPIADT-FR higher than 0.71 and cover approximately 59.38% of the research area. In terms of FFPIADT-WOE, the very high flash-flood potential covers 6.64% of the entire study perimeter, while the low values are spread over 13.87% of the total territory. With values from 0.59 to 0.69 (Figure 10d), the medium flash-flood potential occurs over 30.34% of the Bâsca Chiojdului river catchment. The high and very high flash-flood potential indices have values higher than 0.7 and account for almost 50% of the study zone.

5.5. Results Validation Using ROC Curve

The validation of the FFPI results provided by each ensemble model was carried out using the ROC curve method. Thus, in the case of the Success Rate, the highest performance was achieved by FFPIDLNN-WOE with an AUC of 0.96, being followed by FFPIDLNN-FR (AUC = 0.942), FFPIADT-WOE (AUC = 0.94) and FFPIADT-FR (AUC = 0.919) (Figure 13a). If we analyze the Prediction Rate outcomes, it can be seen that the same FFPIDLNN-WOE indicator achieved the highest performance (AUC = 0.921), followed by FFPIDLNN-FR (0.92), FFPIADT-WOE (0.909) and FFPIADT-FR (AUC = 0.879).

6. Discussions

With the undeniable advancement of technology, there are more and more possibilities to monitor the dangerous phenomena that occur on the Earth’s surface. In this regard, it is worth remembering the rapid advance of observation techniques of the terrestrial surface by means of remote sensing sensors—with the help of which, the surfaces affected by natural hazards can be observed.
Thus, the present paper used images taken with the help of these sensors to identify the areas already affected by the torrential runoff from the Earth’s surface. It should be mentioned that the most accurate identification of these areas is essential in obtaining results with high accuracy and which can be further used by the competent authorities in risk assessment and in adopting the most appropriate measures to reduce future damage caused by these hazards. Thus, by analyzing the images provided by remote sensing sensors, on the river basin of the river Bâsca Chiojdului, areas affected by torrential runoff totaling a total area of 34 km2, representing about 10% of the entire study area, were identified. Furthermore, in order to capitalize on the delimited surfaces, a sample of about 481 was generated, taking a sample of points affected by torrential phenomena transposed into relief microforms such as ravines. In order to ensure the correctness of the modelling results, another sample of 481 points was generated from the areas where the torrential phenomena did not take place; the entire data set being then divided into training and validating data. The values of 10 flash-flood predictors were also used as input data. It should be noted that Remote Sensing sensors also played a crucial role in generating 8 of the 10 flash-flood predictors. Thus, all morphometric parameters were derived from the digital terrain model taken from the SRTM database, 30 m which was acquired using radar techniques. In addition, the land use, taken from the Corine Land Cover 2018 database, was generated by the supervised classification of the images provided by the Remote Sensing sensors.
Data on the presence of phenomena and the values of the main predictors of flash-flood genesis were included in two of the state-of-the-art machine learning models represented by Deep Learning Neural Networks and Alternating Decision Trees. These two models are recommended due to the very good results they provided following their application in previous studies on the estimation of susceptibility to natural hazards [79,80]. For a higher degree of results objectivity, it was decided to process the training sample by assigning some coefficients using the bivariate methods statistics, Frequency Ratio and Weights of Evidence. This method has proven to be very useful in previous studies [46,56] where the initial data were processed with bivariate statistics algorithms.
The combination of DLNN with WOE proved to be the most efficient because the accuracy achieved during the training process exceeded 98%, while ROC curve applied to the final product FFPIDLNN-WOE showed a maximum AUC of 0.96. This value of AUC exceeds the value obtained by Costache et al. [38], when, by applying the hybrid combination between Multilayer Perceptron (MLP) and Statistical Index, for the same study area and for the FFPI calculation, a maximum AUC value of 0.94 was obtained. These results confirm the findings from the literature according to which DLNN, whose architecture includes several hidden layers, is able to surpass the MLP performances whose architecture includes a single hidden layer [57]. Moreover, the MLP performance from the previous study was exceeded by the DLNN-FR ensemble model, characterized by an AUC of 0.942. Overall, in the Bâsca Chiojdului basin, the models showed a percentage of the high and very high flash-flood potential between 46.57% (DLNN-FR) and 59.38% (ADT-FR).

7. Conclusions

In light of the continuous increase in the flash-flood events’ frequency, the present research work proposed a workflow through which the areas susceptible to flash floods are identified based on remote sensing and GIS data included in Deep Learning and Alternating Decision Trees ensembles. Thus, using 418 torrential and 481 non-torrential locations along with 10 flash-flood predictors, the Flash-Flood Potential Index was determined across the Bâsca Chiojdului river basin. Using as input data the FR and WOE coefficients, the FFPI was computed using the following four ensembles: DLNN-FR, DLNN-WOE, ADT-FR and ADT-WOE. As was expected, the slope angle and land use resulted in being the most important flash-flood predictors. The highest results accuracy was achieved by the DLNN-WOE model which is characterized by an AUC–ROC curve of 0.985. The percentage (59.38%) of high and very high FFPI classes was revealed by the application of ADT-FR ensemble.
The main novelty of this study is represented by the application for the first time in the literature of the four ensemble models for determining flash-flood potential index values.
This work is of real importance for the governmental authorities which can use the results in order to improve the measures taken to mitigate the negative effects of flash-flood hazards within the study area.

Author Contributions

Conceptualization, R.C., A.A. (Alireza Arabameri) and Q.B.P.; data curation, R.C., A.A. (Alireza Arabameri), Q.B.P., B.T.P., M.P. and A.A. (Aman Arora); methodology, R.C., A.A. (Alireza Arabameri), Q.B.P., B.T.P., M.P., A.A. (Aman Arora), N.T.T.L. and I.C.; writing—original draft. R.C., A.A. (Alireza Arabameri), Q.B.P., B.T.P., M.P. and A.A. (Aman Arora); writing—review and editing, R.C., A.A. (Alireza Arabameri), Q.B.P., B.T.P., M.P., A.A. (Aman Arora), T.B., N.T.T.L. and I.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partly funded by the Austrian Science Fund (FWF) through the Doctoral College GIScience (DK W 1237-N23) at the University of Salzburg.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data available on request.

Conflicts of Interest

There is no coflict of interest.

References

  1. Halkos, G.; Skouloudis, A. Investigating resilience barriers of small and medium-sized enterprises to flash floods: A quantile regression of determining factors. Clim. Dev. 2020, 12, 57–66. [Google Scholar] [CrossRef]
  2. Bezak, N.; Mikoš, M. Investigation of Trends, Temporal Changes in Intensity-Duration-Frequency (IDF) Curves and Extreme Rainfall Events Clustering at Regional Scale Using 5 min Rainfall Data. Water 2019, 11, 2167. [Google Scholar] [CrossRef] [Green Version]
  3. Bui, D.T.; Tsangaratos, P.; Ngo, P.-T.T.; Pham, T.D.; Pham, B.T. Flash flood susceptibility modeling using an optimized fuzzy rule based feature selection technique and tree based ensemble methods. Sci. Total Environ. 2019, 668, 1038–1054. [Google Scholar] [CrossRef] [PubMed]
  4. Cao, C.; Xu, P.; Wang, Y.; Chen, J.; Zheng, L.; Niu, C. Flash flood hazard susceptibility mapping using frequency ratio and statistical index methods in coalmine subsidence areas. Sustainability 2016, 8, 948. [Google Scholar] [CrossRef] [Green Version]
  5. Costache, R. Flash-Flood Potential assessment in the upper and middle sector of Prahova river catchment (Romania). A comparative approach between four hybrid models. Sci. Total Environ. 2019, 659, 1115–1134. [Google Scholar] [CrossRef]
  6. Elkhrachy, I. Flash flood hazard mapping using satellite images and GIS tools: A case study of Najran City, Kingdom of Saudi Arabia (KSA). Egypt. J. Remote Sens. Space Sci. 2015, 18, 261–278. [Google Scholar] [CrossRef] [Green Version]
  7. Costache, R.; Bui, D.T. Identification of areas prone to flash-flood phenomena using multiple-criteria decision-making, bivariate statistics, machine learning and their ensembles. Sci. Total Environ. 2020, 712, 136492. [Google Scholar] [CrossRef]
  8. Janizadeh, S.; Avand, M.; Jaafari, A.; Phong, T.V.; Bayat, M.; Ahmadisharaf, E.; Prakash, I.; Pham, B.T.; Lee, S. Prediction Success of Machine Learning Methods for Flash Flood Susceptibility Mapping in the Tafresh Watershed, Iran. Sustainability 2019, 11, 5426. [Google Scholar] [CrossRef] [Green Version]
  9. Hosseini, F.S.; Choubin, B.; Mosavi, A.; Nabipour, N.; Shamshirband, S.; Darabi, H.; Haghighi, A.T. Flash-flood hazard assessment using ensembles and Bayesian-based machine learning models: Application of the simulated annealing feature selection method. Sci. Total Environ. 2020, 711, 135161. [Google Scholar] [CrossRef]
  10. Liu, Y.-X.; Yang, C.-N.; Sun, Q.-D.; Wu, S.-Y.; Lin, S.-S.; Chou, Y.-S. Enhanced embedding capacity for the SMSD-based data-hiding method. Signal Process. Image Commun. 2019, 78, 216–222. [Google Scholar] [CrossRef]
  11. Zhao, C.; Li, J. Equilibrium Selection under the Bayes-Based Strategy Updating Rules. Symmetry 2020, 12, 739. [Google Scholar] [CrossRef]
  12. Xiong, Q.; Zhang, X.; Wang, W.-F.; Gu, Y. A Parallel Algorithm Framework for Feature Extraction of EEG Signals on MPI. Comput. Math. Methods Med. 2020, 2020, 9812019. [Google Scholar] [CrossRef] [PubMed]
  13. Zhu, Q. Research on Road Traffic Situation Awareness System Based on Image Big Data. IEEE Intell. Syst. 2019, 35, 18–26. [Google Scholar] [CrossRef]
  14. Fu, X.; Yang, Y. Modeling and analysis of cascading node-link failures in multi-sink wireless sensor networks. Reliab. Eng. Syst. Saf. 2020, 197, 106815. [Google Scholar] [CrossRef]
  15. Fu, X.; Pace, P.; Aloi, G.; Yang, L.; Fortino, G. Topology Optimization Against Cascading Failures on Wireless Sensor Networks Using a Memetic Algorithm. Comput. Netw. 2020, 177, 107327. [Google Scholar] [CrossRef]
  16. Zenggang, X.; Zhiwen, T.; Xiaowen, C.; Xue-min, Z.; Kaibin, Z.; Conghuan, Y. Research on Image Retrieval Algorithm Based on Combination of Color and Shape Features. J. Signal Process. Syst. 2019, 1–8. [Google Scholar] [CrossRef]
  17. Zuo, C.; Sun, J.; Li, J.; Asundi, A.; Chen, Q. Wide-field high-resolution 3d microscopy with fourier ptychographic diffraction tomography. Opt. Lasers Eng. 2020, 128, 106003. [Google Scholar] [CrossRef] [Green Version]
  18. Long, Q.; Wu, C.; Wang, X. A system of nonsmooth equations solver based upon subgradient method. Appl. Math. Comput. 2015, 251, 284–299. [Google Scholar] [CrossRef]
  19. Zhu, J.; Shi, Q.; Wu, P.; Sheng, Z.; Wang, X. Complexity analysis of prefabrication contractors’ dynamic price competition in mega projects with different competition strategies. Complexity 2018, 2018, 5928235. [Google Scholar] [CrossRef]
  20. Xiong, L.; Zhang, H.; Li, Y.; Liu, Z. Improved stability and H∞ performance for neutral systems with uncertain Markovian jump. Nonlinear Anal. Hybrid Systems 2016, 19, 13–25. [Google Scholar]
  21. Wu, T.; Cao, J.; Xiong, L.; Zhang, H. New Stabilization Results for Semi-Markov Chaotic Systems with Fuzzy Sampled-Data Control. Complexity 2019, 2019, 7875305. [Google Scholar] [CrossRef]
  22. Wu, T.; Xiong, L.; Cheng, J.; Xie, X. New results on stabilization analysis for fuzzy semi-Markov jump chaotic systems with state quantized sampled-data controller. Inf. Sci. 2020, 521, 231–250. [Google Scholar] [CrossRef]
  23. Shi, K.; Wang, J.; Tang, Y.; Zhong, S. Reliable asynchronous sampled-data filtering of T–S fuzzy uncertain delayed neural networks with stochastic switched topologies. Fuzzy Sets Syst. 2020, 381, 1–25. [Google Scholar] [CrossRef]
  24. Shi, K.; Wang, J.; Zhong, S.; Tang, Y.; Cheng, J. Non-fragile memory filtering of TS fuzzy delayed neural networks based on switched fuzzy sampled-data control. Fuzzy Sets Syst. 2020, 394, 40–64. [Google Scholar] [CrossRef]
  25. Xu, M.; Li, T.; Wang, Z.; Deng, X.; Yang, R.; Guan, Z. Reducing complexity of HEVC: A deep learning approach. IEEE Trans. Image Process. 2018, 27, 5044–5059. [Google Scholar] [CrossRef] [Green Version]
  26. Lv, Z.; Qiao, L. Deep belief network and linear perceptron based cognitive computing for collaborative robots. Appl. Soft Comput. 2020, 92, 106300. [Google Scholar] [CrossRef]
  27. Lv, Z.; Xiu, W. Interaction of edge-cloud computing based on SDN and NFV for next generation IoT. IEEE Internet Things J. 2019, 7, 5706–5712. [Google Scholar] [CrossRef]
  28. Chen, H.; Chen, A.; Xu, L.; Xie, H.; Qiao, H.; Lin, Q.; Cai, K. A deep learning CNN architecture applied in smart near-infrared analysis of water pollution for agricultural irrigation resources. Agric. Water Manag. 2020, 240, 106303. [Google Scholar] [CrossRef]
  29. Chen, H.; Qiao, H.; Xu, L.; Feng, Q.; Cai, K. A Fuzzy Optimization Strategy for the Implementation of RBF LSSVR Model in Vis–NIR Analysis of Pomelo Maturity. IEEE Trans. Ind. Inform. 2019, 15, 5971–5979. [Google Scholar] [CrossRef]
  30. Qian, J.; Feng, S.; Tao, T.; Hu, Y.; Li, Y.; Chen, Q.; Zuo, C. Deep-learning-enabled geometric constraints and phase unwrapping for single-shot absolute 3D shape measurement. APL Photonics 2020, 5, 046105. [Google Scholar] [CrossRef]
  31. Qian, J.; Feng, S.; Li, Y.; Tao, T.; Han, J.; Chen, Q.; Zuo, C. Single-shot absolute 3D shape measurement with deep-learning-based color fringe projection profilometry. Opt. Lett. 2020, 45, 1842–1845. [Google Scholar] [CrossRef] [PubMed]
  32. Chao, L.; Zhang, K.; Li, Z.; Zhu, Y.; Wang, J.; Yu, Z. Geographically weighted regression based methods for merging satellite and gauge precipitation. J. Hydrol. 2018, 558, 275–289. [Google Scholar] [CrossRef]
  33. Zhang, S.; Pak, R.Y.; Zhang, J. Vertical time-harmonic coupling vibration of an impermeable, rigid, circular plate resting on a finite, poroelastic soil layer. Acta Geotech. 2020, 1–25. [Google Scholar]
  34. Yang, S.; Deng, B.; Wang, J.; Li, H.; Lu, M.; Che, Y.; Wei, X.; Loparo, K.A. Scalable Digital Neuromorphic Architecture for Large-Scale Biophysically Meaningful Neural Network with Multi-Compartment Neurons. IEEE Trans. Neural Netw. Learn. Syst. 2019, 1–15. [Google Scholar] [CrossRef] [PubMed]
  35. Tsai, Y.H.; Wang, J.; Chien, W.T.; Wei, C.Y.; Wang, X.; Hsieh, S.H. A BIM-based approach for predicting corrosion under insulation. Autom. Constr. 2019, 107, 102923. [Google Scholar] [CrossRef]
  36. Costache, R.; Zaharia, L. Flash-flood potential assessment and mapping by integrating the weights-of-evidence and frequency ratio statistical methods in GIS environment–case study: Bâsca Chiojdului River catchment (Romania). J. Earth Syst. Sci. 2017, 126, 59. [Google Scholar] [CrossRef]
  37. Khosravi, K.; Nohani, E.; Maroufinia, E.; Pourghasemi, H.R. A GIS-based flood susceptibility assessment and its mapping in Iran: A comparison between frequency ratio and weights-of-evidence bivariate statistical models with multi-criteria decision-making technique. Nat. Hazards 2016, 83, 947–987. [Google Scholar] [CrossRef]
  38. Costache, R.; Hong, H.; Pham, Q.B. Comparative assessment of the flash-flood potential within small mountain catchments using bivariate statistics and their novel hybrid integration with machine learning models. Sci. Total Environ. 2020, 711, 134514. [Google Scholar] [CrossRef]
  39. Tien Bui, D.; Khosravi, K.; Shahabi, H.; Daggupati, P.; Adamowski, J.F.; Melesse, A.M.; Thai Pham, B.; Pourghasemi, H.R.; Mahmoudi, M.; Bahrami, S. Flood spatial modeling in northern Iran using remote sensing and gis: A comparison between evidential belief functions and its ensemble with a multivariate logistic regression model. Remote Sens. 2019, 11, 1589. [Google Scholar] [CrossRef] [Green Version]
  40. Razandi, Y.; Pourghasemi, H.R.; Neisani, N.S.; Rahmati, O. Application of analytical hierarchy process, frequency ratio, and certainty factor models for groundwater potential mapping using GIS. Earth Sci. Inform. 2015, 8, 867–883. [Google Scholar] [CrossRef]
  41. Siahkamari, S.; Haghizadeh, A.; Zeinivand, H.; Tahmasebipour, N.; Rahmati, O. Spatial prediction of flood-susceptible areas using frequency ratio and maximum entropy models. Geocarto Int. 2018, 33, 927–941. [Google Scholar] [CrossRef]
  42. Yang, W.; Xu, K.; Lian, J.; Ma, C.; Bin, L. Integrated flood vulnerability assessment approach based on TOPSIS and Shannon entropy methods. Ecol. Indic. 2018, 89, 269–280. [Google Scholar] [CrossRef]
  43. Razavi Termeh, S.V.; Pourghasemi, H.R.; Alidadganfard, F. Flood Inundation Susceptibility Mapping using Analytical Hierarchy Process (AHP) and TOPSIS Decision Making Methods and Weight of Evidence Statistical Model (Case Study: Jahrom Township, Fars Province). J. Watershed Manag. Res. 2018, 9, 67–81. [Google Scholar] [CrossRef]
  44. Dano, U.L.; Balogun, A.-L.; Matori, A.-N.; Wan Yusouf, K.; Abubakar, I.R.; Said Mohamed, M.A.; Aina, Y.A.; Pradhan, B. Flood susceptibility mapping using GIS-based analytic network process: A case study of Perlis, Malaysia. Water 2019, 11, 615. [Google Scholar] [CrossRef] [Green Version]
  45. Khosravi, K.; Shahabi, H.; Pham, B.T.; Adamowski, J.; Shirzadi, A.; Pradhan, B.; Dou, J.; Ly, H.-B.; Gróf, G.; Ho, H.L. A comparative assessment of flood susceptibility modeling using Multi-Criteria Decision-Making Analysis and Machine Learning Methods. J. Hydrol. 2019, 573, 311–323. [Google Scholar] [CrossRef]
  46. Ali, S.A.; Parvin, F.; Pham, Q.B.; Vojtek, M.; Vojteková, J.; Costache, R.; Linh, N.T.T.; Nguyen, H.Q.; Ahmad, A.; Ghorbani, M.A. GIS-based comparative assessment of flood susceptibility mapping using hybrid multi-criteria decision-making approach, naïve Bayes tree, bivariate statistics and logistic regression: A case of Topľa basin, Slovakia. Ecol. Indic. 2020, 117, 106620. [Google Scholar] [CrossRef]
  47. Chen, W.; Li, Y.; Xue, W.; Shahabi, H.; Li, S.; Hong, H.; Wang, X.; Bian, H.; Zhang, S.; Pradhan, B. Modeling flood susceptibility using data-driven approaches of naïve bayes tree, alternating decision tree, and random forest methods. Sci. Total Environ. 2020, 701, 134979. [Google Scholar] [CrossRef]
  48. Chapi, K.; Singh, V.P.; Shirzadi, A.; Shahabi, H.; Bui, D.T.; Pham, B.T.; Khosravi, K. A novel hybrid artificial intelligence approach for flood susceptibility assessment. Environ. Model. Softw. 2017, 95, 229–245. [Google Scholar] [CrossRef]
  49. Avand, M.; Janizadeh, S.; Naghibi, S.A.; Pourghasemi, H.R.; Khosrobeigi Bozchaloei, S.; Blaschke, T. A Comparative Assessment of Random Forest and k-Nearest Neighbor Classifiers for Gully Erosion Susceptibility Mapping. Water 2019, 11, 2076. [Google Scholar] [CrossRef] [Green Version]
  50. Pham, B.T.; Prakash, I.; Bui, D.T. Spatial prediction of landslides using a hybrid machine learning approach based on random subspace and classification and regression trees. Geomorphology 2018, 303, 256–270. [Google Scholar] [CrossRef]
  51. Choubin, B.; Moradi, E.; Golshan, M.; Adamowski, J.; Sajedi-Hosseini, F.; Mosavi, A. An Ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines. Sci. Total Environ. 2019, 651, 2087–2096. [Google Scholar] [CrossRef] [PubMed]
  52. Wang, Y.; Hong, H.; Chen, W.; Li, S.; Panahi, M.; Khosravi, K.; Shirzadi, A.; Shahabi, H.; Panahi, S.; Costache, R. Flood susceptibility mapping in Dingnan County (China) using adaptive neuro-fuzzy inference system with biogeography based optimization and imperialistic competitive algorithm. J. Environ. Manag. 2019, 247, 712–729. [Google Scholar] [CrossRef] [PubMed]
  53. Costache, R.; Pham, Q.B.; Sharifi, E.; Linh, N.T.T.; Abba, S.; Vojtek, M.; Vojteková, J.; Nhi, P.T.T.; Khoi, D.N. Flash-Flood Susceptibility Assessment Using Multi-Criteria Decision Making and Machine Learning Supported by Remote Sensing and GIS Techniques. Remote Sens. 2020, 12, 106. [Google Scholar] [CrossRef] [Green Version]
  54. Bui, D.T.; Hoang, N.-D.; Martínez-Álvarez, F.; Ngo, P.-T.T.; Hoa, P.V.; Pham, T.D.; Samui, P.; Costache, R. A novel deep learning neural network approach for predicting flash flood susceptibility: A case study at a high frequency tropical storm area. Sci. Total Environ. 2020, 701, 134413. [Google Scholar]
  55. Arabameri, A.; Saha, S.; Chen, W.; Roy, J.; Pradhan, B.; Bui, D.T. Flash flood susceptibility modelling using functional tree and hybrid ensemble techniques. J. Hydrol. 2020, 587, 125007. [Google Scholar] [CrossRef]
  56. Tehrany, M.S.; Pradhan, B.; Jebur, M.N. Flood susceptibility mapping using a novel ensemble weights-of-evidence and support vector machine models in GIS. J. Hydrol. 2014, 512, 332–343. [Google Scholar] [CrossRef]
  57. Prăvălie, R.; Costache, R. The analysis of the susceptibility of the flash-floods’ genesis in the area of the hydrographical basin of Bāsca Chiojdului river/Analiza susceptibilitatii genezei viiturilor īn aria bazinului hidrografic al rāului Bāsca Chiojdului. Forum Geogr. 2014, 13, 39–49. [Google Scholar] [CrossRef]
  58. Zarea, R.; Gheorghe, M. Dangerous hydrological phenomena on the Hydrographic Basin Bâsca Chiojdului. In Buletinul Institutului Politehnic Din Iaşi; Universitatea Tehnică “Gheorghe Asachi” din Iaşi Tomul LVI (LX), Fasc: Iași, Romania, 2010; pp. 37–48. [Google Scholar]
  59. Prăvălie, R.; Costache, R. Assessment of socioeconomic vulnerability to floods in the Bâsca Chiojdului catchment area. Rom. Rev. Reg. Stud. 2014, 10, 2. [Google Scholar]
  60. Chen, W.; Li, W.; Chai, H.; Hou, E.; Li, X.; Ding, X. GIS-based landslide susceptibility mapping using analytical hierarchy process (AHP) and certainty factor (CF) models for the Baozhong region of Baoji City, China. Environ. Earth Sci. 2016, 75, 63. [Google Scholar] [CrossRef]
  61. Costache, R. Flash-flood Potential Index mapping using weights of evidence, decision Trees models and their novel hybrid integration. Stoch. Environ. Res. Risk Assess. 2019, 33, 1375–1402. [Google Scholar] [CrossRef]
  62. Skentos, A. Topographic Position Index based landform analysis of Messaria (Ikaria Island, Greece). Acta Geobalcanica 2018, 4, 7–15. [Google Scholar] [CrossRef]
  63. Bui, D.T.; Pradhan, B.; Revhaug, I.; Tran, C.T. A comparative assessment between the application of fuzzy unordered rules induction algorithm and J48 decision tree models in spatial prediction of shallow landslides at Lang Son City, Vietnam. In Remote Sensing Applications in Environmental Research; Springer: Berlin/Heidelberg, Germany, 2014; pp. 87–111. [Google Scholar]
  64. De Rosa, P.; Fredduzzi, A.; Cencetti, C. Stream Power Determination in GIS: An Index to Evaluate the Most’Sensitive’Points of a River. Water 2019, 11, 1145. [Google Scholar] [CrossRef] [Green Version]
  65. Corrao, M.V.; Link, T.E.; Heinse, R.; Eitel, J.U. Modeling of terracette-hillslope soil moisture as a function of aspect, slope and vegetation in a semi-arid environment. Earth Surf. Process. Landf. 2017, 42, 1560–1572. [Google Scholar] [CrossRef]
  66. Zhang, K.; Ruben, G.B.; Li, X.; Li, Z.; Yu, Z.; Xia, J.; Dong, Z. A comprehensive assessment framework for quantifying climatic and anthropogenic contributions to streamflow changes: A case study in a typical semi-arid North China basin. Environ. Model. Softw. 2020, 104704. [Google Scholar] [CrossRef]
  67. Zhang, K.; Wang, Q.; Chao, L.; Ye, J.; Li, Z.; Yu, Z.; Yang, T.; Ju, Q. Ground Observation-based Analysis of Soil Moisture Spatiotemporal Variability Across A Humid to Semi-Humid Transitional Zone in China. J. Hydrol. 2019, 574, 903–914. [Google Scholar] [CrossRef]
  68. Singh, C.; Walia, E.; Kaur, K.P. Enhancing color image retrieval performance with feature fusion and non-linear support vector machine classifier. Optik 2018, 158, 127–141. [Google Scholar] [CrossRef]
  69. Lin, S.-W.; Lee, Z.-J.; Chen, S.-C.; Tseng, T.-Y. Parameter determination of support vector machine and feature selection using simulated annealing approach. Appl. Soft Comput. 2008, 8, 1505–1512. [Google Scholar] [CrossRef]
  70. Lee, S.; Kim, Y.-S.; Oh, H.-J. Application of a weights-of-evidence method and GIS to regional groundwater productivity potential mapping. J. Environ. Manag. 2012, 96, 91–105. [Google Scholar] [CrossRef]
  71. Van Westen, C. Statistical Landslide Hazards Analysis, ILWIS 2.1 for Windows Application Guide; International Institute for Aerospace Survey and Earth Sciences (ITC) Publication: Enschede, The Netherlands, 1997. [Google Scholar]
  72. Lee, S.; Pradhan, B. Landslide hazard mapping at Selangor, Malaysia using frequency ratio and logistic regression models. Landslides 2007, 4, 33–41. [Google Scholar] [CrossRef]
  73. Nielsen, M.A. Neural Networks and Deep Learning; Determination Press: San Francisco, CA, USA, 2015; Volume 25. [Google Scholar]
  74. Schmidhuber, J. Deep learning in neural networks: An overview. Neural Netw. 2015, 61, 85–117. [Google Scholar] [CrossRef] [Green Version]
  75. Agarap, A.F. Deep learning using rectified linear units (relu). arXiv 2018, arXiv:180308375. [Google Scholar]
  76. Bui, Q.-T.; Nguyen, Q.-H.; Nguyen, X.L.; Pham, V.D.; Nguyen, H.D.; Pham, V.-M. Verification of novel integrations of swarm intelligence algorithms into deep learning neural network for flood susceptibility mapping. J. Hydrol. 2020, 581, 124379. [Google Scholar] [CrossRef]
  77. Huang, Z.; Li, J.; Weng, C.; Lee, C.-H. Beyond Cross-Entropy: Towards Better Frame-Level Objective Functions for Deep Neural Network Training in Automatic Speech Recognition. In Proceedings of the INTERSPEECH—15th Annual Conference of the International Speech Communication Association, Singapore, 14–18 September 2014. [Google Scholar]
  78. Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016; ISBN 0-262-33737-1. [Google Scholar]
  79. Wu, Y.; Ke, Y.; Chen, Z.; Liang, S.; Zhao, H.; Hong, H. Application of alternating decision tree with AdaBoost and bagging ensembles for landslide susceptibility mapping. Catena 2020, 187, 104396. [Google Scholar] [CrossRef]
  80. Hong, H.; Pradhan, B.; Xu, C.; Bui, D.T. Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines. Catena 2015, 133, 266–281. [Google Scholar] [CrossRef]
  81. Freund, Y.; Mason, L. The Alternating Decision Tree Learning Algorithm. ICML 1999, 99, 124–133. [Google Scholar]
  82. Khosravi, K.; Pham, B.T.; Chapi, K.; Shirzadi, A.; Shahabi, H.; Revhaug, I.; Prakash, I.; Bui, D.T. A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. Sci. Total Environ. 2018, 627, 744–755. [Google Scholar] [CrossRef] [PubMed]
  83. Sahana, M.; Pham, B.T.; Shukla, M.; Costache, R.; Thu, D.X.; Chakrabortty, R.; Satyam, N.; Nguyen, H.D.; Phong, T.V.; Le, H.V. Rainfall Induced Landslide Susceptibility Mapping Using Novel Hybrid Soft Computing Methods Based on Multi-layer Perceptron Neural Network Classifier. Geocarto Int. 2020, 1–25. [Google Scholar] [CrossRef]
  84. Wang, S.; Zhang, K.; van Beek, L.P.; Tian, X.; Bogaard, T.A. Physically-based landslide prediction over a large region: Scaling low-resolution hydrological model results for high-resolution slope stability assessment. Environ. Model. Softw. 2020, 124, 104607. [Google Scholar] [CrossRef]
Figure 1. Study area location.
Figure 1. Study area location.
Sensors 21 00280 g001
Figure 2. Flash-flood predictors: (a) Slope; (b) Topographic Wetness Index (TWI); (c) Topographic Position Index (TPI); (d) Profile curvature; (e) Convergence index; (f) Stream Power Index (SPI).
Figure 2. Flash-flood predictors: (a) Slope; (b) Topographic Wetness Index (TWI); (c) Topographic Position Index (TPI); (d) Profile curvature; (e) Convergence index; (f) Stream Power Index (SPI).
Sensors 21 00280 g002
Figure 3. Flash-flood predictors: (a) Aspect; (b) Land use; (c) Hydrological soil groups; (d) Lithology.
Figure 3. Flash-flood predictors: (a) Aspect; (b) Land use; (c) Hydrological soil groups; (d) Lithology.
Sensors 21 00280 g003
Figure 4. Flowchart of the methodological steps applied in this research.
Figure 4. Flowchart of the methodological steps applied in this research.
Sensors 21 00280 g004
Figure 5. Linear Support Vector Machine (LSVM) scores assigned to flash-flood predictors.
Figure 5. Linear Support Vector Machine (LSVM) scores assigned to flash-flood predictors.
Sensors 21 00280 g005
Figure 6. Distribution of FR and WOE coefficients within the classes of flash-flood predictors.
Figure 6. Distribution of FR and WOE coefficients within the classes of flash-flood predictors.
Sensors 21 00280 g006
Figure 7. DLNN based ensemble running outputs (a) Training and Validating loss of DLNN-FR; (b) Training and Validating accuracy of DLNN-FR; (c) Training and Validating loss of DLNN-WOE; (d) Training and Validating accuracy of DLNN-WOE).
Figure 7. DLNN based ensemble running outputs (a) Training and Validating loss of DLNN-FR; (b) Training and Validating accuracy of DLNN-FR; (c) Training and Validating loss of DLNN-WOE; (d) Training and Validating accuracy of DLNN-WOE).
Sensors 21 00280 g007
Figure 8. Deep Learning Neural Network architecture.
Figure 8. Deep Learning Neural Network architecture.
Sensors 21 00280 g008
Figure 9. Flash-flood predictors importance.
Figure 9. Flash-flood predictors importance.
Sensors 21 00280 g009
Figure 10. Flash Flood Potential Index (a) DLNN-FR; (b) DLNN-WOE; (c) ADT-FR; (d) ADT-WOE.
Figure 10. Flash Flood Potential Index (a) DLNN-FR; (b) DLNN-WOE; (c) ADT-FR; (d) ADT-WOE.
Sensors 21 00280 g010
Figure 11. Flash-Flood Potential Index (FFPI) classes weights.
Figure 11. Flash-Flood Potential Index (FFPI) classes weights.
Sensors 21 00280 g011
Figure 12. Optimally pruned Decision Tree Structure for ADT based ensembles ((a) ADT-FR and (b) ADT-WOE ensembles).
Figure 12. Optimally pruned Decision Tree Structure for ADT based ensembles ((a) ADT-FR and (b) ADT-WOE ensembles).
Sensors 21 00280 g012
Figure 13. Receiver Operating Characteristic (ROC) curve (a) Success Rate; (b) Prediction Rate.
Figure 13. Receiver Operating Characteristic (ROC) curve (a) Success Rate; (b) Prediction Rate.
Sensors 21 00280 g013
Table 1. FR and WOE coefficients.
Table 1. FR and WOE coefficients.
FactorClassFRFR Standardized CoefficientsWoE CoefficientsWoE Standardized Coefficients
Slope<3°0.0000.0000.0000.307
3.1–7°0.1520.039−1.1000.000
7.1–15°0.2450.062−1.1000.000
15.1–25°3.9251.0002.4801.000
>25°1.1250.2871.7200.788
TPI(−20)–(−3.8)0.4350.257−1.7400.116
(−3.7)–(−1.1)1.4150.8350.1600.727
(−1)–1.31.6951.0001.0101.000
1.4–4.50.0000.000−2.1000.000
4.6–200.2450.145−2.1000.000
TWI3.2–6.11.0550.0300.1600.116
6.2–7.80.9750.0170.0100.063
7.9–10.21.0250.0250.0800.088
10.3–14.50.8650.000−0.1700.000
14.6–24.67.2951.0002.6701.000
Land useBuilt-up areas2.0350.7503.9601.000
Agriculture zone2.7151.0001.6100.642
Vineyards0.3650.134−2.1900.063
Fruit trees0.2450.0901.3900.608
Pastures0.6750.249−0.3000.351
Forests0.0000.000−2.6000.000
Transitional woodland-shrub0.9650.3550.2700.438
Water bodies1.4850.5470.8400.524
Lithology10.8950.7680.0000.745
20.0000.000−2.1000.000
30.0000.000−2.1000.000
41.1651.0000.4500.904
50.6650.571−0.3500.621
60.2450.210−2.1000.000
70.4350.373−1.2300.309
80.3550.305−1.7700.117
90.6350.545−0.4900.571
100.8150.700−0.0700.720
Profile curvature−3–00.7050.237−1.3700.299
0.1–0.91.6051.0000.9801.000
1–20.4250.000−2.3700.000
SPI<503.2051.0001.8801.000
50.1–5001.6150.3290.8700.498
500.1–20001.0250.0800.2700.199
2000.1–50000.8350.000−0.0600.035
>50000.9750.059−0.1300.000
AspectFlat surfaces3.6451.0001.5601.000
North1.5350.2620.6100.503
North-East1.0950.1080.1300.251
East1.2050.1470.2800.330
South-East1.6050.2870.6900.545
South1.1150.1150.1700.272
South-West0.7850.000−0.3500.000
West1.0150.0800.0400.204
North-East1.4050.2170.4900.440
Convergence index0.1–992.5151.0001.6501.000
−0.9–01.2850.3170.3600.469
−1.9–−10.9150.111−0.1100.276
−2.9–−20.7150.000−0.7800.000
−99–−30.9850.150−0.0400.305
HSGA1.3250.6970.3600.669
B1.5951.0000.8901.000
C0.7050.000−0.7100.000
D1.1050.4490.0900.500
Table 2. Hardware and software environmental configuration used for modelling.
Table 2. Hardware and software environmental configuration used for modelling.
ConfigurationParameter
CPUIntel(R) Core(TM) i7–[email protected] GHz
RAM16.0 GB DDR4
GPUNVIDIA GeForce MX330
Hard diskSSD 512 GB M.2 PCIe
Operating systemWindows 10 Pro
Table 3. Statistical metrics used to evaluate model’s performance.
Table 3. Statistical metrics used to evaluate model’s performance.
ModelsTPTNFPFNSensitivitySpecificityAccuracyk-Index
TrainingDLNN-FR330332750.9850.9790.9820.964
DLNN-WOE334330370.9790.9910.9850.970
ADT-FR31231025270.9200.9250.9230.846
ADT-WOE30931128260.9220.9170.9200.840
ValidatingDLNN-FR13212812160.8920.9140.9030.806
DLNN-WOE1371287160.8950.9480.9200.840
ADT-FR12912415200.8660.8920.8780.757
ADT-WOE13212612180.8800.9130.8960.792
Table 4. The optimal parameters of the ADT based ensembles.
Table 4. The optimal parameters of the ADT based ensembles.
ModelsNo. of IterationsSeedTraining AccuracyValidating Accuracy
ADT-FR2360.9230.878
ADT-WOE2880.9200.896
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Costache, R.; Arabameri, A.; Blaschke, T.; Pham, Q.B.; Pham, B.T.; Pandey, M.; Arora, A.; Linh, N.T.T.; Costache, I. Flash-Flood Potential Mapping Using Deep Learning, Alternating Decision Trees and Data Provided by Remote Sensing Sensors. Sensors 2021, 21, 280. https://doi.org/10.3390/s21010280

AMA Style

Costache R, Arabameri A, Blaschke T, Pham QB, Pham BT, Pandey M, Arora A, Linh NTT, Costache I. Flash-Flood Potential Mapping Using Deep Learning, Alternating Decision Trees and Data Provided by Remote Sensing Sensors. Sensors. 2021; 21(1):280. https://doi.org/10.3390/s21010280

Chicago/Turabian Style

Costache, Romulus, Alireza Arabameri, Thomas Blaschke, Quoc Bao Pham, Binh Thai Pham, Manish Pandey, Aman Arora, Nguyen Thi Thuy Linh, and Iulia Costache. 2021. "Flash-Flood Potential Mapping Using Deep Learning, Alternating Decision Trees and Data Provided by Remote Sensing Sensors" Sensors 21, no. 1: 280. https://doi.org/10.3390/s21010280

APA Style

Costache, R., Arabameri, A., Blaschke, T., Pham, Q. B., Pham, B. T., Pandey, M., Arora, A., Linh, N. T. T., & Costache, I. (2021). Flash-Flood Potential Mapping Using Deep Learning, Alternating Decision Trees and Data Provided by Remote Sensing Sensors. Sensors, 21(1), 280. https://doi.org/10.3390/s21010280

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop