Abstract
Debris flow is one of the most frequently occurring geological disasters in Jilin province, China, and such disasters often result in the loss of human life and property. The objective of this study is to propose and verify an information fusion (IF) method in order to improve the factors controlling debris flow as well as the accuracy of the debris flow susceptibility map. Nine layers of factors controlling debris flow (i.e., topography, elevation, annual precipitation, distance to water system, slope angle, slope aspect, population density, lithology and vegetation coverage) were taken as the predictors. The controlling factors were improved by using the IF method. Based on the original controlling factors and the improved controlling factors, debris flow susceptibility maps were developed while using the statistical index (SI) model, the analytic hierarchy process (AHP) model, the random forest (RF) model, and their four integrated models. The results were compared using receiver operating characteristic (ROC) curve, and the spatial consistency of the debris flow susceptibility maps was analyzed while using Spearman’s rank correlation coefficients. The results show that the IF method that was used to improve the controlling factors can effectively enhance the performance of the debris flow susceptibility maps, with the IF-SI-RF model exhibiting the best performance in terms of debris flow susceptibility mapping.
1. Introduction
A fast-moving debris flow that has a wide influence range can be defined as a transient mass motion within the loose steep slope channel due to rainfall. Debris flow, which causes a substantial loss of lives and property [1,2], has become one of the most dangerous geological disasters in the world and it poses a serious threat to the living environment of humans [3].
Debris flow susceptibility mapping (DFSM) can predict the location of debris flow that is based on terrain, as well as geological and hydrological characteristics, to prevent and reduce the impact of debris flow disasters [4,5,6]. With the development of GIS and remote sensing technology an increasing number of methods are used for DFSM.
The models for DFSM in previous studies are mainly divided into three categories: statistical models, heuristic models, and soft computing models [7]. For statistical models, Xu et al. [8] used the information value model to analyze the debris flow susceptibility in Sichuan province, China. In addition, the logistic regression model [9], evidence belief function [10], weight of evidence [11], frequency ratio [10,12], and statistical index (SI) [13] have been used extensively. Regarding heuristic models, Addison et al. [14] used the classifier tree to analyze debris flow. Other models, such as multistandard analysis [15] and the heuristic fuzzy model [16], have also been used extensively. For soft computing models, artificial neural networks [17,18,19], maximum entropy [20], decision tree [21], and classification and regression trees [22] are equally common.
However, research on the selection of controlling factors, which are the foundation of DFSM and affect the accuracy of debris flow susceptibility maps, is still rare. Previous studies have focused on converting a group of controlling factors that may be relevant into a set of linear unrelated controlling factors while using principal component analysis [23,24,25,26,27]. However, this method will reduce the number of controlling factors that are used in the study, resulting in a reduction in the utilization of the underlying data, which will eventually have a negative impact on the accuracy of the final debris flow susceptibility map.
Therefore, an information fusion (IF) method that is based on the Minkovsky distance [28,29] and the Dempster-Shafer theory [30,31,32] was proposed in this study to improve the rationality of the controlling factors, the utilization of the underlying data, and the accuracy of the final debris flow susceptibility map. Based on the original controlling factors and the improved controlling factors, debris flow susceptibility maps were developed while using the selected models and compared using the receiver operating characteristic (ROC) curve. To comprehensively verify the applicability of the IF method in different types of models, the SI model was selected from the statistical models, the analytic hierarchy process (AHP) model was chosen from the heuristic models, and the random forest (RF) method was selected from the soft calculation models. Then, these three models were integrated to obtain four integrated models, that is, the combination of SI and AHP (SI-AHP), the combination of SI and RF (SI-RF), the combination of AHP and RF (AHP-RF), and the combination of SI, AHP, and RF (SI-AHP-RF). Finally, to prove the superiority of the IF method, the spatial consistency of the debris flow susceptibility maps was analyzed while using Spearman’s rank correlation coefficients.
2. Study Area
The research area (Jilin province) is located in the northeastern part of China (Figure 1), from approximately 40°52′N to 46°18′N and from 121°38′E to 131°19′E, extending along the northeast-southwest direction, and with a total area of 18.74 km2. The area is in the eastern monsoon climate zone of China, and it has an annual average temperature between 2 and 6 °C and annual precipitation in the range from 400–1000 mm, both of which gradually decrease from southeast to northwest. The precipitation in the summer accounts for 70%–80% of the annual precipitation.
Figure 1.
Location map of the study area.
The study area is 5.0 m to 2691 m above sea level, and it is located in two geomorphological units: the eastern Changbai Mountain and the western Songliao Plain. The terrain is high in the southeast and low in the northwest. Bounded by the latitude 42°40′–43°, the study area spans two major structural units: the Tarimand-China-north Korea para platform area and the Tianshan-Xingan trough fold area.
The exposed strata are from the Archean Eon to the Cenozoic Era. The rock mass can be divided into 13 rock groups, according to lithological characteristics. The lithology consists mainly of granite, basalt, glutenite, clay rock, pyroclastic rock, carbonate rock, and gneiss.
The geological environmental conditions in the study area are complex, and the occurrence of geological hazards is the result of multiple factors. After the study area enters the rainy season, debris flow occurs frequently, causing huge economic losses every year. We selected Jilin province as the study area, because such disasters occur frequently in this region; thus, sufficient data can be collected to verify the IF method, which has practical value for our research.
3. Data Preparation
Data collection is the basis for subsequent analysis [33,34,35]. In this study, a debris flow inventory map, including 868 debris flow events, which was compiled based on the debris flow data before 2012, as provided by the Jilin Provincial Department of Land and Resources, was combined with field investigation data (Figure 2). Then, the debris flow events were randomly divided into training and validation datasets: 70% (608 events) were used for training the models, and 30% (260 events) were used for validation.
Figure 2.
Debris flow field survey: (a) Debris flow accumulation; and, (b) The image of the debris flow ditch taken by drone.
According to the characteristics of debris flow and the results of a 1:100,000 geological disaster investigation in the study area, the relationship between debris flow and the geological environment was analyzed. Then, based on experience [36,37,38,39,40,41], nine layers of debris flow controlling factors (i.e., topography, elevation, annual precipitation, distance to water system, slope angle, slope aspect, population density, lithology, and vegetation coverage) were taken as predictors. The spatial database for the study area is shown in Table 1.
Table 1.
Spatial database for the study area.
The details of the grading standard of each controlling factor are as follows. The hierarchical diagram of the controlling factors is shown in Figure 3.

Figure 3.
The hierarchical diagram of the controlling factors in the study area: (a) topography; (b) elevation; (c) annual precipitation; (d) distance to water system; (e) slope angle; (f) slopeaspect; (g) population density; (h) lithology; and, (i) vegetation coverage.
Topography affects the formation, movement, and scale of debris flow [42]. In this study, according to the order of the density of debris flow point in each geomorphic unit from small to large, the topography was divided into group 1 (Songliao low plain, piedmont slope plain, high plain), group 2 (mountain basin, valley), group 3 (hills, low hills), and group 4 (medium and low mountains, terraces).
The relative height difference determines the gravitational potential energy inside the slope [3]. According to survey statistics, the Changbai Mountain area in the eastern part of the study area is 800–1500 m above sea level: the main peak of Changbai Mountain and its surrounding peaks are above 2000 m, and the central plateau plain is 600–800 m above sea level. According to these critical values, the elevation was divided into four classes: 0–600 m, 600–800 m, 800–1500 m, and >1500 m.
Rainfall plays an important role in slope instability [4]. Debris flow disasters are mostly caused by continuous rainfall. Therefore, the annual precipitation, which is mainly distributed between 600–1000 mm in the study area, was selected as the controlling factor [43,44]. Then, the interval from 600–1000 mm was divided into two parts on average. Thus, annual precipitation was divided into four categories: 0–600 mm, 600–800 mm, 800–1000 mm, and >1000 mm.
The steep slopes provide loose material for debris flow [45]. The slope angle in the northwest of the study area is mainly 0–5°, while the slope in the southeast is generally near 10°, and, in a few areas, it is more than 20°. According to these critical values, the slope angle was divided, as follows: 0–5°, 5–10°, 10–20°, and >20°.
The slope aspect is related to precipitation and topographical trends [1]. According to the influence of light, the slope aspect was divided into shady slope (135°–180°, 180°–225°), semi-shady slope (45°–90°, 270°–315°), semi-sunny slope (90°–135°, 225°–270°), and sunny slope (135°–225°).
The population density indirectly reflects the influence of human activities on the geological environment. Human activities can cause vegetation degradation and changes in topography and geomorphology, indirectly increasing the possibility of debris flow. According to the number of people per square kilometers, the population density was divided into four classes: very low (0–10), low (10–100), moderate (100–500), and high (>500).
The lithology controls the stability of the slope and determines the amount of material that is available for debris flow [46,47]. According to the anti-weathering ability of rock, the lithology was divided into four types: soil, soft rock, hard rock and extremely hard rock.
The lower the vegetation coverage is, the more easily the rock mass becomes weathered, and the more likely it is that debris flow will occur. In this study, the vegetation coverage of the eastern Changbai Mountains is more than 80%, while that of the western plain is less than 20%. The vegetation coverage in the central region is mainly between 20% and 50%. According to these critical values, vegetation coverage was divided into four classes: low (<20%), moderate (20%–50%), high (50%–80%), and very high (>80%).
Rivers will erode the rock mass at the bottom of a slope, affecting the stability of the slope. In general, the likelihood of debris flow decreases as the distance to the water system increases [13]. In this study, the distance to the water system was divided into six classes:0–500 m, 500–1000 m, 1000–1500 m, 1500–2000 m, 2000–2500 m, and >2500 m.
4. Methodology
The methods that were used in this study can be summarized in three parts. The first part is the IF method for improving the controlling factors. The second part is the models for DFSM. Six models, i.e., SI, AHP, SI-AHP, SI-RF, AHP-RF, and SI-AHP-RF, were used to develop the debris flow susceptibility maps. The third part is the method that is used to verify the results. ROC curve analysis is used to verify the success rate and prediction rate of the debris flow susceptibility maps, and the Spearman’s rank correlation coefficients are used to verify the spatial consistency of the debris flow susceptibility maps. Figure 4 shows the flow of the research methods.
Figure 4.
The flowchart of the research methods in this study.
4.1. The Information Fusion Method
The IF method is based on the Minkowski distance and Dempster-Shafer theory. The Minkowski distance, which is a distance function that is defined on eigenvector space [48,49] is used to measure the similarity between the controlling factors. The Dempster-Shafer theory is used to calculate the credibility degree of each controlling factor. The credibility degree is used as a weight to improve the layer of each controlling factor, and it is the result of the IF method. The calculation process of the credibility degree is as follows:
Step 1: Assign the grade of each controlling factor from small to large while using a value from 1 to 4 or a value from 1 to 6. For example, for annual precipitation, the intervals 0–600 mm, 600–800 mm, 800–1000 mm, and >1000 mm was assigned values of 1, 2, 3, and 4, respectively. Finally, the column vector was obtained according to the values of all the disaster points in a controlling factor.
Step 2: Calculate the Minkowski distance according to column vectors of the controlling factors:
where is the Minkowski distance between controlling factor i and controlling factor j; and are the values of a disaster point in a column vector; and, m is a variable parameter.
Step 3: Obtain the similarity measure Matrix, according to the Minkowski distance.
Step 4: Calculate the support degree of the controlling factors:
where is the support degree of controlling factor i; is the controlling factor i; and n is the number of controlling factors.
Step 5: Calculate the credibility degree of controlling factor i:
where is the credibility degree of the controlling factor i; is the controlling factor i; is the controlling factor j; is the support degree of controlling factor i; is the support degree of controlling factor j; and, n is the number of controlling factors.
4.2. The Models for DFSM
4.2.1. The Statistical Index Model
The SI model is a binary statistical method, whose result can reflect the weights of the controlling factors. [50,51]. The weights are obtained by the following formula.
where is the weight of grade j in controlling factor i; is the debris flow density of grade j in controlling factor i; is the total density of debris flow within the map; is the number of debris flow events of grade j in controlling factor i; is the number of debris flow events in the map; is the number of pixels of grade j in controlling factor i; and, is the total number of pixels in the map.
4.2.2. The Analytic Hierarchy Process Model
The AHP model is a multistandard decision-making process and a common method for determining subjective weight [52,53]. There will be some uncertainty results due to the evaluation of different experts. This model can be described in four steps, as follows:
Step 1: Establish a hierarchical analysis structure model for DFSM.
Step 2: Construct a pairwise comparison matrix:
where A is the pairwise comparison matrix and is the result of comparison between controlling factor i and controlling factor j.
Step 3: Calculate the weight vector from the pairwise comparison matrix. Determine the weight of each controlling factor.
Step 4: Check the consistency of the weights, and when the consistency ratio (CR) is less than or equal to 0.1 the result is considered reasonable.
where, CR is the consistency rate; is the weighting and vector mean component; n is the number of controlling factors; and, RI is the degree of freedom index.
4.2.3. The Random Forest Model
The RF model, which can analyze the importance of classification features and determine the weight of each controlling factor, is a classification model that is composed of many decision trees [54,55]. The RF model includes two main kinds of algorithms: the GINI index algorithm and the out-of-bag (OOB) error rate replacement algorithm. The GINI index is used to calculate the impurity of nodes to measure the weight. The calculation process is as follows:
Step1: Calculate the GINI index of node C:
where is the GINI index of node C; is the K category of node C; and, is the proportion of category K in node C.
Step 2: Calculate the importance of factor j in node C:
where is the importance of factor j in node C; and, and represent the GINI values at the two new nodes that branch down.
Step 3: Calculate the weight of controlling factor j:
where is the weight of controlling factor j; n is the number of decision trees; and, m is the number of controlling factors.
4.2.4. The Integrated Model
Integrated models can make up for the shortcomings of individual models because of their ability to solve high-dimensional problems and high identification accuracy [7], which lead to more accurate results. Therefore, in this study, the SI model, the AHP model, and the RF model were integrated to get four integrated models: SI-AHP, SI-RF, AHP-RF, and SI-AHP-RF. The integration of individual models is achieved through the following steps:
Step 1: Obtain the weight of each grade of the controlling factor according to the individual models.
Step 2: Standardize these weights in the data analysis module of Statistical Product and Service Solutions (SPSS) software.
Step 3: Obtain the new weights of each grade of the controlling factors in the integrated model by using the following formula:
where is the new weights of each grade of the controlling factors in the integrated model and are the standardized weights of each grade of the controlling factors in the individual models.
4.2.5. Combination of the IF Method and Six Models
The credibility degree that was obtained by the IF method is regarded as the improved weight of each controlling factor. Then, according to the standardized weight of each grade of the controlling factors obtained by using SI, AHP, SI-AHP, SI-RF, AHP-RF, and SI-AHP-RF models, the improved weights of the controlling factors were obtained by the following formula:
where is the improved weight of controlling factor i, is the credibility degree of controlling factor i, and is the standardized weight of controlling factor i that was obtained by the selected model.
4.3. Validation of Debris Flow Susceptibility Maps
To compare the performance of different debris flow susceptibility maps and ensure the reliability of the IF method, the ROC curve and the Spearman’s rank correlation coefficient were selected.
4.3.1. ROC Curve
The ROC curve is drawn from a series of two-category methods (demarcation values or decision thresholds) and it uses sensitivity as the ordinate and specificity as the abscissa. The area under the curve is between 0.5 and 1. The larger the area is, the better the effect of the model [41].
4.3.2. Spearman’s Rank Correlation Coefficient
Spatial consistency can be interpreted as the similarity of the debris flow susceptibility assessment results in the spatial distribution. The Spearman’s rank correlation coefficient, which is a different nonparametric measure of the correlation of variables, was used to evaluate the spatial consistency between the two different debris flow susceptibility maps. The calculation process is as follows:
Step 1: Obtain the column vectors according to the grade of debris flow susceptibility of all the debris flow points. The grades of debris flow susceptibility were assigned values of 1 to 4 from small to large.
Step 2: Calculate the difference between two column vectors:
where is the difference between two column vectors; X and Y are the column vectors that were obtained from different debris flow susceptibility maps; and are the value of the debris flow susceptibility grade corresponding to a disaster point; and, is the number of debris flow points.
Step 3: Calculate the correlation between two debris flow susceptibility maps:
where is the Spearman’s rank correlation coefficient; is the difference between two column vectors; and, is the number of debris flow points.
5. Results
5.1. The Results of the Information Fusion Method
The correlation between the selected controlling factors was expressed in terms of the magnitude of the Minkowski distance value. The smaller the Minkowski distance between the controlling factors, the higher the similarity between them. The support degree and the credibility degree of the selected controlling factors are shown in Table 2. The topography has the highest credibility degree value (0.157), and the lowest credibility degree value is 0.086 for elevation.
Table 2.
The Minkowski distance value.
5.2. DFSM using Six Models Based on Original Controlling Factors
Based on the original controlling factors, the standardized weights, which are shown in the Table 3, were calculated by the selected six models, i.e., SI, AHP, SI-AHP, SI-RF, AHP-RF, and SI-AHP-RF. The debris flow susceptibility maps (Figure 5) were finally obtained by superimposing the layers of the controlling factors according these weights in the ArcGIS software. The susceptibility of debris flow was divided into four grades—low, moderate, high, and very high—according to the natural fracture method [56].
Table 3.
The standardized weights of the original controlling factors.
Figure 5.
Debris flow susceptibility maps based on the controlling factors before information fusion (IF): (a) debris flow susceptibility mapping (DFSM)using statistical index (SI); (b) DFSM using analytic hierarchy process (AHP); (c) DFSM using SI-AHP; (d) DFSM using SI-random forest (SI-RF); (e) DFSM using AHP-RF; and, (f) DFSM using SI-AHP-RF.
5.3. DFSM Using Six Models Based on Improved Controlling Factors
Based on the improved controlling factors, the new standardized weights of controlling factors, which are shown in the Table 4, were calculated by the selected six models i.e., IF-SI, IF-AHP, IF-SI-AHP, IF-SI-RF, IF-AHP-RF, and IF-SI-AHP-RF. The debris flow susceptibility maps (Figure 6) were finally obtained by superimposing the layers of the improved controlling factors, according to these new weights in the ArcGIS software.
Table 4.
The standardized weights of the improved controlling factors.

Figure 6.
Debris flow susceptibility maps based on the controlling factors after IF: (a) DFSM using IF-SI; (b) DFSM using IF-AHP; (c) DFSM using IF-SI-AHP; (d) DFSM using IF-SI-RF; (e) DFSM using IF-AHP-RF; and, (f) DFSM using IF-SI-AHP-RF.
5.4. Validation
5.4.1. Results of the ROC Curve
The success rate comes from the training dataset, and the prediction rate comes from the validation dataset. The pairwise comparison results between the models are show in Figure 7; Figure 8 shows, which reveal an improvement in the performance of the models based on the IF method. For the debris flow susceptibility maps based on the original controlling factors, the success rates of SI, AHP, SI-AHP, SI-RF, AHP-RF, and SI-AHP-RF were 0.813, 0.801, 0.843, 0.887, 0.854, and 0.817, and their prediction rates were 0.817, 0.803, 0.853, 0.888, 0.857, and 0.832, respectively. For the debris flow susceptibility maps that were based on the improved controlling factors, the success rates of IF-SI, IF-AHP, IF-SI-AHP, IF-SI-RF, IF-AHP-RF, and IF-SI-AHP-RF were 0.854, 0.856, 0.900, 0.930, 0.905, and 0.870, and their prediction rates were 0.868, 0.866, 0.910, 0.940, 0.922, and 0.903, respectively. These results reveal the better performance of the debris flow susceptibility maps that are based on the improved controlling factors.
Figure 7.
Comparison of success rate curves: (a1) SI and IF-SI; (a2) AHP and IF-AHP; (a3) SI-AHP and IF-SI-AHP; (a4) SI-RF and IF-SI-RF; (a5) AHP-RF and IF-AHP-RF; and, (a6) SI-AHP-RF and IF-SI-AHP-RF.
Figure 8.
Comparison of prediction rate curves: (b1) SI and IF-SI; (b2) AHP and IF-AHP; (b3) SI-AHP and IF-SI-AHP; (b4) SI-RF and IF-SI-RF; (b5) AHP-RF and IF-AHP-RF; and, (b6) SI-AHP-RF and IF-SI-AHP-RF.
5.4.2. Results of Spatial Consistency Analysis
The smaller the Spearman’s rank correlation coefficient is, the greater the spatial consistency between the two debris flow susceptibility maps. As shown in Table 5, the Spearman’s rank correlation coefficients between the debris flow susceptibility maps that were obtained by the same models, such as SI and IF-SI, are obviously smaller than the coefficients between other maps. This phenomenon indicates that there was a high spatial consistency between the debris flow susceptibility maps that were obtained by the same models, which proves that the IF method is indeed effective. In addition, the results show that there is also a high degree of spatial consistency between IF-SI-AHP-RF, IF-SI-RF, and IF-AHP-RF, and low spatial consistency between the remaining maps.
Table 5.
The Spearman’s rank correlation coefficients between the debris flow susceptibility maps.
6. Discussion
6.1. Comparison of Debris Flow Susceptibility Maps
As shown in Figure 7 and Figure 8, the success rate and the prediction rate of the twelve debris flow susceptibility maps are more than 0.8, which indicates that the debris flow susceptibility maps are credible. The IF-SI-RF model, which has the highest success and prediction rates, can be considered to be the best model. The results of the best IF-SI-RF model show that the area ratios of low, moderate, high, and very high were 37.7%, 21.4%, 25.5%, and 15.4%, respectively, and the areas with high susceptibility are distributed mainly in the middle and low mountain areas in the east of the study area.
The success and prediction rates of debris flow susceptibility maps that are based on the improved controlling factors are significantly better than those that are based on the original controlling factors. As shown in Table 6, the success rates of SI, AHP, SI-AHP, SI-RF, AHP-RF, and SI-AHP-RF increased by 4.1%, 5.5%, 5.7%, 4.3%, 5.1%, and 5.3%, respectively, and the prediction rates increased by 5.1%, 6.3%, 5.7%, 5.2%, 6.5%, and 7.1%, respectively, which proves that the IF method can improve the rationality of the controlling factors. In addition, the results of six different types of models were significantly improved, which shows that the scope of application of the IF method is extensive.
Table 6.
Improvement of success rate and prediction rate.
6.2. Why the IF Method Can Improve the Controlling Factors
It is also necessary to analyze the reasons why the IF method can improve the controlling factors. First, there is an inevitable correlation between the selected controlling factors, because a controlling factor will have an impact on other controlling factors. For example, in the plain area, the slope angle is relatively small. In addition, when the study area is relatively large, the geological environment conditions are complex and diversified, and the same controlling factors will play different roles in different areas. Thus, there will be conflicts between the controlling factors. The IF method can weaken the correlations and conflicts between the controlling factors. In addition, when the principal component analysis method is used to analyze the controlling factors, the number of controlling factors will be reduced. However, the IF method can make full use of the original data, therefore, it can improve the controlling factors, further improving the performance of the debris flow susceptibility maps.
6.3. Spatial Consistency Analysis of Debris Flow Susceptibility Maps
The improvement in the success rate and prediction rate of debris flow susceptibility maps is not enough to show the effectiveness of the IF method. Therefore, the spatial consistency of the debris flow susceptibility maps is further analyzed. When the two debris flow susceptibility maps, which were obtained by the same model that was based on the original controlling factors and the improved controlling factors, show high spatial consistency, the improvement in the controlling factors is persuasive. In contrast, if the spatial consistency varies greatly, the improvement in the controlling factors is incorrect. Therefore, to ensure the reliability of the IF method, it is necessary to test the spatial consistency between the debris flow susceptibility maps. The final analysis results show that there is good spatial consistency between the two debris flow susceptibility maps that were based on the original controlling factors and the improved controlling factors, which further proves that the IF method is effective.
7. Conclusions
The IF method is proposed and verified for DFSM by taking Jilin province, China, as a study area. Based on field investigations and historical data, nine debris flow controlling factors were selected and improved by the IF method. The SI, AHP, SI-AHP, SI-RF, AHP-RF, and SI-AHP-RF models were used to develop debris flow susceptibility maps, and the results were compared and verified. The conclusions are as follows.
The success and prediction rates of debris flow susceptibility maps that are based on improved controlling factors are significantly better than those based on the original controlling factors. In addition, according to the results of spatial consistency analysis, the two debris flow susceptibility maps, which were obtained by the same model based on the original controlling factors and the improved controlling factors, have a high spatial consistency. Therefore, the conclusion that the IF method can improve the controlling factors for DFSM is trustworthy.
Regarding the results of the best IF-SI-RF model, areas with high susceptibility are mainly distributed mainly in middle and low mountain areas in the east of the study area. The results of this study can provide reliable information for the prevention and management of debris flow disasters in the study area and they have significance for reducing and avoiding the losses that are caused by debris flow.
Author Contributions
Conceptualization, S.Q. (Shengwu Qin); Formal analysis, Q.D.; Funding acquisition, S.Q. (Shengwu Qin); Methodology, Q.D.; Project administration, S.Q. (Shengwu Qin); Resources, Y.Z.; Software, X.H. and F.L.; Supervision, S.Q. (Shuangshuang Qiao); Validation, Z.M.; Visualization, J.C.; Writing—original draft, Q.D.; Writing—review & editing, Q.D.
Funding
This research was funded by the Jilin Provincial Science and Technology Department (No.20190303103SF, No.20170101001JC). National Natural Science Foundation of China (Grant no 41202197).
Conflicts of Interest
The authors declare no conflict of interest.
Abbreviations
| IF | Information fusion |
| SI | Statistical index |
| AHP | Analytic hierarchy process |
| RF | Random forest |
| SI-RF | Statistical index model and the random forest model combination |
| SI-AHP | Statistical index model and the analytic hierarchy process combination |
| AHP-RF | Analytic hierarchy process and the random forest model combination |
| SI-AHP-RF | Statistical index model with the analytic hierarchy process and the random forest model combination |
| IF-SI | Statistical index model based on controlling factors after Information fusion |
| IF-AHP | Analytic hierarchy process model based on controlling factors after Information fusion |
| IF-SI-AHP | Statistical index model and the analytic hierarchy process combination based on controlling factors after Information fusion |
| IF-SI-RF | Statistical index model and the random forest model combination based on controlling factors after Information fusion |
| IF-AHP-RF | Analytic hierarchy process and the random forest model combination based on controlling factors after Information fusion |
| IF-SI-AHP-RF | Statistical index model with the analytic hierarchy process and the random forest model combination based on controlling factors after Information fusion |
| DEM | Digital elevation model |
| ROC | Receiver operating characteristic |
| DFSM | Debris flow susceptibility mapping |
| AUC | Area under the ROC curve |
| OOB | Out of bag |
References
- Elkadiri, R.; Mohamed, S.; Ahmed, M.Y.; Tamer, E.; Ronald, C.; Ali, B.B.; Mohamed, M.A. A Remote Sensing-Based Approach for Debris-Flow Susceptibility Assessment Using Artificial Neural Networks and Logistic Regression Modeling. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 4818–4835. [Google Scholar] [CrossRef]
- Qiao, S.S.; Qin, S.W.; Chen, J.J.; Hu, X.Y.; Ma, Z.J. The Application of a Three-Dimensional Deterministic Model in the Study of Debris Flow Prediction Based on the Rainfall-Unstable Soil Coupling Mechanism. Processes 2019, 7, 99. [Google Scholar] [CrossRef]
- Chevalier, G.G.; Medina, V.; Hürlimann, M.; Bateman, A. Debris-flow susceptibility analysis using fluvio-morphological parameters and data mining: Application to the Central-Eastern Pyrenees. Nat. Hazards 2013, 67, 213–238. [Google Scholar] [CrossRef]
- Wang, Q.; Kong, Y.Y.; Zhang, W.; Chen, J.P.; Xu, P.H.; Li, H.Z.; Xue, Y.G.; Yuan, X.Q.; Zhan, J.W.; Zhu, Y.J. Regional debris flow susceptibility analysis based on principal component analysis and self-organizing map: A case study in Southwest China. Arab. J. Geosci. 2016, 9. [Google Scholar] [CrossRef]
- Oh, H.; Biswajeet, P. Application of a neuro-fuzzy model to landslide-susceptibility mapping for shallow landslides in a tropical hilly area. Comput. Geosci. 2011, 37, 1264–1276. [Google Scholar] [CrossRef]
- Cao, C.; Song, S.Y.; Chen, J.P.; Zheng, L.J.; Kong, Y.Y. An Approach to Predict Debris Flow Average Velocity. Water 2017, 9, 205. [Google Scholar] [CrossRef]
- Chen, W.; Pourghasemi, H.R.; Kornejady, A.; Zhang, N. Landslide spatial modeling: Introducing new ensembles of RF, MaxEnt, and SVM machine learning techniques. Geoderma 2017, 305, 314–327. [Google Scholar] [CrossRef]
- Xu, W.B.; Yu, W.J.; Jing, S.C.; Zhang, G.P.; Huang, J.X. Debris flow susceptibility assessment by GIS and information value model in a large-scale region, Sichuan Province (China). Nat. Hazards 2013, 65, 1379–1392. [Google Scholar] [CrossRef]
- Chen, W.; Hamid, R.P.; Zhao, Z. A GIS-based comparative study of Dempster-Shafer, logistic regression and artificial neural network models for landslide susceptibility mapping. Geocarto Int. 2017, 32, 367–385. [Google Scholar] [CrossRef]
- Zhang, Z.W.; Yang, F.; Chen, H.; Wu, Y.L.; Li, T.; Li, W.P.; Wang, Q.Q.; Liu, P. GIS-basedlandslidesusceptibilityanalysisusingfrequencyratioandevidentialbelieffunctionmodels. Environ. Earth Sci. 2016, 75. [Google Scholar] [CrossRef]
- Rahmati, O.; Pourghasemi, H.R.; Zeinivand, H. Flood susceptibility mapping using frequency ratio and weights-of-evidence models in the Golastan Province, Iran. Geocarto Int. 2016, 31, 42–70. [Google Scholar] [CrossRef]
- Regmi, A.D.; Yoshida, K.; Pourghasemi, H.R.; Dhital, M.R.; Pradhan, B. Landslide susceptibility mapping along Bhalubang-Shiwapur area of mid-Western Nepal using frequency ratio and conditional probability models. J. Mountain Sci. 2014, 11, 1266–1285. [Google Scholar] [CrossRef]
- Razavizadeh, S.; Solaimani, K.; Massironi, M.; Kavian, A. Mapping landslide susceptibility with frequency ratio, statistical index, and weights of evidence models: A case study in northern Iran. Environ. Earth Sci. 2017, 76. [Google Scholar] [CrossRef]
- Addison, P.; Oommen, T.; Sha, Q.Y. Assessment of post-wildfire debris flow occurrence using classifier tree. Geomat. Nat. Hazards Risk 2019, 10, 505–518. [Google Scholar] [CrossRef]
- Brito, M.M.; Weber, E.J.; Filho, L.C. Multi-Criteria Analysis Applied to Landslide Susceptibility Mapping. Rev. Bras. Geomorfol. 2017, 18. [Google Scholar] [CrossRef]
- Stanley, T.; Kirschbaum, D.B. A heuristic approach to global landslide susceptibility mapping. Nat. Hazards 2017, 87, 145–164. [Google Scholar] [CrossRef]
- Kia, M.B.; Pirasteh, S.; Pradhan, B.; Mahmud, A.R.; Sulaiman, W.N.A.; Moradi, A. An artificial neural network model for flood simulation using GIS: Johor River Basin, Malaysia. Environ. Earth Sci. 2012, 67, 251–264. [Google Scholar] [CrossRef]
- Park, S.; Choi, C.; Kim, B.; Kim, J. Landslide susceptibility mapping using frequency ratio, analytic hierarchy process, logistic regression, and artificial neural network methods at the Inje area, Korea. Environ. Earth Sci. 2013, 68, 1443–1464. [Google Scholar] [CrossRef]
- Wang, H.B.; Sassa, K. Rainfall-induced landslide hazard assessment using artificial neural networks. Earth Surf. Process. Topogr. 2006, 31, 235–247. [Google Scholar] [CrossRef]
- Melchiorre, C.; Matteucci, M.; Azzoni, A.; Zanchi, A. Artificial neural networks and cluster analysis in landslide susceptibility zonation. Geomorphology 2008, 94, 379–400. [Google Scholar] [CrossRef]
- Park, S.J.; Lee, C.W.; Lee, S.; Lee, M.J. Landslide Susceptibility Mapping and Comparison Using Decision Tree Models: A Case Study of Jumunjin Area, Korea. Remote Sens. 2018, 10, 1545. [Google Scholar] [CrossRef]
- Pham, B.T.; Prakash, I.; Bui, D.T. Spatial prediction of landslides using a hybrid machine learning approach based on Random Subspace and Classification and Regression Trees. Geomorphology 2018, 303, 256–270. [Google Scholar] [CrossRef]
- Rajesh, S.; Jain, S.; Sharma, P. Inherent vulnerability assessment of rural households based on socio-economic indicators using categorical principal component analysis: A case study of Kimsar region, Uttarakhand. Ecol. Indic. 2018, 85, 93–104. [Google Scholar] [CrossRef]
- Abdul-Wahab, S.A.; Bakheit, C.S.; Al-Alawi, S.M. Principal component and multiple regression analysis in modelling of ground-level ozone and factors affecting its concentrations. Environ. Model. Softw. 2005, 20, 1263–1271. [Google Scholar] [CrossRef]
- Rajab, J.M.; MatJafri, M.Z.; Lim, H.S. Combining multiple regression and principal component analysis for accurate predictions for column ozone in Peninsular Malaysia. Atmos. Environ. 2013, 71, 36–43. [Google Scholar] [CrossRef]
- Shi, M.Y.; Chen, J.P.; Song, Y.; Zhang, W.; Song, S.Y.; Zhang, X.D. Assessing debris flow susceptibility in Heshigten Banner, Inner Mongolia, China, using principal component analysis and an improved fuzzy C-means algorithm. Bull. Eng. Geol. Environ. 2016, 75, 909–922. [Google Scholar] [CrossRef]
- Nandi, A.; Mandal, A.; Wilson, M.; Smith, D. Flood hazard mapping in Jamaica using principal component analysis and logistic regression. Environ. Earth Sci. 2016, 75. [Google Scholar] [CrossRef]
- Liu, H.; Reid, D.D. A new approach for embedding causal sets into Minkowski space. Class. Quantum Gravity 2018, 35, 124002. [Google Scholar] [CrossRef]
- Jin, L.Y.; Zhi, X.B.; Zhao, S.D. Enhanced subspace clustering through combining Minkowski distance and Cosine dissimilarity. J. Intell. Fuzzy Syst. 2018, 35, 5541–5556. [Google Scholar] [CrossRef]
- Shirani, K.; Pasandi, M.; Arabameri, A. Landslide susceptibility assessment by Dempster–Shafer and Index of Entropy models, Sarkhoun basin, Southwestern Iran. Nat. Hazards 2018, 93, 1379–1418. [Google Scholar] [CrossRef]
- Wang, R.; Guiochet, J.; Motet, G.; Schön, W. Safety case confidence propagation based on Dempster–Shafer theory. Int. J. Approx. Reason. 2019, 107, 46–64. [Google Scholar] [CrossRef]
- Razi, S.; Mollaei, M.; Ghasemi, J. A novel method for classification of BCI multi-class motor imagery task based on Dempster–Shafer theory. Inf. Sci. 2019, 484, 14–26. [Google Scholar] [CrossRef]
- Akgun, A.; Dag, S.; Bulut, F. Landslide susceptibility mapping for a landslide-prone area (Findikli, NE of Turkey) by likelihood-frequency ratio and weighted linear combination models. Environ. Geol. 2008, 54, 1127–1143. [Google Scholar] [CrossRef]
- Kannan, M.; Saranathan, E.; Anabalagan, R. Landslide vulnerability mapping using frequency ratio model: A geospatial approach in Bodi-Bodimettu Ghat section, Theni district, Tamil Nadu, India. Arab. J. Geosci. 2013, 6, 2901–2913. [Google Scholar] [CrossRef]
- Wang, Q.Q.; Li, W.P.; Yan, S.S.; Wu, Y.L.; Pei, Y.B. GIS based frequency ratio and index of entropy models to landslide susceptibility mapping (Daguan, China). Environ. Earth Sci. 2016, 75. [Google Scholar] [CrossRef]
- Zhang, S.J.; Wei, F.Q.; Liu, D.L.; Yang, H.J.; Jiang, Y.H. A regional-scale method of forecasting debris flow events based on water-soil coupling mechanism. J. Mountain Sci. 2014, 11, 1531–1542. [Google Scholar] [CrossRef]
- Carrara, A.; Crosta, G.; Frattini, P. Comparing models of debris-flow susceptibility in the alpine environment. Geomorphology 2008, 94, 353–378. [Google Scholar] [CrossRef]
- Bregoli, F.; Medina, V.; Chevalier, G.; Marcel, H.; Bateman, A. Debris-flow susceptibility assessment at regional scale: Validation on an alpine environment. Landslides 2015, 12, 437–454. [Google Scholar] [CrossRef]
- Cama, M.; Conoscenti, C.; Lombardo, L.; Rotigliano, E. Exploring relationships between grid cell size and accuracy for debris-flow susceptibility models: A test in the Giampilieri catchment (Sicily, Italy). Environ. Earth Sci. 2016, 75. [Google Scholar] [CrossRef]
- Wang, X.; Chen, X.; Cui, Q.; Yang, Z.F. An improved two-step parameter adjustment method for the optimization of a reservoir operation function model based on repeated principal component analysis and a genetic algorithm. J. Hydroinform. 2019, 21, 1–12. [Google Scholar] [CrossRef]
- Stancanelli, L.M.; Peres, D.J.; Cancelliere, A.; Foti, E. A combined triggering-propagation modeling approach for the assessment of rainfall induced debris flow susceptibility. J. Hydrol. 2017, 550, 130–143. [Google Scholar] [CrossRef]
- Yu, B.; Wang, T.; Zhu, Y.; Zhu, Y.B. Topographical and rainfall factors determining the formation of gully-type debris flows caused by shallow landslides in the Dayi area, Guizhou Province, China. Environ. Earth Sci. 2016, 75. [Google Scholar] [CrossRef]
- Su, Q.M.; Zhang, J.; Zhao, S.M.; Wang, L.; Liu, J.; Guo, J.L. Comparative Assessment of Three Nonlinear Approaches for Landslide Susceptibility Mapping in a Coal Mine Area. ISPRS Int. J. Geo Inf. 2017, 6, 228. [Google Scholar] [CrossRef]
- Guzzetti, F.; Reichenbach, P.; Ardizzone, F.; Cardinali, M.; Galliet, M. Estimating the quality of landslide susceptibility models. Geomorphology 2006, 81, 166–184. [Google Scholar] [CrossRef]
- Prenner, D.; Kaitna, R.; Mostbauer, K.; Hrachowitz, M. The Value of Using Multiple Hydrometeorological Variables to Predict Temporal Debris Flow Susceptibility in an Alpine Environment. Water Resour. Res. 2018, 54, 6822–6843. [Google Scholar] [CrossRef]
- Chang, M.; Tang, C.; Zhang, D.D.; Ma, G.C. Debris flow susceptibility assessment using a probabilistic approach: A case study in the Longchi area, Sichuan province, China. J. Mountain Sci. 2014, 11, 1001–1014. [Google Scholar] [CrossRef]
- Ma, Z.J.; Qin, S.W.; Chen, J.J.; Lv, J.F.; Chen, J.P.; Zhao, X.L. A probabilistic method for evaluating wedge stability based on blind data theory. Bull. Eng. Geol. Environ. 2019, 78, 1927–1936. [Google Scholar] [CrossRef]
- Bui, D.T.; Lofman, O.; Revhaug, I.; Dick, O. Landslide susceptibility analysis in the Hoa Binh province of Vietnam using statistical index and logistic regression. Nat. Hazards 2011, 59, 1413–1444. [Google Scholar] [CrossRef]
- Felicísimo, A.; Aurora, C.; Remondo, J.; Quirós, E. Mapping landslide susceptibility with logistic regression, multiple adaptive regression splines, classification and regression trees, and maximum entropy methods: A comparative study. Landslides 2013, 10, 175–189. [Google Scholar] [CrossRef]
- Ma, Z.J.; Qin, S.W.; Cao, C.; Lv, J.F.; Li, G.J.; Qiao, S.S.; Hu, X.Y. The Influence of Different Knowledge-Driven Methods on Landslide Susceptibility Mapping: A Case Study in the Changbai Mountain Area, Northeast China. Entropy 2019, 21, 372. [Google Scholar] [CrossRef]
- Zhao, H.L.; Yao, L.H.; Mei, G.; Liu, T.Y.; Ning, Y.S. A Fuzzy Comprehensive Evaluation Method Based on AHP and Entropy for a Landslide Susceptibility Map. Entropy 2017, 19, 396. [Google Scholar] [CrossRef]
- Zabihi, M.; Pourghasemi, H.R.; Pourtaghi, Z.S.; Behzadfar, M. GIS-based multivariate adaptive regression spline and random forest models for groundwater potential mapping in Iran. Environ. Earth Sci. 2016, 75. [Google Scholar] [CrossRef]
- Lagomarsino, D.; Tofani, V.; Segoni, S.; Catani, F.; Casagli, N. A Tool for Classification and Regression Using Random Forest Methodology: Applications to Landslide Susceptibility Mapping and Soil Thickness Modeling. Environ. Model. Assess. 2017, 22, 201–214. [Google Scholar] [CrossRef]
- Li, Y.; Wang, H.G.; Chen, J.P.; Shang, Y.J. Debris Flow Susceptibility Assessment in the Wudongde Dam Area, China Based on Rock Engineering System and Fuzzy C-Means Algorithm. Water 2017, 9, 669. [Google Scholar] [CrossRef]
- Gómez-Tostón, C.; Barrena, M.; Cortés, Á. Characterizing the optimal pivots for efficient similarity searches in vector space databases with Minkowski distances. Appl. Math. Comput. 2018, 328, 203–223. [Google Scholar] [CrossRef]
- Sujatha, E.R.; Sridhar, V. Mapping debris flow susceptibility using analytical network process in Kodaikkanal Hills, Tamil Nadu (India). J. Earth Syst. Sci. 2017, 126. [Google Scholar] [CrossRef]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).









