Next Article in Journal
Evaluation of Informative Bands Used in Different PLS Regressions for Estimating Leaf Biochemical Contents from Hyperspectral Reflectance
Next Article in Special Issue
Shallow Landslide Prediction Using a Novel Hybrid Functional Machine Learning Algorithm
Previous Article in Journal
Synergetic Aerosol Layer Observation After the 2015 Calbuco Volcanic Eruption Event
Previous Article in Special Issue
Ice-Gouging Topography of the Exposed Aral Sea Bed
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evaluation of Different Machine Learning Methods and Deep-Learning Convolutional Neural Networks for Landslide Detection

1
Department of Geoinformatics—Z_GIS, University of Salzburg, 5020 Salzburg, Austria
2
Department of Remote Sensing and GIS, University of Tabriz, Tabriz 5166616471, Iran
3
Discipline of Geography and Spatial Sciences, University of Tasmania, Hobart 7005, Australia
*
Author to whom correspondence should be addressed.
Remote Sens. 2019, 11(2), 196; https://doi.org/10.3390/rs11020196
Submission received: 22 November 2018 / Revised: 16 January 2019 / Accepted: 16 January 2019 / Published: 20 January 2019
(This article belongs to the Special Issue Mass Movement and Soil Erosion Monitoring Using Remote Sensing)

Abstract

:
There is a growing demand for detailed and accurate landslide maps and inventories around the globe, but particularly in hazard-prone regions such as the Himalayas. Most standard mapping methods require expert knowledge, supervision and fieldwork. In this study, we use optical data from the Rapid Eye satellite and topographic factors to analyze the potential of machine learning methods, i.e., artificial neural network (ANN), support vector machines (SVM) and random forest (RF), and different deep-learning convolution neural networks (CNNs) for landslide detection. We use two training zones and one test zone to independently evaluate the performance of different methods in the highly landslide-prone Rasuwa district in Nepal. Twenty different maps are created using ANN, SVM and RF and different CNN instantiations and are compared against the results of extensive fieldwork through a mean intersection-over-union (mIOU) and other common metrics. This accuracy assessment yields the best result of 78.26% mIOU for a small window size CNN, which uses spectral information only. The additional information from a 5 m digital elevation model helps to discriminate between human settlements and landslides but does not improve the overall classification accuracy. CNNs do not automatically outperform ANN, SVM and RF, although this is sometimes claimed. Rather, the performance of CNNs strongly depends on their design, i.e., layer depth, input window sizes and training strategies. Here, we conclude that the CNN method is still in its infancy as most researchers will either use predefined parameters in solutions like Google TensorFlow or will apply different settings in a trial-and-error manner. Nevertheless, deep-learning can improve landslide mapping in the future if the effects of the different designs are better understood, enough training samples exist, and the effects of augmentation strategies to artificially increase the number of existing samples are better understood.

Graphical Abstract

1. Introduction

Mass movements such as landslides are a major natural hazard in mountainous regions all over the world [1]. Although landslides mostly occur locally, they can cause extensive damage to natural and human infrastructures at different scales in hilly and mountainous areas [2]. Along with the physical damage, landslides have a direct long-term economic and social impact on an area of human habitation [3]. Several studies have investigated mitigation strategies, and landslide susceptibility mapping is increasingly applied [4,5,6,7,8]. Hazard and risk mapping generally aim to identify and spatially delineate hazard-prone areas, while analyzing the potential risk from that in a targeted study [9]. To carry out a risk analysis, it is necessary to analyze the occurrence, characteristics, and impacts of past hazard events, to relate them to the present situation and to generate predictions for the future. Several studies have been carried out to this end with various knowledge-based methods [10,11], with some also including several machine learning (ML) methods [12,13,14,15,16]. Almost all of the ML methods that have been used to analyze the potential risk of landslides heavily depend on an inventory data set of the spatial extent of known landslides or at least one characterizing GPS point location per known landslide in the target study area. ML methods require such data sets for both training and validation steps [17]. Although some knowledge-based methods are independent of the existence of landslide inventory data sets for the generation of hazard and risk maps, the resulting maps require accuracy assessment and sensitivity analysis steps and, therefore, need accurate inventory data sets [18]. Thus, it is crucial to have access to an accurate landslide inventory data set at all stages when monitoring, modeling and mapping landslide risk. Moreover, landslides often trigger emergency situations if the occur in the vicinity of human habitations or infrastructures, e.g., power lines, roads, bridges and settlement areas, which means there is often time pressure to detect and delineate landslides affecting certain areas to carry out tasks such as timely support planning and crisis responses [19,20]. Although some advanced field surveying techniques exist, e.g., laser rangefinder binoculars along with a GPS receiver [21], gaining access to the affected areas and conducting field surveys are too difficult or dangerous in most cases [22]. Thus, Earth observation (EO) data, including very high resolution (VHR) images, are widely considered as the most accessible data providing critical up-to-date information necessary for supporting humanitarian response [23]. Analysis and classification of the EO data for extracting mass movements, land displacements and landslides have a long history in remote sensing domain. We may distinguish two main approaches for the classification and information extraction from the satellite images, namely object-based and pixel-based. Image analysis based on objects has become more widespread [24], yet pixel-based approaches still dominate. Both object-based and pixel-based image analysis have been integrated with different ML methods and used in various applications [25]. Generally, ML methods are considered effective methods for remote sensing applications with emphasis on image classification and object recognition [26]. Different ML methods and classifiers have already been used in landslide detection studies. Recurrent neural network (RNN) and multilayer perceptron neural network (MLP-NN) methods typically require an input data set from sources like orthophotos or LiDAR-derived data. This also applies to the use of textural features for landslide detection as in the study by Mezaal et al. [27]. A maximum likelihood ratio (MLR) and an artificial neural network (ANN) were compared as automatic landslide detection methods using multi-spectral advanced spaceborne thermal emission and reflection radiometer (ASTER) images by Danneels et al. [28]. In another study, Bui et al. [29] used five different ML methods for landslide prediction models in the case study of the Son La hydropower basin in Vietnam, namely support vector machines (SVM), multi-layer perceptron neural networks (MLP Neural Nets), radial basis function neural networks (RBF Neural Nets), kernel logistic regression (KLR), and logistic model trees (LMT). The authors tested the mentioned ML methods at the pixel level. For this case study, MLP Neural Nets and SVM achieved the best overall accuracies. Some other studies exist which integrate ML methods with mathematical theories to address the limitations and to improve the overall accuracy of landslide detection. Mezaal et al. [9] optimized the performance of ML methods in landslide detection by using Dempster–Shafer theory (DST) based on the probabilistic output from object-based SVM, K-nearest neighbor (KNN) and RF methods.
More recently, deep-learning methods and, above all, convolutional neural networks (CNNs) have achieved fairly good results in various image analysis tasks in computer vision [30,31]. CNNs have also been used for VHR image classification and segmentation [32,33], semantic segmentation [34], scene annotation using different high spatial resolution and aerial images [35], and object detection [36]. The majority of studies involving CNNs aim to detect objects and, particularly, search for distinct objects such as vehicles, roads and airplanes [37]. Well-known are the community-based crowd-sourced approaches using a large number of labeled images for object recognition [38] like classifying to contain a cat or a dog. The prerequisite for applying CNN methods is using a training data set of labeled data. Such a supervised learning approach usually achieves good results if enough training samples exist [39]. CNNs apply supervised machine learning, where clots of labeled data are used to push learnable, i.e., adaptive, and feature extractor filters to minimize a loss function [40]. The performance of CNNs strongly depends on the availability of larger training datasets, optimized network architectures and faster GPUs [41].
While CNNs have reached good accuracies for object recognition in aerial images, only a few studies exist that use deep-learning methods and CNNs for landslide detection. Yu et al. [42] used a CNN for their research together with an improved region growing algorithm (RSG_R) for landslide detection. They trained their CNN method on a set of landslide images, extracted the discriminant information such as area and boundary of the landslides with the RSG_R algorithm, and concluded to yield high detection accuracy for identifying landslide characteristics. Ding et al. [43] evaluated their CNN method for landslide detection from GF-1 images with four spectral bands and 8 m spatial resolution for Shenzhen, China. Their automated landslide detection method achieved a landslide detection rate of 72.5%, a false positives rate of 10.2%, and an overall accuracy of 67%. This literature review shows that our study is not the first to use CNNs for landslide detection, but that the potential of CNNs in this field has not been fully explored yet. In the remainder of this article, we apply CNN methods to landslide detection based on optical satellite imagery from the Rapid Eye sensor. We compare the results from the CNN methods with those from two state-of-the-art ML methods, namely ANN, SVM and RF. By using the spectral information of the Rapid Eye images separately along with topographic factors, we illustrate the performance of each method and the impact of the used spectral and topographic factors on the landslide detection. The areas identified as having landslide occurrences are then validated using common remote sensing and GIS validation metrics and the mean intersection-over-union (mIOU) validation method from computer vision.

2. Overview of the Study Area

Our case study area lies in the southern part of the Rasuwa district in Nepal along a highway that connects Nepal to China (see Figure 1). The district has an area of about 1544 km2. The elevation in the region ranges from 734 m.a.s.l. to 4050 m.a.s.l. The land cover is predominantly forest, followed by shrub land and agriculture, grassland and rural habituated areas. The study area is located in the higher Himalayas and is considered to be one of the most landslide-prone regions along the Trishuli River, with mainly quartzite, schist, and gneiss rock formations. The Main Central thrust (MCT) lies near the boundary of the Rasuwa district. MCT is a subduction zone where the Asian and Indian plates collide, making this an earthquake-prone area. The climate is sub-tropical and humid with cooler temperatures in higher areas, particularly in the mid hills zone above 2500 m. Orographic monsoon precipitation brings rain to the area, and the annual average rainfall is 691 mm, most of which occurs during the monsoon season.
Some of the known landslides affect built-up areas and have already caused casualties in several villages. Some households were entirely abandoned, and people are forced to live in shelters provided by Non-Governmental Organizations. Agricultural land is severely affected by landslide occurrences. Moreover, landslides have dammed the Trishuli River at several locations and have caused damage to hydropower plant project sites that are planned in the study area. Landslides have also affected the bridges and roads of the main transport corridor between Nepal and China.
Several studies have been carried out analyzing landslide hazard and risk for this case study area, some of which are presented in Table 1. Several studies used GNSS-based field survey data for landslide mapping, to carry out hazard and risk analysis and runoff assessment. However, the field survey is very challenging as most of the villages have no road connection, and many landslide affected areas are difficult to reach. It is often dangerous to trek along steep sides along the hills. Several other studies have been carried out by governmental organizations or private companies, but such reports were not accessible for our study.

3. Workflow

3.1. Overall Methodology

The spectral information from Rapid Eye images was used in combination with some conditioning topographic factors to evaluate the performance of two ML methods, namely ANN, SVM and RF, and differently structured CNNs at the pixel level for the detection of landslide affected areas. The workflow of this study is as follow:
  • Designing two different training data sets, a) spectral information only, and b) a data set containing both spectral information and topographic factors.
  • Applying ANN, SVM and RF methods for landslide detection based on both training data sets and validating the performance for the study area.
  • Generating CNN-based patches by considering multiple window sizes from small to large ones.
  • Developing a data augmentation approach for increasing the number of training data sets used for CNNs.
  • Structuring CNNs with different layer depths in regard to the range of input window size CNN patches to determine the most efficient CNN setting.
  • Testing and validating the performances of each method by using multiple parameters.
The descriptions and the experimental results of this workflow are organized in the following sections. Further explanations and discussions regarding the impact of using different input data sets and methods can be found in the conclusion section.

3.2. Data

RapidEye is a constellation of five Earth-observing satellites that deliver sun synchronous 5 m spatial resolution imagery. The height of their orbits is 680 km, and the swath width is 77 km with a 5-day revisit time period [47]. Three RapidEye cloud-free satellite images from the 28 November 2015 were acquired to cover the study area. We used four out of the five spectral bands of RapidEye, namely blue (440–510 nm), green (520–590 nm), red (630–685 nm), and near-infrared (760–850 nm). In addition, the normalized difference vegetation index (NDVI), which is a very common and widely used ratio [48], was calculated from the near-infrared and the red spectral bands.
Surface topography is considered to be one of the main factors for ground instabilities in hilly and mountainous areas. Steep slopes usually bear the highest probability for landslides, although other factors, such as the geology, can play an even more important role. Within our study area, the geology does not vary significantly and, therefore, topographic factors such as slope and aspect play the key role in the occurrence of landslides. Topography also influences other factors such as wind, sunlight and precipitation [1]. Plan curvature is a particular topographic factor with relevance to landslides that can easily be derived from a digital elevation model (DEM). Earlier studies revealed that hillsides with planar plan curvature are generally more prone to landslides [49]. Although more conditioning topographical factors are used in literature, we consider slope, aspect and plan curvature as the most important ones and want to limit the complexity of the training data set for the different methods tested (see Figure 2). All these topographical factors were derived from a 5 m resolution DEM acquired from the Japanese aerospace exploration agency JAXA ALOS sensor from 2016.
As mentioned, we used two different data sets. We trained our methods using the four original spectral bands plus the NDVI. Then, the topographic factors slope, aspect and plan curvature were added to both the training and testing data sets.
An extensive field survey to locate landslide areas in the Rasuwa district in the higher Himalayas was carried out over two months in the summer of 2018 using a GPS device (Garmin Etre 20X) (see Figure 3). The GPS polygons that refer to the landslide areas were later manually enhanced using the satellite images for visual inspection and plausibility checks. The final landslide inventory data set was created using the GPS data from the field survey, correcting or deleting instances, and eventually adding landslide areas clearly visible in the image but not mapped in the field. This was done in the Geographic Information System ArcGIS 10.3. This integration results in the production of a precise and reliable landslide inventory, while eliminating false positives that occurred in several previous studies when solely using remote sensing-based approaches.

3.3. Random Forest (RF)

The RF method is based on multiple decision trees. It was introduced by [50] and has been used in a wide range of remote sensing applications [51]. As the RF method uses the training data set to create multiple deeper decision trees, it is less sensitive to the over-fitting problem caused by complex datasets compared to other decision trees. Each decision tree of the RF predicts an output and each output is weighted by the value derived from the votes that it receives. The majority voting on an output and a degree of convergence in fitting results in the final classification [52]. The cited literature and others report RF classifications to yield good results for satellite image classifications. Therefore, RF is considered to be one of the most effective non-parametric ensemble learning methods in image analysis [53] and RF was chosen as a ML method for landslide detection in our study. For training the method, 3500 random points were prepared from the landslide polygons in both training zones using the random point tool in ArcMap 10.3. To avoid the resampling of each pixel that was signed by the random points, a minimum distance of 5 m was considered for the generated points due to the spatial resolution of Rapid Eye. We used 100 trees with a single randomly split variable to grow the trees.

3.4. Support Vector Machines (SVM)

The SVM [54] is a machine learning method that maps the dataset of the problem into a higher-dimensional space through the non-linear transformers, where an optimal hyperplane is created for separating the dataset features. The optimal hyperplane will be found when the separating margins between the defined classes are maximal [55]. These maximum separating margins are called support vectors. The SVM method has been used for data classification and regression analysis in several domains. It has also been used for landslide detection [29]. The resulting SVM classifications are affected by the choice of the kernel function (e.g., polynomial, sigmoid, and radial basis function (RBF) [9]. In our study, we applied the widely used RBF kernel [55]. In our study, a gamma parameter (γ) of 0.9 was found to yield the most accurate landslide detections. Both training zones were used for training the SVM with the same data set used for the RF method.

3.5. Artificial Neural Network (ANN)

The ANNs mimic human brain performance, and they can find solution for complex nonlinear problems by discovering their patterns [56]. In the present study, an ANN method was used by a multilayer perceptron (MLP) architecture and trained with the backpropagation algorithm (BPA), which is the most common algorithm for training the ANN method. The number of hidden layer units of any MLP depends on the complexity of the problem [57]. For our case, the network was feed-forwarded with the same training input data set with a hidden layer of 30 neurons. The initial weights are randomly selected by the BPA. Then, the difference between the output values and expected ones are obtained across all observations. This comparison and feeding forward signals and back-propagating errors is done for every terrain unit until the mean-square error stabilizes at an adequate low level [58]. Therefore, all weightings that were randomly selected at first are updated by the backward process during each cycle to minimize the error.

3.6. Convolution Neural Network (CNN)

CNNs have become a hot topic in computer vision and image processing and have introduced state-of-the-art results for these domains [59]. CNN’s multi-layer feed-forward neural networks can obtain the effective feature representations of an image, which makes it possible for these networks to recognize the visual laws in the image without human-designed complex rules [43]. CNNs have a specific architecture, where each so-called hidden layer typically contains convolutional and pooling layers, whereby the convolutional layer is considered to be the main building block of any CNN. The original input image is convolving with a set of trainable kernels that scan across the entire input image resulting in a group of feature maps. Each feature map results from the convolution of the kernel, with its corresponding local region on the original input image. Moreover, an elementwise non-linear activation function (e.g., sigmoid, ReLU, hyperbolic tangent) is taken out of the results of a convolutional layer for non-linearity amplification [60]. The pooling layer is usually computed immediately after a convolutional layer and is used to down/sub-sample output of the convolutional layer to generate a condensed set of feature maps. The max-pooling is the most common and widely used pooling layer, which makes it possible to keep only the maximal values of the feature maps. The max-pooling is considered to be the main operation of any CNN. It reduces the spatial size of feature maps significantly and, consequently, the computation volume for the next layers to be processed. The main operations performed in any CNN can be summarized by the following equation [59]:
O l = P   ( σ ( O l 1 × W l + b l ) )
where the O l 1 represents the output feature map from the previous layer of the lth layer, the W l and the b l indicate the weights and biases of the layer, respectively, that convolve the O l 1 by the linear convolution*, and the σ(⋅) denotes the non-linearity function outside the convolutional layer. These steps are often followed by a pooling operation which is represented by P in Equation (1).

3.6.1. Multiple Input Window Size CNNs

In this study, multiple input window sizes were used for landslide detection. Two input window sizes 32 × 32 and 48 × 48 pixels were considered as our large input window sizes and 12 × 12, 16 × 16, and 22 × 22 were used as three different versions for the small ones (see Figure 4). These input window sizes were selected based on a wide range of sizes from smaller than 12 × 12 up to 64 × 64 based on cross-validation. Multiple input window sizes were used because of the complexity of the shape and size of the features. There are some quite large landslides and several very small ones, often with quite different shapes. Some are elongated and potentially thin and can almost look like an unsealed road rather than a landslide. There are different aspects, slopes and flow directions in the study area and a single landslide may include different aspects. Most landslides exhibit a mixture of topographic features, which makes them difficult to recognize. Zhang et al. [59] used different sizes of input windows and developed a similar approach for the detection of complex shape objects in urban areas.

3.6.2. Augmentation of the Training Data Set

Deep-learning methodologies and CNN in particular, need a huge number of sample patches for efficient training [61] in the form of a so-called labeled training dataset [62]. Preparing such large training datasets can be problematic in practice. The accuracy of the training dataset is also vital, and only a training dataset with a high quality and quantity of samples will yield accurate results. Thus, the size of the training dataset plays a crucial role in the results of CNNs. Therefore, some augmentation techniques have been developed to artificially multiply an existing training dataset. Although the potential impacts of different data augmentation techniques on the results are not clear from the literature, they are believed to improve the performance of the training of the CNNs. The techniques used are also called data distortion since they use particular deformations to increase the volume of the dataset. A deformation applied to a training data set can be a translation or a rotation, or randomly mirroring the image or window shifting. All of these techniques may have particular pros and cons [41]. In this study, because of the dispersion of landslides and their sizes, shapes and directions, we used a random window shifting technique. Applying this data augmentation technique increases the size of our training dataset from the randomly selected 3500 points that were also used for our ML methods (ANN, SVM and RF) to approximately 7000 image sample patches. It means we used 3500 original samples plus 3500 shifted samples to train the CNNs used.

3.6.3. Different Layer Depth CNNs

Structuring the finest architecture and the optimal layer depth for any specific application of the CNN is an ongoing discussion in the deep learning domain [63]. In this study, to account for multiple input window sizes, we structured two CNNs with different layer depths (see Figure 5). A four-layer depth CNN was structured and trained separately with all of our input window sizes (12 × 12, 16 × 16, 22 × 22, 32 × 32 and 48 × 48). A deeper CNN with a seven-layer depth (we call it D-CNN) was structured to apply only large input window sizes (32 × 32 and 48 × 48). The numbers of CNN layers were selected based on cross-validation. By using two different layer depth CNNs, we could also investigate the impacts of the number of CNN layers on the object detection along with the different input window sizes.
The four-layer depth CNN was fed separately with all input window sizes of training sample patches ones using the four original spectral bands (R,G,B, NIR) and the NDVI. Since we fed this CNN with five-layer images, the input sample patch had a × a × 5 units (where a is: 12, 16, 22, 32, and 48). In another training process, we also used three topographic layers, namely slope, aspect and plan curvature, in addition to this spectral information, to train the method. In this case, the CNN was fed by the input sample patches with a × a × 8 units, where a × a is the size of one layer of sample patches, and 8 is the number of different layers used for the analysis. We used 40 feature maps, hence the number of a × a × 5 × 12 different weights were trained during the first hidden layer of the four-layer depth CNN by using input window sizes of a × a. As a result, 12 feature maps with (a − 4) × (a − 4) × 1 units were obtained through a convolution layer with the kernel size of 5. A max-pooling layer of 2 × 2 was used immediately after the first hidden layer, which reduced the units to ( a 4 2 ) × ( a 4 2 ) × 1 in the same number of feature maps from the previous layer. The resulting feature maps were then used as input data for the second convolution layer. Consequently, convolution with a kernel size of 3 led to 20 feature maps with [ ( a 4 2 ) 2 ] × [ ( a 4 2 ) 2 ] × 1 units. We selected the kernel sizes and the number of feature maps with consideration of the used input window sizes and the spread of our target objects. The last convolution layer was used with a kernel size of 3, which resulted in 40 feature maps with [ ( a 4 2 ) 4 ] × [ ( a 4 2 ) 4 ] × 1 units. The processes were executed in Trimble’s eCognition software environment with the CNN implementation based on the Google TensorFlow library. During the training process, the gradients for each weight were calculated in each hidden layer, i.e., estimated using backpropagation. Moreover, a statistical gradient descent function was used to optimize the weights. We found that the following variables resulted in the best object detection rate in our study; a learning rate of 0.0001, 6000 training steps, and a batch size of 50.
Only two large input window sizes of training sample patches were used for the deeper method of the seven-layer depth CNN (D-CNN). In this case, the CNN was prepared using the same structure of a convolution layer with a kernel size of 5 as the first layer and continuing with further convolution layers with the kernel size of 3 and max-pooling layers of 2 × 2. This D-CNN was also trained with the input sample patches of a × a × 5 and a × a × 8 units.

4. Results

The described ML and CNN methods using all mentioned parameters were used on the study in two training zones and tested for another zone. For all tests, we removed those detected landslide objects which were smaller than 70 pixels to account for geometric inaccuracies between the fieldwork samples and the satellite imagery. For the CNN methods the optimal thresholds were used. The statistical analysis (e.g., minimum, maximum, sum, mean and standard deviation) of the landslide detection resulting maps were represented in Table 2. As described earlier, the main goals were (a) to compare the ML methods ANN, SVM and RF to CNNs and, (b) to investigate the impact of the input window size and the layer depth of a CNN on the accuracy of landslide detection. The samples resulting from multiple input window sizes for two non-landslide areas and two landslide areas are presented in Figure 6.
A total of eighteen landslide maps was generated based on all methods and parameters used. Figure 7 shows the 20 resulting maps. For the ML methods ANN, SVM and RF, we implemented these methods first with five spectral layers from the RapidEye images (R, G, B, NIR) and the NDVI and called the resulting maps ANN5, SVM 5 and RF 5 . In addition, we created eight layers including the five mentioned spectral bands plus the three topographical layers slope, aspect and plan curvature. We called these resulting layers ANN8, SVM 8 and RF 8 . This means that four landslide maps in total were created with ML methods. More parameters were used for implementing the CNN methods: in addition to using both five and eight-layer training data sets, different input window sizes and depths also were used. Therefore, for the CNN p , q , the index of p corresponds to the size of the convolution input window, while q indicates the number of input layers that were used for training the method. CNN refers to the four-layer depth CNN, and D-CNN refers to the deeper seven-layer CNN. Figure 8 shows an enlargment of two different sub-areas from the test area to illustrate some differences of the identified landslides for different types of input data layers and methodologies.

5. Accuracy Assessment

In this section, we outline some accuracy assessment methods, which are common and widely used in the remote sensing and the computer vision domains, which were used to evaluate the effectiveness and performance of the applied methods of MLs and CNNs by analyzing the conformity between the landslide inventory dataset and the products of the applied methods. Therefore, existing any uncertainty among the distribution, location, and boundaries of the areas where specified as the landslide in the inventory data set affecting the results of the accuracy assessment processes [9].

5.1. Quantitative Methods

The comparison of accuracies is based on three kinds of classified pixels, namely, true positive (TP), false positive (FP), and false negative (FN) [31,41]. TPs are the pixels that were correctly detected as landslide areas. FPs correspond to pixels that were identified as landslide areas based on the classification but are not landslides according to the inventory data set. FNs indicate inventory landslide areas that are not recognized as such by the applied method (see Figure 9). The corresponding area statistics for these three cases is represented in Table 3 for different methods and parameters. To statistically describe the resulting total areas for TPs, FPs and FNs, three different parameters were used, namely precision, recall, and F1. Precision (P) was used to define how much of the classified areas are really landslides. Recall (R) was used to determine how much of the actual (field-measured) landslide areas were classified in the images. The well-known F1 measure was additionally used to calculate the balance between the two mentioned accuracy descriptors. The three measures are explained as Equations (2)–(4), respectively.
Precision =   True   Positives / ( True   Positives   +   False   Positives )
Recall =   True   Positives / ( True   Positives   +   False   Negatives )
f 1   measure = 2   × precision × recall / ( precision + recall )
Table 3 quantitatively illustrates the results of the ML and CNN methods of landslide detection approaches. For our case study area, the CNN 16 , 5 method achieved the highest precision value of 83.31% closely followed by RF 5 and RF 8 , which achieved precision values of 81.95 and 80.9%, respectively. The CNN 22 , 5 produced the lowest FN value and obtained the best recall metrics value (92.85%). However, the resulting precision and F1 values of this method were lower. CNN 16 , 5 yielded the highest F1 value of 87.8%.

5.2. Mean Intersection-over-Union (mIOU)

The mIOU is a validation metric used to measure the accuracy of the result of a predictor method on a particular dataset. This validation metric is widely used in computer vision, particularly for object detection challenges [64]. Generally, mIOU is a general validation metric where any method that produces bounding polygons can be validated by using mIoU based on a precise inventory dataset of target polygons (see Figure 10). It is described as the mean of the following Equation (5):
IOU = Area   of   Overlap / Area   of   Union
The mIOU value for each resulting landslide map was calculated and represented in Table 3. Based on these values, CNN 16 , 5 yielded the best landslide detection results with the highest mIOU value of 78.26, followed by 70.62 and 66.9 obtained with CNN 16 , 8 and RF   8 , respectively (see Table 3).

6. Discussion

This study proves that it is important to select the appropriate methods and parameters. It is not as simple as to generally compare, for instance, ML methods with CNNs. It turns out that there are multiple options to design a CNN and that a CNN will not automatically outperform other methods—as popular science articles and magazines may imply. Also, for the same method used, different training strategies will influence the results. In this study, we used two different training data sets, different numbers of layers and different depths of CNNs. First, we only focused on the spectral information. This five-layer training data set showed more accurate results than using the eight-layer training data set which also had three topographic layers. Almost all of the applied methods (except for CNN 32 ) yielded better results when using only the spectral information (four original bands plus NDVI). Therefore, we may say that in this study the topographical information did not improve the results. This was somewhat unexpected, and it is unclear whether topographical information could be used in other study areas with other settings to improve the overall classification accuracy or not. Although topographical information slightly reduced the overall accuracy of the results, it was very helpful for distinguishing between settlement areas and the landslide areas, which have a similar spectral behavior (see Figure 11). Here, especially the slope layer was useful because most of the landslides typically occur on steep slopes which are unlikely to be settlement areas. Thus, most of the eight-layer-based results clearly distinguished settlement areas from landslide areas. However, since almost all landslides are located in steep areas, all used ML and CNN methods overestimate landslides in steep areas. The drawback of having these misclassifications associated with using eight layers was more significant in our case study compared to the advantage of having improved detection of settlements since there were only relatively few and small settlement areas.
For CNNs, the effects of using different input window sizes were evaluated. Any used CNN input window size yielded similar results between the two training datasets (see Figure 12) while different CNN input window sizes resulted in different accuracies. Unfortunately, these differences are not systematic. For instance, increasing the CNN input window size from CNN 12 to CNN 16 improved the accuracy but further increases led to lower overall accuracies and especially lower mIOU values. This is presumably due to the fact that larger input windows negatively influenced the classification of random points distributed within the landslide polygons of the inventory data. Some of the randomly distributed points will be close to the border of a landslide area. Thus, non-landslide areas will increase along with increased input window sizes. Nevertheless, increasing the layer depth from four to seven layers in the CNN method resulted in a better performance of larger input window sizes for both training datasets (see Figure 13). Although the deeper structured D-CNN was limited to input window sizes of 32 and 48, this method could significantly improve these two input window sizes accuracies compared to using the 4 layers CNN. Most significantly, the mIOU value for CNN 32 , 5 increased from 48.42% to 63.66% by using the deeper method (D- DCNN 32 , 5 ).

7. Conclusions

The increasing availability of VHR remotely sensed imagery opens many options for landslide mapping and for producing and updating landslide inventories. Landslide mapping is still a challenging task due to the complexity of factors triggering landslides and the many forms, sizes, and shapes landslides can take. Landslide inventory data sets are traditionally generated by field surveys and visual interpretation of satellite images. However, such surveys are time-consuming, expensive and often dangerous. There are also some semi-automated and case-based automatic methods for landslide detection and even classification [65]. Recently, ML, deep-learning methods and particularly CNNs have been shown to be powerful in object detection from images if a huge number of training samples exist. In this study, we analyzed and compared current ML and CNN methods for landslide detection in a case study in the higher Himalayas. We considered different training data sets and parameters. Our experiments revealed that CNN do not necessarily outperform ANN, SVM and RF, rather only in ideal cases. How can a user know in advance which CNN structure is ideal? In our case, CNNs based only on spectral information and 16 pixel input window size yielded the best results. As described, we expected that adding topographic layers would improve the accuracy, but this was not the case. We also observed that larger input window sizes for the same CNN structure tendentially decreased accuracies. Conversely, deeper CNN layer structures can be positive. However, how can users know this beforehand? We may conclude that CNNs have a high potential for landslide detection, but users should not be misled by results from crowdsourcing campaigns in computer vision (is there a cat in an image).
On the positive side, deep-learning object detection methods require less human supervision than traditional methods and can be easily transformed to other regions by retraining the model with other related training data; however, the resulted accuracy of the transition of these methods for extracting the landslides on a global scale is still unclear. CNN may be the most efficient for the recognition of complex image patterns and for semantic classifications. CNNs and deep-learning methods at the pixel level seem to be problematic for deriving exact borders of landslides. For our future work, we aim to develop an object-based CNN method for landslide detection. Instead of using CNN input windows based on random points, which yielded moderate accuracies—especially for larger input windows—in this study, we want to use object segments with precise boundaries and define the optimal input window size based on a hierarchical patch dynamics paradigm [66] or some simple ways like the bounding box of a landslide. However, such a methodology will require even more training data. Augmentation strategies may help—as in this study—but their effects on the results are currently not fully understood.

Author Contributions

Conceptualization, O.G., T.B., D.T., and J.A.; Data curation, S.R.M.; Funding acquisition, T.B.; Investigation, O.G.; Methodology, O.G., K.G. and D.T.; Supervision, T.B. and J.A.; Validation, O.G., and K.G.; Visualization, O.G.; Writing—original draft, O.G.; Writing—review & editing, T.B., D.T., and J.A. All authors read and approved the final manuscript.

Funding

This research is partly funded by the Austrian Science Fund (FWF) through the GIScience Doctoral College (DK W 1237-N23). The work of Dirk Tiede has been supported by the Austrian Science Fund through the project MORPH (Mapping, Monitoring and Modeling the Spatio-Temporal Dynamics of Land Surface Morphology; FWF-P29461-N29).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Pourghasemi, H.; Gayen, A.; Park, S.; Lee, C.-W.; Lee, S. Assessment of landslide-prone areas and their zonation using logistic regression, logitboost, and naïvebayes machine-learning algorithms. Sustainability 2018, 10, 3697. [Google Scholar] [CrossRef]
  2. Hölbling, D.; Füreder, P.; Antolini, F.; Cigna, F.; Casagli, N.; Lang, S. A semi-automated object-based approach for landslide detection validated by persistent scatterer interferometry measures and landslide inventories. Remote Sens. 2012, 4, 1310–1336. [Google Scholar] [CrossRef]
  3. Hong, H.; Chen, W.; Xu, C.; Youssef, A.M.; Pradhan, B.; Tien Bui, D. Rainfall-induced landslide susceptibility assessment at the Chongren area (China) using frequency ratio, certainty factor, and index of entropy. Geocarto Int. 2017, 32, 139–154. [Google Scholar] [CrossRef]
  4. Pradhan, B. A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput. Geosci. 2013, 51, 350–365. [Google Scholar] [CrossRef] [Green Version]
  5. Solway, L. Socio-economic perspective of developing country megacities vulnerable to flood and landslide hazards. In Floods and Landslides: Integrated Risk Assessment; Springer: Berlin, Heidelberg, 1999; pp. 245–260. [Google Scholar]
  6. Svalova, V. Landslide risk management for urbanized territories. In Risk Management Treatise for Engineering Practitioners; Oduoza, C.F., Ed.; IntechOpen: London, UK, 2018. [Google Scholar]
  7. Cascini, L.; Bonnard, C.; Corominas, J.; Jibson, R.; Montero-Olarte, J. Landslide hazard and risk zoning for urban planning and development. In Landslide Risk Management; Hungr, O., Fell, R., Couture, R., Eberhardt, E., Eds.; Taylor & Francis: London, UK, 2005; pp. 199–235. [Google Scholar]
  8. Martire, D.; De Rosa, M.; Pesce, V.; Santangelo, M.A.; Calcaterra, D. Landslide hazard and land management in high-density urban areas of Campania region, Italy. Nat. Hazards Earth Syst. Sci. 2012, 12, 905–926. [Google Scholar] [CrossRef] [Green Version]
  9. Mezaal, M.; Pradhan, B.; Rizeei, H. Improving landslide detection from airborne laser scanning data using optimized Dempster–Shafer. Remote Sens. 2018, 10, 1029. [Google Scholar] [CrossRef]
  10. Feizizadeh, B.; Ghorbanzadeh, O. GIS-based interval pairwise comparison matrices as a novel approach for optimizing an analytical hierarchy process and multiple criteria weighting. GI_Forum 2017, 1, 27–35. [Google Scholar] [CrossRef]
  11. Myronidis, D.; Papageorgiou, C.; Theophanous, S. Landslide susceptibility mapping based on landslide history and analytic hierarchy process (AHP). Nat. Hazards 2016, 81, 245–263. [Google Scholar] [CrossRef]
  12. Aghdam, I.N.; Varzandeh, M.H.M.; Pradhan, B. Landslide susceptibility mapping using an ensemble statistical index (wi) and adaptive neuro-fuzzy inference system (ANFIS) model at Alborz mountains (Iran). Environ. Earth Sci. 2016, 75, 553. [Google Scholar] [CrossRef]
  13. Sezer, E.A.; Pradhan, B.; Gokceoglu, C. Manifestation of an adaptive neuro-fuzzy model on landslide susceptibility mapping: Klang valley, Malaysia. Expert Syst. Appl. 2011, 38, 8208–8219. [Google Scholar] [CrossRef]
  14. Pourghasemi, H.R.; Kerle, N. Random forests and evidential belief function-based landslide susceptibility assessment in western Mazandaran province, Iran. Environ. Earth Sci. 2016, 75, 185. [Google Scholar] [CrossRef]
  15. Chen, W.; Xie, X.; Wang, J.; Pradhan, B.; Hong, H.; Bui, D.T.; Duan, Z.; Ma, J. A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility. Catena 2017, 151, 147–160. [Google Scholar] [CrossRef]
  16. Ghorbanzadeh, O.; Blaschke, T.; Aryal, J.; Gholaminia, K. A new GIS-based technique using an adaptive neuro-fuzzy inference system for land subsidence susceptibility mapping. J. Spat. Sci. 2018, 1–17. [Google Scholar] [CrossRef]
  17. Ghorbanzadeh, O.; Rostamzadeh, H.; Blaschke, T.; Gholaminia, K.; Aryal, J. A new GIS-based data mining technique using an adaptive neuro-fuzzy inference system (ANFIS) and k-fold cross-validation approach for land subsidence susceptibility mapping. Nat. Hazards 2018, 1–21. [Google Scholar] [CrossRef]
  18. Feizizadeh, B.; Roodposhti, M.S.; Jankowski, P.; Blaschke, T. A GIS-based extended fuzzy multi-criteria evaluation for landslide susceptibility mapping. Comput. Geosci. 2014, 73, 208–221. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  19. Voigt, S.; Kemper, T.; Riedlinger, T.; Kiefl, R.; Scholte, K.; Mehl, H. Satellite image analysis for disaster and crisis-management support. IEEE Trans. Geosci. Remote Sens. 2007, 45, 1520–1528. [Google Scholar] [CrossRef]
  20. Plank, S.; Twele, A.; Martinis, S. Landslide mapping in vegetated areas using change detection based on optical and polarimetric SAR data. Remote Sens. 2016, 8, 307. [Google Scholar] [CrossRef]
  21. Guzzetti, F.; Mondini, A.C.; Cardinali, M.; Fiorucci, F.; Santangelo, M.; Chang, K.-T. Landslide inventory maps: New tools for an old problem. Earth-Sci. Rev. 2012, 112, 42–66. [Google Scholar] [CrossRef] [Green Version]
  22. Manconi, A.; Casu, F.; Ardizzone, F.; Bonano, M.; Cardinali, M.; De Luca, C.; Gueguen, E.; Marchesini, I.; Parise, M.; Vennari, C. Brief communication: Rapid mapping of landslide events: The 3 December 2013 Montescaglioso landslide, Italy. Nat. Hazards Earth Syst. Sci. 2014, 14, 1835–1841. [Google Scholar] [CrossRef]
  23. Lang, S.; Schöpfer, E.; Zeil, P.; Riedler, B. Earth observation for humanitarian assistance. GI Forum–J. Geogr. Inf. Sci. 2017, 1, 157–165. [Google Scholar] [CrossRef]
  24. Blaschke, T. Object based image analysis for remote sensing. ISPRS J. Photogramm. Remote Sens. 2010, 65, 2–16. [Google Scholar] [CrossRef] [Green Version]
  25. Duro, D.C.; Franklin, S.E.; Dubé, M.G. A comparison of pixel-based and object-based image analysis with selected machine learning algorithms for the classification of agricultural landscapes using Spot-5 HRG imagery. Remote Sens. Environ. 2012, 118, 259–272. [Google Scholar] [CrossRef]
  26. Roodposhti, M.S.; Aryal, J.; Bryan, B.A. A novel algorithm for calculating transition potential in cellular automata models of land-use/cover change. Environ. Model. Softw. 2019, 112, 70–81. [Google Scholar] [CrossRef]
  27. Mezaal, M.R.; Pradhan, B.; Sameen, M.I.; Mohd Shafri, H.Z.; Yusoff, Z.M. Optimized neural architecture for automatic landslide detection from high-resolution airborne laser scanning data. Appl. Sci. 2017, 7, 730. [Google Scholar] [CrossRef]
  28. Danneels, G.; Pirard, E.; Havenith, H.-B. Automatic landslide detection from remote sensing images using supervised classification methods. In Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2007, Barcelona, Spain, 23–28 July 2007; IEEE: Barcelona, Spain, 2007; pp. 3014–3017. [Google Scholar]
  29. Bui, D.T.; Tuan, T.A.; Klempe, H.; Pradhan, B.; Revhaug, I. Spatial prediction models for shallow landslide hazards: A comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 2016, 13, 361–378. [Google Scholar]
  30. Zhu, X.X.; Tuia, D.; Mou, L.; Xia, G.-S.; Zhang, L.; Xu, F.; Fraundorfer, F. Deep learning in remote sensing: A comprehensive review and list of resources. IEEE Geosci. Remote Sens. Mag. 2017, 5, 8–36. [Google Scholar] [CrossRef]
  31. Ghorbanzadeh, O.; Tiede, D.; Dabiri, Z.; Sudmanns, M.; Lang, S. Dwelling extraction in refugee camps using CNN—First experiences and lessons learnt. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2018, 161–166. [Google Scholar] [CrossRef]
  32. Längkvist, M.; Kiselev, A.; Alirezaie, M.; Loutfi, A. Classification and segmentation of satellite orthoimagery using convolutional neural networks. Remote Sens. 2016, 8, 329. [Google Scholar] [CrossRef]
  33. Maggiori, E.; Tarabalka, Y.; Charpiat, G.; Alliez, P. Convolutional neural networks for large-scale remote-sensing image classification. IEEE Trans. Geosci. Remote Sens. 2017, 55, 645–657. [Google Scholar] [CrossRef]
  34. Long, J.; Shelhamer, E.; Darrell, T. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 3431–3440. [Google Scholar]
  35. Qayyum, A.; Malik, A.S.; Saad, N.M.; Iqbal, M.; Faris Abdullah, M.; Rasheed, W.; Rashid Abdullah, T.A.; Bin Jafaar, M.Y. Scene classification for aerial images based on CNN using sparse coding technique. Int. J. Remote Sens. 2017, 38, 2662–2685. [Google Scholar] [CrossRef]
  36. Radovic, M.; Adarkwa, O.; Wang, Q. Object recognition in aerial images using convolutional neural networks. J. Imaging 2017, 3, 21. [Google Scholar] [CrossRef]
  37. Deng, Z.; Sun, H.; Zhou, S.; Zhao, J.; Lei, L.; Zou, H. Fast multiclass object detection in optical remote sensing images using region based convolutional neural networks. In Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, TX, USA, 23–28 July 2017; pp. 858–861. [Google Scholar]
  38. Dahmane, M.; Foucher, S.; Beaulieu, M.; Riendeau, F.; Bouroubi, Y.; Benoit, M. Object detection in pleiades images using deep features. In Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China, 10–15 July 2016; pp. 1552–1555. [Google Scholar]
  39. Längkvist, M.; Alirezaie, M.; Kiselev, A.; Loutfi, A. Interactive learning with convolutional neural networks for image labeling. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), New York, NY, USA, 9–15 July 2016. [Google Scholar]
  40. Yang, H.L.; Lunga, D.; Yuan, J. Toward country scale building detection with convolutional neural network using aerial images. In Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, TX, USA, 23–28 July 2017; pp. 870–873. [Google Scholar]
  41. Guirado, E.; Tabik, S.; Alcaraz-Segura, D.; Cabello, J.; Herrera, F. Deep-learning convolutional neural networks for scattered shrub detection with google earth imagery. arXiv, 2017; arXiv:1706.00917. [Google Scholar]
  42. Yu, H.; Ma, Y.; Wang, L.; Zhai, Y.; Wang, X. A landslide intelligent detection method based on CNN and rsg_r. In Proceedings of the 2017 IEEE International Conference on Mechatronics and Automation (ICMA), Takamatsu, Japan, 6–9 August 2017; IEEE: Takamatsu, Japan, 2017; pp. 40–44. [Google Scholar]
  43. Ding, A.; Zhang, Q.; Zhou, X.; Dai, B. Automatic recognition of landslide based on CNN and texture change detection. In Proceedings of the Chinese Association of Automation (YAC), Youth Academic Annual Conference, Wuhan, China, 11–13 November 2016; IEEE: Wuhan, China, 2016; pp. 444–448. [Google Scholar]
  44. Roback, K.; Clark, M.K.; West, A.J.; Zekkos, D.; Li, G.; Gallen, S.F.; Chamlagain, D.; Godt, J.W. The size, distribution, and mobility of landslides caused by the 2015 Gorkha earthquake, Nepal. Geomorphology 2018, 301, 121–138. [Google Scholar] [CrossRef]
  45. Regmi, A.D.; Dhital, M.R.; Zhang, J.-Q.; Su, L.-J.; Chen, X.-Q. Landslide susceptibility assessment of the region affected by the 25 April 2015 Gorkha earthquake of Nepal. J. Mt. Sci. 2016, 13, 1941–1957. [Google Scholar] [CrossRef]
  46. Gnyawali, K.R.; Adhikari, B.R. Spatial relations of earthquake induced landslides triggered by 25 April 2015 Gorkha earthquake, mw 7.8, Nepal. In Proceedings of the EGU General Assembly, Vienna, Austria, 17–22 April 2016; p. 18429. [Google Scholar]
  47. Mahdianpari, M.; Salehi, B.; Rezaee, M.; Mohammadimanesh, F.; Zhang, Y. Very deep convolutional neural networks for complex land cover mapping using multispectral remote sensing imagery. Remote Sens. 2018, 10, 1119. [Google Scholar] [CrossRef]
  48. Modzelewska, A.; Stereńczak, K.; Mierczyk, M.; Maciuk, S.; Bałazy, R.; Zawiła-Niedźwiecki, T. Sensitivity of vegetation indices in relation to parameters of Norway spruce stands. Folia Forestalia Polonica 2017, 59, 85–98. [Google Scholar] [CrossRef] [Green Version]
  49. Ohlmacher, G.C. Plan curvature and landslide probability in regions dominated by earth flows and earth slides. Eng. Geol. 2007, 91, 117–134. [Google Scholar] [CrossRef]
  50. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  51. Melville, B.; Lucieer, A.; Aryal, J. Object-based random forest classification of Landsat ETM+ and worldview-2 satellite imagery for mapping lowland native grassland communities in Tasmania, Australia. Int. J. Appl. Earth Obs. Geoinf. 2018, 66, 46–55. [Google Scholar] [CrossRef]
  52. Xu, R.; Lin, H.; Lü, Y.; Luo, Y.; Ren, Y.; Comber, A. A modified change vector approach for quantifying land cover change. Remote Sens. 2018, 10, 1578. [Google Scholar] [CrossRef]
  53. Chen, T.; Trinder, J.C.; Niu, R. Object-oriented landslide mapping using ZY-3 satellite imagery, random forest and mathematical morphology, for the three-gorges reservoir, china. Remote Sens. 2017, 9, 333. [Google Scholar] [CrossRef]
  54. Vapnik, V. Statistical Learning Theory; Wiley: New York, NY, USA, 1998; Volume 3. [Google Scholar]
  55. Janik, P.; Lobos, T. Automated classification of power-quality disturbances using SVM and RBF networks. IEEE Trans. Power Deliv. 2006, 21, 1663–1669. [Google Scholar] [CrossRef]
  56. Goldarag, Y.J.; Mohammadzadeh, A.; Ardakani, A. Fire risk assessment using neural network and logistic regression. J. Indian Soc. Remote Sens. 2016, 44, 885–894. [Google Scholar] [CrossRef]
  57. Paola, J.; Schowengerdt, R. A review and analysis of backpropagation neural networks for classification of remotely-sensed multi-spectral imagery. Int. J. Remote Sens. 1995, 16, 3033–3058. [Google Scholar] [CrossRef]
  58. Park, S.; Choi, C.; Kim, B.; Kim, J. Landslide susceptibility mapping using frequency ratio, analytic hierarchy process, logistic regression, and artificial neural network methods at the Inje area, Korea. Environ. Earth Sci. 2013, 68, 1443–1464. [Google Scholar] [CrossRef]
  59. Zhang, C.; Sargent, I.; Pan, X.; Li, H.; Gardiner, A.; Hare, J.; Atkinson, P.M. An object-based convolutional neural network (OCNN) for urban land use classification. Remote Sens. Environ. 2018, 216, 57–70. [Google Scholar] [CrossRef]
  60. Strigl, D.; Kofler, K.; Podlipnig, S. Performance and scalability of GPU-based convolutional neural networks. In Proceedings of the 2010 18th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), Pisa, Italy, 17–19 February 2010; IEEE: Pisa, Italy, 2010; pp. 317–324. [Google Scholar]
  61. Dixit, M.; Kwitt, R.; Niethammer, M.; Vasconcelos, N. Attribute-guided augmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 7455–7463. [Google Scholar]
  62. Liu, B.; Dixit, M.; Kwitt, R.; Vasconcelos, N. Feature space transfer for data augmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–20 June 2018; pp. 9090–9098. [Google Scholar]
  63. Csillik, O.; Cherbini, J.; Johnson, R.; Lyons, A.; Kelly, M. Identification of citrus trees from unmanned aerial vehicle imagery using convolutional neural networks. Drones 2018, 2, 39. [Google Scholar] [CrossRef]
  64. Chen, L.-C.; Barron, J.T.; Papandreou, G.; Murphy, K.; Yuille, A.L. Semantic image segmentation with task-specific edge detection using cnns and a discriminatively trained domain transform. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 26 June–1 July 2016; pp. 4545–4554. [Google Scholar]
  65. Dou, J.; Chang, K.-T.; Chen, S.; Yunus, A.P.; Liu, J.-K.; Xia, H.; Zhu, Z. Automatic case-based reasoning approach for landslide detection: Integration of object-oriented image analysis and a genetic algorithm. Remote Sens. 2015, 7, 4318–4342. [Google Scholar] [CrossRef]
  66. Blaschke, T.; Piralilou, S.T. The near-decomposability paradigm re-interpreted for place-based GIS. In Proceedings of the 1st Workshop on Platial Analysis (PLATIAL’18), Heidelberg, Germany, 20–21 September 2018; pp. 20–21. [Google Scholar]
Figure 1. A true color composite of RapidEye bands 3/2/1 acquired on 18 June 2015, illustrating the geographic location of the study area.
Figure 1. A true color composite of RapidEye bands 3/2/1 acquired on 18 June 2015, illustrating the geographic location of the study area.
Remotesensing 11 00196 g001
Figure 2. (a) NDVI and the produced topographic factors (b) slope, (c) aspect and (d) plan curvature.
Figure 2. (a) NDVI and the produced topographic factors (b) slope, (c) aspect and (d) plan curvature.
Remotesensing 11 00196 g002
Figure 3. Field photographs showing landslide areas during field survey in the Rasuwa district.
Figure 3. Field photographs showing landslide areas during field survey in the Rasuwa district.
Remotesensing 11 00196 g003
Figure 4. The multiple CNN input window sizes for (a) landslide and (b) non-landslide areas.
Figure 4. The multiple CNN input window sizes for (a) landslide and (b) non-landslide areas.
Remotesensing 11 00196 g004
Figure 5. The CNN architectures with (a) seven-layer depth (D-CNN) and (b) four-layer depth CNN, training separately with five spectral layers and eight layers that included spectral plus with topographical ones. Input window sizes of 32 × 32 and 48 × 48 were used for D-CNN, and window sizes of 12 × 12, 16 × 16, 22 × 22, 32 × 32 and 48 × 48 were used for CNN.
Figure 5. The CNN architectures with (a) seven-layer depth (D-CNN) and (b) four-layer depth CNN, training separately with five spectral layers and eight layers that included spectral plus with topographical ones. Input window sizes of 32 × 32 and 48 × 48 were used for D-CNN, and window sizes of 12 × 12, 16 × 16, 22 × 22, 32 × 32 and 48 × 48 were used for CNN.
Remotesensing 11 00196 g005
Figure 6. An illustration of convolution input window sizes from (a) two non-landslide areas and (b) two landslide areas.
Figure 6. An illustration of convolution input window sizes from (a) two non-landslide areas and (b) two landslide areas.
Remotesensing 11 00196 g006
Figure 7. Landslide detection results using different ML and CNN methods, training datasets and parameters. In any CNN p , q and ML q the index of p corresponds to the size of the convolution input window, and q indicates the number of input layers that were used for training.
Figure 7. Landslide detection results using different ML and CNN methods, training datasets and parameters. In any CNN p , q and ML q the index of p corresponds to the size of the convolution input window, and q indicates the number of input layers that were used for training.
Remotesensing 11 00196 g007
Figure 8. Enlarged maps of two different sub-areas from the test area. Landslide detection results are overlayed on the inventory data. In any CNN p , q and ML q the index of p corresponds to the size of the convolution input window, and q indicates the number of input layers that were used for training.
Figure 8. Enlarged maps of two different sub-areas from the test area. Landslide detection results are overlayed on the inventory data. In any CNN p , q and ML q the index of p corresponds to the size of the convolution input window, and q indicates the number of input layers that were used for training.
Remotesensing 11 00196 g008
Figure 9. Inventory map with GPS-measured landslide polygons optimized by manual detection and correction using RapidEye satellite images (upper raw). The three sample areas of a, b and, c in the lower raw illustrate true positives (TP), false positives (FP) and false negatives (FN).
Figure 9. Inventory map with GPS-measured landslide polygons optimized by manual detection and correction using RapidEye satellite images (upper raw). The three sample areas of a, b and, c in the lower raw illustrate true positives (TP), false positives (FP) and false negatives (FN).
Remotesensing 11 00196 g009
Figure 10. Illustration of the area of overlap (a) and area of union (b) for a detected landslide as compared to the corresponding area from the inventory data set.
Figure 10. Illustration of the area of overlap (a) and area of union (b) for a detected landslide as compared to the corresponding area from the inventory data set.
Remotesensing 11 00196 g010
Figure 11. Enlarged maps of (a) CNN (16,5) and (b) CNN (16,8).
Figure 11. Enlarged maps of (a) CNN (16,5) and (b) CNN (16,8).
Remotesensing 11 00196 g011
Figure 12. The influence of multiple input window sizes on the F1 measure (left) and mIOU (right) of CNN method for both 5 and 8 layer training datasets.
Figure 12. The influence of multiple input window sizes on the F1 measure (left) and mIOU (right) of CNN method for both 5 and 8 layer training datasets.
Remotesensing 11 00196 g012
Figure 13. Differences between CNNs and deeper CNN methods (D-CNNs): F1 measure (left) and mIOU (right) for large input window sizes and 5 and 8 layer datasets.
Figure 13. Differences between CNNs and deeper CNN methods (D-CNNs): F1 measure (left) and mIOU (right) for large input window sizes and 5 and 8 layer datasets.
Remotesensing 11 00196 g013
Table 1. Landslide inventories prepared as part of different studies.
Table 1. Landslide inventories prepared as part of different studies.
Prepared by# LandslidesShapeCoverageStudy Reference
USGS24915PolygonGorkha, Nepal[44]
IMHE, CAS2645PolygonGorkha, Nepal[45]
Gnyawali and Adhikari 201619332PointGorkha, Nepal[46]
USGS: United States Geological Survey. IMHE, CAS: Institute of Mountain Hazards and Environment, Chinese Academy of Sciences.
Table 2. Statistical analysis of the landslide detection results. The count of the extracted polygons as landslide areas, the minimum and maximum sizes of the landslide areas, total detected areas, mean and standard deviation values.
Table 2. Statistical analysis of the landslide detection results. The count of the extracted polygons as landslide areas, the minimum and maximum sizes of the landslide areas, total detected areas, mean and standard deviation values.
MethodCountMinimum (ha)Maximum (ha)Sum (ha)Mean (ha)Standard Deviation (ha)
CNN 12 , 5 2470.17577.16555,631.545.37
CNN 12 , 8 3080.254.65530,631.996.51
CNN 16 , 5 2810.17596.32480,521.154.97
CNN 16 , 8 3210.18157.64447,281.657.08
CNN 22 , 5 2860.175136.85524,321.899.75
CNN 22 , 8 3410.185170.05426,050.964.59
CNN 32 , 5 3060.21208.48784,791.9311.08
CNN 32 , 8 3350.205174.4508,312.166.43
CNN 48 , 5 3140.2204.8426,832.3711.23
CNN 48 , 8 3850.22154.72478,021.13.46
D- CNN 32 , 5 2680.19558.87467,521.274.28
D- CNN 32 , 8 2770.276509,831.846.45
D- CNN 48 , 5 3060.2265.6589,931.494.39
D- CNN 48 , 8 3190.2274.08505,432.36.59
SVM 5 4210.175322.16754,852.2414.31
SVM 8 5140.175352.09798,721.6915.77
RF 5 3330.175117.9565,931.477.29
RF 8 4590.18282.02568,111.2711.62
ANN54890.175117.95991,981.054.64
ANN85460.175153.951125,750.994.5
Table 3. Landslide detection results for the test zone for the ML and CNN methods trained with two different training datasets of five and eight layers. For CNNs, multiple input window sizes and layer depths were applied. Accuracies are stated as precision, recall, F1-measure, and mIOU.
Table 3. Landslide detection results for the test zone for the ML and CNN methods trained with two different training datasets of five and eight layers. For CNNs, multiple input window sizes and layer depths were applied. Accuracies are stated as precision, recall, F1-measure, and mIOU.
MethodTP (ha)FP (ha)FN (ha)Precision (%)Recall (%)F1 (%)mIOU (%)
CNN 12 , 5 368113.2374.476.4783.1879.6866.23
CNN 12 , 8 344.07146.6439.9270.1189.678.6764.84
CNN 16 , 5 397.859.6323.0983.3192.887.878.26
CNN 16 , 8 345.2763.8938.1279.3386.5482.7870.62
CNN 22 , 5 351.02145.8927.4170.5192.8579.9466.8
CNN 22 , 8 260.4587.9877.6274.7477.0375.8761.13
CNN 32 , 5 325.24380.0379.5253.8882.6965.2548.42
CNN 32 , 8 279.58182.9745.7660.4485.9370.9655
CNN 48 , 5 210110.27106.5666.0267.9566.9750.35
CNN 48 , 8 226.61156.3295.0959.1266.3370.2847.29
D- CNN 32 , 5 297.63124.5345.3670.586.7777.7963.66
D- CNN 32 , 8 301.08162.3146.4464.9786.6374.2559.05
D- CNN 48 , 5 298.55201.3690.0259.7276.8367.250.6
D- CNN 48 , 8 273.14194.6937.658.3887.970.5654.04
SVM 5 385.9318.2850.6754.888.3967.6551.12
SVM 8 403.07395.6558. 4750.5187.3864.0147.07
RF 5 393.986.7185.3281.9582.1982.0769.6
RF 8 380.289.698.3180.979.4580.1766.9
ANN5499,83152,03340,1276,759, 5367,0350,41
ANN8445,9459,81220,0449,2266,9556,7339,6

Share and Cite

MDPI and ACS Style

Ghorbanzadeh, O.; Blaschke, T.; Gholamnia, K.; Meena, S.R.; Tiede, D.; Aryal, J. Evaluation of Different Machine Learning Methods and Deep-Learning Convolutional Neural Networks for Landslide Detection. Remote Sens. 2019, 11, 196. https://doi.org/10.3390/rs11020196

AMA Style

Ghorbanzadeh O, Blaschke T, Gholamnia K, Meena SR, Tiede D, Aryal J. Evaluation of Different Machine Learning Methods and Deep-Learning Convolutional Neural Networks for Landslide Detection. Remote Sensing. 2019; 11(2):196. https://doi.org/10.3390/rs11020196

Chicago/Turabian Style

Ghorbanzadeh, Omid, Thomas Blaschke, Khalil Gholamnia, Sansar Raj Meena, Dirk Tiede, and Jagannath Aryal. 2019. "Evaluation of Different Machine Learning Methods and Deep-Learning Convolutional Neural Networks for Landslide Detection" Remote Sensing 11, no. 2: 196. https://doi.org/10.3390/rs11020196

APA Style

Ghorbanzadeh, O., Blaschke, T., Gholamnia, K., Meena, S. R., Tiede, D., & Aryal, J. (2019). Evaluation of Different Machine Learning Methods and Deep-Learning Convolutional Neural Networks for Landslide Detection. Remote Sensing, 11(2), 196. https://doi.org/10.3390/rs11020196

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop