Next Article in Journal
Effect of Selenium Nanocomposites Based on Natural Polymer Matrices on the Biomass and Storage of Potato Tubers in a Field Experiment
Next Article in Special Issue
Soil Chemical Properties and Trace Elements after Wildfire in Mediterranean Croatia: Effect of Severity, Vegetation Type and Time-Since-Fire
Previous Article in Journal
Apple Fruit Size QTLs on Chromosomes 8 and 16 Characterized in ‘Honeycrisp’-Derived Germplasm
Previous Article in Special Issue
Food and Medicinal Uses of Ancestral Andean Grains in the Districts of Quinua and Acos Vinchos (Ayacucho-Peru)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Flood Mapping Method for Land Use Management in Small-Size Water Bodies: Validation of Spectral Indexes and a Machine Learning Technique

by
Lorena Lombana
* and
Antonio Martínez-Graña
Department of Geology, Faculty of Sciences, University of Salamanca, Plaza de la Merced s/n, 37008 Salamanca, Spain
*
Author to whom correspondence should be addressed.
Agronomy 2022, 12(6), 1280; https://doi.org/10.3390/agronomy12061280
Submission received: 24 February 2022 / Revised: 17 May 2022 / Accepted: 25 May 2022 / Published: 26 May 2022
(This article belongs to the Special Issue Resilience in Soils and Land Use)

Abstract

:
The assessment of flood disasters is considered an essential factor in land use management, being necessary to understand and define the magnitude of past events. In this regard, several flood diagnoses have been developed using Sentinel-2 multispectral imagery, especially in large water bodies. However, one of the main challenges is still related to floods, where water surfaces have sizes similar to the spatial resolution of the analyzed satellite images, being difficult to detect and map. Therefore, the present study developed a combined methodology for flood mapping in small-sized water bodies using Sentinel-2 MSI imagery. The method consisted of evaluating the effectiveness of the application and combination of (a) a super-resolution algorithm to improve image resolution, (b) a set of seven spectral indices for highlighting water-covered areas, such as AWE indices, and (c) two methods for flood mapping, including a machine learning method based on unsupervised classification (EM cluster) and 14 thresholding methods for automatic determination. The processes were evaluated in the Carrión River, Palencia, Spain. It was determined that the approach with the best results in flood mapping was the one that combined AWE spectral indices with methods such as Huang and Wang, Li and Tam, Otsu, moment preservation, and EM cluster classification, showing global accuracy and Kappa coefficient values higher than 0.88 and 0.75, respectively, when applying the quantitative accuracy index.

1. Introduction

Floods are among the most common and disastrous worldwide natural events, being the cause of many human life losses, heritage and environmental damages, and economic costs [1]. Those events are progressively more recurrent and catastrophic not only due to extreme weather-related to climate change but also owing to the increase in the number of exposed elements triggered by socio-economic activities that settle in fluvial spaces [2]. Thus, in order to implement flood risk management measures, it is necessary to understand the magnitude, severity, and frequency of past events [3]. Hence, flood disaster assessment must determine the flood event extension, which depends on timely and effective monitoring at a regional scale [4]. However, this diagnosis has difficulties due to the lack of coverage and density of meteorological and gauging stations [5].
Thus, taking advantage of the accelerated development of remote sensing techniques, massive satellite data have become an available practical source for flood monitoring [6]. Several studies have been carried out for this purpose, applying different methods [7,8]. Nevertheless, identifying the surfaces covered by water is the main task in flooding diagnoses, and hence it is necessary to implement remote sensing techniques that provide high spatial images with high acquisition frequency [4].
In this regard, the European Space Agency (ESA) has developed Earth observation missions under the Copernicus program for land monitoring, including the Sentinel-2 mission, which supports global satellite data with a wide width and multispectral band [9]. This mission provides freely available multispectral Images (MSI) with a 5-day revisit frequency and a 10 m spatial resolution for visible bands, making Sentinel-2 imagery an excellent resource for surface water mapping [7,10,11], becoming one of the main satellites used to detect flood extents [12,13].
In this aspect, during the last decades, different methodologies have been developed for the digital processing of satellite images, reducing costs, time, and efforts [11]. These methodologies involve several techniques, which can be classified into four main classes: single band, spectral index, machine learning, and spectral unmixing based methods [7].
Among the techniques applied, water spectral indexes are the most widely used methods to classify surfaces into water and non-water using multispectral images. These indices allow enhancing the contrast between pixels of these two categories, exploiting the information of different bands [8,14]. In this regard, the near-infrared (NIR) band is usually used to identify water and non-water pixels due to its high absorption and low reflectance rates [15,16].
The water indices have been improved to achieve good performances in the delineation of water bodies. In 1996, McFeeters developed the Normalized Difference Water Index (NDWI) [17], which uses green and NIR bands to obtain an index representing water with positive values and non-water with negative values. However, it was found that this index is not efficient in built-up suppressing, so the identified water surfaces can mix water and built-up land noises. Thus, the modified Normalized Difference Water Index (mNDWI) was implemented by using the shortwave-infrared (SWIR) band instead of the NIR band [18]. On the other hand, considering the classification issues caused by shadow noises, the AWE indexes (AWEIsh and AWEInsh) were developed [19]. These use different band combinations and weightings, including the blue, green, NIR, and SWIR bands, where water pixel values are above 0 and non-water pixels below 0. It was determined that AWEInsh is appropriate for images where clouds do not represent a problem, suppressing dark surfaces in urban areas. On the other hand, AWEIsh successfully removes shadow noises. Likewise, other studies have used vegetation indices to extract water features [20].
Although the spectral water indexes normally represent water and non-water pixels with values above or below 0, it is necessary to determine a suitable threshold value depending on the characteristics of the water surfaces in each multispectral image [11]. Different studies have established optimal threshold values by visual interpretation of frequency histograms; however, it is not a useful method, especially when analyzing large multispectral images [8]. Therefore, automatic histogram-based thresholding algorithms have been applied for this purpose, with Otsu’s method being the most widely used algorithm to improve the classification accuracy [11,15]. This one operates with the maximum between-class variance to recognize objects and background [10,21].
On the other hand, surface water mapping methodologies based on machine learning have been applied [7,22], including the classification algorithms, which can be categorized into supervised and unsupervised methods depending on the human training needed [7]. The unsupervised methods have been demonstrated to be effective in differentiating between water from non-water pixels [12,23]. One of the methods used is the Clustering algorithm, where points with similar features are automatically grouped into the same cluster, such as K-means clustering, mean-shift clustering, and expectation–maximization (EM) clustering [7,12].
Although many methods have been developed to improve water surface mapping, there are still many challenges in this subject, including the issues related to the delineation accuracy considering the resolution of remotely sensed images and the water bodies’ sizes. In this case, when water surfaces have sizes similar to the spatial resolution of the satellite images used, they are hard to detect and map [7,10,24].
In an effort to address part of these challenges, some studies have improved the spatial resolution of remotely sensed images by using sharpening algorithms; however, these have been applied to large water bodies, for instance, Poyang Lake in China, Aras River in Turkey, or Po River in Italy [6,10,25,26]. Some of them had used panchromatic bands with high resolution to sharper the bands with lower resolution and improved the delineation accuracy [25]. Nevertheless, while many satellite imagery includes a panchromatic band, some do not, such as the Sentinel-2 images. For these images, other sharpening algorithms have been applied [11,26]. These methods can be classified into three categories: pan-sharpening per band, inverting an explicit imaging model, and machine learning methods [27]. The pan-sharpening per band method increases the spatial resolution of each band independently, mixing information from a high-resolution band [11,25,28]. However, the complexity of these processes can restrict the efficiency of water body mapping [10]. On the other hand, the model-based methods obtain a high-resolution image by minimizing residual errors in a single optimization for all bands simultaneously [27]. Accordingly, Brodu [29] developed an algorithm for super-resolved multiresolution images, showing good results in agricultural lands with large uniform areas. It separates band-dependent spectral information from information that is common across all bands (“geometry of scene elements”), super-resolving the low-resolution bands while preserving their overall reflectance [29]. Although this method proved efficient in improving Sentinel-2 images’ spatial resolution, it has not been directly tested to enhance flood mapping.
Considering the above, and taking into consideration the challenges in surface water mapping and the diversity of methods and techniques developed around this issue, flood mapping is considered an open research topic because no single method has been found suitable for all data sets and all conditions [7].
So, by combining the advantages of data held in Sentinel-2 multispectral images (MSI), image pre-processing techniques, spectral indexes, and machine learning methods, the present study developed a combined automated methodology for flood mapping for one of the biggest challenges: small-sized water bodies. Accordingly, this study involved four main phases (Figure 1): (1) Improve Sentinel-2 images resolution by applying and comparing resample, and a super-resolving algorithm; (2) Assess and compare the effectiveness of seven spectral indexes in highlight flood surfaces; (3) Apply and compare different flood extent mapping methods including 14 thresholding algorithms and a machine learning method for unsupervised classification; (4) Evaluate the performance of the flood mapping methods used; to finally define the most accurate combined method for flood mapping in a small-sized water body.
The water body selected to validate the methodology framework was the Carrión River in the Duero basin, located in Palencia, Spain (Figure 2). It has a channel length of 197 km, with a drainage area of 3368 km2, average values of width of 40 m, river level of 0.5 m, discharges of 12 m3/s, and an annual maximum of 38 m3/s [30]. This river presents a marked lateral migration with a wide alluvial plain, where an active and/or abandoned drainage network can be identified [31]. Throughout its alluvial plain, agricultural activities and urban land uses can be found, so different elements can be exposed to flood risks [32]. Thus, although the Carrión river can be classified as a narrow water body, it presents recurrent flood events due to its morphometric characteristics, fluvial dynamics, occupations, land cover, and land uses.

2. Methods

2.1. Data Source Selection and Pre-Processing

The imagery selected for the study was the high-resolution optical images provided by the Sentinel-2 satellite mission under the Copernicus Earth Observation program led by the European Commission and operated by the European Space Agency (ESA) (https://earth.esa.int/web/sentinel/home, accessed on 10 July 2021).
The time and areas of study in Carrion River were selected to assess post-flood events, considering diverse criteria. Regarding the selection of study dates, the hydrological year analyses were considered. The period selected corresponded to the 2019–2020 wet year, when the Duero basin presented rainfalls 105% higher than the 1981–2020 average value, with November and December being the most important months [33]. On the other hand, the study area selection in the basin took into consideration that the water and non-water surfaces should be diverse and include principal channels, reservoirs, vegetation, buildings, bare land, and fields. Accordingly, Level-2A S2 images were downloaded employing the Sentinel-2 Toolbox for ArcGIS 10.8, using a maximum cloud percentage of 20%, between 1 November 2019 and 31 December 2019. The detailed description of the image obtained is shown in Table 1.
The Level-2A images downloaded were pre-processed using the Sentinel Application Platform (SNAP), developed by ESA (https://earth.esa.int/eogateway/tools/snap, accessed on 10 July 2021). This process consisted of three different steps. First, to work over the areas belonging to the Carrion River, the image subsetting was carried out to obtain a raster of 3132 × 6540 pixels. Sentinel-2A data have multiple bands that include four 10 m visible and near-infrared bands, six 20 m vegetation red edge and short wave infrared bands, and three 60 m coastal aerosol, water vapor, and SWIR-Cirrus bands [34].
Considering that the bands have different resolutions, it is necessary to obtain images of high quality and the same pixel size. In order to obtain high spatial resolution data of 10 m-bands, two methods were applied and compared. Firstly, a resampling algorithm was used, implementing a tool available in Raster-Geometry subset tools of SNAP. It was used by taking as reference data the Blue band and a nearing algorithm. Secondly, a super-resolution method was applied using the Sen2Res tool in SNAP [29]. This method was developed for multispectral and multiresolution images, such as Sentinel-2, which obtains information from pixels that have the highest resolution and reproduce these details to all other bands; thus, an image with the best resolution can be obtained [29]. The method works for uniform areas such as agricultural lands. Finally, a land use band combination was composited (using the 11, 8A, and 4 bands) and the two methods applied were compared by visual interpretation.

2.2. Processing of Sentinel-2 Data: Spectral Indices

For the study, a set of 7 spectral indices were applied to highlight surfaces covered by water before applying flood extent mapping methods. This kind of indices used complex ratios of multiple bands, which are selected depending on the spectral characteristics of the target element studied [7] (a further description of the algorithms can be found in Table 2). In this case, to calculate the indexes, the Raster Math tool available in SNAP was implemented.

2.3. Flood Extent Mapping

The flood extent mapping consists of the surface water detection and delineation using remote sensing images; therefore, multiple studies have been carried out on this subject [7,8,14,38]. In this study, from the indices calculated, two different methods were evaluated, the unsupervised classification and the Automatic Threshold Determination.
The unsupervised classification was performed using the expectation–maximization (EM) algorithm, which considers the number of clusters that must be established as well as their center points which are randomly initialized [7]. In the study, eight classes were performed according to the cover land. For that, the EM cluster analysis tool supported in SNAP was applied to each spectral index.
On the other hand, the Automatic Threshold Determination, a single band method, consisted of the application of image binarization algorithms to classify pixels into different categories, in this case, water and non-water surfaces [7]. A total of 14 automatic thresholding methods were evaluated for each spectral index applied using the ImageJ software [39]. The algorithms used are presented in Table 3.

2.4. Accuracy Assessment of the Flood Mapping Methods

Regarding the assessment of the extraction of the flooded areas, the quantitative accuracy index was applied [10]. This widely used method involves the generation of random water and non-water sample points where water extraction results are verified. Thus, for the study, this process included four steps:
a. Generation of 200 water and non-water sample points over the study area by applying the Create Random Points tool in ArcGIS.
b. Once the points were generated, high-resolution images were employed to verify and adjust the random sample points. In this case, the images were obtained from the Spanish National Geographic Institute, IGN, provided by National Aerial Orthophotography Plan (PNOA 2017). Additionally, scatter plots were plotted to verify the reflectance of the water sample points, which is expected to be low if the Red and NIR-1 bands are contrasted.
c. Subsequently, it was identified whether each of the random points corresponded to surfaces classified as water or non-water according to each flood mapping method applied, so the total number of points correctly identified was calculated.
d. The accuracy of each method was evaluated by applying four assessment indices. For this purpose, a confusion matrix-based approach was developed (Table 4). Thus, by comparing the extracted water and non-water points with the reference data, four types of pixels were obtained: true positive (TP), the number of correctly extracted water pixels; false negative (FN), the number of undetected water pixels; false positive (FP), the number of incorrectly extracted water pixels; and true negative (TN), the number of correctly rejected non-water pixels [25]. Based on the four types of pixels, the producer accuracy, user accuracy, overall accuracy, and Kappa coefficient were applied according to the Equations (1)–(4), respectively [10].
P r o d u c e r s a c c u r a c y = T P T P + F N
U s e r s a c c u r a c y = T P T P + F P
O v e r a l l a c c u r a c y = T P + T N T
K a p p a c o e f f i c i e n t = T ( T P + T N ) ( ( T P + F P ) ( T P + F N ) + ( F N + T N ) ( F P + T N ) ) T 2 ( ( T P + F P ) ( T P + F N ) + ( F N + T N ) ( F P + T N ) )

2.5. Validation of the Index’s Effectiveness

By comparing the mean value of the index values in water and non-water points, the ability of each index to distinguish flood surfaces can be assessed [10], so the formula of the Contrast Value (CV) was applied (Equation (5)).
CV = E − K
where, E and K denote the mean value of each index in terms of the water and non-water surface, respectively.

3. Results

3.1. Image Processing

After pre-processing Sentinel-2 images, an atmosphere corrected subset image was obtained. By applying the resample and super-resolution methods, two different images were produced; thus, the 11 and 12 bands were improved to a 10 m-resolution (Figure 3). As can be seen, although two images with the same pixel size were obtained, the image produced by the super-resolution method shows better results than the resampling method. This is because the resampling method resamples each band independently, while super-resolution obtains information from other bands that have higher resolution [29]. In this sense, this method allowed obtaining high-resolution bands that are necessary to calculate spectral indexes, in this case, the shortwave-infrared bands (11-band and 12-band), going from 20 m to 10 m resolution. Consequently, the following steps in the proposed methodology were applied to the super-resolution image obtained.

3.2. Application of the Spectral Indices and Effectiveness Validation

A total of seven indices were effectively applied to the Sentinel-2 image to identify flooded areas, as shown in Figure 4. By comparing the results with a false-color composite image created to enhance water surfaces (Figure 4b) through a visual interpretation, it can be indicated that the AWEInsh and AWEIsh indices show accurate results in reflecting flooded areas, but in any case, with diffuse boundaries. It seems that, although the MNDWI index does not fully bright flooded areas, it manages to delimit the zones with sharper boundaries, as does the NDWI index, although this shows lesser precision. The NDPI, NDVI, and SAVI indices do not seem to represent good results in flooded areas reflecting. However, the effectiveness of each index and the application of flood extent mapping methods will be evaluated later through quantitative methods.
As a product, histograms of each spectral index were also obtained (Figure 5). As shown, AWEInsh and AWEIsh index show a marked unimodal distribution, while MNDWI, NDWI, and NDPI tend to represent a bimodal distribution. The other indices present different peaks, so they show multimodal distributions.
In order to assess the effectiveness of each spectral index used, the mean and contrast values were calculated for the water and non-water random points previously generated. Accordingly, boxplots were made to compare the distribution of the data (Figure 6). As shown, in terms of the water points, the mean values of all indexes are closer to zero (between −0.23 and 0.05) than the values obtained for non-Water points (which are between −1.08 and 0.37). The AWEIsh, MNDWI, and NDWI indices showed similar mean values for both water and non-water points. The same case occurred with the NDVI and SAVI indices. Therefore, the highest contrast value was calculated for the AWEInsh index, followed by the MNDWI and NDPI indices. At first, it suggests that AWEInsh, MNDWI, and NDPI spectral indices are more effective in highlighting water surfaces in Sentinel-2 images.

3.3. Flood Extent Mapping Performance

As could be interpreted from visual interpretation and contrast values, AWEInsh highlights water surfaces brighter than the other indexes. However, it is necessary to generate flood maps by different methods to develop spatial and quantitative analysis, as was previously detailed. Firstly, the threshold values were calculated for each index, as shown in Table 5. As can be seen, different thresholding methods provide the same results, such as the AWEInsh index applying the Huang’s, Li, and Otsu algorithms, and the AWEish index using the Huang’s, Isodata, and Percentile methods. In these cases, the flood maps produced are the same.
Each threshold value was used to classify pixels into the water and non-water surfaces; therefore, thematic maps from a single-band image were created by using discrete binary colors. Figure 7 shows the performances of each spectral index by applying the different thresholding methods and the unsupervised classification. It should have been previously indicated that Minimum, Yen, and Shanbhang thresholding methods did not work to classify pixels for any of the indices used.
As seen in Figure 7, mean and percentile algorithms extracted water bodies worse than the other methods for all indices. In addition, maximum entropy and intermodel methods only worked for the MNDWI and NDPI indexes. In contrast, the unsupervised classification method (EM Cluster) illustrated the water surfaces better than the thresholding methods in most cases.
Furthermore, the methods of Huang, Isodata, and Li and Tam only worked for the AWEInsh index. Thus, it seems that the best performances are obtained by classifying the AWEInsh index pixels with different methods.
Although the accuracy of each method can be explained by visual interpretation, it is important to analyze the results of the assessment methods implemented for this purpose. Thus, the overall accuracy and the Kappa coefficient values are summarized in Figure 8 and plotted in Figure 9 (the detailed results obtained by applying the producer accuracy, user accuracy, overall accuracy, and Kappa coefficient can be found in the Appendix A).
As can be seen, the AWE indexes showed the highest accuracy in water and non-water pixels categorization. In these cases, the most effective methods used were the EM unsupervised classification (which represents an overall accuracy of 0.94 and a Kappa coefficient of 0.88 in both cases) and the moment-preserving thresholding method (reaching overall and Kappa values of 0.94 and 0.89, respectively).
However, thresholding methods such as Huang and Wang, Li and Tam, Isodata and Otsu also illustrated optimal results for the AWEInsh index, with overall and Kappa values over 0.88 and 0.75, respectively.
The NDVI, NDWI, and SAVI indices presented low accurate results in most of the cases, showing the NDVI the worst performance. Nevertheless, the NDWI index reaches an overall accuracy of 0.85 and a kappa coefficient of 0.69 by using the Triangle thresholding method.

4. Discussion

The super-resolve method developed by Brodu [29] showed good results in improving the resolution of Sentinel-2 images in the area of interest, confirming its efficiency when applied to uniform areas representative of agricultural lands. This algorithm simulates a panchromatic sharpening, which can extract geometric information and use it to separate the low-resolution pixels while preserving their overall reflectance [29], so it is effective for super-resolved bands needed to calculate water spectral indexes, especially the SWIR bands [7,11,19,26].
Once the Sentinel-2 bands had the same high resolution (10 m), it was possible to apply the seven spectral indexes. Regarding the histograms obtained for each index (Figure 5), previous studies have shown similar results for MNDWI, NDWI, and AWEIsh histograms, where peak values usually represent land or water surfaces [15]. In this regard, different authors have demonstrated that index histograms with bimodal and multimodal distributions are appropriate for implementing thresholding methods [8]; however, this study was successful in using automatic thresholding algorithms in index histograms with a unimodal distribution, such as the case of the AWE indices. Additionally, it is important to emphasize that in all histograms, except for the SAVI, the maximum peaks were close to the index mean values calculated for non-water points.
By analyzing the contrast values, the AWEInsh, MNDWI, and NDPI represented the maximum values, suggesting that they may be more effective in highlighting water surfaces. However, as verified in the flood mapping performances, the AWE indexes showed the most accurate results, considering the overall and Kappa coefficients; on the contrary, the MNDWI and NDPI did not present satisfactory results, obtaining accuracy values below 0.84. On the other hand, spectral indexes, such as NDWI and mNDWI, usually used in water surface identification, were not useful for detecting flood areas in the analyzed small-size water body [10]. Consequently, as revealed, the AWE indexes were the most suitable for extracting flood areas in the study zone. This type of spectral indices has been structured to identify water features, so Ish was designed to remove shadow pixels and Insh for areas with an urban background [19]. Although the method was built for Landsat images, it works effectively for the Sentinel-2 image used in this study, as reported by other authors [14,15].
As for flood mapping methods, among the automatic thresholding algorithms employed, the moment-preserving thresholding method showed the most accurate results, followed by the Huang and Wang, Li and Tam, and Otsu algorithms, which also presented good outputs. However, the machine learning method for unsupervised classification (the EM cluster) showed the best results in all cases, only reached by the Moment-preserving thresholding algorithm. Consequently, according to the visual interpretation and the accuracy values obtained, it can be said that the flood mapping methods with Overall and Kappa values over 0.88 and 0.75, respectively, represent effective methods for the identification of flooded areas.
In this regard, the recommended combination of spectral indexes, thresholding algorithms, and/or unsupervised methods for flood mapping in small rivers is as follows: (a) Super-resolved Sentinel-2 images by applying the super-resolution method developed by Brodu [29], and obtain 10-m-resolution bands, including SWIR bands. (b) Calculate the AWEInsh index and apply the Huang and Wang, Li and Tam, Otsu, and EM cluster methods. (c) Calculate the AWEIsh index and use the moment-preserving algorithm and EM cluster method for pixel categorization. (d) Assess the accuracy of the flood mapping methods by using validation points and a derived confusion matrix.
The method proposed involves the use of simple and available access data and tools such as Sentinel-2 MSI, the SNAP (and its Sen2Res tool, Raster Math, cluster analysis packages), and ImageJ software. It holds the advantages of free access to information and an unsupervised algorithm that does not depend on training data efficiency [7], which makes the methodology a practical process for mapping floods in narrow rivers. However, further studies could evaluate the use of other machine learning techniques that can keep the pros of unsupervised classification [23].
Furthermore, the accuracy assessment results suggest that the method developed in this study is not just simple but effective in narrow rivers. Nevertheless, one limitation is related to the analysis of the effect of the presence of mixed spectra in the Sentinel-2 image pixels, so further studies should include this kind of evaluation [7]. In this regard, these analyses should include other accuracy assessment methods that allow measuring the results obtained by comparing flood maps with a Very High Resolution (VHR) imagery of the exact date of the flood event. It may involve new methods to quantitatively evaluate the extraction accuracy taking into account the spatial position [15]. On the other hand, it can be recommended that further investigations evaluate the methodology proposed using SAR images.

5. Conclusions

The combined automated method for water mapping developed in the present study contributes to improving the detection of surfaces covered by water during flood events in narrow rivers. This methodology exploits the advantages of data held in Sentinel-2 MSI by applying image pre-processing techniques and simple methods such as single band techniques, which include thresholding algorithms, combined with spectral indexes and unsupervised machine learning techniques. This can show accurate results in highlighting flood areas in small-size waterbodies whose width is similar to the size pixel.
It can be emphasized the effectiveness of AWE indexes in water surface detection in narrow rivers by combining mapping methods such as Huang and Wang, Li and Tam, and Otsu, moment-preserving thresholding algorithms, and EM cluster classification. In this order, the super-resolution method developed by Brodu was efficient in SWIR band super-resolving, enhancing the water spectral indexes application.
The method such as the proposed becomes a practical procedure for mapping floods in small-size water bodies due to it involves the use of free access data and software, such as Sentinel-2 MSI and SNAP, holding the advantages of unsupervised algorithms.
Future studies can evaluate this combined methodology in other situations in addition to agricultural lands, for instance, agricultural areas plus settlements and mixed water plus other vegetation conditions, among others. Another analysis could be performed, considering spectral mixture in pixels, different machine learning techniques based on unsupervised classification, and the application of other quantitative methods to accuracy assessment in flood mapping.

Author Contributions

Conceptualization, L.L.; methodology L.L. and A.M.-G.; software, L.L.; validation, L.L. and A.M.-G.; formal analysis, L.L.; investigation, L.L. and A.M.-G.; resources, L.L. and A.M.-G.; data curation, L.L. and A.M.-G.; writing—original draft preparation, L.L.; writing—review and editing, L.L.; visualization, L.L.; supervision, L.L. and A.M.-G.; project administration, A.M.-G.; funding acquisition, A.M.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

This work has been supported by the Regional Government of Castilla y León 2014–2020, European Social Fund Operational Programme, and the GEAPAGE research group (Environmental Geomorphology and Geological Heritage) of the University of Salamanca.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Results obtained in the accuracy assessment of the flood mapping methods by applying the producer accuracy, user accuracy, overall accuracy, and Kappa coefficient.

References

  1. European Court of Auditors—ECA. Floods Directive: Progress in Assessing Risks, While Planning and Implementation Need to Improve; Special Report No 25, 2018; ECA: Luxembourg, 2019. [Google Scholar] [CrossRef]
  2. Alfieri, L.; Burek, P.; Feyen, L.; Forzieri, G. Global warming increases the frequency of river floods in Europe. Hydrol. Earth Syst. Sci. 2015, 19, 2247–2260. [Google Scholar] [CrossRef] [Green Version]
  3. Sofia, G.; Nikolopoulos, E.I. Floods and rivers: A circular causality perspective. Sci. Rep. 2020, 10, 5175. [Google Scholar] [CrossRef] [Green Version]
  4. Dong, Z.; Wang, G.; Amankwah, S.O.Y.; Wei, X.; Hu, Y.; Feng, A. Monitoring the summer flooding in the Poyang Lake area of China in 2020 based on Sentinel-1 data and multiple convolutional neural networks. Int. J. Appl. Earth Obs. Geoinf. 2021, 102, 102400. [Google Scholar] [CrossRef]
  5. Caballero, I.; Ruiz, J.; Navarro, G. Sentinel-2 satellites provide near-real time evaluation of catastrophic floods in the West Mediterranean. Water 2019, 11, 2499. [Google Scholar] [CrossRef] [Green Version]
  6. Goffi, A.; Stroppiana, D.; Brivio, P.A.; Bordogna, G.; Boschetti, M. Towards an automated approach to map flooded areas from Sentinel-2 MSI data and soft integration of water spectral features. Int. J. Appl. Earth Obs. Geoinf. 2020, 84, 101951. [Google Scholar] [CrossRef]
  7. Bijeesh, T.V.; Narasimhamurthy, K.N. Surface water detection and delineation using remote sensing images: A review of methods and algorithms. Sustain. Water Resour. Manag. 2020, 6, 68. [Google Scholar] [CrossRef]
  8. Sekertekin, A. Potential of global thresholding methods for the identification of surface water resources using Sentinel-2 satellite imagery and normalized difference water index. J. Appl. Remote Sens. 2019, 13, 044507. [Google Scholar] [CrossRef]
  9. European Space Agency. SENTINEL-2 User Handbook; European Space Agency: Paris, France, 2015; Available online: https://sentinel.esa.int/documents/247904/685211/Sentinel-2_User_Handbook (accessed on 30 July 2021).
  10. Jiang, W.; Ni, Y.; Pang, Z.; Li, X.; Ju, H.; He, G.; Lv, J.; Yang, K.; Fu, J.; Qin, X. An Effective Water Body Extraction Method with New Water Index for Sentinel-2 Imagery. Water 2021, 13, 1647. [Google Scholar] [CrossRef]
  11. Du, Y.; Zhang, Y.; Ling, F.; Wang, Q.; Li, W.; Li, X. Water Bodies’ Mapping from Sentinel-2 Imagery with Modified Normalized Difference Water Index at 10-m Spatial Resolution Produced by Sharpening the SWIR Band. Remote Sens. 2016, 8, 354. [Google Scholar] [CrossRef] [Green Version]
  12. Landuyt, L.; Verhoest, N.E.C.; Van Coillie, F.M.B. Flood Mapping in Vegetated Areas Using an Unsupervised Clustering Approach on Sentinel-1 and -2 Imagery. Remote Sens. 2020, 12, 3611. [Google Scholar] [CrossRef]
  13. Hu, P.; Zhang, Q.; Shi, P.; Chen, B.; Fang, J. Flood-induced mortality across the globe: Spatiotemporal pattern and influencing factors. Sci. Total Environ. 2018, 643, 171–182. [Google Scholar] [CrossRef]
  14. Zhou, Y.; Dong, J.; Xiao, X.; Xiao, T.; Yang, Z.; Zhao, G.; Zou, Z.; Qin, Y. Open Surface Water Mapping Algorithms: A Comparison of Water-Related Spectral Indices and Sensors. Water 2017, 9, 256. [Google Scholar] [CrossRef]
  15. Yue, H.; Li, Y.; Qian, J.; Liu, Y. A new accuracy evaluation method for water body extraction. Int. J. Remote Sens. 2020, 41, 7311–7342. [Google Scholar] [CrossRef]
  16. Wang, H.; Chu, Y.; Huang, Z.; Hwang, C.; Chao, N. Robust, Long-term Lake Level Change from Multiple Satellite Altimeters in Tibet: Observing the Rapid Rise of Ngangzi Co over a New Wetland. Remote Sens. 2019, 11, 558. [Google Scholar] [CrossRef] [Green Version]
  17. McFeeters, S.K. The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features. Int. J. Remote Sens. 1996, 17, 1425–1432. [Google Scholar] [CrossRef]
  18. Xu, H. Modification of normalised difference water index (NDWI) to enhance open water features in remotely sensed imagery. Int. J. Remote Sens. 2006, 27, 3025–3033. [Google Scholar] [CrossRef]
  19. Feyisa, G.L.; Meilby, H.; Fensholt, R.; Proud, S.R. Automated Water Extraction Index: A new technique for surface water mapping using Landsat imagery. Remote Sens. Environ. 2014, 140, 23–35. [Google Scholar] [CrossRef]
  20. Rokni, K.; Ahmad, A.; Selamat, A.; Hazini, S. Water Feature Extraction and Change Detection Using Multitemporal Landsat Imagery. Remote Sens. 2014, 6, 4173–4189. [Google Scholar] [CrossRef] [Green Version]
  21. Otsu, N. Threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. SMC 1979, 9, 62–66. [Google Scholar] [CrossRef] [Green Version]
  22. Huang, X.; Xie, C.; Fang, X.; Zhang, L. Combining Pixel-and Object-Based Machine Learning for Identification of Water-Body Types from Urban High-Resolution Remote-Sensing Imagery. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 2097–2110. [Google Scholar] [CrossRef]
  23. Zhang, Q.; Zhang, P.; Hu, X. Unsupervised GRNN flood mapping approach combined with uncertainty analysis using bi-temporal Sentinel-2 MSI imageries. Int. J. Digit. Earth 2021, 14, 1561–1581. [Google Scholar] [CrossRef]
  24. Bhangale, U.; More, S.; Shaikh, T.; Patil, S.; More, N. Analysis of Surface Water Resources Using Sentinel-2 Imagery. Procedia Comput. Sci. 2020, 171, 2645–2654. [Google Scholar] [CrossRef]
  25. Acharya, T.D.; Subedi, A.; Lee, D.H. Evaluation of Water Indices for Surface Water Extraction in a Landsat 8 Scene of Nepal. Sensors 2018, 18, 2580. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Yang, X.; Zhao, S.; Qin, X.; Zhao, N.; Liang, L.; Kuenzer, C.; Mishra, D.R.; Huang, W.; Thenkabail, P.S. Mapping of Urban Surface Water Bodies from Sentinel-2 MSI Imagery at 10 m Resolution via NDWI-Based Image Sharpening. Remote Sens. 2017, 9, 596. [Google Scholar] [CrossRef] [Green Version]
  27. Lanaras, C.; Bioucas-Dias, J.; Galliani, S.; Baltsavias, E.; Schindler, K. Super-resolution of Sentinel-2 images: Learning a globally applicable deep neural network. ISPRS J. Photogramm. Remote Sens. 2018, 146, 305–319. [Google Scholar] [CrossRef] [Green Version]
  28. Gašparović, M.; Jogun, T. The effect of fusing Sentinel-2 bands on land-cover classification. Int. J. Remote Sens. 2017, 39, 822–841. [Google Scholar] [CrossRef]
  29. Brodu, N. Super-Resolving Multiresolution Images with Band-Independent Geometry of Multispectral Pixels. IEEE Trans. Geosci. Remote Sens. 2017, 55, 4610–4617. [Google Scholar] [CrossRef] [Green Version]
  30. Confederación Hidrográfica del Duero—CHD. Demarcación Hidrográfica del Duero—Revisión y Actualización de la Evaluación Preliminar del Riesgo de Inundación 2o Ciclo. Available online: https://www.chduero.es/documents/20126/704746/0_REVISION_EPRI_DUERO_MEMORIA.pdf/d42faee0-dd95-f352-f820-b150904a7131?t=1566469961152 (accessed on 20 August 2021).
  31. Lombana, L.; Martínez-Graña, A. Hydrogeomorphological analysis for hydraulic public domain definition: Case study in Carrión River (Palencia, Spain). Environ. Earth Sci. 2021, 80, 193. [Google Scholar] [CrossRef]
  32. Lombana, L.; Martínez-Graña, A. Multiscale Hydrogeomorphometric Analysis for Fluvial Risk Management. Application in the Carrión River, Spain. Remote Sens. 2021, 13, 2955. [Google Scholar] [CrossRef]
  33. Agencia Estatal de Meteorología—AEMET. Año Hidrológico 2019–2020. Available online: https://www.miteco.gob.es (accessed on 10 July 2021).
  34. Yan, L.; Roy, D.P.; Zhang, H.; Li, J.; Huang, H. An Automated Approach for Sub-Pixel Registration of Landsat-8 Operational Land Imager (OLI) and Sentinel-2 Multi Spectral Instrument (MSI) Imagery. Remote Sens. 2016, 8, 520. [Google Scholar] [CrossRef] [Green Version]
  35. Rouse, J.; Haas, R.; Schell, J.; Deering, D. Monitoring Vegetation Systems in the Great Plains with Erts. NASA Spec. Publ. 1974, 351, 309. [Google Scholar]
  36. Huete, A.R. A soil-adjusted vegetation index (SAVI). Remote Sens. Environ. 1988, 25, 295–309. [Google Scholar] [CrossRef]
  37. Lacaux, J.P.; Tourre, Y.M.; Vignolles, C.; Ndione, J.A.; Lafaye, M. Classification of ponds from high-spatial resolution remote sensing: Application to Rift Valley Fever epidemics in Senegal. Remote Sens. Environ. 2007, 106, 66–74. [Google Scholar] [CrossRef]
  38. Rouibah, K.; Belabbas, M. Applying Multi-Index Approach from Sentinel-2 Imagery to Extract Urban Areas in Dry Season (Semi-Arid Land in North East Algeria). Rev. Teledetec. 2020, 56, 89–101. [Google Scholar] [CrossRef]
  39. Landini, G. Auto Threshold. Available online: https://imagej.net/plugins/auto-threshold (accessed on 8 January 2022).
  40. Huang, L.K.; Wang, M.J.J. Image thresholding by minimizing the measures of fuzziness. Pattern Recognit. 1995, 28, 41–51. [Google Scholar] [CrossRef]
  41. Tsai, W.H. Moment-preserving thresolding: A new approach. Comput. Vis. Graph. Image Process. 1985, 29, 377–393. [Google Scholar] [CrossRef]
  42. Prewitt, J.M.S.; Mendelsohn, M.L. The Analysis of Cell Images. Ann. N. Y. Acad. Sci. 1966, 128, 1035–1053. [Google Scholar] [CrossRef]
  43. Ridler, T.W.; Calvard, S. Picture Thresholding Using an Iterative Selection Method. IEEE Trans. Syst. Man Cybern. 1978, 8, 630–632. [Google Scholar] [CrossRef]
  44. Doyle, W. Operations Useful for Similarity-Invariant Pattern Recognition. J. ACM (JACM) 1962, 9, 259–267. [Google Scholar] [CrossRef]
  45. Li, C.H.; Tam, P.K.S. An iterative algorithm for minimum cross entropy thresholding. Pattern Recognit. Lett. 1988, 19, 771–776. [Google Scholar] [CrossRef]
  46. Kapur, J.N.; Sahoo, P.K.; Wong, A.K.C. A new method for gray-level picture thresholding using the entropy of the histogram. Comput. Vis. Graph. Image Process. 1985, 29, 273–285. [Google Scholar] [CrossRef]
  47. Shanbhag, A.G. Utilization of Information Measure as a Means of Image Thresholding. CVGIP Graph. Models Image Process. 1994, 56, 414–419. [Google Scholar] [CrossRef]
  48. Glasbey, C.A. An Analysis of Histogram-Based Thresholding Algorithms. CVGIP Graph. Models Image Process. 1993, 55, 532–537. [Google Scholar] [CrossRef]
  49. Zack, G.W.; Rogers, W.E.; Latt, S.A. Automatic measurement of sister chromatid exchange frequency. J. Histochem. Cytochem. 1977, 25, 741–753. [Google Scholar] [CrossRef]
  50. Yen, J.C.; Chang, F.J.; Chang, S. A New Criterion for Automatic Multilevel Thresholding. IEEE Trans. Image Process. 1995, 4, 370–378. [Google Scholar] [CrossRef]
Figure 1. Methodology of the study.
Figure 1. Methodology of the study.
Agronomy 12 01280 g001
Figure 2. Location of the study area and Spectral indices’ images obtained from Sentinel-2: (a) General localization. (b) Studied River. (c) AWEInsh. (d) AWEIsh. (e) MNDWI. (f) NDPI. (g) NDVI. (h) NDWI. (i) SAVI.
Figure 2. Location of the study area and Spectral indices’ images obtained from Sentinel-2: (a) General localization. (b) Studied River. (c) AWEInsh. (d) AWEIsh. (e) MNDWI. (f) NDPI. (g) NDVI. (h) NDWI. (i) SAVI.
Agronomy 12 01280 g002
Figure 3. Sentinel-2 images using composite bands: (a) Raw image. (b) Resampled image. (c) Super-resolution image.
Figure 3. Sentinel-2 images using composite bands: (a) Raw image. (b) Resampled image. (c) Super-resolution image.
Agronomy 12 01280 g003
Figure 4. Flooded areas detected by each index used in the study area. (a) Real color image. (b) Composite image (R: 8, G: 11, B: 4). (c) AWEInsh. (d) AWEIsh. (e) MNDWI. (f) NDPI. (g) NDVI. (h) NDWI. (i) SAVI.
Figure 4. Flooded areas detected by each index used in the study area. (a) Real color image. (b) Composite image (R: 8, G: 11, B: 4). (c) AWEInsh. (d) AWEIsh. (e) MNDWI. (f) NDPI. (g) NDVI. (h) NDWI. (i) SAVI.
Agronomy 12 01280 g004
Figure 5. Histograms obtained from different spectral indices. (a) AWEInsh. (b) AWEIsh. (c) MNDWI. (d) NDPI. (e) NDVI. (f) NDWI. (g) SAVI.
Figure 5. Histograms obtained from different spectral indices. (a) AWEInsh. (b) AWEIsh. (c) MNDWI. (d) NDPI. (e) NDVI. (f) NDWI. (g) SAVI.
Agronomy 12 01280 g005
Figure 6. Box plots classified into water (W) and non-water (NW) categories and contrast values obtained for each index.
Figure 6. Box plots classified into water (W) and non-water (NW) categories and contrast values obtained for each index.
Agronomy 12 01280 g006
Figure 7. Flood-areas extraction performances generated using all indexes by each mapping method (refer to Table 5).
Figure 7. Flood-areas extraction performances generated using all indexes by each mapping method (refer to Table 5).
Agronomy 12 01280 g007
Figure 8. Accuracy values obtained for each index and mapping method: (a) Overall accuracy. (b) Kappa coefficient.
Figure 8. Accuracy values obtained for each index and mapping method: (a) Overall accuracy. (b) Kappa coefficient.
Agronomy 12 01280 g008
Figure 9. Accuracy values plotted for each index and mapping method by Overall and Kappa algorithms.
Figure 9. Accuracy values plotted for each index and mapping method by Overall and Kappa algorithms.
Agronomy 12 01280 g009
Table 1. Sentinel-2A image features.
Table 1. Sentinel-2A image features.
FeatureValue
NameS2A_MSIL2A_20191228T111451_N0213_R137_T30TUN_20191228T122432
SectorSuperior
Cloud cover percentage14.108462
Cloud shadow percentage2.133657
Ingestion Date2019-12-28T20:40:54.449Z
Orbit number (start)23586
Pass directionDescending
Vegetation percentage29.200098
Water percentage0.675628
Table 2. Summary of indexes used in the study.
Table 2. Summary of indexes used in the study.
IndexAlgorithms-Sentinel-2 *Category
NDWI [17](B03 − B08)/(B03 + B08)Water
NDVI [35](B08 − B04)/(B08 + B04)Vegetation
SAVI [36](B08 − B04)/(B08 + B04 + 0.428) × (1.428)Vegetation
mNDWI [18](B03 − B11)/(B03 + B11)Water
AWEInsh [19]4 × (B03 − B11) − (0.25 × B08 + 2.75 × B11)Water
AWEIsh [19]B02 + 2.5 × B03 − 1.5 × (B08 + B11) − 0.25 × B12Water
NDPI [37](B12 − B03)/(B12 + B03)Water
* B: Band.
Table 3. Thresholding methods applied in the study.
Table 3. Thresholding methods applied in the study.
NoThreshold Automatic Algorithm NoThreshold Automatic Algorithm
1Huang and Wang’s [40]8Moment-preserving [41]
2Intermode [42]9Otsu’s thresholding [21]
3Isodata [43]10Percentile (p-tile) [44]
4Li and Tam’s [45]11Renyi’s entropy [46]
5Maximum entropy [46]12Shanbhang’s [47]
6Mean [48]13Triangle [49]
7Minimum [42]14Yen’s [50]
Table 4. Confusion matrix-based applied in the study.
Table 4. Confusion matrix-based applied in the study.
Reference DataUser
WaterNon-Water
Classified dataWaterTPFPTP + FP
Non-WaterFNTNFN + TN
ProducerTP + FNFP + TNT = TP + FP + FN + TN
Table 5. Thresholding values obtained from each method.
Table 5. Thresholding values obtained from each method.
Thresholding MethodsThreshold Value
AWEInshAWEIshMNDWINDPINDVINDWISAVI
Huang and Wang’s (Hu)−0.88−0.44−0.530.330.57−0.520.26
Intermode (Int)−3.92−1.370.1−0.21−0.050.050.05
Isodata (Iso)−0.93−0.44−0.480.270.5−0.520.26
Li and Tam (Li)−0.88−0.39−0.480.290.48−0.520.25
Maximum entropy (Me)0.910.38−0.03−0.110.05−0.110.04
Mean −1.08−0.41−0.50.310.45−0.520.24
Minimum (Min)−6.31−2.260.39−0.47−0.250.36−0
Moment-preserving (Mp)−0.93−0.27−0.330.170.43−0.40.25
Otsu (Ot)−0.88−0.41−0.190.290.5−0.50.26
Percentile (p-tile) (Per)−1.13−0.44−0.520.320.43−0.520.23
Renyi’s entropy (Ren)0.610.38−0.05−0.110.07−0.120.05
Shanbhag’s (Sh)2.660.080.46−0.40.490.010.26
Triangle (Tri)−0.03−0.09−0.260.020.09−0.230.46
Yen’ (Y)0.960.480−0.140.03−0.10.03
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Lombana, L.; Martínez-Graña, A. A Flood Mapping Method for Land Use Management in Small-Size Water Bodies: Validation of Spectral Indexes and a Machine Learning Technique. Agronomy 2022, 12, 1280. https://doi.org/10.3390/agronomy12061280

AMA Style

Lombana L, Martínez-Graña A. A Flood Mapping Method for Land Use Management in Small-Size Water Bodies: Validation of Spectral Indexes and a Machine Learning Technique. Agronomy. 2022; 12(6):1280. https://doi.org/10.3390/agronomy12061280

Chicago/Turabian Style

Lombana, Lorena, and Antonio Martínez-Graña. 2022. "A Flood Mapping Method for Land Use Management in Small-Size Water Bodies: Validation of Spectral Indexes and a Machine Learning Technique" Agronomy 12, no. 6: 1280. https://doi.org/10.3390/agronomy12061280

APA Style

Lombana, L., & Martínez-Graña, A. (2022). A Flood Mapping Method for Land Use Management in Small-Size Water Bodies: Validation of Spectral Indexes and a Machine Learning Technique. Agronomy, 12(6), 1280. https://doi.org/10.3390/agronomy12061280

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop