Using the Improved YOLOv5-Seg Network and Sentinel-2 Imagery to Map Glacial Lakes in High Mountain Asia

Yin, Lichen; Wang, Xin; Du, Wentao; Yang, Chengde; Wei, Junfeng; Wang, Qiong; Lei, Dongyu; Xiao, Jingtao

doi:10.3390/rs16122057

Open AccessTechnical Note

Using the Improved YOLOv5-Seg Network and Sentinel-2 Imagery to Map Glacial Lakes in High Mountain Asia

by

Lichen Yin

¹

,

Xin Wang

^1,2,*,

Wentao Du

^2,3,

Chengde Yang

¹

,

Junfeng Wei

¹

,

Qiong Wang

^1,4

,

Dongyu Lei

¹ and

Jingtao Xiao

¹

School of Earth Sciences and Spatial Information Engineering, Hunan University of Science and Technology, Xiangtan 411201, China

²

State Key Laboratory of Cryospheric Science, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, China

³

University of Chinese Academy of Sciences, Beijing 100049, China

⁴

School of Geography and Tourism, Shaanxi Normal University, Xi’an 710062, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2024, 16(12), 2057; https://doi.org/10.3390/rs16122057

Submission received: 19 April 2024 / Revised: 28 May 2024 / Accepted: 29 May 2024 / Published: 7 June 2024

Download

Browse Figures

Versions Notes

Abstract

:

Continuously monitoring and mapping glacial lake variation is of great importance for determining changes in water resources and potential hazards in alpine cryospheric regions. The semi-automated glacial lake mapping methods used currently are hampered by inherent subjectivity and inefficiency. This study used improved YOLOv5 strategies to extract glacial lake boundaries from Sentinel-2 imagery. These strategies include using the space-to-depth technique to identify small glacial lakes, and adopting the coordinate attention and the convolution block attention modules to improve mapping performance and adaptability. In terms of glacial lake extraction, the improved YOLOv5-seg network achieved values of 0.95, 0.93, 0.96, and 0.94 for precision (P), recall (R), mAP_0.5, and the F1 score, respectively, indicating an overall improvement in performance of 12% compared to that of the newest YOLOv8 networks. In High Mountain Asia (HMA), 23,108 glacial lakes with a total area of 1847.5 km² were identified in imagery from 2022 using the proposed method. Compared with the use of manual interpretation for lake boundary extraction in test sites of HMA, the proposed method achieved values of 0.89, 0.87, and 0.86 for P, R, and the F1 score, respectively. Our proposed deep learning method has improved accuracy in glacial lake extraction because it can address the challenge represented by frozen or high-turbidity glacial lakes in HMA.

Keywords:

High Mountain Asia; Sentinel-2; glacial lake; YOLOv5 network

1. Introduction

Glacial lakes are formed by glaciation or glacial meltwater recharge and are generally located in areas of glaciation, including glacial depression lakes, lateral moraine dammed lakes, ice-dammed lakes, etc. [1,2]. High Mountain Asia (HMA) contains the largest area of glaciers outside the polar regions [3,4]. Over past decades, global warming has caused substantial glacier retreat and the acceleration of negative glacier mass in HMA [5,6]. Meanwhile, the total area and the number of glacial lakes in the HMA region have shown a consistent trend of increase [2,7,8,9,10]. The continuous increase in both the number and the size of glacial lakes might impound glacier meltwater and aggravate the risk of glacial lake outburst floods [11,12,13]. Therefore, the rapid and accurate mapping of glacial lakes is crucial for water resource management and disaster assessment in alpine cryospheric regions [2,14,15,16].

Recent advances in remote sensing and computer vision have accelerated the widespread use of multisource remote sensing images for glacial lake mapping. Semiautomatic extraction methods, based on the geometric characteristics of glacial lakes, include the normalized difference water index [17], normalized difference snow index [18], and modified normalized difference water index [19]. However, these methods need manual postprocessing to fine-tune results in areas with complex terrain and extreme climatic conditions, such as those of the HMA region [20].

Machine learning frameworks (e.g., the Random Forest method) have been used previously for glacial lake segmentation [14,21]. These approaches, which excel at extracting glacial lakes characterized by spectral reflectance, are challenged when segmenting areas characterized by diminished spectral reflectance [14]. In recent years, deep learning methods have shown substantial potential for automatic segmentation in glacial lake mapping, and in overcoming the constraints encountered in traditional machine learning methods [16,20,22,23,24,25,26,27]. Neural networks exhibit strong segmentation capabilities; however, certain limitations remain, e.g., the challenge in extracting the boundaries of frozen or partially frozen glacial lakes [28], the misleading errors caused by mixed pixels of ice and water and fragmented wet ice when classifying lake edges [27], and the technical challenges associated with handling training datasets at inconsistent temporal and spatial scales [23]. Typically, machine learning methods require large numbers of images for training neural networks. Although such methods have been applied to several subregions of HMA, they were often found to be inadequate for the boundary extraction of small-area glacial lakes, and lacked applicability to the large-scale HMA region.

In this paper, we introduce a new deep learning method based on the improved YOLOv5 network and Sentinel-2 imagery to extract the glacial lakes boundaries in HMA in 2022 (Figure 1). The proposed method is a reproducibly automated glacial lake boundary extraction framework that integrates comprehensive technical strategies in data preprocessing and model training using multiband Sentinel-2 imagery.

2. Materials and Methods

2.1. Data Sources

Overall, 1787 high-quality Sentinel-2 images (143 in 2018 and 1644 in 2022) with 10 m spatial resolution were obtained from the Copernicus Open Access Hub (https://scihub.copernicus.eu/dhus/#/home/; last accessed on 5 May 2023) for use in this study (Figure 2). The selected images were captured in summer and autumn (June–November) with cloud coverage of <10% in 2022. To train the glacial lake extraction models, we used 143 Sentinel-2 images from 2018 and the High Asia Glacial Lake dataset [2]. The Second Chinese Glacier Inventory and RGI 6.0 were used to determine a 10 km buffer area from glaciers [29,30]. The Shuttle Radar Topography Mission digital elevation model with spatial resolution of 1 arcsecond (http://imagico.de/map/demsearch.php; last accessed on 5 September 2023) was used to extract glacial lake elevation information.

2.2. Methods

2.2.1. Data Preprocessing

Preprocessing techniques included upsampling, slicing, image enhancement, culling and replacement, and automatic labeling (Figure 3). First, a band 11 image (shortwave infrared) was upsampled to 10 m resolution and synthesized with bands 8, 4, 3, and 2 (near-infrared, red, green, and blue). Second, original and corresponding mask images were sliced to a size of 640 × 640 pixels. All missing values were converted to 0. Automatic labeling using the glacial lake inventory and randomly selected pairs according to a ratio of 7:3 was used to create the training set and the validation set. Finally, the images of the training set were flipped and rotated, and pretzel noise was added randomly.

2.2.2. Improved Convolutional Neural Network

The basic network used for the segmentation of glacial lake is the YOLOv5-seg v7.0 model. The coordinate attention (CA) module, convolution block attention module (CBAM), and the small object detection layer were integrated with the YOLOv5-seg network [31,32,33]. The C3 module was also replaced by the C2f module with the purpose of obtaining additional information on gradient flow. The space-to-depth convolution (SPD-Conv) module was introduced to retain more spatial information [34].

(1): Attention mechanism

This study focused on glacial lakes and considered other ground features such as glaciers, bare ground, and mountain shadows as background information. It has been reported that the YOLOv5-seg algorithm is prone to interference [35,36]. Therefore, we adopted the CA module and the CBAM to enhance the learning of glacial lake characteristics by the YOLOv5-seg network (Figure 4 and Figure 5). By adding different attention mechanisms to two similar networks, the network can flexibly and independently learn the characteristics of glacial lakes and consequently improve extraction accuracy. The algorithms facilitate the calculation of the weighted aggregation of other information by eliminating background details. By specifying the name of the glacial lake information as an input for the attention feature, the model directs its attention to the relevant information, thereby improving the extraction of glacial lake features [31,32].

The CA module encodes channel relationships and long-term dependencies in precise location information through coordinate embedding and coordinate attention generation [31]. The process of coordinate embedding overcomes global pooling in the channel and enhances the retention of positional information. To capture cross-channel information and long-term dependencies in one spatial direction and to retain accurate positional information in another, the features are considered along the two spatial directions to yield a pair of direction-aware feature maps that facilitate the accurate segmentation of glacial lakes in remote sensing images by the neural network. The process can be expressed as follows:

z_{C}^{H} (h) = \frac{1}{W} \sum_{0 \leq i < H} x_{C} (H, i)

(1)

z_{C}^{W} (h) = \frac{1}{H} \sum_{0 \leq j < H} x_{C} (j, W)

(2)

z_{C} = \frac{1}{H \times W} \sum_{i = 1}^{H} \sum_{j = 1}^{W} x_{C} (i, j)

(3)

where

x_{C} (H, i)

and

x_{c} (j, W)

represent the C channel at height h for the i-th value and the C channel at height H and width W for the j-th value, respectively. The output of

z_{C}^{H}

is the C channel at height h and that of

z_{C}^{W}

is the C channel at width W. The information derived in the two dimensions needs to be combined via concatenation, convolution, and normalization. This step can be expressed as follows:

f = δ (F_{1} ([z^{H}, z^{ω}]))

(4)

where

δ

is a nonlinear activation function, and

f \in R^{\frac{c \times (H + W)}{r}}

is the intermediate feature map that encodes spatial information in both the horizontal and vertical directions.

Convolution is performed for W and H separately, and the final output can be expressed using the following formulas:

g^{H} = σ (F_{h} (f^{H}))

(5)

g^{W} = σ (F_{ω} (f^{ω}))

(6)

y_{C} (i, j) = x_{C} (i, j) \times g_{C}^{H} (i) \times g_{C}^{ω} (j)

(7)

where F_h and F_w are 1 × 1 convolutional transformation used to transform f^H and f^W, respectively, to tensors with the same channel number as the input X, and f^H and f^W are the horizontal and vertical components of f, respectively.

The CBAM contains two submodules: the channel attention module (CAM) and the spatial attention module (SAM) [32]. The input feature map F(

F \in R^{c \times h \times w}

) is passed through the maximum pooling layer and the average pooling layer in the CAM. The two summed 1D vectors become the intermediate feature map

M_{c}

(

M_{c} \in R^{c \times 1 \times 1}

) through the fully connected layer. The channel attention is multiplied by the input element F to obtain the adjusted feature map F′. Similarly, the feature map F is passed through the SAM to obtain 2D convolution

M_{s}

(

M_{s} \in R^{1 \times h \times w}

), which is then multiplied by F′ to obtain the feature map F″. The process of generating attention by the CBAM can be expressed as follows:

F^{'} = M_{c} (F) \otimes F

(8)

F^{″} = M_{c} (F^{'}) \otimes F

(9)

where

\otimes

denotes the weighted multiplication, F′ denotes the result obtained after passing through the CAM, and F″ denotes the result obtained after passing through the SAM.

(2): Space-to-depth (SPD) module

In this study, downsampling was replaced by the SPD-Conv module to reduce the misjudgment of small-area glacial lakes. A non-spanning convolutional layer retains more detailed information about the glacial lake by rearranging the pixel blocks of the input feature map, and it reduces the number of parametric quantities to a certain extent [34]. The SPD-Conv module acquires four 2-fold downsampled sub-maps, each containing the spatial information, through mapping and slicing the input feature map. The submaps can splice along the channel dimensions and adjust them through the non-spanning convolutional layer. Additionally, the SPD-Conv module preserves global spatial feature information in the channel dimensions, as illustrated in Figure 6.

(3): Small target layers

The target detection layer of the YOLOv5 network has an output range of 80 × 80, 40 × 40, and 20 × 20 image pixels, with a minimum acceptance range of 8 × 8. To enhance the capability of the algorithm in segmenting small-area glacial lakes, the detection header for small targets is integrated into the original network. The aim here is improving the accuracy of extracting glacial lakes smaller than 8 × 8 pixels in size and refining the precision of boundary extraction. The glacial extraction network structure is shown in Figure 7.

2.2.3. Postprocessing and Accuracy Assessment

To evaluate the results produced by the improved neural network algorithm, an Intersection over Union (IOU) threshold of 0.5 was used to extract glacial lake boundaries. To eliminate misclassification, slopes and shadow relief that were larger than 10° and 0.25, respectively, were removed based on the digital elevation model data [10,37]. Precision (P), recall (R), and mean average precision (mAP) were used to assess the performance of the algorithm. The R and P can be expressed as follows:

R = \frac{TP}{TP + FN}

(10)

P = \frac{TP}{TP + FP}

(11)

where TP is true positive, FP is false positive, and FN is false negative. If TP is positive and the true value is also positive, the positive samples are correctly identified. If FP is positive and the true value is negative, the negative samples are incorrectly identified. IF FN is negative and the true value is positive, positive samples are missed. The mAP_0.5 (i.e., mAP with an IOU threshold of 0.5), used as an evaluation index, can express the recognition accuracy and the number of effective recognitions. The F1 score effectively represents the overall capability in identifying glacial lakes. The formulas for calculation of the mAP, IOU, and F1 score can be expressed as follows:

mAP = \frac{1}{n} \sum_{i = 1}^{n - 1} A P_{(i)} \times 100 %

(12)

IOU = \frac{a \cap b}{a \cup b}

(13)

F 1 score = \frac{2 \times P \times R}{P + R}

(14)

We used S_accuracy as a measurement of the accuracy of glacial lake extraction, which can be expressed as follows:

S_{accuracy} = 1 - \frac{|S_{TP} - S_{recall}|}{S_{recall}}

(15)

where S_TP is the network-mapped glacial lake area and S_recall is the glacial lake area detected by manual vectorization mapping.

3. Results

3.1. Performance of the Proposed Method

The segmentation algorithm exhibits promising results after 1000 training epochs. The accuracies of the improved YOLOv5-seg and other commonly used instance segmentation models in relation to our constructed dataset, listed in Table 1, demonstrate that our improved YOLOv5-seg network exhibits the best performance. The F1 score is 0.94 and 0.81 for the synthesized group of bands 8, 4 and 3 and 11, 4 and 3, respectively.

The YOLOv8 network was used as a benchmark for our ablation experiment to compare the effects of different modules on accuracy (Table 2). The YOLOv5-seg (CBAM) network, refined through ablation experiments, achieves the optimal F1 score of 0.94, which is 12% higher than that of YOLOv8-seg. Additionally, the F1 score of the improved YOLOv5-seg (CA) is 2% higher than that of the YOLOv8-seg. The combination of the improved algorithms (bands 11, 8, 4, 3, 2) achieves at least 4% higher F1 scores in the test sites relative to the other algorithms(Table 3).

3.2. Mapping of Glacial Lakes in HMA

In 2022 imagery, 23,108 glacial lakes covering a total area of 1847.5 km² were identified in HMA (Figure 8). The eastern Himalayas and Inner Tibet have the highest numbers of glacial lakes. The eastern Himalayas have 3675 lakes covering 322.5 km² (17.5% of the total area), while Inner Tibet has 3568 lakes covering 303.4 km² (16.4% of the total area). Meanwhile, only 100 lakes with an area of 7.5 km² (0.4% of the total area) and 213 lakes with an area of 14.9 km² (0.8% of the total area) are found in the Qilian Mountains and the Hissar–Alay region. The elevations of the glacial lakes in HMA in 2022 are in the range 1700–6400 m. A bimodal distribution pattern is observed for the entire HMA region and most of its subregions. The main reason for this phenomenon lies in the bimodal distribution of unconnected glacial lakes and proglacial lakes at different altitudes [42]. In the HMA region, the primary peak in glacial lake area occurs at 5000–5500 m, with a secondary peak at 4000–4500 m. In the different subregions, the peak in the glacial lake area varies from 2000–2500 m in the Hissar–Alay region to 5000–5500 m in the Central Himalaya, Karakoram, and Western Kunlun regions (Figure 8).

4. Discussion

4.1. Advantages of the Improved Strategies

The rapid changes in glacial lakes caused by climate change demand a fast, reliable, reproducible, and automated mapping method [43]. Manual mapping and semi-automated methods demand extensive and labor-intensive post-processing [14,19,28]. We developed a deep learning method by combining several improved YOLOv5-seg networks. The proposed method can independently assimilate the merits of multiple networks in relation to images with different band combinations, and analysis results reveal that our method can automatically map glacial lakes in HMA. The SPD-Conv module included in the network of our proposed method can rearrange the pixels of input feature maps and improve the capability of lake boundary extraction. The combination of the CA module and the CBAM can improve the attention degree of glacial lake boundaries [31,32]. The small target layer is used to extract small-area glacial lakes [33]. Transfer learning and large spatial scale glacial lake datasets were used in the proposed deep learning method to improve network generalization. Our proposed method enhances the network’s learning ability, showing improved performance in extracting glacial lake boundaries (Figure 9). It is particularly effective for lakes with high turbidity, frozen surfaces, shallow water, and small areas (<0.01 km²), which have raised challenges in deep learning-based detection [14,26]. The proposed method offers several advantages: (1) The single algorithm merged with the SPD-Conv and attention modules can retain more detailed information and focus attention on glacial lake information [31,32,34]. (2) The combination of the improved algorithms can compensate for misjudgements caused by a single network. (3) The use of the large volume of glacial lake data on different temporal and spatial scales allows for the synthesis of images as a dataset based on the reflectance of glacial lakes in different bands. Supported by this dataset, the network can learn the different spectral characteristics of glacial lakes on different temporal and spatial scales [2,10,15,44].

The false positive detection of glacial lakes could be largely excluded in the postprocessing step in our proposed method (Figure 9); however, certain limitations remain. First, a lake with an area covering fewer than 8 × 8 pixels might be misclassified as background information owing to the increase in the number of layers and the minimum recognition size of 8 × 8 pixels in the output layer. Second, the uncertainty is often derived from glacial lakes with extreme spectral characteristics, even with the assistance of multiband input data, because of the similarity with other background features (e.g., seasonal snow and rock) [28]. Additionally, the combination of several networks demands a large amount of memory storage and computer processing time; thus, it is still expected for the algorithms to improve in terms of increasing the identification accuracy and the calculation speed of glacial lake boundary extraction.

4.2. Reliability of the Present Glacial Lake Inventory

A number of glacial lake inventory datasets have been published in previous years, most of which were produced using long-term Landsat images of the HMA region [2,8,10,42,45,46,47]. First, the glacial lake inventory of the China–Pakistan Economic Corridor published in 2018 [2] and that published in 2020 [15], produced by visual examination, were selected as references to appraise the lake boundary extraction reliability of our improved deep learning method. Seven Sentinel-2 satellite images acquired in 2018 and 2020 were used to produce the glacial lake inventories in the China–Pakistan Economic Corridor using our proposed method. The results indicate that 91.0% and 84.0% of individual glacial lakes identified by our extraction method coincided with the glacial lakes in the inventories from [2,15], respectively, and the differences in total glacial lake area were 2.2% and 4.7%, respectively. Second, 80.2% of the glacial lakes (area ≥ 0.0064 km²) in the present glacial lake inventory coincided with those in the inventory in HMA [2], despite the use of different source images obtained at different times. Moreover, given the P, R, and F1 score values of 0.886, 0.872, and 0.86, respectively, in the present glacial lake inventory, the numbers and areas of glacial lakes in HMA in 2022 were determined to be in the ranges 20,473–26,500 and 1636–2118 km², respectively. Overall, the present glacial lake inventory produced by the improved deep learning method is considered reliable.

The area of glacial lakes varies seasonally owing to variations in inflow from the melting of the parent glacier and precipitation, although most source images were obtained in summer [2,10]. The available glacial lake inventories usually exclude uncertainties resulting from the use of imagery from different months. Here, 200 Sentinel-2 satellite images with less than 10% cloud coverage were collected (June to October, 2022) to detect monthly glacial lake changes. In this case, only glacial lakes in nine subregions of HMA were derived for analysis, specifically located in the Hengduan Shan, Southeast Tibet, Inner Tibet, Center Himalaya, Western Himalaya, Western Kunlun, Karakoram, Pamir, and Western Tien Shan regions. The relative anomaly of monthly change in glacial lake area was used to depict the lake area variation in different months of 2022. During June–October, notable monthly variation in glacial lake area was found across different subregions of HMA (Figure 10). In the subregions of HMA, 75% of the relative anomaly in monthly glacial lake area variation during June–October was within ±2–4%, and the maximum monthly variation was ±6%.

5. Conclusions

This study proposed an improved deep learning method using the YOLOv5-seg network to extract glacial lake boundaries based on Sentinel-2 imagery. The SPD-Conv module, CA module, CBAM attention module, and small target detection layer, integrated to extract the boundary of glacial lakes, achieved values of 0.95, 0.93, 0.96, and 0.94 for P, R, mAP_0.5, and the F1 score, respectively. The results confirm that the improved machine learning method has evident advantages in overcoming the problems of extracting glacial lakes with high turbidity and a frozen surface through the improved capability of network generalization. Due to its higher accuracy and reproducibility, the proposed method significantly outperforms existing glacial lake mapping techniques.

We selected 1644 Sentinel-2 images from 2022 to map glacial lakes in the HMA region using the improved machine learning method. Overall, 23,108 glacial lakes were identified with a total area of 1847.5 km² distributed at elevations of 1700–6400 m. The P, R, and F1 score values were 0.89, 0.87, and 0.86, respectively, when compared to the manual interpretation of lake boundary extraction in test sites of the HMA region. The uncertainty in glacial lake area for the entire HMA region resulting from the use of source images from different months was determined as ±2–4%.

Author Contributions

L.Y.: conceptualization, methodology, formal analysis, investigation, writing—original draft. X.W.: conceptualization, methodology, investigation, writing—review and editing, supervision, funding acquisition. W.D.: conceptualization, investigation, writing—review and editing. C.Y.: formal analysis, investigation, writing—review and editing, validation. J.W.: conceptualization, investigation, writing—review and editing. Q.W.: conceptualization, investigation, validation. D.L.: formal analysis, investigation, validation. J.X.: formal analysis, investigation. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (No. 42361144874, No. U23A2011 and 42171137).

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

We thank the European Space Agency and NASA for sharing archival Sentienl-2 images and SRTM DEMs, respectively.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Qin, D.; Yao, T.; Ding, Y. Glossary of Cryosphere Science, 2nd ed.; China Meteorological Press: Beijing, China, 2014. [Google Scholar]
Wang, X.; Guo, X.; Yang, C.; Liu, Q.; Wei, J.; Zhang, Y.; Liu, S.; Zhang, Y.; Jiang, Z.; Tang, Z. Glacial lake inventory of high-mountain Asia in 1990 and 2018 derived from Landsat images. Earth Syst. Sci. Data 2020, 12, 2169–2182. [Google Scholar] [CrossRef]
Guillet, G.; King, O.; Lv, M.; Ghuffar, S.; Benn, D.; Quincey, D.; Bolch, T. A regionally resolved inventory of High Mountain Asia surge-type glaciers, derived from a multi-factor remote sensing approach. Cryosphere 2022, 16, 603–623. [Google Scholar] [CrossRef]
Wang, X.; Ran, W.; Wei, J.; Yin, Y.; Liu, S.; Bolch, T.; Zhang, Y.; Xue, X.; Ding, Y.; Liu, Q.; et al. Spatially resolved glacial meltwater retainment in glacial lakes exerts increasing impacts in High Mountain Asia. J. Hydrol. 2024, 633, 130967. [Google Scholar] [CrossRef]
Bhattacharya, A.; Bolch, T.; Mukherjee, K.; King, O.; Menounos, B.; Kapitsa, V.; Neckel, N.; Yang, W.; Yao, T. High Mountain Asian glacier response to climate revealed by multi-temporal satellite observations since the 1960s. Nat. Commun. 2021, 12, 4133. [Google Scholar] [CrossRef]
Yao, T.; Bolch, T.; Chen, D.; Gao, J.; Immerzeel, W.; Piao, S.; Su, F.; Thompson, L.; Wada, Y.; Wang, L.; et al. The imbalance of the Asian water tower. Nat. Rev. Earth Environ. 2022, 3, 618–632. [Google Scholar] [CrossRef]
Nie, Y.; Liu, Q.; Wang, J.; Zhang, Y.; Sheng, Y.; Liu, S. An inventory of historical glacial lake outburst floods in the Himalayas based on remote sensing observations and geomorphological analysis. Geomorphology 2018, 308, 91–106. [Google Scholar] [CrossRef]
Shugar, D.H.; Burr, A.; Haritashya, U.K.; Kargel, J.S.; Watson, C.S.; Kennedy, M.C.; Bevington, A.R.; Betts, R.A.; Harrison, S.; Strattman, K. Rapid worldwide growth of glacial lakes since 1990. Nat. Clim. Chang. 2020, 10, 939–945. [Google Scholar] [CrossRef]
Shrestha, F.; Steiner, J.F.; Shrestha, R.; Dhungel, Y.; Joshi, S.P.; Inglis, S.; Ashraf, A.; Wali, S.; Walizada, K.M.; Zhang, T. HMAGLOFDB v1. 0–a comprehensive and version controlled database of glacier lake outburst floods in high mountain Asia. Earth Syst. Sci. Data Discuss. 2023, 15, 3941–3961. [Google Scholar] [CrossRef]
Chen, F.; Zhang, M.; Guo, H.; Allen, S.; Kargel, J.S.; Haritashya, U.K.; Watson, C.S. Annual 30 m dataset for glacial lakes in High Mountain Asia from 2008 to 2017. Earth Syst. Sci. Data 2021, 13, 741–766. [Google Scholar] [CrossRef]
Rounce, D.R.; Hock, R.; Maussion, F.; Hugonnet, R.; Kochtitzky, W.; Huss, M.; Berthier, E.; Brinkerhoff, D.; Compagno, L.; Copland, L.; et al. Global glacier change in the 21st century: Every increase in temperature matters. Science 2023, 379, 78–83. [Google Scholar] [CrossRef] [PubMed]
Zhao, F.; Long, D.; Li, X.; Huang, Q.; Han, P. Rapid glacier mass loss in the Southeastern Tibetan Plateau since the year 2000 from satellite observations. Remote Sens. Environ. 2022, 270, 112853. [Google Scholar] [CrossRef]
Nie, Y.; Deng, Q.; Pritchard, H.D.; Carrivick, J.L.; Ahmed, F.; Huggel, C.; Liu, L.; Wang, W.; Lesi, M.; Wang, J. Glacial lake outburst floods threaten Asia’s infrastructure. Sci. Bull. 2023, 68, 1361–1365. [Google Scholar] [CrossRef]
Wangchuk, S.; Bolch, T. Mapping of glacial lakes using Sentinel-1 and Sentinel-2 data and a random forest classifier: Strengths and challenges. Sci. Remote Sens. 2020, 2, 100008. [Google Scholar] [CrossRef]
Lesi, M.; Nie, Y.; Shugar, D.H.; Wang, J.; Deng, Q.; Chen, H.; Fan, J. Landsat- and Sentinel-derived glacial lake dataset in the China–Pakistan Economic Corridor from 1990 to 2020. Earth Syst. Sci. Data 2022, 14, 5489–5512. [Google Scholar] [CrossRef]
Wang, S.; Peppa, M.V.; Xiao, W.; Maharjan, S.B.; Joshi, S.P.; Mills, J.P. A second-order attention network for glacial lake segmentation from remotely sensed imagery. ISPRS J. Photogramm. Remote Sens. 2022, 189, 289–301. [Google Scholar] [CrossRef]
Gao, B.-c. NDWI—A normalized difference water index for remote sensing of vegetation liquid water from space. Remote Sens. Environ. 1996, 58, 257–266. [Google Scholar] [CrossRef]
Salomonson, V.V.; Appel, I. Estimating fractional snow cover from MODIS using the normalized difference snow index. Remote Sens. Environ. 2004, 89, 351–360. [Google Scholar] [CrossRef]
Xu, H. A study on information extraction of water body with the modified normalized difference water index (MNDWI). J. Remote Sens. 2005, 9, 595. [Google Scholar]
Wang, J.; Chen, F.; Zhang, M.; Yu, B. NAU-Net: A New Deep Learning Framework in Glacial Lake Detection. IEEE Geosci. Remote Sens. Lett. 2022, 19, 2000905. [Google Scholar] [CrossRef]
Dirscherl, M.; Dietz, A.J.; Kneisel, C.; Kuenzer, C. Automated mapping of Antarctic supraglacial lakes using a machine learning approach. Remote Sens. 2020, 12, 1203. [Google Scholar] [CrossRef]
Thati, J.; Ari, S. A systematic extraction of glacial lakes for satellite imagery using deep learning based technique. Measurement 2022, 192, 110858. [Google Scholar] [CrossRef]
Wu, R.; Liu, G.; Zhang, R.; Wang, X.; Li, Y.; Zhang, B.; Cai, J.; Xiang, W. A Deep Learning Method for Mapping Glacial Lakes from the Combined Use of Synthetic-Aperture Radar and Optical Satellite Images. Remote Sens. 2020, 12, 4020. [Google Scholar] [CrossRef]
Jiang, D.; Li, X.; Zhang, K.; Marinsek, S.; Hong, W.; Wu, Y. Automatic Supraglacial Lake Extraction in Greenland Using Sentinel-1 SAR Images and Attention-Based U-Net. Remote Sens. 2022, 14, 4998. [Google Scholar] [CrossRef]
Cao, Y.; Bai, X.; Pan, M.; Lei, R.; Du, P. Refined glacial lake extraction in high Asia region by Deep Neural Network and Superpixel-based Conditional Random Field. Cryosphere Discuss. 2023, 2023, 1–21. [Google Scholar] [CrossRef]
Dirscherl, M.; Dietz, A.J.; Kneisel, C.; Kuenzer, C. A Novel Method for Automated Supraglacial Lake Mapping in Antarctica Using Sentinel-1 SAR Imagery and Deep Learning. Remote Sens. 2021, 13, 197. [Google Scholar] [CrossRef]
Qayyum, N.; Ghuffar, S.; Ahmad, H.M.; Yousaf, A.; Shahid, I. Glacial lakes mapping using multi satellite PlanetScope imagery and deep learning. ISPRS Int. J. Geo-Inf. 2020, 9, 560. [Google Scholar] [CrossRef]
Kaushik, S.; Singh, T.; Joshi, P.K.; Dietz, A.J. Automated mapping of glacial lakes using multisource remote sensing data and deep convolutional neural network. Int. J. Appl. Earth Obs. Geoinf. 2022, 115, 103085. [Google Scholar] [CrossRef]
Guo, W.; Liu, S.; Xu, J.; Wu, L.; Shangguan, D.; Yao, X.; Wei, J.; Bao, W.; Yu, P.; Liu, Q.; et al. The second Chinese glacier inventory: Data, methods and results. J. Glaciol. 2015, 61, 357–372. [Google Scholar] [CrossRef]
Pfeffer, W.T.; Arendt, A.A.; Bliss, A.; Bolch, T.; Cogley, J.G.; Gardner, A.S.; Hagen, J.-O.; Hock, R.; Kaser, G.; Kienholz, C.; et al. The Randolph Glacier Inventory: A globally complete inventory of glaciers. J. Glaciol. 2014, 60, 537–552. [Google Scholar] [CrossRef]
Hou, Q.; Zhou, D.; Feng, J. Coordinate attention for efficient mobile network design. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 19–25 June 2021; pp. 13713–13722. [Google Scholar]
Woo, S.; Park, J.; Lee, J.-Y.; Kweon, I.S. Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
Zhu, X.; Lyu, S.; Wang, X.; Zhao, Q. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 10–17 October 2021; pp. 2778–2788. [Google Scholar]
Sunkara, R.; Luo, T. No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects. In Proceedings of the Machine Learning and Knowledge Discovery in Databases, Turin, Italy, 18–22 September 2023; Springer: Cham, Switzerland, 2023; pp. 443–459. [Google Scholar]
Wan, D.; Lu, R.; Wang, S.; Shen, S.; Xu, T.; Lang, X. YOLO-HR: Improved YOLOv5 for Object Detection in High-Resolution Optical Remote Sensing Images. Remote Sens. 2023, 15, 614. [Google Scholar] [CrossRef]
Bian, L.; Li, B.; Wang, J.; Gao, Z. Multi-branch stacking remote sensing image target detection based on YOLOv5. Egypt. J. Remote Sens. Space Sci. 2023, 26, 999–1008. [Google Scholar] [CrossRef]
Li, J.; Sheng, Y. An automated scheme for glacial lake dynamics mapping using Landsat imagery and digital elevation models: A case study in the Himalayas. Int. J. Remote Sens. 2012, 33, 5194–5213. [Google Scholar] [CrossRef]
Wang, C.-Y.; Yeh, I.-H.; Liao, H.-Y.M. You only learn one representation: Unified network for multiple tasks. arXiv 2021, arXiv:2105.04206. [Google Scholar]
Wang, C.-Y.; Bochkovskiy, A.; Liao, H.-Y.M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 7464–7475. [Google Scholar]
Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015, Munich, Germany, 5–9 October 2015; Springer: Cham, Switzerland, 2015; pp. 234–241. [Google Scholar]
Badrinarayanan, V.; Kendall, A.; Cipolla, R. SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 2481–2495. [Google Scholar] [CrossRef] [PubMed]
Dou, X.; Fan, X.; Wang, X.; Yunus, A.P.; Xiong, J.; Tang, R.; Lovati, M.; van Westen, C.; Xu, Q. Spatio-Temporal Evolution of Glacial Lakes in the Tibetan Plateau over the Past 30 Years. Remote Sens. 2023, 15, 416. [Google Scholar] [CrossRef]
Abe, C.; Fujita, K.; Kawamoto, S.; Narama, C.; Nishimura, K.; Tadono, T.; Tomiyama, N.; Uda, T.; Ukita, J.; Yabuki, H.; et al. Glacial lake inventory of Bhutan using ALOS data: Methods and preliminary results. Ann. Glaciol. 2011, 52, 65–71. [Google Scholar] [CrossRef]
Xu, J.; Feng, M.; Sui, Y.; Yan, D.; Zhang, K.; Shi, K. Identifying Alpine Lakes in the Eastern Himalayas Using Deep Learning. Water 2023, 15, 229. [Google Scholar] [CrossRef]
Zhang, M.; Chen, F.; Guo, H.; Yi, L.; Zeng, J.; Li, B. Glacial Lake Area Changes in High Mountain Asia during 1990–2020 Using Satellite Remote Sensing. Research 2022, 2022, 9821275. [Google Scholar] [CrossRef]
Yin, Y.; Wang, X.; Liu, S.; Guo, X.; Zhang, Y.; Ran, W.; Wang, Q. Variation characteristics and influencing factors of glacial lakes in China from 1990 to 2020. Lake Sci. 2023, 35, 358–367. [Google Scholar]
Zhang, M.; Chen, F.; Zhao, H.; Wang, J.; Wang, N. Recent Changes of Glacial Lakes in the High Mountain Asia and Its Potential Controlling Factors Analysis. Remote Sens. 2021, 13, 3757. [Google Scholar] [CrossRef]

Figure 1. Distribution of glacial lakes of different sizes in the glaciered region of HMA (pink). Black squares identify where monthly variations in glacial lakes area were detected.

Figure 2. Temporal phase of remote sensing images in High Mountain Asia (143 Sentinel-2 images from 2018 were used to train the deep learning model; 1644 Sentinel-2 images from 2022 were used for glacial lake boundary extraction).

Figure 3. Flowchart of glacial lake boundary extraction method.

Figure 4. Coordinate attention module (CONV represents a convolution layer, and the kernel size is 1. X AVGPOOL and Y AVGPOOL means the average adaptive pooling of X and Y, respectively. Split represents the splitting of the tensor into 1

\times

H

\times

C, 1

\times

W

\times

C. W represents the width. H represents the height. C represents the number of channels).

Figure 4. Coordinate attention module (CONV represents a convolution layer, and the kernel size is 1. X AVGPOOL and Y AVGPOOL means the average adaptive pooling of X and Y, respectively. Split represents the splitting of the tensor into 1

\times

H

\times

C, 1

\times

W

\times

C. W represents the width. H represents the height. C represents the number of channels).

Figure 5. Convolution block attention module.

Figure 6. Structure of the SPD-Conv module.

Figure 7. Network structure adopted in this study (convolution layer 3

\times

3: convolution kernel is a convolution layer with 3 steps of 2, convolution layer 1

\times

1: convolution kernel is 1, step size is 1).

Figure 7. Network structure adopted in this study (convolution layer 3

\times

3: convolution kernel is a convolution layer with 3 steps of 2, convolution layer 1

\times

1: convolution kernel is 1, step size is 1).

Figure 8. Glacial lake distribution results for the HMA and its subregions in 2022.

Figure 9. Contribution of different factors in glacial lake detection (a—RGB, b—synthesis of bands 8, 4, 3, c—synthesis of bands 11, 4, 3, d—adding terrain factors).

Figure 10. Relative anomaly of glacial lake area during June–October in HMA in 2022 (orange line represents the average of the relative anomaly of monthly glacial lake area; the left and right limits of the box represent the upper and lower quartiles of the monthly relative anomaly, respectively; and the whisker lines indicate the maximum relative anomaly of the different subregions of HMA).

Table 1. Comparison of mainstream instance segmentation algorithms (data for YOLOv5-seg and YOLOv8-seg were obtained from Ultralytics Inc. (Los Angeles, CA, USA); data for YOLOv7 and YOLOR were obtained from [38,39,40,41]), respectively).

Algorithm	Precision	Recall	mAP_0.5	F1 Score	Bands
YOLOv5-seg	0.6	0.64	0.56	0.62	8 4 3
YOLOv7-seg	0.651	0.71	0.78	0.70	8 4 3
YOLOR-seg	0.583	0.731	0.60	0.790	8 4 3
YOLOv8-seg	0.916	0.75	0.77	0.82	8 4 3
Improved YOLOv5-seg	0.702	0.711	0.75	0.71	4 3 2
YOLOv5-seg	0.673	0.701	0.724	0.687	4 3 2
Improved YOLOv5-seg(CA)	0.87	0.801	0.88	0.84	8 4 3
Improved YOLOv5-seg(CBAM)	0.95	0.928	0.96	0.94	8 4 3
Improved YOLOv5-seg(CA)	0.916	0.734	0.80	0.81	11 4 3

Table 2. Results of ablation experiments.

Index	CA	CBAM	Small-Object Detection Layer	SPD-Conv	C2f	mAP_0.5	F1 Score
YOLOv5-seg	-	-	-	-	-	0.56	0.62
1	√	-	-	-	-	0.62	0.65
2	√	-	√	-	-	0.73	0.70
YOLOv8-seg	-	-	-	-	√	0.77	0.82
4	√	-	-	-	√	0.88	0.84
5	-	√	-	-	√	0.88	0.85
6	√	-	-	√	√	0.87	0.83
Improved YOLOv5-seg (CBAM)	-	√	-	√	√	0.96	0.94

Table 3. Results of convolutional neural network in relation to the test sites.

Algorithm	Recall	Precision	S_accuracy	F1 Score	Bands
Improved YOLOv5-seg	0.70	0.80	0.95	0.75	(4,3,2)
Improved YOLOv5-seg (CA)	0.76	0.86	0.95	0.81	(8,4,3)
Improved YOLOv5-seg (CA)	0.81	0.84	0.94	0.82	(11,4,3)
Improved YOLOv5-seg (CBAM)	0.82	0.89	0.92	0.85	(8,4,3)
Combined the improved algorithms	0.87	0.89	0.96	0.89	(11,8,4,3,2)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yin, L.; Wang, X.; Du, W.; Yang, C.; Wei, J.; Wang, Q.; Lei, D.; Xiao, J. Using the Improved YOLOv5-Seg Network and Sentinel-2 Imagery to Map Glacial Lakes in High Mountain Asia. Remote Sens. 2024, 16, 2057. https://doi.org/10.3390/rs16122057

AMA Style

Yin L, Wang X, Du W, Yang C, Wei J, Wang Q, Lei D, Xiao J. Using the Improved YOLOv5-Seg Network and Sentinel-2 Imagery to Map Glacial Lakes in High Mountain Asia. Remote Sensing. 2024; 16(12):2057. https://doi.org/10.3390/rs16122057

Chicago/Turabian Style

Yin, Lichen, Xin Wang, Wentao Du, Chengde Yang, Junfeng Wei, Qiong Wang, Dongyu Lei, and Jingtao Xiao. 2024. "Using the Improved YOLOv5-Seg Network and Sentinel-2 Imagery to Map Glacial Lakes in High Mountain Asia" Remote Sensing 16, no. 12: 2057. https://doi.org/10.3390/rs16122057

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Using the Improved YOLOv5-Seg Network and Sentinel-2 Imagery to Map Glacial Lakes in High Mountain Asia

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Sources

2.2. Methods

2.2.1. Data Preprocessing

2.2.2. Improved Convolutional Neural Network

2.2.3. Postprocessing and Accuracy Assessment

3. Results

3.1. Performance of the Proposed Method

3.2. Mapping of Glacial Lakes in HMA

4. Discussion

4.1. Advantages of the Improved Strategies

4.2. Reliability of the Present Glacial Lake Inventory

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI