Gully Erosion Monitoring Based on Semi-Supervised Semantic Segmentation with Boundary-Guided Pseudo-Label Generation Strategy and Adaptive Loss Function
Abstract
:1. Introduction
- (1)
- To improve the reliability of pseudo-labels, we propose a boundary-guided pseudo-label generation strategy (BPGS), which is composed of an object boundary generator and a multi-evidence fusion strategy. First, an object boundary generator based on CNNs and superpixel segmentation is proposed to output boundary maps of geographic objects, aiming to exploit the structural prior information and neighborhood correlation of pixels. On this basis, guided by these boundary maps, a multi-evidence fusion strategy is designed to fully utilize historical labels as well as the output of the student model and the teacher model to generate high-quality pseudo-labels.
- (2)
- To alleviate the impact of pseudo-label noise, an adaptive loss function based on centroid similarity is developed. In this loss function, centroid similarity (CSIM) is designed to measure the reliability of pseudo-labels and adjust the loss value. In this manner, the loss value is weighted lower when the reliability is lower. Consequently, we can endure pseudo-label noise through this loss function during the training process.
- (3)
- To validate our method, two benchmark datasets for gully erosion monitoring are constructed according to the satellite data acquired in northeastern China.
2. Method
2.1. Boundary-Guided Pseudo-Label Generation Strategy
2.1.1. Object Boundary Generator
- (1)
- Dual attention network: As shown in Figure 3, the dual attention network consists of three parts, taking an image as the input and the predictions of this image as the output. The first part is features encoding, consisting of three convolution blocks. Each block includes a convolution layer, a batch normalization layer, and a ReLU activation function layer. The second part is dual attention enhancement, which is beneficial for focusing on the information of interest and suppressing irrelevant background information. In this part, we adopt a channel attention module and a spatial attention module, as shown in Figure 4. As for the third part, a convolution layer, a batch normalization layer, and an argmax method are used to decode the features. With the above three parts, pixels with similar features can be aggregated into an object.
- (2)
- SLIC refinement: The dual attention network aims to aggregate pixels with similar features into one object. However, it is also preferable for objects to be spatially continuous [29]. Meanwhile, the SLIC method considers the spatial correlation of pixels in images when generating irregular pixel regions (superpixels). Therefore, we adopt the SLIC method to refine the output of the dual attention network. Specifically, based on the SLIC method, we first extract the superpixel set from the input image, where O is the total number of superpixels, and denotes a set of the index of pixels belonging to the oth superpixel. Then, on the basis of the predictions obtained from the dual attention network and , we can obtain (the predictions for pixels belonging to ). Next, we force the most frequent category in as the category for all pixels of . Finally, iterate the above steps until all superpixels are updated. Based on the above operations, continuous pixels with similar features are aggregated into the same object.
2.1.2. Multi-Evidence Fusion Strategy
2.2. The Adaptive Loss Function Based on Centroid Similarity
2.3. Model Training and Testing
2.3.1. Iterative Training Framework
Algorithm 1 Iterative Training Framework |
Input: Historical data (labeled data) DL, the latest monitoring data (unlabeled data) DU. Output: Trained model and . 1 Initialize and with different pre-trained weights. 2 Train on DL. 3 Train on DL. 4 for , do 5 Predict on DU with . 6 Predict on DU with . 7 Use BPGS to fuse historical labels and the output of and pseudo-labeled data DPL. 8 Fine tune from on both DL and latest DPL with the adaptive loss . 9 Predict on DU with . 10 Use BPGS to fuse historical labels and the output of and pseudo-labeled data DPL. 11 Fine tune from on both DL and latest DPL with the adaptive loss . 12 end for |
2.3.2. Testing Process
3. Experiments and Results
3.1. Datasets
3.1.1. Dataset Description
3.1.2. Label of Dataset
3.2. Experimental Design and Implementation Details
3.2.1. Experimental Design
3.2.2. Implementation Details
3.3. Experiment 1: HC2012 to HC2020
3.4. Experiment 2: HC2020 to HC2012
4. Discussion
4.1. Ablation Study
4.2. Analysis of the Setting of Similarity Regulation Indicator
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Conflicts of Interest
References
- Valentin, C.; Poesen, J.; Li, Y. Gully Erosion: Impacts, Factors and Control. Catena 2005, 63, 132–153. [Google Scholar] [CrossRef]
- Hitouri, S.; Varasano, A.; Mohajane, M.; Ijlil, S.; Essahlaoui, N.; Ali, S.A.; Essahlaoui, A.; Pham, Q.B.; Waleed, M.; Palateerdham, S.K.; et al. Hybrid Machine Learning Approach for Gully Erosion Mapping Susceptibility at a Watershed Scale. ISPRS Int. J. Geo-Inf. 2022, 11, 401. [Google Scholar] [CrossRef]
- Wang, Z.; Zhang, G.; Wang, C.; Xing, S. Assessment of the Gully Erosion Susceptibility Using Three Hybrid Models in One Small Watershed on the Loess Plateau. Soil Tillage Res. 2022, 223, 105481. [Google Scholar] [CrossRef]
- Kong, H.; Wu, D.; Yang, L. Quantification of Soil Erosion in Small Watersheds on the Loess Plateau Based on a Modified Soil Loss Model. Water Supply 2022, 22, 6308–6320. [Google Scholar] [CrossRef]
- Rafique, N.; Bhat, M.S.; Muntazari, T.H. Identification and Mapping of Land Degradation through Remote Sensing in Budgam District of Jammu and Kashmir, India. Indian J. Ecol. 2022, 49, 602–606. [Google Scholar]
- Wang, R.; Sun, H.; Yang, J.; Zhang, S.; Fu, H.; Wang, N.; Liu, Q. Quantitative Evaluation of Gully Erosion Using Multitemporal UAV Data in the Southern Black Soil Region of Northeast China: A Case Study. Remote Sens. 2022, 14, 1479. [Google Scholar] [CrossRef]
- Liu, G.; Zheng, F.; Wilson, G.V.; Xu, X.; Liu, C. Three decades of ephemeral gully erosion studies. Soil Tillage Res. 2021, 212, 105046. [Google Scholar] [CrossRef]
- Slimane, A.B.; Raclot, D.; Rebai, H.; Le Bissonnais, Y.; Planchon, O.; Bouksila, F. Combining field monitoring and aerial imagery to evaluate the role of gully erosion in a Mediterranean catchment (Tunisia). Catena 2018, 170, 73–83. [Google Scholar] [CrossRef]
- Evans, M.; Lindsay, J. High resolution quantification of gully erosion in upland peatlands at the landscape scale. Earth Surf. Proc. Land 2010, 35, 876–886. [Google Scholar] [CrossRef]
- Li, H.; Cruse, R.M.; Bingner, R.L.; Gesch, K.R.; Zhang, X. Evaluating ephemeral gully erosion impact on Zea mays L. yield and economics using AnnAGNPS. Soil Till. Res. 2016, 155, 157–165. [Google Scholar] [CrossRef]
- Guo, Y.; Liu, Y.; Georgiou, T.; Lew, M.S. A review of semantic segmentation using deep neural networks. Int. J. Multimed. Inf. Retr. 2018, 7, 87–93. [Google Scholar] [CrossRef]
- Shivappriya, S.N.; Priyadarsini, M.J.P.; Stateczny, A.; Puttamadappa, C.; Parameshachari, B.D. Cascade object detection and remote sensing object detection method based on trainable activation function. Remote Sens. 2021, 13, 200. [Google Scholar] [CrossRef]
- Xie, S.; Hu, H. Facial Expression Recognition Using Hierarchical Features with Deep Comprehensive Multipatches Aggregation Convolutional Neural Networks. IEEE Trans. Multimed. 2018, 21, 211–220. [Google Scholar] [CrossRef]
- Song, J.; Gao, S.; Zhu, Y.; Ma, C. A Survey of Remote Sensing Image Classification Based on CNNs. Big Earth Data 2019, 3, 232–254. [Google Scholar] [CrossRef]
- Wang, G.; Fan, B.; Xiang, S.; Pan, C. Aggregating rich hierarchical features for scene classification in remote sensing imagery. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 10, 4104–4115. [Google Scholar] [CrossRef]
- Wang, C.; Qiu, X.; Huan, H.; Wang, S.; Zhang, Y.; Chen, X.; He, W. Earthquake-Damaged Buildings Detection in Very High-Resolution Remote Sensing Images Based on Object Context and Boundary Enhanced Loss. Remote Sens. 2021, 13, 3119. [Google Scholar] [CrossRef]
- Yu, M.; Chen, X.; Zhang, W.; Liu, Y. AGs-Unet: Building Extraction Model for High Resolution Remote Sensing Images Based on Attention Gates U Network. Sensors 2022, 22, 2932. [Google Scholar] [CrossRef]
- Zhu, Q.; Li, Z.; Zhang, Y.; Guan, Q. Building Extraction from High Spatial Resolution Remote Sensing Images via Multiscale-Aware and Segmentation-Prior Conditional Random Fields. Remote Sens. 2020, 12, 3983. [Google Scholar] [CrossRef]
- Zhang, P.; Zhang, B.; Zhang, T.; Chen, D.; Wang, Y.; Wen, F. Prototypical pseudo label denoising and target structure learning for domain adaptive semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Online, 19–25 June 2021; pp. 12414–12424. [Google Scholar]
- Zhang, Q.; Zhang, J.; Liu, W.; Tao, D. Category anchor-guided unsupervised domain adaptation for semantic segmentation. In Proceedings of the Conference and Workshop on Neural Information Processing Systems, Vancouver, BC, Canada, 8–14 December 2019; p. 32. [Google Scholar]
- Yang, L.; Zhuo, W.; Qi, L.; Shi, Y.; Gao, Y. St++: Make Self-Training Work Better for Semi-Supervised Semantic Segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 19–23 June 2022; pp. 4268–4277. [Google Scholar]
- Xu, Y.; Shang, L.; Ye, J.; Qian, Q.; Li, Y.F.; Sun, B.; Jin, R. Dash: Semi-supervised learning with dynamic thresholding. In Proceedings of the International Conference on Machine Learning, Online, 18–24 July 2021; pp. 11525–11536. [Google Scholar]
- Zou, Y.; Zhang, Z.; Zhang, H.; Li, C.L.; Bian, X.; Huang, J.B.; Pfister, T. Pseudoseg: Designing pseudo labels for semantic segmentation. arXiv 2020, arXiv:2010.09713. [Google Scholar]
- Zuo, S.; Yu, Y.; Liang, C.; Jiang, H.; Er, S.; Zhang, C.; Zha, H. Self-training with differentiable teacher. arXiv 2021, arXiv:2109.07049. [Google Scholar]
- Zhang, W.; Zhu, L.; Hallinan, J.; Zhang, S.; Makmur, A.; Cai, Q.; Ooi, B.C. Boostmis: Boosting Medical Image Semi-Supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 19–23 June 2022; pp. 20666–20676. [Google Scholar]
- Wang, Y.; Wang, H.; Shen, Y.; Fei, J.; Li, W.; Jin, G.; Le, X. Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 19–23 June 2022; pp. 4248–4257. [Google Scholar]
- Yao, H.; Hu, X.; Li, X. Enhancing Pseudo Label Quality for Semi-Supervised Domain-Generalized Medical Image Segmentation. arXiv 2022, arXiv:2201.08657. [Google Scholar] [CrossRef]
- He, R.; Yang, J.; Qi, X. Re-Distributing Biased Pseudo Labels for Semi-Supervised Semantic Segmentation: A Baseline Investigation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Online, 19–25 June 2021; pp. 6930–6940. [Google Scholar]
- Kanezaki, A. Unsupervised Image Segmentation by Backpropagation. In Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada, 15–20 April 2018; pp. 1543–1547. [Google Scholar]
- Wang, C.; Shi, A.; Wang, X.; Wu, F.; Huang, F.; Xu, L. A novel multi-scale segmentation algorithm for high resolution remote sensing images based on wavelet transform and improved JSEG algorithm. Light Electron. Opt. 2014, 125, 5588–5595. [Google Scholar] [CrossRef]
- Achanta, R.; Shaji, A.; Smith, K.; Lucchi, A.; Fua, P.; Süsstrunk, S. SLIC Superpixels Compared to State-of-the-art Superpixel Methods. IEEE Trans. Pattern Anal. 2012, 34, 2274–2282. [Google Scholar] [CrossRef] [PubMed]
- Csillik, O. Fast segmentation and classification of very high resolution remote sensing data using SLIC superpixels. Remote Sens. 2017, 9, 243. [Google Scholar] [CrossRef] [Green Version]
- Feng, Z.; Zhou, Q.; Gu, Q.; Tan, X.; Cheng, G.; Lu, X.; Ma, L. Dmt: Dynamic Mutual Training for Semi-Supervised Learning. Pattern Recogn. 2022, 2022, 108777. [Google Scholar] [CrossRef]
- Wang, C.; Liu, H.; Shen, Y.; Zhao, K.; Xing, H.; Wu, H. High-Resolution Remote-Sensing Image-Change Detection Based on Morphological Attribute Profiles and Decision Fusion. Complexity 2020, 171, 8360361. [Google Scholar] [CrossRef]
- Shi, A.; Gao, G.; Shen, S. Change detection of bitemporal multispectral images based on FCM and DS theory. EURASIP J. Adv. Sig. Process. 2016, 2016, 96. [Google Scholar] [CrossRef] [Green Version]
- Dempster, A.P. Upper and lower probabilities induced by multivalue mapping. Ann. Math. Stat. 1967, 38, 325–339. [Google Scholar] [CrossRef]
- Wang, C.; Zhang, Y.; Chen, X.; Jiang, H.; Mukherjee, M.; Wang, S. Automatic Building Detection from High-Resolution Remote Sensing Images Based on Joint Optimization and Decision Fusion of Morphological Attribute Profiles. Remote Sens. 2021, 13, 357. [Google Scholar] [CrossRef]
- Trivedi, M.M.; Mills, J.K. Centroid calculation of the blastomere from 3D Z-Stack image data of a 2-cell mouse embryo. Biomed Signal. Proces. 2020, 57, 101726. [Google Scholar] [CrossRef]
- Wang, Z.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Brunet, D.; Vrscay, E.R.; Wang, Z. On the mathematical properties of the structural similarity index. IEEE Trans. Image Process. 2011, 21, 1488–1499. [Google Scholar] [CrossRef] [PubMed]
- Yu, C.; Gao, C.; Wang, J.; Yu, G.; Shen, C.; Sang, N. Bisenet v2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation. Int. J. Comput. Vision 2021, 129, 3051–3068. [Google Scholar] [CrossRef]
- Xie, E.; Wang, W.; Yu, Z.; Anandkumar, A.; Alvarez, J.M.; Luo, P. SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. In Proceedings of the Conference and Workshop on Neural Information Processing Systems, Vancouver, BC, Canada, 6–14 December 2021; pp. 12077–12090. [Google Scholar]
- Pozzer, S.; Rezazadeh Azar, E.; Dalla Rosa, F.; Chamberlain Pravia, Z.M. Semantic segmentation of defects in infrared thermographic images of highly damaged concrete structures. J. Perform. Constr. Facil. 2021, 35, 04020131. [Google Scholar] [CrossRef]
- Xia, L.; Zhang, R.; Chen, L.; Li, L.; Yi, T.; Wen, Y.; Xie, C. Evaluation of Deep Learning Segmentation Models for Detection of Pine Wilt Disease in Unmanned Aerial Vehicle Images. Remote Sens. 2021, 13, 3594. [Google Scholar] [CrossRef]
- Peng, X.; Zhong, R.; Li, Z.; Li, Q. Optical remote sensing image change detection based on attention mechanism and image difference. IEEE Trans. Geosci. Remote 2020, 59, 7296–7307. [Google Scholar] [CrossRef]
- He, N.; Fang, L.; Plaza, A. Hybrid first and second order attention Unet for building segmentation in remote sensing images. Sci. China Inform. Sci. 2020, 63, 140305. [Google Scholar] [CrossRef]
- Lin, T.Y.; Maire, M.; Belongie, S.; Hays, J.; Perona, P.; Ramanan, D.; Zitnick, C.L. Microsoft Coco: Common Objects in Context. In Proceedings of the European Conference on Computer Vision (ECCV), Zurich, Switzerland, 5–12 September 2014; pp. 740–755. [Google Scholar]
- Deng, J.; Dong, W.; Socher, R.; Li, L.J.; Li, K.; Fei-Fei, L. Imagenet: A Large-Scale Hierarchical Image Database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Miami, FL, USA, 20–25 June 2009; pp. 248–255. [Google Scholar]
Methods | Precision | Recall | F1 score | IoU |
---|---|---|---|---|
Bisenetv2 [41] | 50.9% | 40.9% | 45.4% | 18.9% |
Segformer [42] | 52.7% | 39.4% | 45.1% | 18.6% |
U2PL [26] | 43.2% | 83.8% | 57% | 41.1% |
DMT [33] | 44.8% | 85% | 58.7% | 42.4% |
The proposed method | 58.7% | 60.4% | 59.5% | 64.5% |
Methods | Precision | Recall | F1 score | IoU |
---|---|---|---|---|
Bisenetv2 | 58.8% | 26.6% | 36.6% | 18.1% |
Segformer | 57.4% | 25.9 % | 35.7% | 16.2 % |
U2PL | 41.9% | 82.8% | 55.6% | 39.3% |
DMT | 44.5% | 85.5% | 58.5% | 42.1% |
The proposed method | 63.9% | 65.2% | 64.5% | 60.4% |
Experiment | Method | BPGS | Adaptive Loss | IoU | Δ(IoU) |
---|---|---|---|---|---|
Experiment 1 | Baseline | – | – | 44.7% | +0.0% |
Baseline | √ | – | 56.1% | +11.4% | |
Baseline | √ | √ | 64.5% | +19.8% | |
Experiment 2 | Baseline | – | – | 43.6% | +0.0% |
Baseline | √ | – | 53.3% | +9.7% | |
Baseline | √ | √ | 60.4% | +16.8% |
Experiment 1 | 0 | 0.05 | 0.1 | 0.15 | 0.2 | 0.25 | 0.3 | 0.35 | 0.4 | 0.45 | 0.5 | |
IoU (%) | 51.4 | 52.1 | 53.5 | 54.7 | 56.3 | 57.5 | 59.8 | 60.7 | 62.2 | 63.4 | 62.7 | |
0.55 | 0.6 | 0.65 | 0.7 | 0.75 | 0.8 | 0.85 | 0.9 | 0.95 | 1 | |||
IoU (%) | 63.3 | 64.5 | 62.9 | 60.3 | 58.7 | 56.8 | 55.4 | 54.8 | 54.3 | 53.6 | ||
Experiment 2 | 0 | 0.05 | 0.1 | 0.15 | 0.2 | 0.25 | 0.3 | 0.35 | 0.4 | 0.45 | 0.5 | |
IoU (%) | 49.8 | 49.6 | 50.7 | 51.2 | 53.9 | 55.3 | 57.1 | 59.5 | 58.3 | 59.4 | 60.1 | |
0.55 | 0.6 | 0.65 | 0.7 | 0.75 | 0.8 | 0.85 | 0.9 | 0.95 | 1 | |||
IoU (%) | 60.9 | 60.4 | 59.7 | 58.1 | 57.8 | 55.9 | 53.4 | 53.6 | 52.5 | 52.1 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhao, C.; Shen, Y.; Su, N.; Yan, Y.; Liu, Y. Gully Erosion Monitoring Based on Semi-Supervised Semantic Segmentation with Boundary-Guided Pseudo-Label Generation Strategy and Adaptive Loss Function. Remote Sens. 2022, 14, 5110. https://doi.org/10.3390/rs14205110
Zhao C, Shen Y, Su N, Yan Y, Liu Y. Gully Erosion Monitoring Based on Semi-Supervised Semantic Segmentation with Boundary-Guided Pseudo-Label Generation Strategy and Adaptive Loss Function. Remote Sensing. 2022; 14(20):5110. https://doi.org/10.3390/rs14205110
Chicago/Turabian StyleZhao, Chunhui, Yi Shen, Nan Su, Yiming Yan, and Yong Liu. 2022. "Gully Erosion Monitoring Based on Semi-Supervised Semantic Segmentation with Boundary-Guided Pseudo-Label Generation Strategy and Adaptive Loss Function" Remote Sensing 14, no. 20: 5110. https://doi.org/10.3390/rs14205110
APA StyleZhao, C., Shen, Y., Su, N., Yan, Y., & Liu, Y. (2022). Gully Erosion Monitoring Based on Semi-Supervised Semantic Segmentation with Boundary-Guided Pseudo-Label Generation Strategy and Adaptive Loss Function. Remote Sensing, 14(20), 5110. https://doi.org/10.3390/rs14205110