A Novel Post-Processing Method Based on a Weighted Composite Filter for Enhancing Semantic Segmentation Results
Abstract
:1. Introduction
- A new post-processing strategy with a unified framework is proposed. It consists of three consecutive sub-strategies, including (i) Selection and normalization, (ii) Weighted composite filter (WCF), and (iii) Argmax. It provides a new enhancement solution for semantic segmentation results outside the framework based on CRFs.
- A novel WCF is proposed, in which a local guided image filter and a minimum spanning tree (MST)-based filter are combined by adjustable weights.
- Compared with the complex theory and structure of CRFs, the proposed enhancement method can combine advantages of the local characteristics of boundary protection and the global characteristics of recognizing global similarity, and is qualified to solve the inherent problems in the semantic segmentation algorithms with theoretical simplicity.
2. Related Work
2.1. Local Image Filter
2.2. Minimum Spanning Tree (MST) for Global Filtering
3. Method
3.1. Definition of Weighted Composite Filter (WCF)
3.1.1. Guided Image Filter for Local Filtering
3.1.2. MST-Based Filter for Global Filtering
3.2. Enhancement Method Based on WCF
- (i)
- Selection and normalization. Since the size of the label assignment probability computed by a deep convolutional neural network (DCNN) [37] is different from the original image, an up-sampling operation ought to be used to resize the probability. For original probability, we utilize bi-linear interpolation to reach the original image resolution. For the assumption that only the classes occurring in the coarse semantic image segmentation mask would influence the segmentation results, only the assignment probability of class labels is chosen for improvement methods instead of using all classes, which reduces the computed quantity. Meanwhile, the uncertain range of values and output values of DCNN for each pixel and each class are normalized to the same order as the image pixel values. Only in this way can the label assignment probability get updated, and the filtering process would be effective. Blue arrows in Figure 3 represent the step (i).
- (ii)
- (iii)
- Argmax. The argmax function [38] of each pixel-bit depth vector is used to decompose the predicted values into segmentation masks and to get the enhancement result. The enhancement method based on WCF is then employed to improve the segmentation result and better capture the object boundaries. Red arrows in Figure 3 represent the step (iii).
4. Experiments
4.1. Training and Parameters Selection
4.2. Evaluation
5. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Zhang, Y.; David, P.; Foroosh, H.; Gong, B. A curriculum domain adaptation approach to the semantic segmentation of urban scenes. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 42, 1823–1841. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Arsalan, M.; Baek, N.R.; Owais, M.; Mahmood, T.; Park, K.R. Deep learning-based detection of pigment signs for analysis and diagnosis of retinitis pigmentosa. Sensors 2020, 20, 12. [Google Scholar] [CrossRef] [PubMed]
- Lee, H.; Park, J.; Hwang, J.Y. Channel attention module with multiscale grid average pooling for breast cancer segmentation in an ultrasound image. IEEE Trans. Ultrason. Ferroelectr. Freq. Control. 2020, 67, 1344–1353. [Google Scholar] [PubMed]
- Shelhamer, E.; Long, J.; Darrell, T. Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 640–651. [Google Scholar] [CrossRef] [PubMed]
- Li, L.; Li, Z.; Li, Y.; Chen, C.; Yu, J.; Zhang, C. Small infrared target detection based on local difference adaptive measure. IEEE Geosci. Remote. Sens. Lett. 2020, 17, 1258–1262. [Google Scholar] [CrossRef]
- Liu, X.; Xu, Q.; Adkins, G.; Medwedeff, E.; Lin, L.; Yan, S. Learning semisupervised multilabel fully convolutional network for hierarchical object parsing. IEEE Trans. Neural Netw. Learn. Syst. 2020, 31, 2500–2509. [Google Scholar] [CrossRef]
- Lin, C.Y.; Chiu, Y.C.; Ng, H.F.; Shih, T.K.; Lin, K.H. Global-and-local context network for semantic segmentation of street view images. Sensors 2020, 20, 10. [Google Scholar] [CrossRef]
- Fu, J.; Liu, J.; Jiang, J.; Li, Y.; Bao, Y.; Lu, H. Scene segmentation with dual relation-aware attention network. IEEE Trans. Neural Netw. Learn. Syst. 2020, 1–14. [Google Scholar] [CrossRef]
- Zhang, Y.; Hou, Y.; Zhou, S.; Ouyang, K. Encoding time series as multi-scale signed recurrence plots for classification using fully convolutional networks. Sensors 2020, 20, 14. [Google Scholar] [CrossRef] [Green Version]
- Chai, D.; Newsam, S.; Huang, J. Aerial image semantic segmentation using DCNN predicted distance maps. ISPRS J. Photogramm. Remote. Sens. 2020, 161, 309–322. [Google Scholar] [CrossRef]
- Mi, L.; Chen, Z. Superpixel-enhanced deep neural forest for remote sensing image semantic segmentation. ISPRS J. Photogramm. Remote. Sens. 2020, 159, 140–152. [Google Scholar] [CrossRef]
- Alberto, G.G.; Sergio, O.E.; Sergiu, O.; Victor, V.M.; Jose, G.R. A Review on deep learning techniques applied to semantic segmentation. arXiv 2017, arXiv:1704.06857v1. [Google Scholar]
- Hao, S.; Zhou, Y.; Guo, Y. A brief survey on semantic segmentation with deep learning. Neurocomputing 2020, 406, 302–321. [Google Scholar] [CrossRef]
- Chandra, S.; Kokkinos, I. Fast, exact and multi-scale inference for semantic image segmentation with deep gaussian CRFs. In Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands, 8–16 October 2016; pp. 402–418. [Google Scholar]
- Guosheng, L.; Chunhua, S.; Anton, V.D.H.; Ian, R. Efficient piecewise training of deep structured models for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 3194–3203. [Google Scholar]
- Gould, S.; Rodgers, J.; Cohen, D.; Elidan, G.; Koller, D. Multi-class segmentation with relative location prior. Int. J. Comput. Vis. 2008, 80, 300–316. [Google Scholar] [CrossRef]
- Krähenbühl, P.; Koltun, V. Efficient inference in fully connected CRFs with Gaussian edge potentials. arXiv 2012, arXiv:1210.5644. [Google Scholar]
- Wang, P.; Shen, C.; van den Hengel, A. Efficient SDP inference for fully-connected CRFs Based on low-rank decomposition. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015; pp. 3222–3231. [Google Scholar]
- Chen, L.C.; Papandreou, G.; Kokkinos, I.; Murphy, K.; Yuille, A.L. DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 2018, 40, 834–848. [Google Scholar] [CrossRef]
- Zhong, Z.; Li, J.; Clausi, D.A.; Wong, A. Generative adversarial networks and conditional random fields for hyperspectral image classification. IEEE Trans. Cybern. 2020, 50, 3318–3329. [Google Scholar] [CrossRef] [Green Version]
- Miclea, V.C.; Nedevschi, S. Real-time semantic segmentation-based stereo reconstruction. IEEE Trans. Intell. Transp. Syst. 2020, 21, 1514–1524. [Google Scholar] [CrossRef]
- Zhang, Y.; Huynh, C.P.; Ngan, K.N. Feature fusion with predictive weighting for spectral image classification and segmentation. IEEE Trans. Geosci. Remote. Sens. 2019, 57, 6792–6807. [Google Scholar] [CrossRef]
- Himayat, N.; Kassam, S. Approximate performance abalysis of edge-preserving filters. IEEE Trans. Singal Process. 1993, 41, 2764–2777. [Google Scholar] [CrossRef]
- Tomasi, C.; Manduchi, R. Bilateral filtering for gray and color images. In Proceedings of the Sixth International Conference on Computer Vision, Bombay, India, India, 7 January 1998; pp. 839–846. [Google Scholar]
- Li, Z.; Zheng, J.; Zhu, Z.; Yao, W.; Wu, S.; Rahardja, S. Content adaptive bilateral filtering. In Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), San Jose, CA, USA, 15–19 July 2013. [Google Scholar]
- Ghosh, S.; Gavaskar, R.G.; Panda, D.; Chaudhury, K.N. Fast scale-adaptive bilateral texture smoothing. IEEE Trans. Circuits Syst. Video Technol. 2020, 30, 2015–2026. [Google Scholar] [CrossRef]
- Guo, X.; Yu, L.; Ma, J.; Ling, H. Mutually guided image filtering. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 42, 694–707. [Google Scholar] [CrossRef] [PubMed]
- He, K.; Sun, J.; Tang, X. Guided image filtering. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 1397–1409. [Google Scholar] [CrossRef] [PubMed]
- Weiss, B. Fast median and bilateral filtering. ACM Trans. Graph. 2006, 25, 519–526. [Google Scholar] [CrossRef]
- Lai, Q.; Nie, Y.; Sun, H.; Xu, Q.; Zhang, Z.; Xiao, M. Video super-resolution via pre-frame constrained and deep-feature enhanced sparse reconstruction. Pattern Recognit. 2020, 100, 107139. [Google Scholar] [CrossRef]
- Economou, G.; Pothos, V.; Ifantis, A. Geodesic distance and MST based image segmentation. In Proceedings of the 2004 12th European Signal Processing Conference, Vienna, Austria, 6–10 September 2004; pp. 941–944. [Google Scholar]
- Yang, Q. Stereo matching using tree Filtering. IEEE Trans. Pattern Anal. Mach. Intell. 2015, 37, 834–846. [Google Scholar] [CrossRef]
- Li, J.; Wang, X.; Wang, X. A scaled-MST-based clustering algorithm and application on image segmentation. J. Intell. Inf. Syst. 2020, 54, 501–525. [Google Scholar] [CrossRef]
- Sureshkumar, G.; Baskaran, R.; Sathya, A.; Deivamani, A. Automatic image annotation using colour entropy and region contours. In Proceedings of the 2009 IEEE International Advance Computing Conference, VOLS 1-3, Patiala, India, 6–7 March 2009. [Google Scholar]
- Zahn, C.T. Graph-theoretical methods for detecting and describing gestalt clusters. IEEE Trans. Comput. 1971, 20, 68–86. [Google Scholar] [CrossRef] [Green Version]
- Zhang, L.; Wang, L.; Zhang, X.; Shen, P.; Bennamoun, M.; Zhu, G.; Shah, S.A.A.; Song, J. Semantic scene completion with dense CRF from a single depth image. Neurocomputing 2018, 318, 182–195. [Google Scholar] [CrossRef]
- Essien, A.; Giannetti, C. A deep learning model for smart manufacturing using convolutional LSTM neural network autoencoders. IEEE Trans. Ind. Inform. 2020, 16, 6069–6078. [Google Scholar] [CrossRef] [Green Version]
- Lee, J.; Kim, D.; Ponce, J.; Ham, B. SFNet: Learning object-aware semantic correspondence. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2019), Long Beach, CA, USA, 16–20 June 2019; pp. 2273–2282. [Google Scholar]
- Wu, H.; Zhang, J.; Huang, K.; Liang, K.; Yu, Y. FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentatio. arXiv 2019, arXiv:1903.11816. [Google Scholar]
- Zhao, H.; Shi, J.; Qi, X.; Wang, X.; Jia, J. Pyramid scene parsing network. In Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), Honolulu, HI, USA, 21–26 July 2017; pp. 6230–6239. [Google Scholar]
- Teichmann, M.; Cipolla, R. Convolutional CRFs for semantic segmentation. arXiv 2018, arXiv:1805.04777v2. [Google Scholar]
Methods of Image Semantic Segmentation | ||||||||
---|---|---|---|---|---|---|---|---|
FCN | FastFCN | DeepLab | PSPNet | |||||
FCN | FCN + WCF | FastFCN | FastFCN + WCF | DeepLab | DeepLab + WCF | PSPNet | PSPNet + WCF | |
MIOU | 0.52448 | 0.53490 | 0.62857 | 0.64315 | 0.62937 | 0.64314 | 0.79395 | 0.80239 |
PA | 0.88645 | 0.89071 | 0.90289 | 0.90904 | 0.90306 | 0.90895 | 0.95031 | 0.95289 |
Methods of Image Semantic Segmentation | |||
---|---|---|---|
PSPNet | |||
PSPNet | PSPNet+CRF | PSPNet+WCF | |
MIOU | 0.79395 | 0.79584 | 0.80239 |
PA | 0.95031 | 0.95080 | 0.95289 |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cheng, X.; Liu, H. A Novel Post-Processing Method Based on a Weighted Composite Filter for Enhancing Semantic Segmentation Results. Sensors 2020, 20, 5500. https://doi.org/10.3390/s20195500
Cheng X, Liu H. A Novel Post-Processing Method Based on a Weighted Composite Filter for Enhancing Semantic Segmentation Results. Sensors. 2020; 20(19):5500. https://doi.org/10.3390/s20195500
Chicago/Turabian StyleCheng, Xin, and Huashan Liu. 2020. "A Novel Post-Processing Method Based on a Weighted Composite Filter for Enhancing Semantic Segmentation Results" Sensors 20, no. 19: 5500. https://doi.org/10.3390/s20195500
APA StyleCheng, X., & Liu, H. (2020). A Novel Post-Processing Method Based on a Weighted Composite Filter for Enhancing Semantic Segmentation Results. Sensors, 20(19), 5500. https://doi.org/10.3390/s20195500