An Efficient Hybrid Linear Clustering Superpixel Decomposition Framework for Traffic Scene Semantic Segmentation
Abstract
:1. Introduction
- Redundant computations still exist in superpixels during region growing, since boundary pixels might be inspected more than once;
- The greedy strategy performed in a priority queue is prone to premature convergence;
- Once there is no global iterative updating, the single online average clustering is vulnerable to seed initialization.
- Different from grid sampling seeds for clustering pixels, the convergence results of a SLIC-like iterative clustering structure are utilized as inceptions for superpixel region clustering in the first stage. Accordingly, new centers are context-aware without falling into local optimum trap;
- Inspired by the process of online average clustering in SNIC, an efficient region-growing based label expansion structure is proposed to generate superpixels in a one-pass manner. Substantially, it is a series of re-labelling operations on the clustering results of the previous stage;
- To further reduce the computational cost of conventional linear clustering frameworks, two acceleration strategies are proposed for iterative and online average clustering, respectively. Consequently, the efficiency of optimized structures can be both guaranteed;
- The stage-by-stage clustering processes hybridize an integrated framework, wherein the two stages generate a synergistic effect to produce higher-quality superpixels. Compared with other state-of-the-art methods, it is evenly matched in terms of segmentation accuracy, spatial compactness and running efficiency.
2. Related Works
2.1. Feature Optimizations
2.2. Structure Optimizations
2.3. Clustering Optimizations
3. Proposed Framework
3.1. Fast Clustering Convergence
- 1.
- Modeling: The joint 5-dimensional feature of a pixel in an image with elements in CIELab color space can be modeled as . Specifically, and represent the 3-dimensional color feature and 2-dimensional position feature of , respectively.
- 2.
- Initialization: A set of seeds is evenly sampled from grids in , wherein is a hyper-parameter that expects the superpixel number. That is, is initially partitioned to regular grids with a step of .
- 3.
- Inspection: For each seed , it acts as the initial cluster center and then searches its square context region to calculate the correlation distance with the unlabeled pixels therein
- 4.
- Assignment: Once is inspected by all cluster centers whose search region contains itself, it acquires the same label of with the maxima correlation, i.e.,
- 5.
- Updating: Once all pixels within are labeled, the cluster centers are updated via k-means averaging
- If then keeps active and continues the SLIC procedure in the next loop;
- If , then is marked with the state of pre-convergence. Furthermore, if all adjacent clusters of are concurrently pre-convergent, then is marked with the state of convergence. In this case, all elements in are settled with the current label and excluded from the subsequent loops.
3.2. Efficient Label Expansion
- 1.
- Initialization (additional): A small-root priority queue is introduced, which returns the element with a minima key value. The grid sampled seeds are pushed in with zero key value;
- 2.
- Inspection: For each seed , it acts as the initial growing point of the cluster and then checks the four neighboring pixels to calculate the correlation distance with the unlabeled one through Equation (1). The distance is then recorded as the key value of in with a temporary label
- 3.
- Assignment: The top-most element is popped from , and the latest temporary label is settled, i.e., ,
- 4.
- Updating: becomes a steady member of the cluster , and then upgrades the center to as follows
3.3. Hybrid Implementation
- The number of clusters that participate in global updating gradually decreases within each loop, since the pre-converged superpixels are increasing. Compared with the original framework, the computational efficiency can be significantly boosted;
- The criterion of global iteration termination becomes adaptive for different input images and parameters. As a substitute, the overall iteration process could dynamically terminate itself only if all superpixels are marked as pre-converged.
4. Experiments and Discussions
4.1. Experiment Setup
4.2. Synergistic Effect Analysis
4.3. SOTA Comparisons
Methods | Expected Superpixel Number | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
50 | 100 | 150 | 200 | 250 | 300 | 350 | 400 | 450 | 500 | |
SCALP | 571 | 607 | 614 | 605 | 605 | 577 | 573 | 593 | 565 | 543 |
FLIC | 33 | 35 | 36 | 37 | 38 | 39 | 40 | 42 | 42 | 43 |
DBSCAN | - | - | - | 35 | 34 | 34 | 33 | 33 | 33 | 33 |
SLIC | 50 | 51 | 51 | 52 | 52 | 52 | 53 | 54 | 54 | 55 |
SNIC | 41 | 41 | 42 | 43 | 43 | 44 | 44 | 45 | 45 | 45 |
HLC | 35 | 37 | 38 | 38 | 39 | 40 | 39 | 40 | 40 | 40 |
5. Application
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
SLIC | simple linear iterative clustering |
SNIC | simple non-iterative clustering |
SOTA | state-of-the-art |
HLC | hybrid linear clustering |
WS | watershed superpixels |
SCALP | superpixels with contour adherence using linear path |
FLIC | fast linear iterative clustering |
DBSCAN | density-based spatial clustering of applications with noise |
GMM | Gaussian mixture model |
EM | expectation maximization |
BR | boundary recall |
UE | under-segmentation error |
ASA | achievable segmentation accuracy |
SC | shape compactness |
FCC | fast clustering convergence |
ELE | efficient label expansion |
MS | mean shift |
References
- Ren, X.; Malik, J. Learning a classification model for segmentation. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Nice, France, 13–16 October 2003; pp. 10–17. [Google Scholar]
- Wang, N.; Zeng, X.; Duan, Y.; Deng, B.; Mo, Y.; Xie, Z.; Duan, P. Multi-scale superpixel-guided structural profiles for hyperspectral image classification. Sensors 2022, 22, 8502. [Google Scholar] [CrossRef] [PubMed]
- He, Y.; Jin, L.; Wang, H.; Huo, Z.; Wang, G.; Sun, X. Automatic ROI setting method based on LSC for a traffic congestion area. Sustainability 2022, 14, 16126. [Google Scholar] [CrossRef]
- Farmakis, I.; Karantanellis, E.; Hutchinson, D.J.; Vlachopoulos, N.; Marinos, V. Superpixel and supervoxel segmentation assessment of landslides using UAV-derived models. Remote Sens. 2022, 14, 5668. [Google Scholar] [CrossRef]
- Wei, W.; Chen, W.; Xu, M. Co-saliency detection of RGBD image based on superpixel and hypergraph. Symmetry 2022, 14, 2393. [Google Scholar] [CrossRef]
- Wei, X.; Yang, Q.; Gong, Y.; Ahuja, N.; Yang, M. Superpixel Hierarchy. IEEE Trans. Image Process. 2018, 27, 4838–4849. [Google Scholar] [CrossRef]
- Xiao, X.; Zhou, Y.; Gong, Y. Content-adaptive superpixel segmentation. IEEE Trans. Image Process. 2018, 27, 2883–2896. [Google Scholar] [CrossRef]
- Achanta, R.; Shaji, A.; Smith, K.; Lucchi, A.; Fua, P.; Susstrunk, S. SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 34, 2274–2282. [Google Scholar] [CrossRef] [Green Version]
- Levinshtein, A.; Stere, A.; Kutulakos, K.N.; Fleet, D.J.; Dickinson, S.J.; Siddiqi, K. Turbopixels: Fast superpixels using geometric flows. IEEE Trans. Pattern Anal. Mach. Intell. 2009, 31, 2290–2297. [Google Scholar] [CrossRef] [Green Version]
- Lloyd, S. Least squares quantization in PCM. IEEE Trans. Inf. Theory 1982, 28, 129–137. [Google Scholar] [CrossRef] [Green Version]
- Achanta, R.; Susstrunk, S. Superpixels and polygons using simple non-iterative clustering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 4895–4904. [Google Scholar]
- Hu, Z.; Zou, Q.; Li, Q. Watershed superpixel. In Proceedings of the IEEE International Conference on Image Processing (ICIP), Quebec City, QC, Canada, 27–30 September 2015; pp. 349–353. [Google Scholar]
- Machairas, V.; Faessel, M.; Cardenas, D.; Chabardes, T.; Walter, T.; Decencière, E. Waterpixels. IEEE Trans. Image Process. 2015, 24, 3707–3716. [Google Scholar] [CrossRef]
- Liu, Y.; Yu, M.; Li, B.; He, Y. Intrinsic manifold SLIC: A simple and efficient method for computing content-sensitive superpixels. IEEE Trans. Pattern Anal. Mach. Intell. 2018, 40, 653–666. [Google Scholar] [CrossRef]
- Giraud, R.; Ta, V.; Papadakis, N. Robust superpixels using color and contour features along linear path. Comput. Vis. Image Underst. 2018, 170, 1–13. [Google Scholar] [CrossRef] [Green Version]
- Choi, K.; Oh, K. Subsampling-based acceleration of simple linear iterative clustering for superpixel segmentation. Comput. Vis. Image Underst. 2016, 146, 1–8. [Google Scholar] [CrossRef]
- Zhao, J.; Hou, Q.; Ren, B.; Cheng, M.; Rosin, P. FLIC: Fast linear iterative clustering with active search. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), New Orleans, LA, USA, 2–7 February 2018; pp. 7574–7581. [Google Scholar]
- Shen, J.; Hao, X.; Liang, Z.; Liu, Y.; Wang, W.; Shao, L. Real-time superpixel segmentation by DBSCAN clustering algorithm. IEEE Trans. Image Process. 2016, 25, 5933–5942. [Google Scholar] [CrossRef] [Green Version]
- Ban, Z.; Liu, J.; Cao, L. Superpixel segmentation using Gaussian mixture model. IEEE Trans. Image Process. 2018, 27, 4105–4117. [Google Scholar] [CrossRef] [Green Version]
- Kang, X.; Zhu, L.; Ming, A. Dynamic random walk for superpixel segmentation. IEEE Trans. Image Process. 2020, 29, 3871–3884. [Google Scholar] [CrossRef]
- Wang, P.; Zeng, G.; Gan, R.; Wang, J.; Zha, H. Structure-sensitive superpixels via geodesic distance. Int. J. Comput. Vis. 2013, 103, 1–21. [Google Scholar] [CrossRef] [Green Version]
- Arbelaez, P.; Maire, M.; Fowlkes, C.; Malik, J. Contour detection and hierarchical image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2011, 33, 898–916. [Google Scholar] [CrossRef] [Green Version]
- Stutz, D.; Hermans, A.; Leibe, B. Superpixels: An evaluation of the state-of-the-art. Comput. Vis. Image Underst. 2018, 166, 1–27. [Google Scholar] [CrossRef] [Green Version]
- Liao, N.; Guo, B.; Li, C.; Liu, H.; Zhang, C. BACA: Superpixel segmentation with boundary awareness and content adaptation. Remote Sens. 2022, 14, 4572. [Google Scholar] [CrossRef]
- Liu, M.; Tuzel, O.; Ramalingam, S.; Chellappa, R. Entropy rate superpixel segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Colorado Springs, CO, USA, 20–25 June 2011; pp. 2097–2104. [Google Scholar]
- Jiang, J.; Ma, J.; Chen, C.; Wang, Z.; Cai, Z.; Wang, L. SuperPCA: A superpixelwise PCA approach for unsupervised feature extraction of hyperspectral imagery. IEEE Trans. Geosci. Remote Sens. 2018, 56, 4581–4593. [Google Scholar] [CrossRef] [Green Version]
- Kolekar, S.; Gite, s.; Pradhan, B.; Alamri, A. Explainable AI in scene understanding for autonomous vehicles in unstructured traffic environments on Indian roads using the inception U-Net model with Grad-CAM visualization. Sensors 2022, 22, 9677. [Google Scholar] [CrossRef] [PubMed]
- Comaniciu, D.; Meer, P. Mean shift: A robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 2002, 24, 603–619. [Google Scholar] [CrossRef] [Green Version]
- Brostow, G.J.; Fauqueur, J.; Cipolla, R. Semantic object classes in video: A high-definition ground truth database. Pattern Recogn. Lett. 2009, 30, 88–97. [Google Scholar] [CrossRef]
- Huang, C.; Wang, W.; Wang, W.; Lin, S.; Lin, Y. USEAQ: Ultra-fast superpixel extraction via adaptive sampling from quantized regions. IEEE Trans. Image Process. 2018, 27, 4916–4931. [Google Scholar] [CrossRef]
- Li, Z.; Wu, X.; Chang, S. Segmentation using superpixels: A bipartite graph partitioning approach. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Providence, RI, USA, 16–21 June 2012; pp. 789–796. [Google Scholar]
SCALP | FLIC | DBSCAN | SLIC | SNIC | |
---|---|---|---|---|---|
Number | ✓ | ✓ | ✗ | ✓ | ✓ |
Compactness | ✓ | ✗ | ✓ | ✗ | ✓ |
Iteration | ✓ | ✓ | ✗ | ✓ | ✗ |
Complexity | |||||
Implementation | C++/Matlab | C/C++ | C++/Matlab | C/C++ | C++/Matlab |
SLIC | FCC | SNIC | ELE | HLC | |
---|---|---|---|---|---|
Initialization | 10 | - | 8 | - | 11 |
Clustering | 40 | 16 | 35 | 11 | 27 |
Post-processing | 2 | - | - | - | - |
Total time | 52 | - | 43 | - | 38 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhong, D.; Li, T.; Dong, Y. An Efficient Hybrid Linear Clustering Superpixel Decomposition Framework for Traffic Scene Semantic Segmentation. Sensors 2023, 23, 1002. https://doi.org/10.3390/s23021002
Zhong D, Li T, Dong Y. An Efficient Hybrid Linear Clustering Superpixel Decomposition Framework for Traffic Scene Semantic Segmentation. Sensors. 2023; 23(2):1002. https://doi.org/10.3390/s23021002
Chicago/Turabian StyleZhong, Dan, Tiehu Li, and Yuxuan Dong. 2023. "An Efficient Hybrid Linear Clustering Superpixel Decomposition Framework for Traffic Scene Semantic Segmentation" Sensors 23, no. 2: 1002. https://doi.org/10.3390/s23021002
APA StyleZhong, D., Li, T., & Dong, Y. (2023). An Efficient Hybrid Linear Clustering Superpixel Decomposition Framework for Traffic Scene Semantic Segmentation. Sensors, 23(2), 1002. https://doi.org/10.3390/s23021002