Pedestrian Re-Identification Based on Weakly Supervised Multi-Feature Fusion
Abstract
:1. Introduction
2. Related Works
2.1. Pedestrian Re-ID Research
2.2. Strongly Supervised Pedestrian Re-Identification Research
2.2.1. Traditional Methods Based on Feature Expression and Metric Learning
2.2.2. Pedestrian Re-Identification Based on Deep Learning
2.3. Weakly Supervised Pedestrian Re-Identification Research
2.3.1. Semi-Supervised Re-ID
2.3.2. Unsupervised Re-ID
2.4. The Problems That Need to Be Further Addressed in Pedestrian Re-Identification
3. Our Approach
3.1. Weakly Supervised Multi-Feature Fusion Pedestrian Re-Recognition Algorithm Network Architecture
3.2. Image Pseudo-Label Loss Function
3.3. Multi-Feature Fusion Algorithm
3.4. Weak-Supervised Triplet Loss
4. Experiment
4.1. Experimental Setup and Environment
4.2. Experimental Dataset
4.3. Experimental Results
4.4. Ablation Experiment
4.5. Comparison with Existing Technology
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Zhang, J.; Jiang, F. Multi-level supervised network for person re-identification. In Proceedings of the 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 12–17 May 2019; pp. 2072–2076. [Google Scholar]
- Li, W.; Zhu, X.; Gong, S. Harmonious attention network for person re-identification. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 2285–2294. [Google Scholar]
- Yuan, Y.; Chen, W.; Yang, Y.; Wang, Z. In defense of the triplet loss again: Learning robust person re-identification with fast approximated triplet loss and label distillation. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA, 14–19 June 2020; pp. 1454–1463. [Google Scholar]
- Wang, X. Intelligent multi-camera video surveillance: A review. Pattern Recognit. Lett. 2013, 34, 3–19. [Google Scholar] [CrossRef]
- Huang, T.; Russell, S. Object identification in a Bayesian context. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, San Diego, CA, USA, 23–29 August 1997; Morgan Kaufmann Publishers Inc.: San Francisco, CA, USA, 1997; pp. 1276–1282. [Google Scholar]
- Javed, O.; Shafique, K.; Shah, M. Appearance modeling for tracking in multiple non-overlapping cameras. In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA, 20–25 June 2005; pp. 26–33. [Google Scholar]
- Chen, K.W.; Lai, C.C.; Lee, P.J.; Chen, C.S.; Hung, Y.P. Adaptive learning for target tracking and true linking discovering across multiple non-overlapping cameras. IEEE Trans. Multimed. 2011, 13, 625–638. [Google Scholar] [CrossRef]
- Zajdel, W.; Zivkovic, Z.; Krose, B.J.A. Keeping track of humans: Have I seen this person before? In Proceedings of the 2005 IEEE International Conference on Robotics and Automation, Barcelona, Spain, 18–22 April 2005; pp. 2081–2086. [Google Scholar]
- Gheissari, N.; Sebastian, T.B.; Hartley, R. Person reidentification using spatiotemporal appearance. In Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York, NY, USA, 17–22 June 2006; pp. 1528–1535. [Google Scholar]
- Dikmen, M.; Akbas, E.; Huang, T.S.; Ahuja, N. Pedestrian recognition with a learned metric. In Proceedings of the Computer Vision-ACCV 2010, Queenstown, New Zealand, 8–12 November 2010; Springer: Heidelberg, Germany, 2011; pp. 501–512. [Google Scholar]
- Zheng, W.S.; Gong, S.; Xiang, T. Person re-identification by probabilistic relative distance comparison. In Proceedings of the CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011; pp. 649–656. [Google Scholar]
- Wang, Y.; Hu, R.; Liang, C.; Zhang, C.; Leng, Q. Camera compensation using feature projection matrix for person re-identification. In Proceedings of the 2013 IEEE International Conference on Multimedia and Expo (ICME), San Jose, CA, USA, 15–19 July 2013; pp. 1–6. [Google Scholar]
- Wang, Z.; Hu, R.; Liang, C.; Yu, Y.; Jiang, J.; Ye, M.; Leng, Q. Zero-Shot person re-identification via Cross-View consistency. IEEE Trans. Multimed. 2016, 18, 260–272. [Google Scholar] [CrossRef]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef]
- Yi, D.; Lei, Z.; Liao, S.; Li, S.Z. Deep metric learning for person re-identification. In Proceedings of the 2014 22nd International Conference on Pattern Recognition, Stockholm, Sweden, 24–28 August 2014; pp. 34–39. [Google Scholar]
- Li, W.; Zhao, R.; Xiao, T.; Malik, K. Deep ReID: Deep filter pairing neural network for person re-identification. In Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 152–159. [Google Scholar]
- Zheng, L.; Shen, L.; Tian, L.; Wang, S.; Wang, J.; Tian, Q. Scalable person pe-identification: A benchmark. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; pp. 1116–1124. [Google Scholar]
- Wang, W.C. Research on Pedestrian Re-identification Algorithm Based on Weak Supervision. Master’s Thesis, University of Electronic Science and Technology, Hangzhou, China, 2023. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Bak, S.; Carr, P. One-Shot metric learning for person re-identification. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 1571–1580. [Google Scholar]
- Chen, D.; Li, H.; Xiao, T.; Yi, S.; Wang, X. Video person re-identification with competitive snippet-similarity aggregation and co-attentive snippet embedding. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 1169–1178. [Google Scholar]
- He, K.; Fan, H.; Wu, Y.; Xie, S.; Girshick, R. Momentum contrast for unsupervised visual representation learning. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 9726–9735. [Google Scholar]
- Xia, D.; Guo, F.; Liu, H.; Xia, Y. A review on the progress of open pedestrian re-identification. Data Acquis. Process. 2021, 36, 449–467. [Google Scholar]
- Wang, S.; Xiao, S. A Review of Pedestrian Re-identification Research. J. Beijing Inst. Technol. 2022, 48, 1100–1112. [Google Scholar]
- Zhao, R.; Ouyang, W.; Wang, X. Unsupervised salience learning for person re-identification. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 23–28 June 2013; pp. 3586–3593. [Google Scholar]
- Li, W.; Wang, X. Locally aligned feature transforms across views. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 23–28 June 2013; pp. 3594–3601. [Google Scholar]
- Kviatkovsky, I.; Adam, A.; Rivlin, E. Color invariants for person reidentification. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 1622–1634. [Google Scholar] [CrossRef]
- Zhao, R.; Ouyang, W.; Wang, X. Person re-identification by salience matching. In Proceedings of the 2013 IEEE International Conference on Computer Vision, Sydney, Australia, 1–8 December 2013; pp. 2528–2535. [Google Scholar]
- Farenzena, M.; Bazzani, L.; Perina, A.; Murino, V.; Cristani, M. Person re-identification by symmetry-driven accumulation of local features. In Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, 13–18 June 2010; pp. 2360–2367. [Google Scholar]
- Gray, D.; Tao, H. Viewpoint invariant pedestrian recognition with an ensemble of localized features. In Proceedings of the Computer Vision-ECCV 2008, Marseille, France, 12–18 October 2008; Springer: Heidelberg, Germany, 2008; pp. 262–275. [Google Scholar]
- Zheng, W.S.; Gong, S.; Xiang, T. Reidentification by relative distance comparison. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 653–668. [Google Scholar] [CrossRef]
- Köstinger, M.; Hirzer, M.; Wohlhart, P.; Roth, P.M.; Bischof, H. Large scale metric learning from equivalence constraints. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; pp. 2288–2295. [Google Scholar]
- Varior, R.R.; Shuai, B.; Lu, J.; Xu, D.; Wang, G. A siamese long short-term memory architecture for human re-identification. In Proceedings of the Computer Vision-ECCV 2016, Amsterdam, The Netherlands, 11–14 October 2016; Springer: Cham, Switzerland, 2016; pp. 135–153. [Google Scholar]
- Sun, Y.; Zheng, L.; Yang, Y.; Tian, Q.; Wang, S. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In Proceedings of the Computer Vision-ECCV 2018, Munich, Germany, 8–14 September 2018; Springer: Cham, Switzerland, 2018; pp. 501–508. [Google Scholar]
- Wang, G.; Yuan, Y.; Chen, X.; Li, J.; Zhou, X. Learning discriminative features with multiple granularities for person re-identification. In Proceedings of the 26th ACM International Conference on Multimedia, Seoul, Republic of Korea, 22–26 October 2018; Association for Computing Machinery: New York, NY, USA, 2018; pp. 274–282. [Google Scholar]
- Zhang, Z.; Lan, C.; Zeng, W.; Jin, X.; Chen, Z. Relation-aware global attention for person re-identification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 14–19 June 2020; pp. 3183–3192. [Google Scholar]
- Gu, J.; Wang, K.; Luo, H.; Chen, C.; Jiang, W.; Fang, Y.; Zhao, J. Msinet: Twins contrastive search of multi-scale interaction for object reid. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 19243–19253. [Google Scholar]
- Zhang, M.; Yu, Z.; Han, Y.; Li, T. A review of pedestrian re-identification for complex scenes. Comput. Sci. 2022, 49, 138–150. [Google Scholar]
- Qi, L.; Yu, P.; Gao, Y. A review of pedestrian re-identification research in weakly supervised scenarios. J. Softw. 2020, 31, 2883–2902. [Google Scholar]
- Agrawala, A. Learning with a probabilistic teacher. IEEE Trans. Inf. Theory 1970, 16, 373–379. [Google Scholar] [CrossRef]
- Wu, Y.; Lin, Y.; Dong, X.; Yan, Y.; Ouyang, W.; Yang, Y. Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 5177–5186. [Google Scholar]
- Xin, X.; Wang, J.; Xie, R.; Zhou, S.; Huang, W.; Zheng, N. Semi-supervised person re-identification using multi-view clustering. Pattern Recognit. 2019, 88, 285–297. [Google Scholar] [CrossRef]
- Yu, H.X.; Zheng, W.S.; Wu, A.; Guo, X.; Gong, S.; Lai, J.H. Unsupervised person re-identification by soft multilabel learning. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 16–20 June 2019; pp. 2143–2152. [Google Scholar]
- Yang, Q.; Yu, H.X.; Wu, A.; Zheng, W.S. Patch-based discriminative feature learning for unsupervised person re-identification. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 16–20 June 2019; pp. 3628–3637. [Google Scholar]
- Chen, L.; Ye, F.; Huang, T.; Huang, L.; Weng, B.; Xu, C.; Hu, J. An unsupervised pedestrian re-identification method based on inter-domain merging in camera domain. Comput. Res. Dev. 2023, 60, 415–425. [Google Scholar]
- Wang, J.; Zhu, X.; Gong, S.; Li, W. Transferable joint attribute-identity deep learning for unsupervised person re-identification. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 2275–2284. [Google Scholar]
- Wang, G.; Wang, G.; Zhang, X.; Lai, J.; Yu, Z.; Lin, L. Weakly-supervised person Re-ID: Differentiable graphical learning and a new benchmark. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 2142–2156. [Google Scholar] [CrossRef] [PubMed]
- Yang, J.; Zhang, C.; Li, Z.; Tang, Y.; Wang, Z. Discriminative feature mining with relation regularization for person re-identification. Inf. Process. Management. 2023, 60, 103295. [Google Scholar] [CrossRef]
- Wang, Z.; Feng, J.; Zhang, Y. Pedestrian detection in infrared image based on depth transfer learning. Multimed. Tools Appl. 2022, 81, 39655–39674. [Google Scholar] [CrossRef]
- Zhou, R.; Chang, X.; Shi, L.; Shen, Y.D.; Yang, Y.; Nie, F. Person reidentification via Multi-Feature fusion with adaptive graph learning. IEEE Trans. Neural Netw. Learn. Syst. 2020, 31, 1592–1601. [Google Scholar] [CrossRef]
- Weinberger, K.Q.; Saul, L.K. Distance metric learning for large margin nearest neighbor classification. J. Mach. Learn. Res. 2009, 10, 207–244. [Google Scholar]
- Sun, Y.; Zheng, L.; Deng, W.; Wang, S. SVDNet for Pedestrian Retrieval. In Proceedings of the IEEE international Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 3800–3808. [Google Scholar]
- Cui, X.; Liang, Y.; Zhang, W. Dual-Branch Person Re-Identification Algorithm Based on Multi-Feature Representation. Electronics 2023, 12, 1869. [Google Scholar] [CrossRef]
- Li, M.; Yuan, L.; Wen, X.; Wang, J.; Xie, G.; Jia, Y. Multi-Scale Attention Network Based on Multi-Feature Fusion for Person Re-Identification. In Proceedings of the 2021 International Joint Conference on Neural Networks (IJCNN), Shenzhen, China, 18–22 July 2021; pp. 1–8. [Google Scholar] [CrossRef]
- Yu, H.X.; Wu, A.; Zheng, W.S. Cross-view asymmetric metric learning for unsupervised person re-identification. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 994–1002. [Google Scholar]
- Xiao, T.; Li, S.; Wang, B.; Lin, L.; Wang, X. Joint Detection and Identification Feature Learning for Person Search. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 3376–3385. [Google Scholar] [CrossRef]
- Chen, Y.; Zhu, X.; Gong, S. Deep association learning for unsupervised video person re-identification. arXiv 2018, arXiv:1808.07301. [Google Scholar]
- Ding, G.; Khan, S.H.; Tang, Z.; Zhang, J.; Porikli, F.M. Towards better Validity: Dispersion based Clustering for Unsupervised Person Re-identification. arXiv 2019, arXiv:1906.01308. [Google Scholar]
- Li, M.; Gong, S. Unsupervised tracklet person re-identification. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 42, 1770–1782. [Google Scholar] [CrossRef] [PubMed]
- Bilen, H.; Vedaldi, A. Weakly supervised deep detection networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, Nevada, USA, 27–30 June 2016; pp. 2846–2854. [Google Scholar]
- Dong, Q.; Zhu, X.; Gong, S. Single-label multi-class image classification by deep logistic regression. Proc. AAAI Conf. Artif. Intell. 2019, 33, 3486–3493. [Google Scholar] [CrossRef]
Configuration | Parameter |
---|---|
CPU | 13th Gen Intel(R) Core(TM) i5-13600 @ 3.50 GHz |
GPU | NVIDIA GeForce RTX 4090 |
Python | 3.9 |
Pytorch | 1.12.0 |
CUDA | 11.4 |
cuDNN | 8.0.2 |
Module | Whether to Use | Markt-1501 | CUHK03 | ||
---|---|---|---|---|---|
mAP | Rank-1 | mAP | Rank-1 | ||
Probability map and multi-feature fusion module | - | 57.4 | 81.9 | 38.5 | 56.4 |
Probability Map Module | √ | 61.1 | 87.2 | 43.3 | 62.3 |
multi-feature fusion module | √ | 63.5 | 89.6 | 45.8 | 67.5 |
Image module + feature fusion module | √ | 65.4 | 91.1 | 49.8 | 71.2 |
Category | CUHK03 | ||
---|---|---|---|
Rank-1 | Rank-5 | Rank-10 | |
36 | 12.6 | 35.6 | 59.6 |
227 | 34.3 | 50.3 | 64.3 |
467 | 52.6 | 64.5 | 75.0 |
767 | 71.2 | 80.3 | 88.0 |
ID/Wrap | CUHK03 | PRID2011 | ||
---|---|---|---|---|
mAP | Rank-1 | mAP | Rank-1 | |
1 (Full supervision) | 68.1 | 71.5 | 56.3 | 68 |
2 | 61.4 | 68 | 52.6 | 63 |
3 | 55.7 | 59.4 | 48.2 | 60.4 |
10 | 46.5 | 48.2 | 39.9 | 54.7 |
Stochastic Averaging (5) | 62.3 | 69.6 | 53 | 65.4 |
Method | Way | Market-1501 | DuKeMTMC | CUHK03 | Effectiveness | |||
---|---|---|---|---|---|---|---|---|
mAP | Rank-1 | mAP | Rank-1 | mAP | Rank-1 | |||
PCB | Strong supervision | 81.6 | 93.8 | 69.2 | 83.3 | 57.5 | 63.7 | More Complicated |
PCB + RPP [34] | Strong supervision | 80.9 | 93.3 | 68.1 | 82.9 | - | - | More Complicated |
MGN | Strong supervision | 86.9 | 95.7 | 78.4 | 88.7 | 68.0 | 67.4 | Better, more complicated |
SVDNet [52] | Strong supervision | 62.1 | 82.3 | 56.8 | 76.7 | - | - | More Complicated |
MSMG-Net [53] | Strong supervision | 88.6 | 96.3 | 80.1 | 89.9 | - | - | Multi-scale and multi-granularity supervision, difficult to deploy |
MSAN [54] | Strong supervision | 88.6 | 96.1 | - | - | 68.6 | 72.2 | More Complicated |
CAMEL | Unsupervised | 26.3 | 54.5 | - | - | 31.9 | 39.4 | Worse |
PAUL | Unsupervised | 40.1 | 68.5 | 53.2 | 72.0 | - | - | Worse |
OIMI [56] | Unsupervised | 13.5 | 33.7 | 43.8 | 51.1 | - | - | Poor |
DAL [57] | Unsupervised | 23.0 | 49.3 | - | - | - | - | Poor |
DBC [58] | Unsupervised | 43.8 | 64.3 | 66.1 | 75.2 | - | - | Worse |
UTAL [59] | Unsupervised | 35.2 | 49.9 | 36.6 | 48.3 | - | - | Worse |
CV-MIML | Weakly supervised | - | - | 59.53 | 78.5 | - | - | Better |
WSDDN [60] | Weakly supervised | 47.1 | 59.2 | 60.7 | 65.4 | - | - | Better |
HSLR [61] | Weakly supervised | 35.8 | 56.4 | 54.7 | 61.7 | - | - | Worse |
SSLR [61] | Weakly supervised | 31.2 | 51.9 | 50.0 | 56.3 | - | - | Worse |
OURS | Weakly supervised | 65.4 | 86.6 | 59.6 | 79.3 | 49.8 | 61.2 | Better, lower cost, easier to deploy |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Qin, C.; Wang, Z.; Zhang, L.; Peng, Q.; Lin, G.; Lu, G. Pedestrian Re-Identification Based on Weakly Supervised Multi-Feature Fusion. Algorithms 2024, 17, 426. https://doi.org/10.3390/a17100426
Qin C, Wang Z, Zhang L, Peng Q, Lin G, Lu G. Pedestrian Re-Identification Based on Weakly Supervised Multi-Feature Fusion. Algorithms. 2024; 17(10):426. https://doi.org/10.3390/a17100426
Chicago/Turabian StyleQin, Changming, Zhiwen Wang, Linghui Zhang, Qichang Peng, Guixing Lin, and Guanlin Lu. 2024. "Pedestrian Re-Identification Based on Weakly Supervised Multi-Feature Fusion" Algorithms 17, no. 10: 426. https://doi.org/10.3390/a17100426
APA StyleQin, C., Wang, Z., Zhang, L., Peng, Q., Lin, G., & Lu, G. (2024). Pedestrian Re-Identification Based on Weakly Supervised Multi-Feature Fusion. Algorithms, 17(10), 426. https://doi.org/10.3390/a17100426