Improving Person Re-Identification with Distance Metric and Attention Mechanism of Evaluation Features
Abstract
:1. Introduction
2. Related Work
2.1. Feature Extraction
2.2. Attention Mechanisms
2.3. Distance Metrics
3. Methods
3.1. MGN Architecture
3.2. Attention Mechanisms
Algorithm 1: Attention mechanism of MGA |
Input: , Global features with reduced dimensions: , Concatenated features of global features with reduced dimensions: Features with reduced dimensions on : Features with reduced dimensions on : Attention value of global features in :, Attention value of local features in :; when , , and when , Global features before the attention: , Local features before the attention: ; when , , and when , Output: Global features for identification after the attention: , Local features for identification after the attention: ; when , , and when , |
1: 2: 3: 4: 5: 6: |
3.3. Distance Metrics
- (1)
- Euclidean distance
- (2)
- Mahalanobis distance
- (3)
- Correlation distance
- (4)
- Cosine distance
- (5)
- Squared Euclidean distance
4. Experiments
4.1. Datasets and Protocols
4.2. Implementation Details
4.3. Comparison with State-of-the-Art Methods
4.4. Discussion
4.4.1. Experimental Results of Attention Mechanisms
4.4.2. Experimental Results for Distance Metrics
4.4.3. Visualization Experiment Results
5. Conclusions
Funding
Data Availability Statement
Conflicts of Interest
References
- Wu, L.; Shen, C.; Hengel, A. PersonNet: Person re-identification with deep convolutional neural networks. arXiv 2016, arXiv:1601.07255. [Google Scholar]
- Chen, J.; Zhang, Z.; Wang, Y. Relevance Metric Learning for Person Re-Identification by Exploiting Listwise Similarities. IEEE Trans. Image Process. 2015, 24, 4741–4755. [Google Scholar] [CrossRef]
- Ding, S.; Lin, L.; Wang, G.; Chao, H. Deep feature learning with relative distance comparison for person re-identification. Pattern Recognit. 2015, 48, 2993–3003. [Google Scholar] [CrossRef]
- Shi, X. Person Re-identification Based on Improved Residual Neural Networks. In Proceedings of the 2021 5th International Conference on Communication and Information Systems (ICCIS), Chongqing, China, 15–17 October 2021; pp. 170–174. [Google Scholar]
- Wang, J.; Zhou, S.; Wang, J.; Hou, Q. Deep ranking model by large adaptive margin learning for person re-identification. Pattern Recognit. 2018, 74, 241–252. [Google Scholar] [CrossRef]
- Fu, M.; Sun, S.; Gao, H.; Wang, D.; Tong, X.; Liu, Q. Liang Improving Person Reidentification Using a Self-Focusing Network in Internet of Things. IEEE Internet Things J. 2022, 9, 9342–9353. [Google Scholar] [CrossRef]
- Zhang, Z.; Si, T.; Liu, S. Integration convolutional neural network for person re-identification in camera networks. IEEE Access 2018, 6, 36887–36896. [Google Scholar] [CrossRef]
- Tian, H.; Hu, J. Self-Regulation Feature Network for Person Reidentification. IEEE Trans. Instrum. Meas. 2023, 72, 1–8. [Google Scholar] [CrossRef]
- Lin, L.; Wu, T.; Porway, J.; Xu, Z. A stochastic graph grammar for compositional object representation and recognition. Pattern Recognit. 2009, 42, 1297–1307. [Google Scholar] [CrossRef]
- Li, W.; Zhao, R.; Xiao, T.; Wang, X. DeepReID: Deep filter pairing neural network for person re-identification. In Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA, 23–28 June 2014; pp. 152–159. [Google Scholar]
- Van De Weijer, J.; Schmid, C.; Verbeek, J.; Larlus, D. Learning color names for real-world applications. IEEE Trans. Image Process. 2009, 18, 1512–1523. [Google Scholar] [CrossRef] [PubMed]
- Liu, H.; Feng, J.; Qi, M.; Jiang, J.; Yan, S. End-to-End comparative attention networks for person re-identification. A Publication of the IEEE Signal Processing Society. IEEE Trans. Image Process. 2017, 26, 3492–3506. [Google Scholar] [CrossRef]
- Feng, Z.; Lai, J.; Xie, X. Learning view-specific deep networks for person re-identification. IEEE Trans. Image Process. 2018, 27, 3472–3483. [Google Scholar] [CrossRef]
- Zhou, S.; Wang, J.; Meng, D.; Xin, X.; Li, Y.; Gong, Y.; Zheng, N. Deep self-paced learning for person re-identification. Pattern Recognit. 2018, 76, 739–751. [Google Scholar] [CrossRef]
- Zheng, W.; Gong, S.; Xiang, T. Re-identification by relative distance comparison. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 35, 653–668. [Google Scholar] [CrossRef]
- Liu, X.; Song, M.; Zhao, Q.; Tao, D.; Chen, C.; Bu, J. Attribute restricted latent topic model for person re-identification. Pattern Recognit. 2012, 45, 4204–4213. [Google Scholar] [CrossRef]
- Liang, X.; Lin, L. Look into person: Joint body parsing & pose estimation network and a new benchmark. IEEE Trans. Pattern Anal. Mach. Intell. 2018, 41, 871–885. [Google Scholar]
- Zheng, L.; Huang, Y.; Lu, H.; Yang, Y. Pose-invariant embedding for deep person re-identification. IEEE Trans. Image Process. 2019, 28, 4500–4509. [Google Scholar] [CrossRef] [PubMed]
- Wu, L.; Wang, Y.; Li, X.; Gao, J. What-and-where to match: Deep spatially multiplicative integration networks for person re-identification. Pattern Recognit. 2018, 76, 727–738. [Google Scholar] [CrossRef]
- Chen, Y.; Zhu, X.; Zheng, W.; Lai, J. Person re-identification by camera correlation aware feature augmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 40, 392–408. [Google Scholar] [CrossRef] [PubMed]
- Yao, H.; Zhang, S.; Hong, R.; Zhang, Y.; Xu, C.; Tian, Q. Deep representation learning with part loss for person re-identification. IEEE Trans. Image Process. 2019, 28, 2860–2871. [Google Scholar] [CrossRef]
- Yang, F.; Yan, K.; Lu, S.; Jia, H.; Xie, X.; Gao, W. Attention driven person re-identification. Pattern Recognit. 2019, 86, 143–155. [Google Scholar] [CrossRef]
- Xu, B.; He, L.; Liang, J.; Sun, Z. Learning Feature Recovery Transformer for Occluded Person Re-Identification. IEEE Trans. Image Process. 2022, 31, 4651–4661. [Google Scholar] [CrossRef] [PubMed]
- Zhao, R.; Oyang, W.; Wang, X. Person re-identification by saliency learning. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 356–370. [Google Scholar] [CrossRef] [PubMed]
- Ma, X.; Zhu, X.; Gong, S.; Xie, X.; Hu, J.; Lam, K.; Zhong, Y. Person re-identification by unsupervised video matching. Pattern Recognit. 2017, 65, 197–210. [Google Scholar] [CrossRef]
- Gao, Z.; Gao, L.; Zhang, H.; Cheng, Z.; Hong, R.; Chen, S. DCR: A Unified Framework for Holistic/PartialPerson ReID. IEEE Trans. Multimed. 2021, 23, 3332–3345. [Google Scholar] [CrossRef]
- Shu, X.; Wang, X.; Zang, X.; Zang, S.; Zhang, S.; Chen, Y.; Li, G.; Tian, Q. Large-Scale Spatio-Temporal Person Re-Identification: Algorithms and Benchmark. IEEE Trans. Circuits Syst. Video Technol. 2022, 32, 4390–4403. [Google Scholar] [CrossRef]
- Zhou, Q.; Zhong, B.; Liu, X.; Ji, R. Attention-Based Neural Architecture Search for Person Re-Identification. IEEE Trans. Neural Netw. Learn. Syst. 2022, 33, 6627–6639. [Google Scholar] [CrossRef]
- Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.; Karpathy, A.; Khosla, A.; Bernstein, M.; et al. Imagenet large scale visual recognition challenge. Int. J. Comput. Vis. 2015, 115, 211–252. [Google Scholar] [CrossRef]
- Tan, H.; Liu, X.; Yin, B.; Li, X. MHSA-Net: Multihead Self-Attention Network for Occluded Person Re-Identification. IEEE Trans. Neural Netw. Learn. Syst. 2022. online ahead of print. [Google Scholar] [CrossRef]
- Zheng, L.; Shen, L.; Tian, L.; Wang, S.; Wang, J.; Tian, Q. Scalable Person Re-identification: A Benchmark. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 18 February 2016. [Google Scholar]
- Peng, W.; Zhong, X.; Zou, C.; Zhang, J.; Ci, Q.; Zhong, L. Label-noise Robust Person Re-identification via Symmetric Learning. In Proceedings of the 7th Annual International Conference on Network and Information Systems for Computers, Guiyang, China, 23–25 July 2021. [Google Scholar]
- Wang, G.; Yuan, Y.; Chen, X.; Li, J.; Zhou, X. Learning discriminative features with multiple granularities for person re-Identification. In Proceedings of the 26th ACM International Conference on Multimedia(MM), Seoul, Republic of Korea, 22–26 October 2018; pp. 274–282. [Google Scholar]
- Su, C.; Zhang, S.; Xing, J.; Gao, W.; Tian, Q. Multi-type attributes driven multi-camera person re-identification. Pattern Recognit. 2018, 75, 77–89. [Google Scholar] [CrossRef]
- Wang, J.; Li, S. Query-driven iterated neighborhood graph search for large scale indexing. In Proceedings of the 20th ACM International Conference on Multimedia (MM), Nara, Japan, 29 October–2 November 2012; pp. 179–188. [Google Scholar]
- Wang, L.; Zhang, Y.; Feng, J. On the Euclidean distance of images. IEEE Trans. Pattern Anal. Mach. Intell. 2005, 27, 1334–1339. [Google Scholar] [CrossRef]
- Melter, R.A. Some characterizations of city block distance. Pattern Recognit. Lett. 1987, 6, 235–240. [Google Scholar] [CrossRef]
- McLachlan, G.J. Mahalanobis distance. Resonance 1999, 4, 20–26. [Google Scholar] [CrossRef]
- Mercioni, M.A.; Holban, S. A Survey of Distance Metrics in Clustering Data Mining Techniques. In Proceedings of the 3rd International Conference on Graphics and Signal Processing (ICGSP), Hong Kong, China, 1–3 June 2019; pp. 44–47. [Google Scholar]
- Huang, Y.; Xu, J.; Wu, Q.; Zheng, Z.; Zhang, Z.; Zhang, J. Multipseudo regularized label for generated data in person re-identification. IEEE Trans. Image Process. 2018, 28, 1391–1403. [Google Scholar] [CrossRef]
- Zheng, Z.; Zheng, L.; Yang, Y. A discriminatively learned cnn embedding for person reidentification. ACM Trans. Multimed. Comput. Commun. Appl. 2017, 14, 1–20. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Ahmed, E.; Jones, M.; Marks, T.K. An improved deep learning architecture for person re-identification. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015; pp. 3908–3916. [Google Scholar]
- Sun, Y.; Zheng, L.; Yang, Y.; Tian, Q.; Wang, S. Beyond part models: Person retrieval with refined part pooling. In Proceedings of the 2018 European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 480–496. [Google Scholar]
- Su, C.; Li, J.; Zhang, S.; Xing, J.; Gao, W.; Tian, Q. Pose-driven deep convolutional model for person re-identification. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 3960–3969. [Google Scholar]
- Zhou, B.; Khosla, A.; Lapedriza, A.; Oliva, A.; Torralba, A. Learning deep features for discriminative localization. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 2921–2929. [Google Scholar]
- Li, W.; Zhu, X.; Gong, S. Harmonious attention network for person re-Identification. In Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 2285–2294. [Google Scholar]
- Yang, W.; Huang, H.; Zhang, Z.; Chen, X.; Huang, K.; Zhang, S. Towards rich feature discovery with class activation maps augmentation for person re-Identification. In Proceedings of the 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 1389–1398. [Google Scholar]
- Xu, Y.; Zhao, L.; Qin, F. Dual attention-based method for occluded person re-identification. Knowl. Based Syst. 2020, 212, 106554.1–106554.12. [Google Scholar] [CrossRef]
- Chen, B.; Deng, W.; Hu, J. Mixed high-Order attention network for person re-identification. In Proceedings of the 2019 IEEE International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 371–381. [Google Scholar]
- Chen, G.; Gu, T.; Lu, J.; Bao, J.A.; Zhou, J. Person re-identification via Attention Pyramid. IEEE Trans. Image Process. 2021, 30, 7663–7676. [Google Scholar] [CrossRef] [PubMed]
- Hu, X.; Chen, Y.; Ma, X.; Liang, Y. Research on person re-Identification method based on fine-tune ResNet50. In Proceedings of the 2020 International Conference on Computer Network, Electronic and Automation (ICCNEA), Xi’an, China, 25–27 September 2020. [Google Scholar]
- Wu, W.; Yang, Z.; Tao, D.; Zhang, Q.; Cheng, J. On comparing different metric learning schemes for deep feature based person re-identification with camera adaption. In Proceedings of the 2019 IEEE International Conference on Real-time Computing and Robotics (RCAR), Irkutsk, Russia, 4–9 August 2019. [Google Scholar]
- Ma, L.; Yang, X.; Tao, D. Person re-identification over camera networks using multi-task distance metric learning. IEEE Trans. Image Process. 2014, 23, 3656–3670. [Google Scholar]
- Lu, L.; Zhou, J. A route-like demand location problem based on squared-euclidean distance. In Proceedings of the IEEE 2016 International Conference on Logistics, Informatics and Service Sciences (LISS), Sydney, Australia, 24–27 July 2016. [Google Scholar]
- Shehu, G.; Ashir, A.; Eleyan, A. Character recognition using correlation & hamming distance. In Proceedings of the 2015 Signal Processing and Communications Applications Conference, Malatya, Turkey, 22 June 2015. [Google Scholar]
Symbol | Definition |
---|---|
Branches of the MGN | |
Number of stripes of the branches ; . | |
Features obtained via global max pooling for each of the three MGN branch features | |
, , ; , | of the MGN |
continue to be extracted to obtain three 256-dimensional global features | |
, , ; , | continue to be extracted to obtain five 256-dimensional local features |
1 × 1 convolution | |
BN | batch normalization |
ReLU | ReLU activation function |
Sigmoid | Sigmoid activation function |
continued feature extraction | |
are concatenated into one 768-dimensional global feature | |
continued feature extraction | |
continued feature extraction | |
dimension, the attention values of global features | |
, , ; , | dimension, the attention values of the local feature |
Global attention features | |
, , ; , | Local attention features |
n-dimensional feature vector for distance metrics | |
n-dimensional feature vector for distance metrics | |
Distance between vectors and | |
Covariance matrix of vectors and | |
Correlation coefficient of vectors and | |
Variance of vector | |
Variance of vector | |
Covariance of vectors and | |
Cosine similarity of vectors and | |
Angle between vectors and | |
Dot product of vectors and | |
Length of vector | |
Length of vector | |
MGAE | Euclidean distance used in MGA |
MGAM | Mahalanobis distance used in MGA |
MGACO | Correlation distance used in MGA |
MGAC | Cosine distance used in MGA |
MGAS | Squared Euclidean distance used in MGA |
Method | Top-1 (%) | mAP (%) | |
---|---|---|---|
DaRe | 86.4 | 69.3 | |
AOS | 86.5 | 70.4 | |
OSNet | 94.8 | 84.9 | |
G | DG-Net | 94.8 | 86.0 |
DML | 87.7 | 68.8 | |
PAN | 82.8 | 63.4 | |
SVDNet | 82.3 | 62.1 | |
SL-ReID | 84.5 | 65.4 | |
SOMAnet | 73.9 | 47.9 | |
PCB + RPP | 93.8 | 81.6 | |
MGN | 95.7 | 86.9 | |
MSCAN | 80.3 | 57.5 | |
MR | HPM | 94.2 | 82.7 |
MultiScale | 88.9 | 73.1 | |
PAR | 81.0 | 63.4 | |
DCR | 93.8 | 84.7 | |
MultiRegion | 66.4 | 41.2 | |
PABR | 91.7 | 79.6 | |
PGFA | 91.2 | 76.8 | |
PDC | 84.4 | 63.4 | |
AG | AACN | 85.9 | 66.9 |
PN-GAN | 89.4 | 72.6 | |
MGCAM | 83.8 | 74.3 | |
LaST_Cloth | 93.1 | 81.7 | |
SPReID | 92.5 | 81.3 | |
GLAD | 89.9 | 73.9 | |
HA-CNN | 91.2 | 75.7 | |
Mancs | 93.1 | 82.3 | |
CASN | 94.4 | 82.8 | |
A | IANet | 94.4 | 83.1 |
DuATM | 91.4 | 76.6 | |
CAMA | 94.7 | 84.5 | |
MHN-6 | 95.1 | 85.0 | |
reID-NAS+ | 95.1 | 85.7 | |
SFNet | 95.3 | 87.7 | |
MHSA-Net | 94.6 | 84.0 | |
MGAC | 95.1 | 88.2 |
Method | Top-1 (%) | mAP (%) |
---|---|---|
TriNet | 86.7 | 81.1 |
AOS | 88.7 | 83.3 |
AACN | 88.7 | 83.0 |
PSE + ECN | 90.3 | 84.0 |
LuNet | 84.59 | 75.62 |
GP-reid | 92.2 | 90.0 |
CamStyle | 89.5 | 71.5 |
MHSA-Net | 95.5 | 93.0 |
DaRe | 88.3 | 82.0 |
MGN | 96.6 | 94.2 |
MGAC | 96.2 | 94.9 |
Method | Top-1 (%) | mAP (%) |
---|---|---|
MGN | 95.7 | 86.9 |
MGA | 95.0 | 87.6 |
Method | Top-1 (%) | mAP (%) |
---|---|---|
MGN | 96.6 | 94.2 |
MGA | 96.0 | 94.8 |
Method | Top-1 (%) | mAP (%) |
---|---|---|
MGAE (Euclidean) | 96.0 | 94.8 |
MGAM (Mahalanobis) | 95.9 | 94.2 |
MGACO (correlation) | 95.9 | 94.7 |
MGAC (cosine) | 96.2 | 94.9 |
MGAS (squared Euclidean) | 96.4 | 94.7 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhou, J. Improving Person Re-Identification with Distance Metric and Attention Mechanism of Evaluation Features. Electronics 2023, 12, 4298. https://doi.org/10.3390/electronics12204298
Zhou J. Improving Person Re-Identification with Distance Metric and Attention Mechanism of Evaluation Features. Electronics. 2023; 12(20):4298. https://doi.org/10.3390/electronics12204298
Chicago/Turabian StyleZhou, Jieqian. 2023. "Improving Person Re-Identification with Distance Metric and Attention Mechanism of Evaluation Features" Electronics 12, no. 20: 4298. https://doi.org/10.3390/electronics12204298
APA StyleZhou, J. (2023). Improving Person Re-Identification with Distance Metric and Attention Mechanism of Evaluation Features. Electronics, 12(20), 4298. https://doi.org/10.3390/electronics12204298