Next Article in Journal
Drilling Optimization by Means of Decision Matrices and VB Tool Applied to Torsional Vibration and MSE Surveillance
Previous Article in Journal
Power Grid Load Forecasting Using a CNN-LSTM Network Based on a Multi-Modal Attention Mechanism
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Local Diversity-Guided Weakly Supervised Fine-Grained Image Classification Method

by
Yuebo Meng
1,2,*,
Xianglong Luo
1,2,
Hua Zhan
1,2,
Bo Wang
1,2,
Shilong Su
1,2 and
Guanghui Liu
1,2
1
College of Information and Control Engineering, Xi’an University of Architecture and Technology, Yanta, Xi’an 710055, China
2
Xi’an Key Laboratory of Intelligent Technology for Building Manufacturing, Xi’an 710311, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(5), 2437; https://doi.org/10.3390/app15052437
Submission received: 7 November 2024 / Revised: 24 January 2025 / Accepted: 26 January 2025 / Published: 25 February 2025

Abstract

For fine-grained recognition, capturing distinguishable features and effectively utilizing local information play a key role, since the objects of recognition exhibit subtle differences in different subcategories. Finding subtle differences between subclasses is not straightforward. To address this problem, we propose a weakly supervised fine-grained classification network model with Local Diversity Guidance (LDGNet). We designed a Multi-Attention Semantic Fusion Module (MASF) to build multi-layer attention maps and channel–spatial interaction, which can effectively enhance the semantic representation of the attention maps. We also introduce a random selection strategy (RSS) that forces the network to learn more comprehensive and detailed information and more local features from the attention map by designing three feature extraction operations. Finally, both the attention map obtained by RSS and the feature map are employed for prediction through a fully connected layer. At the same time, a dataset of ancient towers is established, and our method is applied to ancient building recognition for practical applications of fine-grained image classification tasks in natural scenes. Extensive experiments conducted on four fine-grained datasets and explainable visualization demonstrate that the LDGNet can effectively enhance discriminative region localization and detailed feature acquisition for fine-grained objects, achieving competitive performance over other state-of-the-art algorithms.
Keywords: fine-grained classification; random selection strategy; discriminative features fine-grained classification; random selection strategy; discriminative features

Share and Cite

MDPI and ACS Style

Meng, Y.; Luo, X.; Zhan, H.; Wang, B.; Su, S.; Liu, G. Local Diversity-Guided Weakly Supervised Fine-Grained Image Classification Method. Appl. Sci. 2025, 15, 2437. https://doi.org/10.3390/app15052437

AMA Style

Meng Y, Luo X, Zhan H, Wang B, Su S, Liu G. Local Diversity-Guided Weakly Supervised Fine-Grained Image Classification Method. Applied Sciences. 2025; 15(5):2437. https://doi.org/10.3390/app15052437

Chicago/Turabian Style

Meng, Yuebo, Xianglong Luo, Hua Zhan, Bo Wang, Shilong Su, and Guanghui Liu. 2025. "Local Diversity-Guided Weakly Supervised Fine-Grained Image Classification Method" Applied Sciences 15, no. 5: 2437. https://doi.org/10.3390/app15052437

APA Style

Meng, Y., Luo, X., Zhan, H., Wang, B., Su, S., & Liu, G. (2025). Local Diversity-Guided Weakly Supervised Fine-Grained Image Classification Method. Applied Sciences, 15(5), 2437. https://doi.org/10.3390/app15052437

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop