Next Article in Journal
Improving Churn Detection in the Banking Sector: A Machine Learning Approach with Probability Calibration Techniques
Previous Article in Journal
Hybrid Control Strategy for LLC Converter Based on Improved Fruit Fly Optimization Algorithm
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Text-Guided Unknown Pseudo-Labeling for Open-World Object Detection

School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
*
Author to whom correspondence should be addressed.
Electronics 2024, 13(22), 4528; https://doi.org/10.3390/electronics13224528
Submission received: 26 October 2024 / Revised: 16 November 2024 / Accepted: 16 November 2024 / Published: 18 November 2024

Abstract

Open-world object detection (OWOD) focuses on training models with partially known class labels, enabling the detection of objects from known classes while concurrently identifying objects from unknown classes. Current models often perform suboptimally in generating pseudo-labels for unknown objects based on objectness scores due to inherent biases towards known classes. To address this issue, we propose a cross-modal learning model named Text-Guided Unknown Pseudo-Labeling for Open-world Object Detection(TGOOD) building on the Featurized Query R-CNN (FQR-CNN) framework. Specifically, we introduce a module called Similarity-Random-Similarity (SRS) to guide the model in detecting unknown objects during training. Additionally, we replace the one-to-one label assignment strategy in FQR-CNN with a one-to-many (OTM) label assignment strategy to provide more supervisory information during training. Moreover, we propose the ROI features Refinement Module (RRM) to enhance the discriminability of all objects. Experimental evaluations on the PASCAL VOC, MS-COCO, and COCO-O benchmarks demonstrate TGOOD’s superior open-world detection capability.
Keywords: open-world object detection; cross-modal learning; pseudo-labeling open-world object detection; cross-modal learning; pseudo-labeling

Share and Cite

MDPI and ACS Style

Wang, X.; Xu, D. Text-Guided Unknown Pseudo-Labeling for Open-World Object Detection. Electronics 2024, 13, 4528. https://doi.org/10.3390/electronics13224528

AMA Style

Wang X, Xu D. Text-Guided Unknown Pseudo-Labeling for Open-World Object Detection. Electronics. 2024; 13(22):4528. https://doi.org/10.3390/electronics13224528

Chicago/Turabian Style

Wang, Xuefei, and Dong Xu. 2024. "Text-Guided Unknown Pseudo-Labeling for Open-World Object Detection" Electronics 13, no. 22: 4528. https://doi.org/10.3390/electronics13224528

APA Style

Wang, X., & Xu, D. (2024). Text-Guided Unknown Pseudo-Labeling for Open-World Object Detection. Electronics, 13(22), 4528. https://doi.org/10.3390/electronics13224528

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop