Next Article in Journal
Robust Reversible Watermarking Scheme in Video Compression Domain Based on Multi-Layer Embedding
Next Article in Special Issue
Deep Learning-Based Stopped Vehicle Detection Method Utilizing In-Vehicle Dashcams
Previous Article in Journal
A Secure and Efficient Authentication Scheme for Large-Scale IoT Devices Based on Zero-Knowledge Proof
Previous Article in Special Issue
Ancient Painting Inpainting Based on Multi-Layer Feature Enhancement and Frequency Perception
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

FusionNetV2: Explicit Enhancement of Edge Features for 6D Object Pose Estimation

1
Department of Artificial Intelligence Convergence, Graduate School, Pukyong National University, 45 Yongso-ro, Nam-gu, Busan 48513, Republic of Korea
2
Division of Electronics and Communications Engineering, Pukyong National University, 45 Yongso-ro, Nam-gu, Busan 48513, Republic of Korea
*
Author to whom correspondence should be addressed.
Electronics 2024, 13(18), 3736; https://doi.org/10.3390/electronics13183736
Submission received: 22 August 2024 / Revised: 9 September 2024 / Accepted: 19 September 2024 / Published: 20 September 2024
(This article belongs to the Special Issue AI Synergy: Vision, Language, and Modality)

Abstract

FusionNet is a hybrid model that incorporates convolutional neural networks and Transformers, achieving state-of-the-art performance in 6D object pose estimation while significantly reducing the number of model parameters. Our study reveals that FusionNet has local and global attention mechanisms for enhancing deep features in two paths and the attention mechanisms play a role in implicitly enhancing features around object edges. We found that enhancing the features around object edges was the main reason for the performance improvement in 6D object pose estimation. Therefore, in this study, we attempt to enhance the features around object edges explicitly and intuitively. To this end, an edge boosting block (EBB) is introduced that replaces the attention blocks responsible for local attention in FusionNet. EBB is lightweight and can be directly applied to FusionNet with minimal modifications. EBB significantly improved the performance of FusionNet in 6D object pose estimation in experiments on the LINEMOD dataset.
Keywords: object pose estimation; convolutional neural network; Transformer; hybrid model; edge boosting object pose estimation; convolutional neural network; Transformer; hybrid model; edge boosting

Share and Cite

MDPI and ACS Style

Ye, Y.; Park, H. FusionNetV2: Explicit Enhancement of Edge Features for 6D Object Pose Estimation. Electronics 2024, 13, 3736. https://doi.org/10.3390/electronics13183736

AMA Style

Ye Y, Park H. FusionNetV2: Explicit Enhancement of Edge Features for 6D Object Pose Estimation. Electronics. 2024; 13(18):3736. https://doi.org/10.3390/electronics13183736

Chicago/Turabian Style

Ye, Yuning, and Hanhoon Park. 2024. "FusionNetV2: Explicit Enhancement of Edge Features for 6D Object Pose Estimation" Electronics 13, no. 18: 3736. https://doi.org/10.3390/electronics13183736

APA Style

Ye, Y., & Park, H. (2024). FusionNetV2: Explicit Enhancement of Edge Features for 6D Object Pose Estimation. Electronics, 13(18), 3736. https://doi.org/10.3390/electronics13183736

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop