Next Article in Journal
Motion Artifacts in Dynamic EEG Recordings: Experimental Observations, Electrical Modelling, and Design Considerations
Previous Article in Journal
Machine Learning-Empowered Real-Time Acoustic Trapping: An Enabling Technique for Increasing MRI-Guided Microbubble Accumulation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

EnNet: Enhanced Interactive Information Network with Zero-Order Optimization

1
State Key Laboratory of Integrated Services Networks, Xidian University, Xi’an 710071, China
2
Key Laboratory of Smart Human-Computer Interaction and Wearable Technology of Shaanxi Province, Xi’an 710071, China
3
School of Computer Science and Technology, Xidian University, Xi’an 710126, China
*
Authors to whom correspondence should be addressed.
Sensors 2024, 24(19), 6361; https://doi.org/10.3390/s24196361
Submission received: 29 August 2024 / Revised: 19 September 2024 / Accepted: 29 September 2024 / Published: 30 September 2024
(This article belongs to the Section Sensing and Imaging)

Abstract

Interactive image segmentation extremely accelerates the generation of high-quality annotation image datasets, which are the pillars of the applications of deep learning. However, these methods suffer from the insignificance of interaction information and excessively high optimization costs, resulting in unexpected segmentation outcomes and increased computational burden. To address these issues, this paper focuses on interactive information mining from the network architecture and optimization procedure. In terms of network architecture, the issue mentioned above arises from two perspectives: the less representative feature of interactive regions in each layer and the interactive information weakened by the network hierarchy structure. Therefore, the paper proposes a network called EnNet. The network addresses the two aforementioned issues by employing attention mechanisms to integrate user interaction information across the entire image and incorporating interaction information twice in a design that progresses from coarse to fine. In terms of optimization, this paper proposes a method of using zero-order optimization during the first four iterations of training. This approach can reduce computational overhead with only a minimal reduction in accuracy. The experimental results on GrabCut, Berkeley, DAVIS, and SBD datasets validate the effectiveness of the proposed method, with our approach achieving an average NOC@90 that surpasses RITM by 0.35.
Keywords: interactive image segmentation; self-attention mechanism; global features; fine-grained features; semi-supervised optimization interactive image segmentation; self-attention mechanism; global features; fine-grained features; semi-supervised optimization

Share and Cite

MDPI and ACS Style

Shao, Y.; Chen, Y.; Yang, P.; Cheng, F. EnNet: Enhanced Interactive Information Network with Zero-Order Optimization. Sensors 2024, 24, 6361. https://doi.org/10.3390/s24196361

AMA Style

Shao Y, Chen Y, Yang P, Cheng F. EnNet: Enhanced Interactive Information Network with Zero-Order Optimization. Sensors. 2024; 24(19):6361. https://doi.org/10.3390/s24196361

Chicago/Turabian Style

Shao, Yingzhao, Yanxin Chen, Pengfei Yang, and Fei Cheng. 2024. "EnNet: Enhanced Interactive Information Network with Zero-Order Optimization" Sensors 24, no. 19: 6361. https://doi.org/10.3390/s24196361

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop