Next Article in Journal
Radiation Detectors and Sensors in Medical Imaging
Previous Article in Journal
Θ-Net: A Deep Neural Network Architecture for the Resolution Enhancement of Phase-Modulated Optical Micrographs In Silico
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

KSL-POSE: A Real-Time 2D Human Pose Estimation Method Based on Modified YOLOv8-Pose Framework

School of Computer, Jiangsu University of Science and Technology, Zhenjiang, 212100, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Sensors 2024, 24(19), 6249; https://doi.org/10.3390/s24196249
Submission received: 15 August 2024 / Revised: 23 September 2024 / Accepted: 25 September 2024 / Published: 26 September 2024
(This article belongs to the Section Intelligent Sensors)

Abstract

Two-dimensional human pose estimation aims to equip computers with the ability to accurately recognize human keypoints and comprehend their spatial contexts within media content. However, the accuracy of real-time human pose estimation diminishes when processing images with occluded body parts or overlapped individuals. To address these issues, we propose a method based on the YOLO framework. We integrate the convolutional concepts of Kolmogorov–Arnold Networks (KANs) through introducing non-linear activation functions to enhance the feature extraction capabilities of the convolutional kernels. Moreover, to improve the detection of small target keypoints, we integrate the cross-stage partial (CSP) approach and utilize the small object enhance pyramid (SOEP) module for feature integration. We also innovatively incorporate a layered shared convolution with batch normalization detection head (LSCB), consisting of multiple shared convolutional layers and batch normalization layers, to enable cross-stage feature fusion and address the low utilization of model parameters. Given the structure and purpose of the proposed model, we name it KSL-POSE. Compared to the baseline model YOLOv8l-POSE, KSL-POSE achieves significant improvements, increasing the average detection accuracy by 1.5% on the public MS COCO 2017 data set. Furthermore, the model also demonstrates competitive performance on the CrowdPOSE data set, thus validating its generalization ability.
Keywords: human pose estimation; Kolmogorov–Arnold networks; YOLOv8-pose; multi-scale feature fusion human pose estimation; Kolmogorov–Arnold networks; YOLOv8-pose; multi-scale feature fusion

Share and Cite

MDPI and ACS Style

Lu, T.; Cheng, K.; Hua, X.; Qin, S. KSL-POSE: A Real-Time 2D Human Pose Estimation Method Based on Modified YOLOv8-Pose Framework. Sensors 2024, 24, 6249. https://doi.org/10.3390/s24196249

AMA Style

Lu T, Cheng K, Hua X, Qin S. KSL-POSE: A Real-Time 2D Human Pose Estimation Method Based on Modified YOLOv8-Pose Framework. Sensors. 2024; 24(19):6249. https://doi.org/10.3390/s24196249

Chicago/Turabian Style

Lu, Tianyi, Ke Cheng, Xuecheng Hua, and Suning Qin. 2024. "KSL-POSE: A Real-Time 2D Human Pose Estimation Method Based on Modified YOLOv8-Pose Framework" Sensors 24, no. 19: 6249. https://doi.org/10.3390/s24196249

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop