Development of Marker-Based Motion Capture Using RGB Cameras: A Neural Network Approach for Spherical Marker Detection
Abstract
1. Introduction
- Development of a neural network model to infer digitized coordinates of spherical markers from RGB images.
- Validation of the 3D reconstruction accuracy of markers based on the inferred digitized coordinates.
- Demonstration that the proposed method enables infrared motion capture (IR MoCap) level accuracy using only RGB cameras, simplifying 3D motion data acquisition.
- Section 2 describes the experimental procedures, the fine-tuning process, and the accuracy evaluation.
- Section 3 presents the results of the accuracy evaluation and robustness evaluation.
- Section 4 mainly discusses the accuracy of the proposed method and provides a comparison with IR MoCap and Markerless MoCap.
- Section 5 presents the conclusions of this study.
2. Methods
2.1. Data Collection
2.2. Development of Neural Network Model for Detecting Spherical Markers
2.2.1. Generation of Virtual Spheres
2.2.2. Generation of Virtual Tape
2.2.3. Combination of Virtual Sphere and Virtual Tape Images
2.2.4. Insertion of Virtual Markers into Non-Marker Trial Images and Blurring Process
2.2.5. Fine-Tuning
2.2.6. Detection of Spherical Markers
2.3. Verification of Accuracy of 3D Coordinates
2.3.1. Estimation of Camera Parameters
2.3.2. Reconstruction of 3D Marker Positions Based on Proposed Method
2.3.3. Determination of Gold Standard
2.4. Accuracy Evaluation of Proposed Method
2.5. Robustness Evaluation of the Proposed Method
- A spherical marker was randomly selected.
- For the selected marker, random noise was added to the digitized coordinates detected by the NN model, and the 3D coordinates were reconstructed using these noisy coordinates. The magnitude of the noise was set such that it falls within a circular area obtained by reprojecting a sphere with a diameter of 12 mm (half the size of the spherical marker) centered at the 3D coordinates reconstructed by the proposed method onto the image plane.
- The distance between the reconstructed 3D coordinates obtained in step 2 and those reconstructed by the proposed method was calculated.
- Steps 1 to 3 were repeated 1,000,000 times to obtain the average and maximum distances.
3. Results
4. Discussion
5. Conclusions
6. Patents
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Manjila, S.; Singh, G.; Alkhachroum, A.M.; Ramos-Estebanez, C. Understanding Edward Muybridge: Historical review of behavioral alterations after a 19th-century head injury and their multifactorial influence on human life and culture. Neurosurg. Focus 2015, 39, E4. [Google Scholar] [CrossRef] [PubMed]
- Silverman, M.E. Etienne-Jules Marey: 19th Century cardiovascular physiologist and inventor of cinematography. Clin. Cardiol. 1996, 19, 339–341. [Google Scholar] [CrossRef] [PubMed]
- Fenn, W.O. Work against gravity and work due to velocity changes in running. Am. J. Physiol. 1930, 93, 433–462. [Google Scholar] [CrossRef]
- Elftman, H. The work done by muscles in running. Am. J. Physiol. 1940, 129, 672–684. [Google Scholar] [CrossRef]
- Fenn, W.O. Frictional and kinetic factors in the work of sprint running. Am. J. Physiol. 1930, 92, 583–611. [Google Scholar] [CrossRef]
- Abdel-Aziz, Y.I.; Karara, H.M. Direct Linear Transformation from Comparator Coordinates into Object Space Coordinates in Close-Range Photogrammetry. In Proceedings of the ASP/UI Symposium on Close-Range Photogrammetry, Urbana, IL, USA, 26–29 January 1971; pp. 1–18. [Google Scholar]
- Neal, R.J.; Wilson, B.D. 3D kinematics and kinetics of the golf swing. Int. J. Sport Biomech. 1985, 1, 221–232. [Google Scholar] [CrossRef]
- Elliott, B.C.; Marsh, T.; Blanksby, B.A. A three-dimensional cinematographic analysis of the tennis serve. Int. J. Sport Biomech. 1986, 2, 260–271. [Google Scholar] [CrossRef]
- Ae, M.; Sakatani, Y.; Yokoi, T.; Hashihara, Y.; Shibukawa, K. Biomechanical analysis of the preparatory motion for takeoff in the Fosbury Flop. Int. J. Sport Biomech. 1986, 2, 66–77. [Google Scholar] [CrossRef]
- Ae, M.; Nagahara, R.; Ohshima, Y.; Koyama, H.; Takamoto, M.; Shibayama, K. Biomechanical analysis of the top three male high jumpers at the 2007 World Championships in Athletics. New Stud. Athl. 2008, 23, 45–52. [Google Scholar]
- Ito, A.; Fukuda, K.; Kijima, K. Mid-phase movements of Tyson Gay and Asafa Powell in the 100 meters at the 2007 World Championships in Athletics. New Stud. Athl. 2008, 23, 39–43. [Google Scholar]
- Ohyama, B.K.; Fujii, H.; Murakami, M.; Endo, T.; Takesako, H.; Gomi, K.; Tauchi, K. A biomechanical analysis of the men’s shot put at the 2007 World Championships in Athletics. New Stud. Athl. 2008, 23, 53–62. [Google Scholar]
- Nakano, N.; Sakura, T.; Ueda, K.; Omura, L.; Kimura, A.; Iino, Y.; Fukashiro, S.; Yoshioka, S. Evaluation of 3D markerless motion capture accuracy using OpenPose with multiple video cameras. Front. Sports Act. Living 2020, 2, 50. [Google Scholar] [CrossRef]
- Noorbhai, H.; Monn, S.; Fukushima, T. A conceptual framework and review of multi-method approaches for 3D markerless motion capture in sports and exercise. J. Sports Sci. 2025, 43, 1167–1174. [Google Scholar] [CrossRef] [PubMed]
- Carse, B.; Meadows, B.; Bowers, R.; Rowe, P. Affordable clinical gait analysis: An assessment of the marker tracking accuracy of a new low-cost optical 3D motion analysis system. Physiotherapy 2013, 99, 347–351. [Google Scholar] [CrossRef] [PubMed]
- Figueroa, P.J.; Leite, N.J.; Barros, R.M.L. A flexible software for tracking of markers used in human motion analysis. Comput. Methods Programs Biomed. 2003, 72, 155–165. [Google Scholar] [CrossRef] [PubMed]
- Needham, L.; Evans, M.; Cosker, D.; Wade, L.; McGuigan, P.; Bilzon, J.; Colyer, S. The Accuracy of Several Pose Estimation Methods For 3D Joint Centre Localisation. Sci. Rep. 2021, 11, 20673. [Google Scholar] [CrossRef]
- Kanko, R.M.; Laende, E.K.; Davis, E.M.; Selbie, W.S.; Deluzio, K.J. Concurrent assessment of gait kinematics using marker-based and markerless motion capture. J. Biomech. 2021, 127, 110665. [Google Scholar] [CrossRef]
- Wang, C.Y.; Bochkovskiy, A.; Liao, H.Y.M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv 2022, arXiv:2207.02696. [Google Scholar]
- Wang, C.Y.; Yeh, I.H.; Mark Liao, H.Y. YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information. arXiv 2024, arXiv:2402.13616. [Google Scholar] [CrossRef]
- Khanam, R.; Hussain, M. YOLOv11: An Overview of the Key Architectural Enhancements. arXiv 2024, arXiv:2410.17725. [Google Scholar]
- Suzuki, Y.; Ae, M.; Takenaka, S.; Fujii, N. Comparison of support leg kinetics between side-step and cross-step cutting techniques. Sports Biomech. 2014, 13, 144–153. [Google Scholar] [CrossRef] [PubMed]
- Dapena, J.; Harman, E.A.; Miller, J.A. Three-dimensional cinematography with control object of unknown shape. J. Biomech. 1982, 15, 11–19. [Google Scholar] [CrossRef] [PubMed]
- Zhang, S.; Fu, Q. Wand-Based Calibration of Unsynchronized Multiple Cameras for 3D Localization. Sensors 2024, 21, 284. [Google Scholar] [CrossRef] [PubMed]
- Cereatti, A.; Bonci, T.; Akbarshahi, M.; Aminian, K.; Barré, A.; Begon, M.; Benoit, D.L.; Charbonnier, C.; Dal Maso, F.; Fantozzi, S.; et al. Standardization proposal of soft tissue artefact description for data sharing in human motion measurements. J. Biomech. 2017, 62, 5–13. [Google Scholar] [CrossRef]
- Merriaux, P.; Dupuis, Y.; Boutteau, R.; Vasseur, P.; Savatier, X. A Study of Vicon System Position Performance. Sensors 2017, 17, 1591. [Google Scholar] [CrossRef]
Component | Fixed Bias (Mean ± SD) | Proportinal Bias (Slope) |
---|---|---|
X | −0.192 ± 1.40 mm ** | 0.000176 * |
Y | −0.402 ± 1.86 mm ** | −0.000251 |
Z | −0.467 ± 1.11 mm ** | −0.000437 ** |
Number of Cameras | Average Distance (mm) | Maximum Distance (mm) |
---|---|---|
3 | 5.23 | 24.68 |
4 | 4.06 | 15.45 |
5 | 3.42 | 13.53 |
6 | 2.91 | 10.61 |
7 | 2.61 | 8.77 |
8 | 2.43 | 7.66 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ohshima, Y. Development of Marker-Based Motion Capture Using RGB Cameras: A Neural Network Approach for Spherical Marker Detection. Sensors 2025, 25, 5228. https://doi.org/10.3390/s25175228
Ohshima Y. Development of Marker-Based Motion Capture Using RGB Cameras: A Neural Network Approach for Spherical Marker Detection. Sensors. 2025; 25(17):5228. https://doi.org/10.3390/s25175228
Chicago/Turabian StyleOhshima, Yuji. 2025. "Development of Marker-Based Motion Capture Using RGB Cameras: A Neural Network Approach for Spherical Marker Detection" Sensors 25, no. 17: 5228. https://doi.org/10.3390/s25175228
APA StyleOhshima, Y. (2025). Development of Marker-Based Motion Capture Using RGB Cameras: A Neural Network Approach for Spherical Marker Detection. Sensors, 25(17), 5228. https://doi.org/10.3390/s25175228