EM-Sign: A Non-Contact Recognition Method Based on 24 GHz Doppler Radar for Continuous Signs and Dialogues
Abstract
:1. Introduction
2. State-Of-The-Art of Radar Sensing
2.1. State-Of-The-Art and Characteristic of Radar Sensing
2.2. State-Of-The-Art of 24 Ghz Radar Integrated Implementation
3. Sign Language Representation Based on Doppler Radar
3.1. Micro-Doppler Signatures of Sign Language
3.2. Continuous Sign Representation at the Sentential Level
3.3. CEMD-Based Time-Frequency Analysis
4. Yolov3-Based Sign Language Detection
4.1. Network Input
4.2. Yolov3 and Yolov3-Tiny
4.3. Improved Yolov3-Tiny Network
5. Experiments
5.1. Doppler Radar Sensor Prototype System
5.2. Dataset
5.3. Classification with Improved Yolov3-Tiny Network
6. Results and Analysis
6.1. Continuous Sign Recognition
6.2. Dialogue Recognition
6.3. Discussion on Reuse of 24 Ghz Automotive Radar in Sign Language Recognition
7. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Zhang, X.; Wu, Q.; Zhao, D. Dynamic Hand Gesture Recognition Using FMCW Radar Sensor for Driving Assistance. In Proceedings of the 2018 10th International Conference on Wireless Communications and Signal Processing (WCSP), Hangzhou, China, 18–20 October 2018; pp. 1–6. [Google Scholar]
- Yang, S.; Premaratne, P.; Vial, P. Hand gesture recognition: An overview. In Proceedings of the 2013 5th IEEE International Conference on Broadband Network Multimedia Technology, Guilin, China, 17–19 November 2013; pp. 63–69. [Google Scholar]
- Abraham, E.; Nayak, A.; Iqbal, A. Real-Time Translation of Indian Sign Language using LSTM. In Proceedings of the 2019 Global Conference for Advancement in Technology (GCAT), Bangaluru, India, 18–20 October 2019; pp. 1–5. [Google Scholar]
- Feng, W.; Xia, C.; Zhang, Y.; Yu, J.; Jiang, W. Research on Chinese Sign Language Recognition Methods Based on Mechanomyogram Signals Analysis. In Proceedings of the 2019 IEEE 4th International Conference on Signal and Image Processing (ICSIP), Wuxi, China, 19–21 July 2019; pp. 46–50. [Google Scholar]
- Krishnan, P.T.; Balasubramanian, P. Detection of Alphabets for Machine Translation of Sign Language Using Deep Neural Net. In Proceedings of the 2019 International Conference on Data Science and Communication (IconDSC), Bangalore, India, 1–2 March 2019; pp. 1–3. [Google Scholar]
- Ku, Y.; Chen, M.; King, C. A Virtual Sign Language Translator on Smartphones. In Proceedings of the 2019 Seventh International Symposium on Computing and Networking Workshops (CANDARW), Nagasaki, Japan, 26–29 November 2019; pp. 445–449. [Google Scholar]
- Liao, Y.; Xiong, P.; Min, W.; Min, W.; Lu, J. Dynamic Sign Language Recognition Based on Video Sequence With BLSTM-3D Residual Networks. IEEE Access 2019, 7, 38044–38054. [Google Scholar] [CrossRef]
- Lien, J.; Gillian, N.; Karagozler, M.E.; Amihood, P.; Schwesig, C.; Olson, E.; Raja, H.; Poupyrev, I. Soli: Ubiquitous gesture sensing with millimeter wave radar. ACM Trans. Graph. (TOG) 2016, 35, 1–19. [Google Scholar] [CrossRef] [Green Version]
- Zhang, S.; Li, G.; Ritchie, M.; Fioranelli, F.; Griffiths, H. Dynamic hand gesture classification based on radar micro-Doppler signatures. In Proceedings of the 2016 CIE International Conference on Radar (RADAR), Guangzhou, China, 10–13 October 2016; pp. 1–4. [Google Scholar]
- Kim, Y.; Toomajian, B. Application of Doppler radar for the recognition of hand gestures using optimized deep convolutional neural networks. In Proceedings of the 2017 11th European Conference on Antennas and Propagation (EUCAP), Paris, France, 19–24 March 2017; pp. 1258–1260. [Google Scholar]
- Kulhandjian, H.; Sharma, P.; Kulhandjian, M.; D’Amours, C. Sign Language Gesture Recognition Using Doppler Radar and Deep Learning. In Proceedings of the 2019 IEEE Globecom Workshops (GC Wkshps), Waikoloa, HI, USA, 9–13 December 2019; pp. 1–6. [Google Scholar]
- Hou, J.; Aoki, Y. A real-time interactive non-verbal communication system through semantic feature extraction. In Proceedings of the IEEE International Conference on Multimedia and Expo, Lausanne, Switzerland, 26–29 August 2002; Volume 2, pp. 425–428. [Google Scholar]
- Fioranelli, F.; Le Kernec, J.; Shah, S.A. Radar for Health Care: Recognizing Human Activities and Monitoring Vital Signs. IEEE Potentials 2019, 38, 16–23. [Google Scholar] [CrossRef] [Green Version]
- Choi, J.; Va, V.; Gonzalez-Prelcic, N.; Daniels, R.; Bhat, C.R.; Heath, R.W. Millimeter-Wave Vehicular Communication to Support Massive Automotive Sensing. IEEE Commun. Mag. 2016, 54, 160–167. [Google Scholar] [CrossRef] [Green Version]
- Lan, S.; Yang, C.; Liu, B.; Qiu, J.; Denisov, A. Indoor real-time multiple moving targets detection and tracking using UWB antenna arrays. In Proceedings of the 2015 International Symposium on Antennas and Propagation (ISAP), Hobart, Australia, 9–12 November 2015; pp. 1–4. [Google Scholar]
- Shah, S.A.; Fioranelli, F. RF Sensing Technologies for Assisted Daily Living in Healthcare: A Comprehensive Review. IEEE Aerosp. Electron. Syst. Mag. 2019, 34, 26–44. [Google Scholar] [CrossRef] [Green Version]
- Liu, J.; Ding, H.; Shahroudy, A.; Duan, L.Y.; Jiang, X.; Wang, G.; Kot, A.C. Feature Boosting Network For 3D Pose Estimation. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 42, 494–501. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Yang, S.; Liu, J.; Lu, S.; Er, M.H.; Kot, A.C. Collaborative learning of gesture recognition and 3D hand pose estimation with multi-order feature analysis. In Proceedings of the European Conference on Computer Vision, Glasgow, UK, 23–28 August 2020. [Google Scholar]
- Zhang, Z.; Pu, J.; Zhuang, L.; Zhou, W.; Li, H. Continuous Sign Language Recognition via Reinforcement Learning. In Proceedings of the 2019 IEEE International Conference on Image Processing (ICIP), Taipei, Taiwan, 22–25 September 2019; pp. 285–289. [Google Scholar]
- Gurbuz, S.Z.; Gurbuz, A.C.; Malaia, E.A.; Griffin, D.J.; Crawford, C.; Rahman, M.M.; Aksu, R.; Kurtoglu, E.; Mdrafi, R.; Anbuselvam, A.; et al. A Linguistic Perspective on Radar Micro-Doppler Analysis of American Sign Language. In Proceedings of the 2020 IEEE International Radar Conference (RADAR), Washington, DC, USA, 28–30 April 2020; pp. 232–237. [Google Scholar]
- Tanaka, T.; Mandic, D.P. Complex Empirical Mode Decomposition. IEEE Signal Process 2007, 14, 101–104. [Google Scholar] [CrossRef]
- Huang, N.; Shen, Z.; Long, S.; Wu, M.; Shih, H.; Zheng, Q.; Yen, N.; Tung, C.; Liu, H. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. A Math. Phys. Eng. Sci. 1998, 454, 903–995. [Google Scholar] [CrossRef]
- Suh, J.S.; Ryu, S.; Han, B.; Choi, J.; Kim, J.; Hong, S. 24 GHz FMCW Radar System for Real-Time Hand Gesture Recognition Using LSTM. In Proceedings of the 2018 Asia-Pacific Microwave Conference (APMC), Kyoto, Japan, 6–9 November 2018; pp. 860–862. [Google Scholar]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar]
- Redmon, J.; Farhadi, A. YOLOv3: An Incremental Improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Izidio, D.M.F.; Ferreira, A.P.A.; Barros, E.N.S. An Embedded Automatic License Plate Recognition System Using Deep Learning. In Proceedings of the 2018 VIII Brazilian Symposium on Computing Systems Engineering (SBESC), Salvador, Brazil, 6–9 November 2018; pp. 38–45. [Google Scholar]
- Carolis, B.D.; Ladogana, F.; Macchiarulo, N. YOLO TrashNet: Garbage Detection in Video Streams. In Proceedings of the 2020 IEEE Conference on Evolving and Adaptive Intelligent Systems (EAIS), Bari, Italy, 27–29 May 2020; pp. 1–7. [Google Scholar]
- Kau, L.; Su, W.; Yu, P.; Wei, S. A real-time portable sign language translation system. In Proceedings of the 2015 IEEE 58th International Midwest Symposium on Circuits and Systems (MWSCAS), Fort Collins, CO, USA, 2–5 August 2015; pp. 1–4. [Google Scholar]
- Kamrul Hasan, S.M.; Ahmad, M. A new approach of sign language recognition system for bilingual users. In Proceedings of the 2015 International Conference on Electrical & Electronic Engineering (ICEEE), Rajshahi, Bangladesh, 4–6 November 2015; pp. 33–36. [Google Scholar]
- Rao, G.A.; Syamala, K.; Kishore, P.V.V.; Sastry, A.S.C.S. Deep convolutional neural networks for sign language recognition. In Proceedings of the 2018 Conference on Signal Processing Additionally, Communication Engineering Systems (SPACES), Vijayawada, India, 4–5 January 2018; pp. 194–197. [Google Scholar]
- Bantupalli, K.; Xie, Y. American Sign Language Recognition using Deep Learning and Computer Vision. In Proceedings of the 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA, 10–13 December 2018; pp. 4896–4899. [Google Scholar]
- He, S. Research of a Sign Language Translation System Based on Deep Learning. In Proceedings of the 2019 International Conference on Artificial Intelligence and Advanced Manufacturing (AIAM), Dublin, Ireland, 16–18 October 2019; pp. 392–396. [Google Scholar]
- Camgoz, N.C.; Hadfield, S.; Koller, O.; Bowden, R. SubUNets: End-to-End Hand Shape and Continuous Sign Language Recognition. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 3075–3084. [Google Scholar]
- Pu, J.; Zhou, W.; Li, H. Iterative Alignment Network for Continuous Sign Language Recognition. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 4160–4169. [Google Scholar]
- Wei, C.; Zhao, J.; Zhou, W.; Li, H. Semantic Boundary Detection with Reinforcement Learning for Continuous Sign Language Recognition. IEEE Trans. Circuits Syst. Video Technol. 2020. [Google Scholar] [CrossRef]
Input: original radar time series . |
Output: spectrogram obtained by CEMD. |
(1) Pass the Fourier transform of the signal through an ideal bandpass filter to get the positive and negative frequency components, and . |
(2) Inverse Fourier transform to and and take the real part. The real sequence and are obtained. |
(3) Perform standard EMD to and . The order of the intrinsic mode functions (IMF) is N. |
(4) Iteration from 1 to N. Do the following operations. |
(a) Hilbert transform to and obtain the matrix . |
(b) Add the newly obtained matrix to the previous matrix. |
(5) End of iteration |
(6) The final matrix is the spectrogram of the gesture data. The horizontal axis of the spectrogram is time and the vertical axis is frequency. |
(7) Follow Equations (4) and (5) to sharpen for . |
Parameter | Value |
---|---|
Epoch | 300 |
Batch size | 16 |
Initial learning rate | 0.005 |
Weight decay | 0.0005 |
IOU threshold | 0.5 |
Confidence threshold | 0.3 |
Optimizer | Adam |
Dataset Size | Precision | Recall | mAP | F1-Measure |
---|---|---|---|---|
800 | 0.812 | 0.958 | 0.932 | 0.879 |
1600 | 0.924 | 0.993 | 0.99 | 0.957 |
Data Acquisition | Classification | mAP |
---|---|---|
Data glove [29] | Orientation and motion | 94% |
Video [30] | SVM | 96% |
Video [31] | CNN | 92.88% |
Video [32] | Inception+RCNN | 93% |
Image [33] | Faster-RCNN+3D CNN+LSTM | 99% |
Radar(our method) | Improved yolov3-tiny | 99% |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ye, L.; Lan, S.; Zhang, K.; Zhang, G. EM-Sign: A Non-Contact Recognition Method Based on 24 GHz Doppler Radar for Continuous Signs and Dialogues. Electronics 2020, 9, 1577. https://doi.org/10.3390/electronics9101577
Ye L, Lan S, Zhang K, Zhang G. EM-Sign: A Non-Contact Recognition Method Based on 24 GHz Doppler Radar for Continuous Signs and Dialogues. Electronics. 2020; 9(10):1577. https://doi.org/10.3390/electronics9101577
Chicago/Turabian StyleYe, Linting, Shengchang Lan, Kang Zhang, and Guiyuan Zhang. 2020. "EM-Sign: A Non-Contact Recognition Method Based on 24 GHz Doppler Radar for Continuous Signs and Dialogues" Electronics 9, no. 10: 1577. https://doi.org/10.3390/electronics9101577
APA StyleYe, L., Lan, S., Zhang, K., & Zhang, G. (2020). EM-Sign: A Non-Contact Recognition Method Based on 24 GHz Doppler Radar for Continuous Signs and Dialogues. Electronics, 9(10), 1577. https://doi.org/10.3390/electronics9101577