Next Article in Journal
The Extended Exponential-Weibull Accelerated Failure Time Model with Application to Sudan COVID-19 Data
Previous Article in Journal
Peristalsis of Nanofluids via an Inclined Asymmetric Channel with Hall Effects and Entropy Generation Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Emotional Speaker Verification Using Novel Modified Capsule Neural Network

1
Computer Engineering Department, University of Sharjah, Sharjah 27272, United Arab Emirates
2
Electrical Engineering Department, University of Sharjah, Sharjah 27272, United Arab Emirates
3
Computer Science Department, University of Sharjah, Sharjah 27272, United Arab Emirates
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(2), 459; https://doi.org/10.3390/math11020459
Submission received: 14 December 2022 / Revised: 5 January 2023 / Accepted: 10 January 2023 / Published: 15 January 2023
(This article belongs to the Section E1: Mathematics and Computer Science)

Abstract

Capsule Neural Network (CapsNet) models are regarded as efficient substitutes for convolutional neural networks (CNN) due to their powerful hierarchical representation capability. Nevertheless, CNN endure their inability of recording spatial information in spectrograms. The main constraint of CapsNet is related to the compression method which can be implemented in CNN models but cannot be directly employed in CapsNet. As a result, we propose a novel architecture based on dual-channel long short-term memory compressed CapsNet (DC-LSTM–COMP CapsNet) for speaker verification in emotional as well as stressful talking environments. The proposed approach is perceived as a modified Capsule network that attempts to overcome the limitations that exist within the original CapsNet, as well as in CNN while enhancing the verification performance. The proposed architecture is assessed on four distinct databases. The experimental analysis reveals that the average speaker verification performance is improved in comparison with CNN, the original CapsNet, as well as the conventional classifiers. The proposed algorithm notably achieves the best verification accuracy across the four speech databases. For example, using the Emirati dataset, the average percentage equal error rates (EERs) obtained is 10.50%, based on the proposed architecture which outperforms other deep and classical models.
Keywords: capsule neural networks; deep neural network; speaker verification capsule neural networks; deep neural network; speaker verification

Share and Cite

MDPI and ACS Style

Nassif, A.B.; Shahin, I.; Nemmour, N.; Hindawi, N.; Elnagar, A. Emotional Speaker Verification Using Novel Modified Capsule Neural Network. Mathematics 2023, 11, 459. https://doi.org/10.3390/math11020459

AMA Style

Nassif AB, Shahin I, Nemmour N, Hindawi N, Elnagar A. Emotional Speaker Verification Using Novel Modified Capsule Neural Network. Mathematics. 2023; 11(2):459. https://doi.org/10.3390/math11020459

Chicago/Turabian Style

Nassif, Ali Bou, Ismail Shahin, Nawel Nemmour, Noor Hindawi, and Ashraf Elnagar. 2023. "Emotional Speaker Verification Using Novel Modified Capsule Neural Network" Mathematics 11, no. 2: 459. https://doi.org/10.3390/math11020459

APA Style

Nassif, A. B., Shahin, I., Nemmour, N., Hindawi, N., & Elnagar, A. (2023). Emotional Speaker Verification Using Novel Modified Capsule Neural Network. Mathematics, 11(2), 459. https://doi.org/10.3390/math11020459

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop