Ali, Zulfiqar and Alsulaiman, Mansour and Muhammad, Ghulam and Elamvazuthi, Irraivan and Al-Nasheri, Ahmed and Mesallam, Tamer A and Farahat, Mohamed and Malki, Khalid H (2017) Intra- and Inter-database Study for Arabic, English, and German Databases: Do Conventional Speech Features Detect Voice Pathology? Journal of Voice, 31 (3). 386.e1-386.e8. DOI https://doi.org/10.1016/j.jvoice.2016.09.009
Ali, Zulfiqar and Alsulaiman, Mansour and Muhammad, Ghulam and Elamvazuthi, Irraivan and Al-Nasheri, Ahmed and Mesallam, Tamer A and Farahat, Mohamed and Malki, Khalid H (2017) Intra- and Inter-database Study for Arabic, English, and German Databases: Do Conventional Speech Features Detect Voice Pathology? Journal of Voice, 31 (3). 386.e1-386.e8. DOI https://doi.org/10.1016/j.jvoice.2016.09.009
Ali, Zulfiqar and Alsulaiman, Mansour and Muhammad, Ghulam and Elamvazuthi, Irraivan and Al-Nasheri, Ahmed and Mesallam, Tamer A and Farahat, Mohamed and Malki, Khalid H (2017) Intra- and Inter-database Study for Arabic, English, and German Databases: Do Conventional Speech Features Detect Voice Pathology? Journal of Voice, 31 (3). 386.e1-386.e8. DOI https://doi.org/10.1016/j.jvoice.2016.09.009
Abstract
A large population around the world has voice complications. Various approaches for subjective and objective evaluations have been suggested in the literature. The subjective approach strongly depends on the experience and area of expertise of a clinician, and human error cannot be neglected. On the other hand, the objective or automatic approach is noninvasive. Automatic developed systems can provide complementary information that may be helpful for a clinician in the early screening of a voice disorder. At the same time, automatic systems can be deployed in remote areas where a general practitioner can use them and may refer the patient to a specialist to avoid complications that may be life threatening. Many automatic systems for disorder detection have been developed by applying different types of conventional speech features such as the linear prediction coefficients, linear prediction cepstral coefficients, and Mel-frequency cepstral coefficients (MFCCs). This study aims to ascertain whether conventional speech features detect voice pathology reliably, and whether they can be correlated with voice quality. To investigate this, an automatic detection system based on MFCC was developed, and three different voice disorder databases were used in this study. The experimental results suggest that the accuracy of the MFCC-based system varies from database to database. The detection rate for the intra-database ranges from 72% to 95%, and that for the inter-database is from 47% to 82%. The results conclude that conventional speech features are not correlated with voice, and hence are not reliable in pathology detection.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Humans; Voice Disorders; Diagnosis, Computer-Assisted; Speech Production Measurement; Reproducibility of Results; Predictive Value of Tests; Language; Voice Quality; Speech Acoustics; Signal Processing, Computer-Assisted; Databases, Factual; Pattern Recognition, Automated |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 09 Apr 2020 14:07 |
Last Modified: | 30 Oct 2024 16:37 |
URI: | http://repository.essex.ac.uk/id/eprint/27215 |
Available files
Filename: Intra_and_Inter_Database_Study_for_Arabic_English_and_German_Databases.pdf