Mesallam, Tamer A and Farahat, Mohamed and Malki, Khalid H and Alsulaiman, Mansour and Ali, Zulfiqar and Al-nasheri, Ahmed and Muhammad, Ghulam (2017) Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms. Journal of Healthcare Engineering, 2017. pp. 1-13. DOI https://doi.org/10.1155/2017/8783751
Mesallam, Tamer A and Farahat, Mohamed and Malki, Khalid H and Alsulaiman, Mansour and Ali, Zulfiqar and Al-nasheri, Ahmed and Muhammad, Ghulam (2017) Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms. Journal of Healthcare Engineering, 2017. pp. 1-13. DOI https://doi.org/10.1155/2017/8783751
Mesallam, Tamer A and Farahat, Mohamed and Malki, Khalid H and Alsulaiman, Mansour and Ali, Zulfiqar and Al-nasheri, Ahmed and Muhammad, Ghulam (2017) Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms. Journal of Healthcare Engineering, 2017. pp. 1-13. DOI https://doi.org/10.1155/2017/8783751
Abstract
A voice disorder database is an essential element in doing research on automatic voice disorder detection and classification. Ethnicity affects the voice characteristics of a person, and so it is necessary to develop a database by collecting the voice samples of the targeted ethnic group. This will enhance the chances of arriving at a global solution for the accurate and reliable diagnosis of voice disorders by understanding the characteristics of a local group. Motivated by such idea, an Arabic voice pathology database (AVPD) is designed and developed in this study by recording three vowels, running speech, and isolated words. For each recorded samples, the perceptual severity is also provided which is a unique aspect of the AVPD. During the development of the AVPD, the shortcomings of different voice disorder databases were identified so that they could be avoided in the AVPD. In addition, the AVPD is evaluated by using six different types of speech features and four types of machine learning algorithms. The results of detection and classification of voice disorders obtained with the sustained vowel and the running speech are also compared with the results of an English-language disorder database, the Massachusetts Eye and Ear Infirmary (MEEI) database.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Humans; Voice Disorders; Diagnosis, Computer-Assisted; Laryngoscopy; Speech Production Measurement; Reproducibility of Results; Language; Voice; Voice Quality; Speech Acoustics; Algorithms; Acoustics; Signal Processing, Computer-Assisted; Video Recording; Databases, Factual; Pattern Recognition, Automated; Adult; Middle Aged; Saudi Arabia; Female; Male; Young Adult; Machine Learning |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 09 Apr 2020 11:08 |
Last Modified: | 30 Oct 2024 16:37 |
URI: | http://repository.essex.ac.uk/id/eprint/27206 |
Available files
Filename: JHE2017-8783751.pdf
Licence: Creative Commons: Attribution 3.0