Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms

Mesallam, Tamer A and Farahat, Mohamed and Malki, Khalid H and Alsulaiman, Mansour and Ali, Zulfiqar and Al-nasheri, Ahmed and Muhammad, Ghulam (2017) Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms. Journal of Healthcare Engineering, 2017. pp. 1-13. DOI https://doi.org/10.1155/2017/8783751

Abstract

A voice disorder database is an essential element in doing research on automatic voice disorder detection and classification. Ethnicity affects the voice characteristics of a person, and so it is necessary to develop a database by collecting the voice samples of the targeted ethnic group. This will enhance the chances of arriving at a global solution for the accurate and reliable diagnosis of voice disorders by understanding the characteristics of a local group. Motivated by such idea, an Arabic voice pathology database (AVPD) is designed and developed in this study by recording three vowels, running speech, and isolated words. For each recorded samples, the perceptual severity is also provided which is a unique aspect of the AVPD. During the development of the AVPD, the shortcomings of different voice disorder databases were identified so that they could be avoided in the AVPD. In addition, the AVPD is evaluated by using six different types of speech features and four types of machine learning algorithms. The results of detection and classification of voice disorders obtained with the sustained vowel and the running speech are also compared with the results of an English-language disorder database, the Massachusetts Eye and Ear Infirmary (MEEI) database.

Item Metadata

Item Type:	Article
Uncontrolled Keywords:	Humans; Voice Disorders; Diagnosis, Computer-Assisted; Laryngoscopy; Speech Production Measurement; Reproducibility of Results; Language; Voice; Voice Quality; Speech Acoustics; Algorithms; Acoustics; Signal Processing, Computer-Assisted; Video Recording; Databases, Factual; Pattern Recognition, Automated; Adult; Middle Aged; Saudi Arabia; Female; Male; Young Adult; Machine Learning
Divisions:	Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of
SWORD Depositor:	Unnamed user with email elements@essex.ac.uk
Depositing User:	Unnamed user with email elements@essex.ac.uk
Date Deposited:	09 Apr 2020 11:08
Last Modified:	16 Aug 2025 03:51
URI:	http://repository.essex.ac.uk/id/eprint/27206

Available files

Published Version

Filename: JHE2017-8783751.pdf

Licence: Creative Commons: Attribution 3.0

Download

Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms

Abstract

Item Metadata

Share and export

Available files

Published Version

Statistics

Altmetrics

Downloads