Jalali-najafabadi, Farideh and Gadepalli, Chaitanya and Jarchi, Delaram and Cheetham, Barry MG (2021) Acoustic analysis and digital signal processing for the assessment of voice quality. Biomedical Signal Processing and Control, 70. p. 103018. DOI https://doi.org/10.1016/j.bspc.2021.103018
Jalali-najafabadi, Farideh and Gadepalli, Chaitanya and Jarchi, Delaram and Cheetham, Barry MG (2021) Acoustic analysis and digital signal processing for the assessment of voice quality. Biomedical Signal Processing and Control, 70. p. 103018. DOI https://doi.org/10.1016/j.bspc.2021.103018
Jalali-najafabadi, Farideh and Gadepalli, Chaitanya and Jarchi, Delaram and Cheetham, Barry MG (2021) Acoustic analysis and digital signal processing for the assessment of voice quality. Biomedical Signal Processing and Control, 70. p. 103018. DOI https://doi.org/10.1016/j.bspc.2021.103018
Abstract
Purpose: This paper addresses the application of digital signal processing (DSP) techniques to the robust measurement of acoustical features of the human voice. It then addresses the use of regression based techniques for the estimation of grade, roughness, breathiness, asthenia and strain, from these acoustical features. These five properties of voice are the basis of the widely used ‘GRBAS’ characterisation of voice disorders. Method: A well-known cross-correlation technique has been enhanced for more reliably measuring the fundamental frequency of vowels which is crucial for the derivation of acoustic features such as the harmonic-to-noise-ratio, jitter and shimmer. Regression techniques including K-Nearest Neighbour Regression and Multiple Linear Regression are employed for derivation of GRBAS properties. Results: Validation of the enhanced cross-correlation technique against well established published or commercially available techniques has been carried out by analysing synthetic sustained vowels. It was found that the enhanced method is capable of producing more reliable and robust measurements, in the context of our experiments, than the well-established Praat technique and Multi-Dimensional-Voice-Program (MDVP) software, especially in cases where the signal to noise ratio is low. Estimation of GRBAS components using our methods has been found to be in good agreement with traditional GRBAS scoring by speech and language therapists (SLTs). Conclusion: Voice analysis using DSP to extract acoustic features has the potential for objective and computerised GRBAS voice assessment. Such assessment can usefully augment GRBAS assessment as traditionally carried out subjectively by SLTs.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Praat; MDVP; Speech; Acoustic; HNR; SNR; Shimmer; Jitter; Fundamental frequency (f(o)) |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 26 Jan 2023 21:16 |
Last Modified: | 30 Oct 2024 19:15 |
URI: | http://repository.essex.ac.uk/id/eprint/32172 |
Available files
Filename: BSPC-D-21-01207_R1-2.pdf
Licence: Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0