Ross, Alice and Markl, Nina and Lai, Catherine and Hall-Lew, Lauren A (2026) The Sound of Silencing: Identities and Ideologies in Commercial Text-To-Speech. In: 2026 CHI Conference on Human Factors in Computing Systems, 2026-04-13 - 2026-04-17, Barcelona.
Ross, Alice and Markl, Nina and Lai, Catherine and Hall-Lew, Lauren A (2026) The Sound of Silencing: Identities and Ideologies in Commercial Text-To-Speech. In: 2026 CHI Conference on Human Factors in Computing Systems, 2026-04-13 - 2026-04-17, Barcelona.
Ross, Alice and Markl, Nina and Lai, Catherine and Hall-Lew, Lauren A (2026) The Sound of Silencing: Identities and Ideologies in Commercial Text-To-Speech. In: 2026 CHI Conference on Human Factors in Computing Systems, 2026-04-13 - 2026-04-17, Barcelona.
Abstract
Text-to-speech (TTS) technology allows the synthesis of speech that is frequently described as highly ‘natural’ and, in some contexts, indistinguishable from human speech. Voice interfaces using such synthesised speech are increasingly encountered in a wide range of contexts. Recognising that listeners are likely to hear human-like voices as belonging to different demographic/social groups, and that these social judgments exist within ideological frameworks, we note a lack of diversity in popularly used English-speaking TTS voices, and caution that decisions taken in the design and deployment of voice interfaces risk perpetuating, or even exacerbating, existing social biases. Drawing upon sociolinguistic theory, we carry out a novel experiment to investigate these issues in a leading commercial TTS system, concluding that the system’s output disproportionately reproduces white, male, US-accented speech when prompted to convey competence. This work aims to encourage further research applying sociolinguistic knowledge to the study of human-computer interaction with speech technology.
| Item Type: | Conference or Workshop Item (Paper) |
|---|---|
| Uncontrolled Keywords: | voice user interfaces, speech synthesis, voice AI, language ideology, diversity and inclusion |
| Divisions: | Faculty of Social Sciences Faculty of Social Sciences > Language and Linguistics, Department of |
| SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
| Depositing User: | Unnamed user with email elements@essex.ac.uk |
| Date Deposited: | 21 Apr 2026 10:18 |
| Last Modified: | 21 Apr 2026 10:18 |
| URI: | http://repository.essex.ac.uk/id/eprint/43153 |
Available files
Filename: 3772363.3798657.pdf
Licence: Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0