Racial disparities in automated speech recognition

A Koenecke, A Nam, E Lake, J Nudell… - Proceedings of the …, 2020 - National Acad Sciences
Automated speech recognition (ASR) systems, which use sophisticated machine-learning
algorithms to convert spoken language to text, have become increasingly widespread …

The listening talker: A review of human and algorithmic context-induced modifications of speech

M Cooke, S King, M Garnier, V Aubanel - Computer Speech & Language, 2014 - Elsevier
Speech output technology is finding widespread application, including in scenarios where
intelligibility might be compromised–at least for some listeners–by adverse conditions …

Gender and dialect bias in YouTube's automatic captions

R Tatman - Proceedings of the first ACL workshop on ethics in …, 2017 - aclanthology.org
This project evaluates the accuracy of YouTube's automatically-generated captions across
two genders and five dialect groups. Speakers' dialect and gender was controlled for by …

Quantifying bias in automatic speech recognition

S Feng, O Kudina, BM Halpern… - arXiv preprint arXiv …, 2021 - arxiv.org
Automatic speech recognition (ASR) systems promise to deliver objective interpretation of
human speech. Practice and recent evidence suggests that the state-of-the-art (SotA) ASRs …

[图书][B] Teaching and researching: Listening

M Rost - 2013 - taylorfrancis.com
Teaching and Researching Listening provides a focused, state-of-the-art treatment of the
linguistic, psycholinguistic and pragmatic processes that are involved in oral language use …

Deconstructing comprehensibility: Identifying the linguistic influences on listeners' L2 comprehensibility ratings

T Isaacs, P Trofimovich - Studies in Second Language Acquisition, 2012 - cambridge.org
Comprehensibility, a major concept in second language (L2) pronunciation research that
denotes listeners' perceptions of how easily they understand L2 speech, is central to …

[HTML][HTML] Towards inclusive automatic speech recognition

S Feng, BM Halpern, O Kudina… - Computer Speech & …, 2024 - Elsevier
Practice and recent evidence show that state-of-the-art (SotA) automatic speech recognition
(ASR) systems do not perform equally well for all speaker groups. Many factors can cause …

[PDF][PDF] Effects of Talker Dialect, Gender & Race on Accuracy of Bing Speech and YouTube Automatic Captions.

R Tatman, C Kasten - Interspeech, 2017 - drive.google.com
This project compares the accuracy of two automatic speech recognition (ASR) systems–
Bing Speech and YouTube's automatic captions–across gender, race and four dialects of …

[PDF][PDF] Lexicon-free conversational speech recognition with neural networks

A Maas, Z Xie, D Jurafsky, AY Ng - … of the 2015 Conference of the …, 2015 - aclanthology.org
We present an approach to speech recognition that uses only a neural network to map
acoustic input to characters, a character-level language model, and a beam search …

Understanding automatic speech recognition

D O'Shaughnessy - Computer Speech & Language, 2023 - Elsevier
This paper discusses how automatic speech recognition systems are and could be
designed, in order to best exploit the discriminative information encoded in human speech …