Racial disparities in automated speech recognition
Automated speech recognition (ASR) systems, which use sophisticated machine-learning
algorithms to convert spoken language to text, have become increasingly widespread …
algorithms to convert spoken language to text, have become increasingly widespread …
The listening talker: A review of human and algorithmic context-induced modifications of speech
Speech output technology is finding widespread application, including in scenarios where
intelligibility might be compromised–at least for some listeners–by adverse conditions …
intelligibility might be compromised–at least for some listeners–by adverse conditions …
Gender and dialect bias in YouTube's automatic captions
R Tatman - Proceedings of the first ACL workshop on ethics in …, 2017 - aclanthology.org
This project evaluates the accuracy of YouTube's automatically-generated captions across
two genders and five dialect groups. Speakers' dialect and gender was controlled for by …
two genders and five dialect groups. Speakers' dialect and gender was controlled for by …
Quantifying bias in automatic speech recognition
Automatic speech recognition (ASR) systems promise to deliver objective interpretation of
human speech. Practice and recent evidence suggests that the state-of-the-art (SotA) ASRs …
human speech. Practice and recent evidence suggests that the state-of-the-art (SotA) ASRs …
[图书][B] Teaching and researching: Listening
M Rost - 2013 - taylorfrancis.com
Teaching and Researching Listening provides a focused, state-of-the-art treatment of the
linguistic, psycholinguistic and pragmatic processes that are involved in oral language use …
linguistic, psycholinguistic and pragmatic processes that are involved in oral language use …
Deconstructing comprehensibility: Identifying the linguistic influences on listeners' L2 comprehensibility ratings
T Isaacs, P Trofimovich - Studies in Second Language Acquisition, 2012 - cambridge.org
Comprehensibility, a major concept in second language (L2) pronunciation research that
denotes listeners' perceptions of how easily they understand L2 speech, is central to …
denotes listeners' perceptions of how easily they understand L2 speech, is central to …
[HTML][HTML] Towards inclusive automatic speech recognition
Practice and recent evidence show that state-of-the-art (SotA) automatic speech recognition
(ASR) systems do not perform equally well for all speaker groups. Many factors can cause …
(ASR) systems do not perform equally well for all speaker groups. Many factors can cause …
[PDF][PDF] Effects of Talker Dialect, Gender & Race on Accuracy of Bing Speech and YouTube Automatic Captions.
R Tatman, C Kasten - Interspeech, 2017 - drive.google.com
This project compares the accuracy of two automatic speech recognition (ASR) systems–
Bing Speech and YouTube's automatic captions–across gender, race and four dialects of …
Bing Speech and YouTube's automatic captions–across gender, race and four dialects of …
[PDF][PDF] Lexicon-free conversational speech recognition with neural networks
We present an approach to speech recognition that uses only a neural network to map
acoustic input to characters, a character-level language model, and a beam search …
acoustic input to characters, a character-level language model, and a beam search …
Understanding automatic speech recognition
D O'Shaughnessy - Computer Speech & Language, 2023 - Elsevier
This paper discusses how automatic speech recognition systems are and could be
designed, in order to best exploit the discriminative information encoded in human speech …
designed, in order to best exploit the discriminative information encoded in human speech …