[PDF][PDF] Cheap, fast and good enough: Automatic speech recognition with non-expert transcription

S Novotney, C Callison-Burch - … of the North American Chapter of …, 2010 - aclanthology.org
Deploying an automatic speech recognition system with reasonable performance requires
expensive and time-consuming in-domain transcription. Previous work demonstrated that …

[PDF][PDF] Tools for collecting speech corpora via Mechanical-Turk

I Lane, M Eck, K Rottmann… - Proceedings of the NAACL …, 2010 - aclanthology.org
To rapidly port speech applications to new languages one of the most difficult tasks is the
initial collection of sufficient speech corpora. State-of-the-art automatic speech recognition …

Integrating automatic transcription into the language documentation workflow: Experiments with Na data and the Persephone toolkit

A Michaud, O Adams, TA Cohn, G Neubig… - 2018 - scholarspace.manoa.hawaii.edu
Automatic speech recognition tools have potential for facilitating language documentation,
but in practice these tools remain little-used by linguists for a variety of reasons, such as that …

[PDF][PDF] Using Amazon Mechanical Turk for transcription of non-native speech

K Evanini, D Higgins, K Zechner - … of the naacl hlt 2010 workshop …, 2010 - aclanthology.org
This study investigates the use of Amazon Mechanical Turk for the transcription of nonnative
speech. Multiple transcriptions were obtained from several distinct MTurk workers and were …

The effects of automatic speech recognition quality on human transcription latency

Y Gaur, WS Lasecki, F Metze, JP Bigham - Proceedings of the 13th …, 2016 - dl.acm.org
Transcription makes speech accessible to deaf and hard of hearing people. This conversion
of speech to text is still done manually by humans, despite high cost, because the quality of …

Automatic speech recognition for supporting endangered language documentation

E Prud'hommeaux, R Jimerson, R Hatcher… - 2021 - scholarspace.manoa.hawaii.edu
Generating accurate word-level transcripts of recorded speech for language documentation
is difficult and time-consuming, even for skilled speakers of the target language. Automatic …

[PDF][PDF] Fast transcription of unstructured audio recordings

BC Roy, DK Roy - 2009 - dspace.mit.edu
We introduce a new method for human-machine collaborative speech transcription that is
significantly faster than existing transcription methods. In this approach, automatic audio …

Toward better crowdsourced transcription: Transcription of a year of the let's go bus information system data

G Parent, M Eskenazi - 2010 IEEE Spoken Language …, 2010 - ieeexplore.ieee.org
Transcription is typically a long and expensive process. In the last year, crowdsourcing
through Amazon Mechanical Turk (MTurk) has emerged as a way to transcribe large …

Advances in speech transcription at IBM under the DARPA EARS program

SF Chen, B Kingsbury, L Mangu… - … on Audio, Speech …, 2006 - ieeexplore.ieee.org
This paper describes the technical and system building advances made in IBM's speech
recognition technology over the course of the Defense Advanced Research Projects Agency …

[PDF][PDF] A self-transcribing speech corpus: collecting continuous speech with an online educational game

A Gruenstein, I McGraw… - International Workshop on …, 2009 - groups.csail.mit.edu
We describe a novel approach to collecting orthographically transcribed continuous speech
data through the use of an online educational game called Voice Scatter, in which players …