Automatic speech recognition for supporting endangered language documentation

E Prud'hommeaux, R Jimerson, R Hatcher… - 2021 - scholarspace.manoa.hawaii.edu
Generating accurate word-level transcripts of recorded speech for language documentation
is difficult and time-consuming, even for skilled speakers of the target language. Automatic …

Sparse transcription

S Bird - Computational Linguistics, 2021 - direct.mit.edu
The transcription bottleneck is often cited as a major obstacle for efforts to document the
world's endangered languages and supply them with language technologies. One solution …

The effects of automatic speech recognition quality on human transcription latency

Y Gaur, WS Lasecki, F Metze, JP Bigham - Proceedings of the 13th …, 2016 - dl.acm.org
Transcription makes speech accessible to deaf and hard of hearing people. This conversion
of speech to text is still done manually by humans, despite high cost, because the quality of …

[PDF][PDF] ASR for documenting acutely under-resourced indigenous languages

R Jimerson, E Prud'Hommeaux - Proceedings of the Eleventh …, 2018 - aclanthology.org
Despite its potential utility for facilitating the transcription of speech recordings, automatic
speech recognition (ASR) has not been widely explored as a tool for documenting …

Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion

J Tejedor, DT Toledano, P Lopez-Otero… - EURASIP Journal on …, 2015 - Springer
Spoken term detection (STD) aims at retrieving data from a speech repository given a textual
representation of the search term. Nowadays, it is receiving much interest due to the large …

Domain robust feature extraction for rapid low resource asr development

S Dalmia, X Li, F Metze… - 2018 IEEE Spoken …, 2018 - ieeexplore.ieee.org
Developing a practical speech recognizer for a low resource language is challenging, not
only because of the (potentially unknown) properties of the language, but also because test …

[PDF][PDF] Very Low Resource Radio Browsing for Agile Developmental and Humanitarian Monitoring.

A Saeb, R Menon, H Cameron, W Kibira, JA Quinn… - …, 2017 - academia.edu
We present a radio browsing system developed on a very small corpus of annotated speech
by using semi-supervised training of multilingual DNN/HMM acoustic models. This system is …

Wav2Gloss: Generating Interlinear Glossed Text from Speech

T He, K Choi, L Tjuatja, NR Robinson, J Shi… - arXiv preprint arXiv …, 2024 - arxiv.org
Thousands of the world's languages are in danger of extinction--a tremendous threat to
cultural identities and human language diversity. Interlinear Glossed Text (IGT) is a form of …

[PDF][PDF] Robust speech recognition for low-resource languages

A Romanenko - 2022 - oparu.uni-ulm.de
Process of human-machine interaction is an integral part of everyday human life in a modern
world. The various interfaces are intended to facilitate this interaction and provide maximum …

Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages

J Kim, M Kumar, D Gowda, A Garg… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org
In this paper, we propose a three-stage training methodology to improve the speech
recognition accuracy of low-resource languages. We explore and propose an effective …