Semi-supervised training in low-resource ASR and KWS

E Prud'hommeaux, R Jimerson, R Hatcher… - 2021 - scholarspace.manoa.hawaii.edu

Generating accurate word-level transcripts of recorded speech for language documentation
is difficult and time-consuming, even for skilled speakers of the target language. Automatic …

被引用次数：30 相关文章所有 5 个版本

[PDF] mit.edu

Sparse transcription

S Bird - Computational Linguistics, 2021 - direct.mit.edu

The transcription bottleneck is often cited as a major obstacle for efforts to document the
world's endangered languages and supply them with language technologies. One solution …

被引用次数：44 相关文章所有 8 个版本

[PDF] umich.edu

The effects of automatic speech recognition quality on human transcription latency

Y Gaur, WS Lasecki, F Metze, JP Bigham - Proceedings of the 13th …, 2016 - dl.acm.org

Transcription makes speech accessible to deaf and hard of hearing people. This conversion
of speech to text is still done manually by humans, despite high cost, because the quality of …

被引用次数：61 相关文章所有 11 个版本

[PDF] aclanthology.org

[PDF][PDF] ASR for documenting acutely under-resourced indigenous languages

R Jimerson, E Prud'Hommeaux - Proceedings of the Eleventh …, 2018 - aclanthology.org

Despite its potential utility for facilitating the transcription of speech recordings, automatic
speech recognition (ASR) has not been widely explored as a tool for documenting …

被引用次数：43 相关文章所有 2 个版本

[PDF] springer.com

Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion

J Tejedor, DT Toledano, P Lopez-Otero… - EURASIP Journal on …, 2015 - Springer

Spoken term detection (STD) aims at retrieving data from a speech repository given a textual
representation of the search term. Nowadays, it is receiving much interest due to the large …

被引用次数：18 相关文章所有 12 个版本

[PDF] arxiv.org

Domain robust feature extraction for rapid low resource asr development

S Dalmia, X Li, F Metze… - 2018 IEEE Spoken …, 2018 - ieeexplore.ieee.org

Developing a practical speech recognizer for a low resource language is challenging, not
only because of the (potentially unknown) properties of the language, but also because test …

被引用次数：19 相关文章所有 6 个版本

[PDF] academia.edu

[PDF][PDF] Very Low Resource Radio Browsing for Agile Developmental and Humanitarian Monitoring.

A Saeb, R Menon, H Cameron, W Kibira, JA Quinn… - …, 2017 - academia.edu

We present a radio browsing system developed on a very small corpus of annotated speech
by using semi-supervised training of multilingual DNN/HMM acoustic models. This system is …

被引用次数：21 相关文章所有 9 个版本

[PDF] arxiv.org

Wav2Gloss: Generating Interlinear Glossed Text from Speech

T He, K Choi, L Tjuatja, NR Robinson, J Shi… - arXiv preprint arXiv …, 2024 - arxiv.org

Thousands of the world's languages are in danger of extinction--a tremendous threat to
cultural identities and human language diversity. Interlinear Glossed Text (IGT) is a form of …

[PDF][PDF] Robust speech recognition for low-resource languages

A Romanenko - 2022 - oparu.uni-ulm.de

Process of human-machine interaction is an integral part of everyday human life in a modern
world. The various interfaces are intended to facilitate this interaction and provide maximum …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages

J Kim, M Kumar, D Gowda, A Garg… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org

In this paper, we propose a three-stage training methodology to improve the speech
recognition accuracy of low-resource languages. We explore and propose an effective …

被引用次数：2 相关文章所有 5 个版本