[PDF][PDF] ASR for documenting acutely under-resourced indigenous languages

R Jimerson, E Prud'Hommeaux - Proceedings of the Eleventh …, 2018 - aclanthology.org
Despite its potential utility for facilitating the transcription of speech recordings, automatic
speech recognition (ASR) has not been widely explored as a tool for documenting …

Multilingual graphemic hybrid ASR with massive data augmentation

C Liu, Q Zhang, X Zhang, K Singh, Y Saraf… - arXiv preprint arXiv …, 2019 - arxiv.org
Towards developing high-performing ASR for low-resource languages, approaches to
address the lack of resources are to make use of data from multiple languages, and to …

Large-scale semi-supervised training in deep learning acoustic model for ASR

Y Long, Y Li, S Wei, Q Zhang, C Yang - IEEE Access, 2019 - ieeexplore.ieee.org
This study investigated large-scale semi-supervised training (SST) to improve acoustic
models for automatic speech recognition. The conventional self-training, the recently …

ASR for under-resourced languages from probabilistic transcription

MA Hasegawa-Johnson, P Jyothi… - … on Audio, Speech …, 2016 - ieeexplore.ieee.org
In many under-resourced languages it is possible to find text, and it is possible to find
speech, but transcribed speech suitable for training automatic speech recognition (ASR) is …

Multilingual techniques for low resource automatic speech recognition

E Chuangsuwanich - 2016 - dspace.mit.edu
Out of the approximately 7000 languages spoken around the world, there are only about
100 languages with Automatic Speech Recognition (ASR) capability. This is due to the fact …

Improving asr output for endangered language documentation

R Jimerson, K Simha, R Ptucha… - The 6th intl. workshop …, 2018 - par.nsf.gov
Documenting endangered languages supports the historical preservation of diverse
cultures. Automatic speech recognition (ASR), while potentially very useful for this task, has …

[图书][B] Computational tools for endangered language documentation

A Anastasopoulos - 2019 - search.proquest.com
COMPUTATIONAL TOOLS FOR ENDANGERED LANGUAGE DOCUMENTATION A
Dissertation Submitted to the Graduate School of the University of N Page 1 …

A case study on using speech-to-translation alignments for language documentation

A Anastasopoulos, D Chiang - arXiv preprint arXiv:1702.04372, 2017 - arxiv.org
For many low-resource or endangered languages, spoken language resources are more
likely to be annotated with translations than with transcriptions. Recent work exploits such …

Multitask learning for phone recognition of underresourced languages using mismatched transcription

NF Chen, BP Lim… - IEEE/ACM Transactions …, 2017 - ieeexplore.ieee.org
It is challenging to obtain large amounts of native (matched) labels for speech audio in
underresourced languages. This challenge is often due to a lack of literate speakers of the …

Multi-object classification via crowdsourcing with a reject option

Q Li, A Vempaty, LR Varshney… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
Consider designing an effective crowdsourcing system for M-ary classification where crowd
workers complete simple binary microtasks, which are aggregated to give the final result. We …