- 学术资源搜索

Transfer learning for speech and language processing

D Wang, TF Zheng - 2015 Asia-Pacific Signal and Information …, 2015 - ieeexplore.ieee.org

Transfer learning is a vital technique that generalizes models trained for one setting or task
to other settings or tasks. For example in speech recognition, an acoustic model trained for …

被引用次数：262 相关文章所有 12 个版本

[PDF] arxiv.org

Unsupervised speech representation learning using wavenet autoencoders

J Chorowski, RJ Weiss, S Bengio… - … /ACM transactions on …, 2019 - ieeexplore.ieee.org

We consider the task of unsupervised extraction of meaningful latent representations of
speech by applying autoencoding neural networks to speech waveforms. The goal is to …

被引用次数：404 相关文章所有 11 个版本

[PDF] arxiv.org

Unsupervised pretraining transfers well across languages

M Riviere, A Joulin, PE Mazaré… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

Cross-lingual and multi-lingual training of Automatic Speech Recognition (ASR) has been
extensively investigated in the supervised setting. This assumes the existence of a parallel …

被引用次数：228 相关文章所有 7 个版本

[PDF] shu.ac.uk

A survey on automatic speech recognition systems for Portuguese language and its variations

TA de Lima, M Da Costa-Abreu - Computer Speech & Language, 2020 - Elsevier

Communication has been an essential part of being human and living in society. There are
several different languages and variations of them, so you can speak English in one place …

被引用次数：56 相关文章所有 4 个版本

[PDF] nwu.ac.za

Automatic speech recognition for under-resourced languages: A survey

L Besacier, E Barnard, A Karpov, T Schultz - Speech communication, 2014 - Elsevier

Speech processing for under-resourced languages is an active field of research, which has
experienced significant progress during the past decade. We propose, in this paper, a …

被引用次数：658 相关文章所有 16 个版本

[PDF] arxiv.org

Libri-light: A benchmark for asr with limited or no supervision

J Kahn, M Riviere, W Zheng… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

We introduce a new collection of spoken English audio suitable for training speech
recognition systems under limited or no supervision. It is derived from open-source audio …

被引用次数：694 相关文章所有 13 个版本

[PDF] merl.com

Language independent end-to-end architecture for joint language identification and speech recognition

S Watanabe, T Hori, JR Hershey - 2017 IEEE Automatic Speech …, 2017 - ieeexplore.ieee.org

End-to-end automatic speech recognition (ASR) can significantly reduce the burden of
developing ASR systems for new languages, by eliminating the need for linguistic …

被引用次数：181 相关文章所有 7 个版本

[PDF] arxiv.org

Multilingual end-to-end speech translation

H Inaguma, K Duh, T Kawahara… - 2019 IEEE Automatic …, 2019 - ieeexplore.ieee.org

In this paper, we propose a simple yet effective framework for multilingual end-to-end
speech translation (ST), in which speech utterances in source languages are directly …

被引用次数：98 相关文章所有 11 个版本

[PDF] whiterose.ac.uk

Data augmentation for low resource languages

A Ragni, KM Knill, SP Rath… - … 2014: 15th annual …, 2014 - eprints.whiterose.ac.uk

Recently there has been interest in the approaches for training speech recognition systems
for languages with limited resources. Under the IARPA Babel program such resources have …

被引用次数：178 相关文章所有 12 个版本

[PDF] arxiv.org

Sequence-based multi-lingual low resource speech recognition

S Dalmia, R Sanabria, F Metze… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org

Techniques for multi-lingual and cross-lingual speech recognition can help in low resource
scenarios, to bootstrap systems and enable analysis of new languages and domains. End-to …

被引用次数：116 相关文章所有 10 个版本