Improved models for Mandarin speech-to-text transcription

[PDF][PDF] Large, pruned or continuous space language models on a gpu for statistical machine translation

H Schwenk, A Rousseau, M Attik - … We Ever Really Replace the N …, 2012 - aclanthology.org

Abstract Language models play an important role in large vocabulary speech recognition
and statistical machine translation systems. The dominant approach since several decades …

被引用次数：142 相关文章所有 10 个版本

Structured output layer neural network language models for speech recognition

HS Le, I Oparin, A Allauzen… - IEEE Transactions on …, 2012 - ieeexplore.ieee.org

This paper extends a novel neural network language model (NNLM) which relies on word
clustering to structure the output vocabulary: Structured OUtput Layer (SOUL) NNLM. This …

被引用次数：90 相关文章所有 5 个版本

[PDF] cmu.edu

Models of tone for tonal and non-tonal languages

F Metze, ZAW Sheikh, A Waibel… - … IEEE Workshop on …, 2013 - ieeexplore.ieee.org

Conventional wisdom in automatic speech recognition asserts that pitch information is not
helpful in building speech recognizers for non-tonal languages and contributes only …

被引用次数：62 相关文章所有 15 个版本

[PDF] sciencedirect.com

Mandarin lexical tone duration: Impact of speech style, word length, syllable position and prosodic position

Y Wu, M Adda-Decker, L Lamel - Speech Communication, 2023 - Elsevier

This study aims to increase our knowledge of Mandarin lexical tone duration in continuous
Mandarin speech. Related variation factors such as the number of syllable (s) in word, the …

被引用次数：5 相关文章所有 5 个版本

[PDF] arxiv.org

Two-pass decoding and cross-adaptation based system combination of end-to-end conformer and hybrid tdnn asr systems

M Cui, J Deng, S Hu, X Xie, T Wang, S Hu… - arXiv preprint arXiv …, 2022 - arxiv.org

Fundamental modelling differences between hybrid and end-to-end (E2E) automatic speech
recognition (ASR) systems create large diversity and complementarity among them. This …

被引用次数：9 相关文章所有 7 个版本

[PDF] isca-archive.org

[PDF][PDF] Multi-domain neural network language model.

T Alumäe - INTERSPEECH, 2013 - isca-archive.org

The paper describes a neural network language model that jointly models language in many
related domains. In addition to the traditional layers of a neural network language model, the …

被引用次数：31 相关文章所有 7 个版本

[PDF] cuhk.edu.hk

Language model cross adaptation for LVCSR system combination

X Liu, MJF Gales, PC Woodland - Computer Speech & Language, 2013 - Elsevier

State-of-the-art large vocabulary continuous speech recognition (LVCSR) systems often
combine outputs from multiple sub-systems that may even be developed at different sites …

被引用次数：33 相关文章所有 16 个版本

[PDF] unimelb.edu.au

[PDF][PDF] Automatic understanding of unwritten languages

O Adams - 2017 - minerva-access.unimelb.edu.au

Many of the world's languages are falling out of use without a written record and minimal
linguistic documentation. Language documentation is a slow process and there are an …

被引用次数：17 相关文章

[PDF] vocapia.com

[PDF][PDF] Developing STT and KWS systems using limited language resources

VB Le, L Lamel, A Messaoudi, W Hartmann… - … Annual Conference of …, 2014 - vocapia.com

This paper presents recent progress in developing speech-totext (STT) and keyword
spotting (KWS) systems for the 2014 IARPA-Babel evaluation. Systems have been …

被引用次数：27 相关文章所有 8 个版本

[PDF] isca-archive.org

[PDF][PDF] Phonotactic Language Recognition Using MLP Features.

MF BenZeghiba, JL Gauvain, L Lamel - Interspeech, 2012 - isca-archive.org

This paper describes a very efficient Parallel Phone Recognizers followed by Language
Modeling (PPRLM) system in terms of both performance and processing speed. The system …

被引用次数：20 相关文章所有 4 个版本