HMM-free encoder pre-training for streaming RNN transducer

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

我的图书馆

HMM-free encoder pre-training for streaming RNN transducer

在引用文章中搜索

[PDF] arxiv.org

Improving Frame-level Classifier for Word Timings with Non-peaky CTC in End-to-End Automatic Speech Recognition

X Chen, YY Lin, K Wang, Y He, Z Ma - arXiv preprint arXiv:2306.07949, 2023 - arxiv.org

End-to-end (E2E) systems have shown comparable performance to hybrid systems for
automatic speech recognition (ASR). Word timings, as a by-product of ASR, are essential in …

被引用次数：5 相关文章所有 4 个版本

[PDF] isca-archive.org

[PDF][PDF] Bring dialogue-context into RNN-T for streaming ASR.

J Hou, J Chen, W Li, Y Tang, J Zhang, Z Ma - INTERSPEECH, 2022 - isca-archive.org

Recently the conversational end-to-end (E2E) automatic speech recognition (ASR) models,
which directly integrate dialogue-context such as historical utterances into E2E models …

被引用次数：7 相关文章所有 2 个版本

[PDF] isca-archive.org

[PDF][PDF] A Neural Time Alignment Module for End-to-End Automatic Speech Recognition

D Jiang, C Zhang, PC Woodland - isca-archive.org

End-to-end trainable (E2E) automatic speech recognition (ASR) systems have low word
error rates, but they do not model timings or silence by default unlike hidden Markov model …

被引用次数：1 相关文章所有 2 个版本