Streaming intended query detection using e2e modeling for continued conversation

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

Streaming intended query detection using e2e modeling for continued conversation

Multi-output RNN-T joint networks for multi-task learning of ASR and auxiliary tasks

W Wang, D Zhao, S Ding, H Zhang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

We propose a multi-output joint network architecture for RNN-T transducer, for multi-task
modeling of ASR and auxiliary tasks that rely on ASR outputs. Each output of the joint …

被引用次数：3 相关文章

[PDF] arxiv.org

Text Injection for Capitalization and Turn-Taking Prediction in Speech Models

S Bijwadia, S Chang, W Wang, Z Meng… - arXiv preprint arXiv …, 2023 - arxiv.org

Text injection for automatic speech recognition (ASR), wherein unpaired text-only data is
used to supplement paired audio-text data, has shown promising improvements for word …

被引用次数：1 相关文章所有 5 个版本