Boundary and context aware training for cif-based non-autoregressive end-to-end asr

Y Xiao, L Wu, J Guo, J Li, M Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Non-autoregressive (NAR) generation, which is first proposed in neural machine translation
(NMT) to speed up inference, has attracted much attention in both machine learning and …

被引用次数：76 相关文章所有 8 个版本

[PDF] arxiv.org

A comparative study on non-autoregressive modelings for speech-to-text generation

Y Higuchi, N Chen, Y Fujita, H Inaguma… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org

Non-autoregressive (NAR) models simultaneously generate multiple outputs in a sequence,
which significantly reduces the inference speed at the cost of accuracy drop compared to …

被引用次数：47 相关文章所有 6 个版本

Non-autoregressive asr modeling using pre-trained language models for chinese speech recognition

FH Yu, KY Chen, KH Lu - IEEE/ACM Transactions on Audio …, 2022 - ieeexplore.ieee.org

Transformer-based models have led to significant innovation in various classic and practical
subjects, including speech processing, natural language processing, and computer vision …

被引用次数：19 相关文章所有 2 个版本

[PDF] arxiv.org

A ctc alignment-based non-autoregressive transformer for end-to-end automatic speech recognition

R Fan, W Chu, P Chang, A Alwan - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org

Recently, end-to-end models have been widely used in automatic speech recognition (ASR)
systems. Two of the most representative approaches are connectionist temporal …

被引用次数：10 相关文章所有 5 个版本

[PDF] arxiv.org

SeACo-Paraformer: A non-autoregressive ASR system with flexible and effective hotword customization ability

X Shi, Y Yang, Z Li, Y Chen, Z Gao… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org

Hotword customization is one of the concerned issues remained in ASR field-it is of value to
enable users of ASR systems to customize names of entities, persons and other phrases to …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition

P Chen, F Yu, Y Liang, H Xue, X Wan… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org

Mixture-of-experts based models, which use language experts to extract language-specific
representations effectively, have been well applied in code-switching automatic speech …

被引用次数：3 相关文章所有 3 个版本

[PDF] arxiv.org

Decoupling recognition and transcription in mandarin asr

J Yuan, X Cai, D Gao, R Zheng… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org

Much of the recent literature on automatic speech recognition (ASR) is taking an end-to-end
approach. Unlike English where the writing system is closely related to sound, Chinese …

被引用次数：10 相关文章所有 3 个版本

[PDF] arxiv.org

被引用次数：4 相关文章所有 3 个版本