Wenet: Production oriented streaming and non-streaming end-to-end speech recognition toolkit Z Yao, D Wu, X Wang, B Zhang, F Yu, C Yang, Z Peng, X Chen, L Xie, ... arXiv preprint arXiv:2102.01547, 2021 | 229 | 2021 |
M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge F Yu, S Zhang, Y Fu, L Xie, S Zheng, Z Du, W Huang, P Guo, Z Yan, B Ma, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 87* | 2022 |
The accented english speech recognition challenge 2020: open datasets, tracks, baselines, results and methods X Shi, F Yu, Y Lu, Y Liang, Q Feng, D Wang, Y Qian, L Xie ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 69 | 2021 |
Summary on the ICASSP 2022 multi-channel multi-party meeting transcription grand challenge F Yu, S Zhang, P Guo, Y Fu, Z Du, S Zheng, W Huang, L Xie, ZH Tan, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 26 | 2022 |
The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines F Yu, Z Yao, X Wang, K An, L Xie, Z Ou, B Liu, X Li, G Miao 2021 IEEE Spoken Language Technology Workshop (SLT), 1117-1123, 2021 | 19 | 2021 |
Boundary and context aware training for cif-based non-autoregressive end-to-end asr F Yu, H Luo, P Guo, Y Liang, Z Yao, L Xie, Y Gao, L Hou, S Zhang 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 17 | 2021 |
MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario F Yu, S Zhang, P Guo, Y Liang, Z Du, Y Lin, L Xie 2022 IEEE Spoken Language Technology Workshop (SLT), 144-151, 2023 | 11 | 2023 |
A comparative study on speaker-attributed automatic speech recognition in multi-party meetings F Yu, Z Du, S Zhang, Y Lin, L Xie arXiv preprint arXiv:2203.16834, 2022 | 10 | 2022 |
An Embarrassingly Simple Approach for LLM with Strong ASR Capacity Z Ma, G Yang, Y Yang, Z Gao, J Wang, Z Du, F Yu, Q Chen, S Zheng, ... arXiv preprint arXiv:2402.08846, 2024 | 8 | 2024 |
Ba-sot: Boundary-aware serialized output training for multi-talker asr Y Liang, F Yu, Y Li, P Guo, S Zhang, Q Chen, L Xie arXiv preprint arXiv:2305.13716, 2023 | 5 | 2023 |
The iscslp 2022 intelligent cockpit speech recognition challenge (icsrc): Dataset, tracks, baseline and results A Zhang, F Yu, K Huang, L Xie, L Wang, ES Chng, H Bu, B Zhang, ... 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 5 | 2022 |
SlideSpeech: A Large Scale Slide-Enriched Audio-Visual Corpus H Wang, F Yu, X Shi, Y Wang, S Zhang, M Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition P Chen, F Yu, Y Liang, H Xue, X Wan, N Zheng, H Zhou, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | 4 | 2023 |
A comparative study on multichannel speaker-attributed automatic speech recognition in multi-party meetings M Shi, J Zhang, Z Du, F Yu, Q Chen, S Zhang, LR Dai 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023 | 4 | 2023 |
Casa-asr: Context-aware speaker-attributed asr M Shi, Z Du, Q Chen, F Yu, Y Li, S Zhang, J Zhang, LR Dai arXiv preprint arXiv:2305.12459, 2023 | 3 | 2023 |
The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR Y Liang, M Shi, F Yu, Y Li, S Zhang, Z Du, Q Chen, L Xie, Y Qian, J Wu, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 2 | 2023 |
MaLa-ASR: Multimedia-Assisted LLM-Based ASR G Yang, Z Ma, F Yu, Z Gao, S Zhang, X Chen arXiv preprint arXiv:2406.05839, 2024 | 1 | 2024 |
LCB-Net: Long-Context Biasing for Audio-Visual Speech Recognition F Yu, H Wang, X Shi, S Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Hourglass-AVSR: Down-Up Sampling-Based Computational Efficiency Model for Audio-Visual Speech Recognition F Yu, H Wang, Z Ma, S Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Separate-to-recognize: Joint multi-target speech separation and speech recognition for speaker-attributed ASR Y Lin, Z Du, S Zhang, F Yu, Z Zhao, F Wu 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 1 | 2022 |