关注
Fan Yu
标题
引用次数
引用次数
年份
Wenet: Production oriented streaming and non-streaming end-to-end speech recognition toolkit
Z Yao, D Wu, X Wang, B Zhang, F Yu, C Yang, Z Peng, X Chen, L Xie, ...
arXiv preprint arXiv:2102.01547, 2021
2262021
M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge
F Yu, S Zhang, Y Fu, L Xie, S Zheng, Z Du, W Huang, P Guo, Z Yan, B Ma, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
85*2022
The accented english speech recognition challenge 2020: open datasets, tracks, baselines, results and methods
X Shi, F Yu, Y Lu, Y Liang, Q Feng, D Wang, Y Qian, L Xie
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
682021
Summary on the ICASSP 2022 multi-channel multi-party meeting transcription grand challenge
F Yu, S Zhang, P Guo, Y Fu, Z Du, S Zheng, W Huang, L Xie, ZH Tan, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
252022
The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines
F Yu, Z Yao, X Wang, K An, L Xie, Z Ou, B Liu, X Li, G Miao
2021 IEEE Spoken Language Technology Workshop (SLT), 1117-1123, 2021
192021
Boundary and context aware training for cif-based non-autoregressive end-to-end asr
F Yu, H Luo, P Guo, Y Liang, Z Yao, L Xie, Y Gao, L Hou, S Zhang
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
172021
MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario
F Yu, S Zhang, P Guo, Y Liang, Z Du, Y Lin, L Xie
2022 IEEE Spoken Language Technology Workshop (SLT), 144-151, 2023
112023
A comparative study on speaker-attributed automatic speech recognition in multi-party meetings
F Yu, Z Du, S Zhang, Y Lin, L Xie
arXiv preprint arXiv:2203.16834, 2022
102022
An Embarrassingly Simple Approach for LLM with Strong ASR Capacity
Z Ma, G Yang, Y Yang, Z Gao, J Wang, Z Du, F Yu, Q Chen, S Zheng, ...
arXiv preprint arXiv:2402.08846, 2024
72024
Ba-sot: Boundary-aware serialized output training for multi-talker asr
Y Liang, F Yu, Y Li, P Guo, S Zhang, Q Chen, L Xie
arXiv preprint arXiv:2305.13716, 2023
52023
SlideSpeech: A Large Scale Slide-Enriched Audio-Visual Corpus
H Wang, F Yu, X Shi, Y Wang, S Zhang, M Li
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
A comparative study on multichannel speaker-attributed automatic speech recognition in multi-party meetings
M Shi, J Zhang, Z Du, F Yu, Q Chen, S Zhang, LR Dai
2023 Asia Pacific Signal and Information Processing Association Annual …, 2023
42023
BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition
P Chen, F Yu, Y Liang, H Xue, X Wan, N Zheng, H Zhou, L Xie
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023
32023
Casa-asr: Context-aware speaker-attributed asr
M Shi, Z Du, Q Chen, F Yu, Y Li, S Zhang, J Zhang, LR Dai
arXiv preprint arXiv:2305.12459, 2023
32023
The iscslp 2022 intelligent cockpit speech recognition challenge (icsrc): Dataset, tracks, baseline and results
A Zhang, F Yu, K Huang, L Xie, L Wang, ES Chng, H Bu, B Zhang, ...
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
32022
The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR
Y Liang, M Shi, F Yu, Y Li, S Zhang, Z Du, Q Chen, L Xie, Y Qian, J Wu, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
22023
MaLa-ASR: Multimedia-Assisted LLM-Based ASR
G Yang, Z Ma, F Yu, Z Gao, S Zhang, X Chen
arXiv preprint arXiv:2406.05839, 2024
12024
LCB-Net: Long-Context Biasing for Audio-Visual Speech Recognition
F Yu, H Wang, X Shi, S Zhang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
Hourglass-AVSR: Down-Up Sampling-Based Computational Efficiency Model for Audio-Visual Speech Recognition
F Yu, H Wang, Z Ma, S Zhang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
Separate-to-recognize: Joint multi-target speech separation and speech recognition for speaker-attributed ASR
Y Lin, Z Du, S Zhang, F Yu, Z Zhao, F Wu
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
12022
系统目前无法执行此操作,请稍后再试。
文章 1–20