关注
Kaixun Huang
Kaixun Huang
在 mail.nwpu.edu.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Contextualized end-to-end speech recognition with contextual phrase prediction network
K Huang, A Zhang, Z Yang, P Guo, B Mu, T Xu, L Xie
arXiv preprint arXiv:2305.12493, 2023
152023
Adaptive contextual biasing for transducer based streaming speech recognition
T Xu, Z Yang, K Huang, P Guo, A Zhang, B Li, C Chen, C Li, L Xie
arXiv preprint arXiv:2306.00804, 2023
112023
The iscslp 2022 intelligent cockpit speech recognition challenge (icsrc): Dataset, tracks, baseline and results
A Zhang, F Yu, K Huang, L Xie, L Wang, ES Chng, H Bu, B Zhang, ...
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
52022
Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition
K Huang, A Zhang, B Zhang, T Xu, X Song, L Xie
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
42023
SSHR: Leveraging self-supervised hierarchical representations for multilingual automatic speech recognition
H Xue, Q Shao, K Huang, P Chen, J Liu, L Xie
2024 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2024
22024
The NPU-TEA System Report for the CHiME-8 MMCSG Challenge
K Huang, W Rao, Y Li, H Wang, S Huang, Y Wang, L Xie
CHiME Workshop on Speech Processing in Everyday Environments, 2024
12024
U2-KWS: Unified Two-Pass Open-Vocabulary Keyword Spotting with Keyword Bias
A Zhang, P Zhou, K Huang, Y Zou, M Liu, L Xie
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
12023
Leveraging Synthetic Speech for CIF-Based Customized Keyword Spotting
S Liu, A Zhang, K Huang, L Xie
National Conference on Man-Machine Speech Communication, 354-365, 2023
12023
Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper
T Xu, K Huang, P Guo, Y Zhou, L Huang, H Xue, L Xie
arXiv preprint arXiv:2408.10680, 2024
2024
The NPU-TEA System for the CHiME-8 NOTSOFAR-1 Challenge
K Huang, Y Li, Z Wang, H Wang, W Rao, Z Sun, Z Tang, S Huang, ...
Proc. CHiME 2024, 45-48, 2024
2024
SEQ-former: A context-enhanced and efficient automatic speech recognition framework
Q Meng, M Liu, K Huang, K Wei, L Xie, Z Quan, W Deng, Q Lu, N Jiang, ...
Proc. Interspeech 2024, 212-216, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–11