Contextualized end-to-end speech recognition with contextual phrase prediction network K Huang, A Zhang, Z Yang, P Guo, B Mu, T Xu, L Xie arXiv preprint arXiv:2305.12493, 2023 | 15 | 2023 |
Adaptive contextual biasing for transducer based streaming speech recognition T Xu, Z Yang, K Huang, P Guo, A Zhang, B Li, C Chen, C Li, L Xie arXiv preprint arXiv:2306.00804, 2023 | 11 | 2023 |
The iscslp 2022 intelligent cockpit speech recognition challenge (icsrc): Dataset, tracks, baseline and results A Zhang, F Yu, K Huang, L Xie, L Wang, ES Chng, H Bu, B Zhang, ... 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 5 | 2022 |
Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition K Huang, A Zhang, B Zhang, T Xu, X Song, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 4 | 2023 |
SSHR: Leveraging self-supervised hierarchical representations for multilingual automatic speech recognition H Xue, Q Shao, K Huang, P Chen, J Liu, L Xie 2024 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2024 | 2 | 2024 |
The NPU-TEA System Report for the CHiME-8 MMCSG Challenge K Huang, W Rao, Y Li, H Wang, S Huang, Y Wang, L Xie CHiME Workshop on Speech Processing in Everyday Environments, 2024 | 1 | 2024 |
U2-KWS: Unified Two-Pass Open-Vocabulary Keyword Spotting with Keyword Bias A Zhang, P Zhou, K Huang, Y Zou, M Liu, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 1 | 2023 |
Leveraging Synthetic Speech for CIF-Based Customized Keyword Spotting S Liu, A Zhang, K Huang, L Xie National Conference on Man-Machine Speech Communication, 354-365, 2023 | 1 | 2023 |
Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper T Xu, K Huang, P Guo, Y Zhou, L Huang, H Xue, L Xie arXiv preprint arXiv:2408.10680, 2024 | | 2024 |
The NPU-TEA System for the CHiME-8 NOTSOFAR-1 Challenge K Huang, Y Li, Z Wang, H Wang, W Rao, Z Sun, Z Tang, S Huang, ... Proc. CHiME 2024, 45-48, 2024 | | 2024 |
SEQ-former: A context-enhanced and efficient automatic speech recognition framework Q Meng, M Liu, K Huang, K Wei, L Xie, Z Quan, W Deng, Q Lu, N Jiang, ... Proc. Interspeech 2024, 212-216, 2024 | | 2024 |