Contextualized end-to-end speech recognition with contextual phrase prediction network K Huang, A Zhang, Z Yang, P Guo, B Mu, T Xu, L Xie Proc. Interspeech, 2023 | 15 | 2023 |
The NPU-ASLP system for audio-visual speech recognition in MISP 2022 challenge P Guo, H Wang, B Mu, A Zhang, P Chen ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 6 | 2023 |
Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets X Geng, T Xu, K Wei, B Mu, H Xue, H Wang, Y Li, P Guo, Y Dai, L Li, ... Proc. ISCSLP 2024, 2024 | 4 | 2024 |
MMGER: Multi-Modal and Multi-Granularity Generative Error Correction With LLM for Joint Accent and Speech Recognition B Mu, X Wan, N Zheng, H Zhou, L Xie IEEE Signal Processing Letters 31, 1940-1944, 2024 | 2 | 2024 |
Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies B Mu, P Guo, D Guo, P Zhou, W Chen, L Xie ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 2 | 2024 |
E-chat: Emotion-sensitive Spoken Dialogue System with Large Language Models H Xue, Y Liang, B Mu, S Zhang, Q Chen, L Xie Proc. ISCSLP 2024, 2023 | 1 | 2023 |
The npu-aslp system for audio-visual speech recognition in misp challenge 2022 P Guo, H Wang, B Mu, A Zhang, P Chen Proc. ICASSP, 2023 | 1 | 2023 |
HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models B Mu, K Wei, Q Shao, Y Xu, L Xie arXiv preprint arXiv:2409.19878, 2024 | | 2024 |
The NPU System for DASR Task of CHiME-7 Challenge B Mu, P Guo, H Wang, Y Li, Y Li, P Zhou, W Chen, L Xie Proc. CHiME 2023, 63-66, 2023 | | 2023 |