Using synthetic audio to improve the recognition of out-of-vocabulary words in end-to-end ASR systems X Zheng, Y Liu, D Gunceler, D Willett ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 82 | 2021 |
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition X Zheng, C Zhang, PC Woodland 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 56 | 2021 |
Multi-turn RNN-T for streaming recognition of multi-party speech I Sklyar, A Piunova, X Zheng, Y Liu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 21 | 2022 |
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription X Zheng, C Zhang, PC Woodland Interspeech 2022, 2022 | 14 | 2022 |
Can Contextual Biasing Remain Effective with Whisper and GPT-2? G Sun, X Zheng, C Zhang, PC Woodland arXiv preprint arXiv:2306.01942, 2023 | 13 | 2023 |
Self-Supervised Learning-Based Source Separation for Meeting Data Y Li, X Zheng, PC Woodland ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Conditional Diffusion Model for Target Speaker Extraction T Nguyen, G Sun, X Zheng, C Zhang, PC Woodland arXiv preprint arXiv:2310.04791, 2023 | 1 | 2023 |
SOT Triggered Neural Clustering for Speaker Attributed ASR X Zheng, G Sun, C Zhang, PC Woodland Interspeech 2024, 2024 | | 2024 |
The University of Cambridge System for the CHiME-7 DASR Task K Deng, X Zheng, PC Woodland Proc. CHiME 2023, 73-76, 2023 | | 2023 |
Combining Diverse Neural Network Language Models for Speech Recognition X Zheng | | |