Wavlm: Large-scale self-supervised pre-training for full stack speech processing S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022 | 1341 | 2022 |
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement Y Hu, Y Liu, S Lv, M Xing, S Zhang, Y Fu, J Wu, B Zhang, L Xie Proc. Interspeech 2020, 2472--2476, 2020 | 632 | 2020 |
Continuous Speech Separation: Dataset and Analysis Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J Li ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 210 | 2020 |
Continuous speech separation with conformer S Chen, Y Wu, Z Chen, J Wu, J Li, T Yoshioka, C Wang, S Liu, M Zhou ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 134 | 2021 |
Time Domain Audio Visual Speech Separation J Wu, Y Xu, SX Zhang, LW Chen, M Yu, L Xie, D Yu 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 125 | 2019 |
Audio-visual Recognition of Overlapped Speech for the LRS2 dataset J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 102 | 2020 |
A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu Interspeech 2019, 4574--4578, 2019 | 91 | 2019 |
End-to-end multi-channel speech separation R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu arXiv preprint arXiv:1905.06286, 2019 | 82 | 2019 |
Unispeech-sat: Universal speech representation learning with speaker aware pre-training S Chen, Y Wu, C Wang, Z Chen, Z Chen, S Liu, J Wu, Y Qian, F Wei, J Li, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 80 | 2022 |
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario Y Fu, L Cheng, S Lv, Y Jv, Y Kong, Z Chen, Y Hu, L Xie, J Wu, H Bu, X Xu, ... Proc. Interspeech 2021, 3665--3669, 2021 | 68 | 2021 |
On decoder-only architecture for speech-to-text and large language model integration J Wu, Y Gaur, Z Chen, L Zhou, Y Zhu, T Wang, J Li, S Liu, B Ren, L Liu, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 55 | 2023 |
Streaming multi-talker ASR with token-level serialized output training N Kanda, J Wu, Y Wu, X Xiao, Z Meng, X Wang, Y Gaur, Z Chen, J Li, ... Proc. Interspeech 2022, 521--525, 2022 | 48 | 2022 |
Channel-Wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music H Liu, L Xie, J Wu, G Yang Proc. Interspeech 2020, 1241--1245, 2020 | 28 | 2020 |
Desnet: A multi-channel network for simultaneous speech dereverberation, enhancement and separation Y Fu, J Wu, Y Hu, M Xing, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 857-864, 2021 | 25 | 2021 |
An End-to-end Architecture of Online Multi-channel Speech Separation J Wu, Z Chen, J Li, T Yoshioka, Z Tan, E Lin, Y Luo, L Xie Proc. Interspeech 2020, 3066--3070, 2020 | 24 | 2020 |
Streaming speaker-attributed ASR with token-level speaker embeddings N Kanda, J Wu, Y Wu, X Xiao, Z Meng, X Wang, Y Gaur, Z Chen, J Li, ... Proc. Interspeech 2022, 521--525, 2022 | 18 | 2022 |
Investigation of Practical Aspects of Single Channel Speech Separation for ASR J Wu, Z Chen, S Chen, Y Wu, T Yoshioka, N Kanda, S Liu, J Li Proc. Interspeech 2021, 3066--3070, 2021 | 16 | 2021 |
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge L Zhang, J Wu, L Xie Proc. Interspeech 2020, 3471--3475, 2020 | 14 | 2020 |
VarArray meets t-SOT: Advancing the state of the art of streaming distant conversational speech recognition N Kanda, J Wu, X Wang, Z Chen, J Li, T Yoshioka ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 13 | 2023 |
Ultra Fast Speech Separation Model with Teacher Student Learning S Chen, Y Wu, Z Chen, J Wu, T Yoshioka, S Liu, J Li, X Yu Proc. Interspeech 2021, 3026--3030, 2022 | 13 | 2022 |