An overview of deep-learning-based audio-visual speech enhancement and separation D Michelsanti, ZH Tan, SX Zhang, Y Xu, M Yu, D Yu, J Jensen IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1368-1396, 2021 | 242 | 2021 |
ADL-MVDR: All deep learning MVDR beamformer for target speech separation Z Zhang, Y Xu, M Yu, SX Zhang, L Chen, D Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 120 | 2021 |
Time domain audio visual speech separation J Wu, Y Xu, SX Zhang, LW Chen, M Yu, L Xie, D Yu 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019 | 120 | 2019 |
DurIAN: Duration Informed Attention Network for Speech Synthesis. C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ... Interspeech, 2027-2031, 2020 | 104 | 2020 |
Deep extractor network for target speaker recovery from single channel speech mixtures J Wang, J Chen, D Su, L Chen, M Yu, Y Qian, D Yu arXiv preprint arXiv:1807.08974, 2018 | 103 | 2018 |
Durian: Duration informed attention network for multimodal synthesis C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ... arXiv preprint arXiv:1909.01700, 2019 | 98 | 2019 |
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information. R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu Interspeech, 4290-4294, 2019 | 95 | 2019 |
A comprehensive study of speech separation: spectrogram vs waveform separation F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu arXiv preprint arXiv:1905.07497, 2019 | 90 | 2019 |
End-to-end multi-channel speech separation R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu arXiv preprint arXiv:1905.06286, 2019 | 80 | 2019 |
Self-supervised text-independent speaker verification using prototypical momentum contrastive learning W Xia, C Zhang, C Weng, M Yu, D Yu ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021 | 74 | 2021 |
Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition AS Subramanian, C Weng, S Watanabe, M Yu, D Yu Computer Speech & Language 75, 101360, 2022 | 68 | 2022 |
Enhancing end-to-end multi-channel speech separation via spatial feature learning R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 62 | 2020 |
Audio-visual speech separation and dereverberation with a two-stage multimodal network K Tan, Y Xu, SX Zhang, M Yu, D Yu IEEE Journal of Selected Topics in Signal Processing 14 (3), 542-553, 2020 | 55 | 2020 |
Seq2seq attentional siamese neural networks for text-dependent speaker verification Y Zhang, M Yu, N Li, C Yu, J Cui, D Yu ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 50 | 2019 |
FAST-RIR: Fast neural diffuse room impulse response generator A Ratnarajah, SX Zhang, M Yu, Z Tang, D Manocha, D Yu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 48 | 2022 |
Far-field location guided target speech extraction using end-to-end speech recognition objectives AS Subramanian, C Weng, M Yu, SX Zhang, Y Xu, S Watanabe, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 42 | 2020 |
Neural spatio-temporal beamformer for target speech separation Y Xu, M Yu, SX Zhang, L Chen, C Weng, J Liu, D Yu arXiv preprint arXiv:2005.03889, 2020 | 41 | 2020 |
Joint training of complex ratio mask based beamformer and acoustic model for noise robust asr Y Xu, C Weng, L Hui, J Liu, M Yu, D Su, D Yu ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 41 | 2019 |
Speaker-aware target speaker enhancement by jointly learning with speaker embedding extraction X Ji, M Yu, C Zhang, D Su, T Yu, X Liu, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 38 | 2020 |
Generalized spatio-temporal RNN beamformer for target speech separation Y Xu, Z Zhang, M Yu, SX Zhang, D Yu arXiv preprint arXiv:2101.01280, 2021 | 37 | 2021 |