关注
Wangyou Zhang
Wangyou Zhang
Ph.D. candidate, Department of Computer Science and Engineering, Shanghai Jiao Tong University
在 sjtu.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
A comparative study on Transformer vs RNN in speech applications
S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
7862019
Recent Developments on ESPnet Toolkit Boosted by Conformer
P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2722021
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition
X Chang, W Zhang, Y Qian, JL Roux, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
1162019
End-To-End Multi-Speaker Speech Recognition With Transformer
X Chang, W Zhang, Y Qian, J Le Roux, S Watanabe
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1012020
ESPnet-SE: End-to-End Speech Enhancement and Separation Toolkit Designed for ASR Integration
C Li, J Shi, W Zhang, AS Subramanian, X Chang, N Kamo, M Hira, ...
IEEE Spoken Language Technology Workshop (SLT), 785–792, 2021
752021
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ...
2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021
532021
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
W Zhang, C Boeddeker, S Watanabe, T Nakatani, M Delcroix, K Kinoshita, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
342021
Improving End-to-End Single-Channel Multi-Talker Speech Recognition
W Zhang, X Chang, Y Qian, S Watanabe
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1385-1394, 2020
322020
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming
W Zhang, AS Subramanian, X Chang, S Watanabe, Y Qian
Proc. Interspeech 2020, 324-328, 2020
292020
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation
C Boeddeker, W Zhang, T Nakatani, K Kinoshita, T Ochiai, M Delcroix, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
262021
Robust DOA Estimation Based on Convolutional Neural Network and Time-Frequency Masking
W Zhang, Y Zhou, Y Qian
Proc. Interspeech 2019, 2703-2707, 2019
262019
Towards Low-Distortion Multi-Channel Speech Enhancement: The ESPnet-SE Submission to the L3DAS22 Challenge
YJ Lu, S Cornell, X Chang, W Zhang, C Li, Z Ni, ZQ Wang, S Watanabe
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
242022
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions
W Zhang, J Shi, C Li, S Watanabe, Y Qian
2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021
222021
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
YJ Lu, X Chang, C Li, W Zhang, S Cornell, Z Ni, Y Masuyama, B Yan, ...
Proc. Interspeech 2022, 5458-5462, 2022
202022
Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data
Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
182023
End-to-End Dereverberation, Beamforming, and Speech Recognition in a Cocktail Party
W Zhang, X Chang, C Boeddeker, T Nakatani, S Watanabe, Y Qian
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 3173-3188, 2022
152022
Toward Universal Speech Enhancement For Diverse Input Conditions
W Zhang, K Saijo, ZQ Wang, S Watanabe, Y Qian
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-6, 2023
72023
The SJTU System For Multimodal Information Based Speech Processing Challenge 2021
W Wang, X Gong, Y Wu, Z Zhou, C Li, W Zhang, B Han, Y Qian
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
72022
Joint prediction and denoising for large-scale multilingual self-supervised learning
W Chen, J Shi, B Yan, D Berrebbi, W Zhang, Y Peng, X Chang, S Maiti, ...
arXiv preprint arXiv:2309.15317, 2023
62023
Separating Long-Form Speech with Group-Wise Permutation Invariant Training
W Zhang, Z Chen, N Kanda, S Liu, J Li, SE Eskimez, T Yoshioka, X Xiao, ...
Proc. Interspeech 2022, 5383–5387, 2022
62022
系统目前无法执行此操作,请稍后再试。
文章 1–20