Wangyou Zhang 个人学术档案

引用次数

	总计	2019 年至今
引用	1710	1710
h 指数	15	15
i10 指数	16	16

520

260

130

390

20192020202120222023202413 159 387 402 511 235

开放获取的出版物数量

查看全部

14 篇文章

4 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Shinji WatanabeCarnegie Mellon University在 cmu.edu 的电子邮件经过验证
Yanmin QianProfessor, Shanghai Jiao Tong University在 sjtu.edu.cn 的电子邮件经过验证
Xuankai ChangCarnegie Mellon University, Student在 andrew.cmu.edu 的电子邮件经过验证
Chenda LiShanghai Jiao Tong University在 sjtu.edu.cn 的电子邮件经过验证
Jing ShiInstitute of Automation Chinese Academy of Sciences在 ia.ac.cn 的电子邮件经过验证
Christoph BoeddekerPaderborn University在 mail.upb.de 的电子邮件经过验证
Aswin Shanmugam SubramanianMicrosoft在 microsoft.com 的电子邮件经过验证

关注

Wangyou Zhang

Ph.D. candidate, Department of Computer Science and Engineering, Shanghai Jiao Tong University

在 sjtu.edu.cn 的电子邮件经过验证 - 首页

Signal Processing Speech Separation Speech Enhancement Robust Speech Recognition


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
A comparative study on Transformer vs RNN in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	786	2019
Recent Developments on ESPnet Toolkit Boosted by Conformer P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	272	2021
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition X Chang, W Zhang, Y Qian, JL Roux, S Watanabe 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	116	2019
End-To-End Multi-Speaker Speech Recognition With Transformer X Chang, W Zhang, Y Qian, J Le Roux, S Watanabe ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	101	2020
ESPnet-SE: End-to-End Speech Enhancement and Separation Toolkit Designed for ASR Integration C Li, J Shi, W Zhang, AS Subramanian, X Chang, N Kamo, M Hira, ... IEEE Spoken Language Technology Workshop (SLT), 785–792, 2021	75	2021
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ... 2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021	53	2021
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend W Zhang, C Boeddeker, S Watanabe, T Nakatani, M Delcroix, K Kinoshita, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	34	2021
Improving End-to-End Single-Channel Multi-Talker Speech Recognition W Zhang, X Chang, Y Qian, S Watanabe IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1385-1394, 2020	32	2020
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming W Zhang, AS Subramanian, X Chang, S Watanabe, Y Qian Proc. Interspeech 2020, 324-328, 2020	29	2020
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation C Boeddeker, W Zhang, T Nakatani, K Kinoshita, T Ochiai, M Delcroix, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	26	2021
Robust DOA Estimation Based on Convolutional Neural Network and Time-Frequency Masking W Zhang, Y Zhou, Y Qian Proc. Interspeech 2019, 2703-2707, 2019	26	2019
Towards Low-Distortion Multi-Channel Speech Enhancement: The ESPnet-SE Submission to the L3DAS22 Challenge YJ Lu, S Cornell, X Chang, W Zhang, C Li, Z Ni, ZQ Wang, S Watanabe ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	24	2022
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions W Zhang, J Shi, C Li, S Watanabe, Y Qian 2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021	22	2021
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding YJ Lu, X Chang, C Li, W Zhang, S Cornell, Z Ni, Y Masuyama, B Yan, ... Proc. Interspeech 2022, 5458-5462, 2022	20	2022
Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	18	2023
End-to-End Dereverberation, Beamforming, and Speech Recognition in a Cocktail Party W Zhang, X Chang, C Boeddeker, T Nakatani, S Watanabe, Y Qian IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 3173-3188, 2022	15	2022
Toward Universal Speech Enhancement For Diverse Input Conditions W Zhang, K Saijo, ZQ Wang, S Watanabe, Y Qian 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-6, 2023	7	2023
The SJTU System For Multimodal Information Based Speech Processing Challenge 2021 W Wang, X Gong, Y Wu, Z Zhou, C Li, W Zhang, B Han, Y Qian ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	7	2022
Joint prediction and denoising for large-scale multilingual self-supervised learning W Chen, J Shi, B Yan, D Berrebbi, W Zhang, Y Peng, X Chang, S Maiti, ... arXiv preprint arXiv:2309.15317, 2023	6	2023
Separating Long-Form Speech with Group-Wise Permutation Invariant Training W Zhang, Z Chen, N Kanda, S Liu, J Li, SE Eskimez, T Yoshioka, X Xiao, ... Proc. Interspeech 2022, 5383–5387, 2022	6	2022

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用