Shi-Xiong (Austin) Zhang 个人学术档案

引用次数

	总计	2019 年至今
引用	2669	2295
h 指数	28	27
i10 指数	50	45

660

330

165

495

20092010201120122013201420152016201720182019202020212022202320247 7 11 18 15 50 44 55 62 94 138 242 377 554 648 330

开放获取的出版物数量

查看全部

7 篇文章

1 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Meng YUTencent AI Lab在 tencent.com 的电子邮件经过验证
Yong XuPrincipal Researcher, Tencent America, Bellevue, USA在 tencent.com 的电子邮件经过验证
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA Fellow在 global.tencent.com 的电子邮件经过验证
Rongzhi GuTencent AI Lab在 pku.edu.cn 的电子邮件经过验证
Yifan GongPrincipal Science Manager, Microsoft Corp.在 microsoft.com 的电子邮件经过验证
Mark GalesCambridge University在 eng.cam.ac.uk 的电子邮件经过验证
Jinyu LiPartner Applied Science Manager, Microsoft在 microsoft.com 的电子邮件经过验证
Shinji WatanabeCarnegie Mellon University在 cmu.edu 的电子邮件经过验证
M.W. MakThe Hong Kong Polytechnic University在 polyu.edu.hk 的电子邮件经过验证
Xunying LiuChinese University of Hong Kong在 se.cuhk.edu.hk 的电子邮件经过验证
Yong ZhaoMicrosoft Corporation在 microsoft.com 的电子邮件经过验证
Kaisheng YaoGoogle在 google.com 的电子邮件经过验证
Fahimeh BahmaninezhadMicrosoft在 microsoft.com 的电子邮件经过验证
Jianwei YuTencent AI lab在 tencent.com 的电子邮件经过验证
Kate KnillUniversity of Cambridge在 eng.cam.ac.uk 的电子邮件经过验证
Philip WoodlandProfessor of Information Engineering, Cambridge University Engineering Department在 eng.cam.ac.uk 的电子邮件经过验证
Yajie MiaoCarnegie Mellon University在 cs.cmu.edu 的电子邮件经过验证
Rui Zhaomicrosoft在 microsoft.com 的电子邮件经过验证
Rogier van DalenSamsung AI Center在 samsung.com 的电子邮件经过验证

关注

Shi-Xiong (Austin) Zhang

其他姓名Shi-Xiong Zhang, Shixiong Zhang

Sr. Director | AI Foundations@Capital One | ex-Microsoft, ex-Tencent, Cambridge PhD

在 capitalone.com 的电子邮件经过验证

Multi-modal Foundation Models ASR Speech Processing NLP


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
An overview of deep-learning-based audio-visual speech enhancement and separation D Michelsanti, ZH Tan, SX Zhang, Y Xu, M Yu, D Yu, J Jensen IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1368-1396, 2021	237	2021
End-to-end attention based text-dependent speaker verification SX Zhang, Z Chen, Y Zhao, J Li, Y Gong 2016 IEEE Spoken Language Technology Workshop (SLT), 171-178, 2016	204	2016
Time Domain Audio Visual Speech Separation J Wu, Y Xu, SX Zhang, LW Chen, M Yu, L Xie, D Yu Automatic Speech Recognition and Understanding Workshop, ASRU 2019,, 2019	119	2019
ADL-MVDR: All deep learning MVDR beamformer for target speech separation Z Zhang, Y Xu, M Yu, SX Zhang, L Chen, D Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	118	2021
Computerized intelligent assistant for conferences A Diamant, KM Ben-Dor, E Krupka, R Halaly, Y Smolin, I Gurvich, ... US Patent 10,867,610, 2020	105	2020
Multi-modal multi-channel target speech separation R Gu, SX Zhang, Y Xu, L Chen, Y Zou, D Yu IEEE Journal of Selected Topics in Signal Processing 14 (3), 530-541, 2020	105	2020
Investigation of Multilingual Deep Neural Networks for Spoken Term Detection K Knill, MJF Gales, S Rath, P Woodland, SX Zhang ASRU, 2013	102	2013
SIMPLIFYING LONG SHORT-TERM MEMORY ACOUSTIC MODELS FOR FAST TRAINING AND DECODING Y Miao, J Li, Y Wang, S Zhang, Y Gong ICASSP, 2016	100	2016
Audio-visual Recognition of Overlapped speech for the LRS2 dataset J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	99	2020
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu	94	2019
A comprehensive study of speech separation: spectrogram vs waveform separation F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu arXiv preprint arXiv:1905.07497, 2019	90	2019
End-to-end multi-channel speech separation R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu arXiv preprint arXiv:1905.06286, 2019	86	2019
New era for robust speech recognition: exploiting deep learning S Watanabe, M Delcroix, F Metze, JR Hershey, et al. Springer, 2017	64*	2017
Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	62	2020
Audio-visual speech separation and dereverberation with a two-stage multimodal network K Tan, Y Xu, SX Zhang, M Yu, D Yu IEEE Journal of Selected Topics in Signal Processing 14 (3), 542-553, 2020	53	2020
Structured SVMs for automatic speech recognition SX Zhang, MJF Gales IEEE Transactions on Audio, Speech, and Language Processing 21 (3), 544-555, 2012	50	2012
FAST-RIR: Fast neural diffuse room impulse response generator A Ratnarajah, SX Zhang, M Yu, Z Tang, D Manocha, D Yu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	48	2022
DEEP NEURAL SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION SX Zhang, C Liu, K Yao, Y Gong ICASSP 2015, 2015	46	2015
Far-Field Location Guided Target Speech Extraction Using End-to-End Speech Recognition Objectives AS Subramanian, C Weng, M Yu, SX Zhang, Y Xu, S Watanabe, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	42	2020
Neural Spatio-Temporal Beamformer for Target Speech Separation Y Xu, M Yu, SX Zhang, L Chen, C Weng, J Liu, D Yu arXiv preprint arXiv:2005.03889, 2020	40	2020

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用