Pan Zexu 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	506	506
h 指数	10	10
i10 指数	10	10

200

100

150

202120222023202421 111 197 172

开放获取的出版物数量

查看全部

7 篇文章

1 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Haizhou LiThe Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), China; NUS, Singapore在 u.nus.edu 的电子邮件经过验证
Tao RuijieResearch Fellow, National University of Singapore在 u.nus.edu 的电子邮件经过验证
Xinyuan QianAssociate Professor, University of Science and Technology Beijing, China在 nus.edu.sg 的电子邮件经过验证
Meng GeTianjin University; CUHK-Shenzhen; National University of Singapore在 nus.edu.sg 的电子邮件经过验证
Jonathan Le RouxMERL在 merl.com 的电子邮件经过验证
Chenglin XuKuaishou Technology, China在 kuaishou.com 的电子邮件经过验证
Zhaojie LuoOsaka University Assistant Professor在 irl.sys.es.osaka-u.ac.jp 的电子邮件经过验证

关注

Pan Zexu

Alibaba; MERL; National University of Singapore

在 u.nus.edu 的电子邮件经过验证 - 首页

Multi-media Speaker extraction


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Is someone speaking? exploring long-term temporal features for audio-visual active speaker detection R Tao, Z Pan, RK Das, X Qian, MZ Shou, H Li Proceedings of the 29th ACM international conference on multimedia, 3927-3935, 2021	159	2021
Multi-modal Attention for Speech Emotion Recognition Z Pan, Z Luo, J Yang, H Li Proc. Interspeech 2020, 364--368, 2020	81	2020
Muse: Multi-modal target speaker extraction with visual cues Z Pan, R Tao, C Xu, H Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	44	2021
Selective listening by synchronizing speech with lips Z Pan, R Tao, C Xu, H Li IEEE/ACM Transactions on Audio, Speech and Language Processing 30, 1650 - 1664, 2022	41	2022
Multi-target DoA estimation with an audio-visual fusion mechanism X Qian, M Madhavi, Z Pan, J Wang, H Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	39	2021
USEV: Universal speaker extraction with visual cue Z Pan, M Ge, H Li IEEE/ACM Transactions on Audio, Speech and Language Processing 30, 3032 - 3045, 2022	36	2022
Speaker Extraction with Co-Speech Gestures Cue Z Pan, X Qian, H Li IEEE Signal Processing Letters 29, 1467 - 1471, 2022	23	2022
Target active speaker detection with audio-visual cues Y Jiang, R Tao, Z Pan, H Li arXiv preprint arXiv:2305.12831, 2023	13	2023
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction Z Pan, M Ge, H Li Proc. Interspeech 2022, 2022	11	2022
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network J Li, M Ge, Z Pan, L Wang, J Dang Proc. Interspeech 2022, 906-910, 2022	10	2022
Time-domain speech separation networks with graph encoding auxiliary T Wang, Z Pan, M Ge, Z Yang, H Li IEEE Signal Processing Letters 30, 110-114, 2023	7	2023
NeuroHeed: Neuro-steered speaker extraction using EEG signals Z Pan, M Borsdorf, S Cai, T Schultz, H Li arXiv preprint arXiv:2307.14303, 2023	6	2023
Is someone speaking R Tao, Z Pan, RK Das, X Qian, MZ Shou, H Li Proceedings of the 29th ACM International Conference on Multimedia, Oct, 2021	6	2021
Rethinking the visual cues in audio-visual speaker extraction J Li, M Ge, Z Pan, R Cao, L Wang, J Dang, S Zhang arXiv preprint arXiv:2306.02625, 2023	5	2023
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue through Embedding Inpainting Z Pan, W Wang, M Borsdorf, H Li ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2022	4	2022
NeuroHeed+: Improving neuro-steered speaker extraction with joint auditory attention detection Z Pan, G Wichern, FG Germain, S Khurana, J Le Roux ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	3	2024
Generation or Replication: Auscultating Audio Latent Diffusion Models D Bralios, G Wichern, FG Germain, Z Pan, S Khurana, C Hori, J Le Roux ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	3	2024
Restoring Speaking Lips from Occlusion for Audio-Visual Speech Recognition J Wang, Z Pan, M Zhang, RT Tan, H Li Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 19144 …, 2024	3	2024
Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction Z Pan, G Wichern, Y Masuyama, FG Germain, S Khurana, C Hori, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	3	2023
Towards end-to-end speaker diarization in the wild Z Pan, G Wichern, FG Germain, A Subramanian, J Le Roux arXiv preprint arXiv: 2211.01299, 2022	3	2022

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用