Arsha Nagrani 个人学术档案

引用次数

	总计	2019 年至今
引用	11089	10924
h 指数	33	33
i10 指数	47	47

3200

1600

800

2400

2018201920202021202220232024135 487 1240 1632 2404 3106 2026

开放获取的出版物数量

查看全部

21 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Andrew ZissermanUniversity of Oxford在 robots.ox.ac.uk 的电子邮件经过验证
Cordelia SchmidResearch director INRIA 在 inria.fr 的电子邮件经过验证
Joon Son ChungKAIST在 kaist.ac.kr 的电子邮件经过验证
Chen SunAssistant Professor, Brown University在 brown.edu 的电子邮件经过验证
Andrea VedaldiUniversity of Oxford在 robots.ox.ac.uk 的电子邮件经过验证
Dima DamenProfessor, University of Bristol and Google DeepMind在 bristol.ac.uk 的电子邮件经过验证
Evangelos KazakosCzech Technical University in Prague在 cvut.cz 的电子邮件经过验证
Rahul SukthankarGoogle Research在 google.com 的电子邮件经过验证
Samuel AlbanieAssistant Professor, University of Cambridge在 cam.ac.uk 的电子邮件经过验证

关注

Arsha Nagrani

Research Scientist, Google

在 google.com 的电子邮件经过验证 - 首页

Machine learning Computer Vision Speech Technology Deep Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Voxceleb: a large-scale speaker identification dataset A Nagrani, JS Chung, A Zisserman arXiv preprint arXiv:1706.08612, 2017	2493	2017
Voxceleb2: Deep speaker recognition JS Chung, A Nagrani, A Zisserman arXiv preprint arXiv:1806.05622, 2018	2352	2018
Frozen in time: A joint video and image encoder for end-to-end retrieval M Bain, A Nagrani, G Varol, A Zisserman Proceedings of the IEEE/CVF international conference on computer vision …, 2021	841	2021
Voxceleb: Large-scale speaker verification in the wild A Nagrani, JS Chung, W Xie, A Zisserman Computer Speech & Language 60, 101027, 2020	662	2020
Attention bottlenecks for multimodal fusion A Nagrani, S Yang, A Arnab, A Jansen, C Schmid, C Sun Advances in neural information processing systems 34, 14200-14213, 2021	512	2021
Use what you have: Video retrieval using representations from collaborative experts Y Liu, S Albanie, A Nagrani, A Zisserman arXiv preprint arXiv:1907.13487, 2019	414	2019
Utterance-level aggregation for speaker recognition in the wild W Xie, A Nagrani, JS Chung, A Zisserman ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	400	2019
Epic-fusion: Audio-visual temporal binding for egocentric action recognition E Kazakos, A Nagrani, A Zisserman, D Damen Proceedings of the IEEE/CVF international conference on computer vision …, 2019	373	2019
Emotion recognition in speech using cross-modal transfer in the wild S Albanie, A Nagrani, A Vedaldi, A Zisserman Proceedings of the 26th ACM international conference on Multimedia, 292-301, 2018	312	2018
Seeing voices and hearing faces: Cross-modal biometric matching A Nagrani, S Albanie, A Zisserman Proceedings of the IEEE conference on computer vision and pattern …, 2018	236	2018
Chimpanzee face recognition from videos in the wild using deep learning D Schofield, A Nagrani, A Zisserman, M Hayashi, T Matsuzawa, D Biro, ... Science advances 5 (9), eaaw0736, 2019	193	2019
Localizing visual sounds the hard way H Chen, W Xie, T Afouras, A Nagrani, A Vedaldi, A Zisserman Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	172	2021
End-to-end generative pretraining for multimodal video captioning PH Seo, A Nagrani, A Arnab, C Schmid Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	162	2022
Learnable pins: Cross-modal embeddings for person identity A Nagrani, S Albanie, A Zisserman Proceedings of the European conference on computer vision (ECCV), 71-88, 2018	148	2018
Spot the conversation: speaker diarisation in the wild JS Chung, J Huh, A Nagrani, T Afouras, A Zisserman arXiv preprint arXiv:2007.01216, 2020	144	2020
Vid2seq: Large-scale pretraining of a visual language model for dense video captioning A Yang, A Nagrani, PH Seo, A Miech, J Pont-Tuset, I Laptev, J Sivic, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	138	2023
Cough against covid: Evidence of covid-19 signature in cough sounds P Bagad, A Dalmia, J Doshi, A Nagrani, P Bhamare, A Mahale, S Rane, ... arXiv preprint arXiv:2009.08790, 2020	132	2020
Disentangled speech embeddings using cross-modal self-supervision A Nagrani, JS Chung, S Albanie, A Zisserman ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	100	2020
Pali-x: On scaling up a multilingual vision and language model X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ... arXiv preprint arXiv:2305.18565, 2023	99	2023
Condensed movies: Story based retrieval with contextual embeddings M Bain, A Nagrani, A Brown, A Zisserman Proceedings of the Asian Conference on Computer Vision, 2020	86	2020

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用