HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips A Miech, D Zhukov, JB Alayrac, M Tapaswi, I Laptev, J Sivic International Conference on Computer Vision (ICCV), 2019 | 1142 | 2019 |
MovieQA: Understanding Stories in Movies through Question-Answering M Tapaswi, Y Zhu, R Stiefelhagen, A Torralba, R Urtasun, S Fidler Conference on Computer Vision and Pattern Recognition (CVPR), 2016 | 809 | 2016 |
“Knock! Knock! Who is it?” probabilistic person identification in TV-series M Tapaswi, M Baeuml, R Stiefelhagen Conference on Computer Vision and Pattern Recognition (CVPR), 2012 | 168 | 2012 |
Situation Recognition with Graph Neural Networks R Li, M Tapaswi, R Liao, J Jia, R Urtasun, S Fidler International Conference on Computer Vision (ICCV), 2017 | 164 | 2017 |
MovieGraphs: Towards Understanding Human-Centric Situations from Videos P Vicol, M Tapaswi, L Castrejon, S Fidler Conference on Computer Vision and Pattern Recognition (CVPR), 2018 | 159 | 2018 |
Semi-supervised Learning with Constraints for Person Identification in Multimedia Data M Bäuml, M Tapaswi, R Stiefelhagen Conference on Computer Vision and Pattern Recognition (CVPR), 2013 | 155 | 2013 |
Airbert: In-domain Pretraining for Vision-and-Language Navigation PL Guhur, M Tapaswi, S Chen, I Laptev, C Schmid International Conference on Computer Vision (ICCV), 2021 | 125 | 2021 |
Recovering the Missing Link: Predicting Class-Attribute Associations for Unsupervised Zero-Shot Learning Z Al-Halah, M Tapaswi, R Stiefelhagen Conference on Computer Vision and Pattern Recognition (CVPR), 2016 | 122 | 2016 |
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation S Chen, PL Guhur, M Tapaswi, C Schmid, I Laptev Conference on Computer Vision and Pattern Recognition (CVPR), 2022 | 118 | 2022 |
Book2Movie: Aligning Video Scenes With Book Chapters M Tapaswi, M Bäuml, R Stiefelhagen Conference on Computer Vision and Pattern Recognition (CVPR), 2015 | 106 | 2015 |
StoryGraphs: Visualizing Character Interactions as a Timeline M Tapaswi, M Bäuml, R Stiefelhagen Conference on Computer Vision and Pattern Recognition (CVPR), 2014 | 96 | 2014 |
Instruction-driven History-aware Policies for Robotic Manipulations PL Guhur, S Chen, R Garcia, M Tapaswi, I Laptev, C Schmid Conference on Robot Learning (CoRL), 2022 | 82 | 2022 |
Video Face Clustering with Unknown Number of Clusters M Tapaswi, MT Law, S Fidler International Conference on Computer Vision (ICCV), 2019 | 77 | 2019 |
Learning Interactions and Relationships between Movie Characters A Kukleva, M Tapaswi, I Laptev Conference on Computer Vision and Pattern Recognition (CVPR), 2020 | 66 | 2020 |
Self-Supervised Learning of Face Representations for Video Face Clustering V Sharma, M Tapaswi, MS Sarfraz, R Stiefelhagen IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2019 | 55 | 2019 |
Clustering based Contrastive Learning for Improving Face Representations V Sharma, M Tapaswi, MS Sarfraz, R Stiefelhagen IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2020 | 52 | 2020 |
Total Cluster: A person agnostic clustering method for broadcast videos M Tapaswi, OM Parkhi, E Rahtu, E Sommerlade, R Stiefelhagen, ... Indian Conference on Computer Vision Graphics and Image Processing (ICVGIP), 2014 | 49 | 2014 |
Aligning plot synopses to videos for story-based retrieval M Tapaswi, M Bäuml, R Stiefelhagen International Journal of Multimedia Information Retrieval 4, 3-16, 2015 | 47 | 2015 |
Naming TV characters by watching and analyzing dialogs ML Haurilet, M Tapaswi, Z Al-Halah, R Stiefelhagen IEEE Winter Conference on Applications of Computer Vision (WACV), 2016 | 38 | 2016 |
Language Conditioned Spatial Relation Reasoning for 3D Object Grounding S Chen, PL Guhur, M Tapaswi, C Schmid, I Laptev Neural Information Processing Systems (NeurIPS), 2022 | 34 | 2022 |