Movienet: A holistic dataset for movie understanding

Q Huang, Y Xiong, A Rao, J Wang, D Lin - Computer Vision–ECCV 2020 …, 2020 - Springer
Recent years have seen remarkable advances in visual understanding. However, how to
understand a story-based long video with artistic styles, eg movie, remains challenging. In …

Video face clustering with unknown number of clusters

M Tapaswi, MT Law, S Fidler - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Understanding videos such as TV series and movies requires analyzing who the characters
are and what they are doing. We address the challenging problem of clustering face tracks …

The amazing mysteries of the gutter: Drawing inferences between panels in comic book narratives

M Iyyer, V Manjunatha, A Guha… - Proceedings of the …, 2017 - openaccess.thecvf.com
Visual narrative is often a combination of explicit information and judicious omissions,
relying on the viewer to supply missing details. In comics, most movements in time and …

The typed λ-calculus is not elementary recursive

R Statman - 18th Annual Symposium on Foundations of …, 1977 - ieeexplore.ieee.org
Historically, the principal interest in the typed λ-calculus is in connection with Godel's
functional (" Dialectica") interpretation'of intuitionistic arithmetic. However, since the early …

Clustering based contrastive learning for improving face representations

V Sharma, M Tapaswi, MS Sarfraz… - 2020 15th IEEE …, 2020 - ieeexplore.ieee.org
A good clustering algorithm can discover natural groupings in data. These groupings, if used
wisely, provide a form of weak supervision for learning representations. In this work, we …

Face, body, voice: Video person-clustering with multiple modalities

A Brown, V Kalogeiton… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
The objective of this work is person-clustering in videos--grouping characters according to
their identity. Previous methods focus on the narrower task of face-clustering, and for the …

Self-supervised learning of face representations for video face clustering

V Sharma, M Tapaswi, MS Sarfraz… - 2019 14th IEEE …, 2019 - ieeexplore.ieee.org
Analyzing the story behind TV series and movies often requires understanding who the
characters are and what they are doing. With improving deep face models, this may seem …

Online multi-modal person search in videos

J Xia, A Rao, Q Huang, L Xu, J Wen, D Lin - Computer Vision–ECCV 2020 …, 2020 - Springer
The task of searching certain people in videos has seen increasing potential in real-world
applications, such as video organization and editing. Most existing approaches are devised …

End-to-end face detection and cast grouping in movies using erdos-renyi clustering

SY Jin, H Su, C Stauffer… - Proceedings of the …, 2017 - openaccess.thecvf.com
We present an end-to-end system for detecting and clustering faces by identity in full-length
movies. Unlike works that start with a predefined set of detected faces, we consider the end …

Previously on... From Recaps to Story Summarization

AK Singh, D Srivastava… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We introduce multimodal story summarization by leveraging TV episode recaps-short video
sequences interweaving key story moments from previous episodes to bring viewers up to …