Movienet: A holistic dataset for movie understanding
Recent years have seen remarkable advances in visual understanding. However, how to
understand a story-based long video with artistic styles, eg movie, remains challenging. In …
understand a story-based long video with artistic styles, eg movie, remains challenging. In …
Video face clustering with unknown number of clusters
Understanding videos such as TV series and movies requires analyzing who the characters
are and what they are doing. We address the challenging problem of clustering face tracks …
are and what they are doing. We address the challenging problem of clustering face tracks …
The amazing mysteries of the gutter: Drawing inferences between panels in comic book narratives
Visual narrative is often a combination of explicit information and judicious omissions,
relying on the viewer to supply missing details. In comics, most movements in time and …
relying on the viewer to supply missing details. In comics, most movements in time and …
The typed λ-calculus is not elementary recursive
R Statman - 18th Annual Symposium on Foundations of …, 1977 - ieeexplore.ieee.org
Historically, the principal interest in the typed λ-calculus is in connection with Godel's
functional (" Dialectica") interpretation'of intuitionistic arithmetic. However, since the early …
functional (" Dialectica") interpretation'of intuitionistic arithmetic. However, since the early …
Clustering based contrastive learning for improving face representations
A good clustering algorithm can discover natural groupings in data. These groupings, if used
wisely, provide a form of weak supervision for learning representations. In this work, we …
wisely, provide a form of weak supervision for learning representations. In this work, we …
Face, body, voice: Video person-clustering with multiple modalities
A Brown, V Kalogeiton… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
The objective of this work is person-clustering in videos--grouping characters according to
their identity. Previous methods focus on the narrower task of face-clustering, and for the …
their identity. Previous methods focus on the narrower task of face-clustering, and for the …
Self-supervised learning of face representations for video face clustering
Analyzing the story behind TV series and movies often requires understanding who the
characters are and what they are doing. With improving deep face models, this may seem …
characters are and what they are doing. With improving deep face models, this may seem …
Online multi-modal person search in videos
The task of searching certain people in videos has seen increasing potential in real-world
applications, such as video organization and editing. Most existing approaches are devised …
applications, such as video organization and editing. Most existing approaches are devised …
End-to-end face detection and cast grouping in movies using erdos-renyi clustering
We present an end-to-end system for detecting and clustering faces by identity in full-length
movies. Unlike works that start with a predefined set of detected faces, we consider the end …
movies. Unlike works that start with a predefined set of detected faces, we consider the end …
Previously on... From Recaps to Story Summarization
AK Singh, D Srivastava… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We introduce multimodal story summarization by leveraging TV episode recaps-short video
sequences interweaving key story moments from previous episodes to bring viewers up to …
sequences interweaving key story moments from previous episodes to bring viewers up to …