Naming TV characters by watching and analyzing dialogs

Q Huang, Y Xiong, A Rao, J Wang, D Lin - Computer Vision–ECCV 2020 …, 2020 - Springer

Recent years have seen remarkable advances in visual understanding. However, how to
understand a story-based long video with artistic styles, eg movie, remains challenging. In …

被引用次数：221 相关文章所有 4 个版本

[PDF] thecvf.com

Video face clustering with unknown number of clusters

M Tapaswi, MT Law, S Fidler - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com

Understanding videos such as TV series and movies requires analyzing who the characters
are and what they are doing. We address the challenging problem of clustering face tracks …

被引用次数：77 相关文章所有 12 个版本

[PDF] thecvf.com

The amazing mysteries of the gutter: Drawing inferences between panels in comic book narratives

M Iyyer, V Manjunatha, A Guha… - Proceedings of the …, 2017 - openaccess.thecvf.com

Visual narrative is often a combination of explicit information and judicious omissions,
relying on the viewer to supply missing details. In comics, most movements in time and …

被引用次数：112 相关文章所有 9 个版本

[PDF] sciencedirect.com

The typed λ-calculus is not elementary recursive

R Statman - 18th Annual Symposium on Foundations of …, 1977 - ieeexplore.ieee.org

Historically, the principal interest in the typed λ-calculus is in connection with Godel's
functional (" Dialectica") interpretation'of intuitionistic arithmetic. However, since the early …

被引用次数：241 相关文章所有 11 个版本

[PDF] arxiv.org

Clustering based contrastive learning for improving face representations

V Sharma, M Tapaswi, MS Sarfraz… - 2020 15th IEEE …, 2020 - ieeexplore.ieee.org

A good clustering algorithm can discover natural groupings in data. These groupings, if used
wisely, provide a form of weak supervision for learning representations. In this work, we …

被引用次数：52 相关文章所有 8 个版本

[PDF] thecvf.com

Face, body, voice: Video person-clustering with multiple modalities

A Brown, V Kalogeiton… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

The objective of this work is person-clustering in videos--grouping characters according to
their identity. Previous methods focus on the narrower task of face-clustering, and for the …

被引用次数：29 相关文章所有 17 个版本

[PDF] arxiv.org

Self-supervised learning of face representations for video face clustering

V Sharma, M Tapaswi, MS Sarfraz… - 2019 14th IEEE …, 2019 - ieeexplore.ieee.org

Analyzing the story behind TV series and movies often requires understanding who the
characters are and what they are doing. With improving deep face models, this may seem …

被引用次数：55 相关文章所有 10 个版本

[PDF] arxiv.org

Online multi-modal person search in videos

J Xia, A Rao, Q Huang, L Xu, J Wen, D Lin - Computer Vision–ECCV 2020 …, 2020 - Springer

The task of searching certain people in videos has seen increasing potential in real-world
applications, such as video organization and editing. Most existing approaches are devised …

被引用次数：32 相关文章所有 4 个版本

[PDF] thecvf.com

End-to-end face detection and cast grouping in movies using erdos-renyi clustering

SY Jin, H Su, C Stauffer… - Proceedings of the …, 2017 - openaccess.thecvf.com

We present an end-to-end system for detecting and clustering faces by identity in full-length
movies. Unlike works that start with a predefined set of detected faces, we consider the end …

被引用次数：54 相关文章所有 7 个版本

[PDF] thecvf.com

Previously on... From Recaps to Story Summarization

AK Singh, D Srivastava… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

We introduce multimodal story summarization by leveraging TV episode recaps-short video
sequences interweaving key story moments from previous episodes to bring viewers up to …