查看文章

thecvf.com 中的 [PDF]

TGIF: A new dataset and benchmark on animated GIF description

作者

Yuncheng Li, Yale Song, Liangliang Cao, Joel Tetreault, Larry Goldberg, Alejandro Jaimes, Jiebo Luo

发表日期

2016

研讨会论文

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

页码范围

4641-4650

简介

With the recent popularity of animated GIFs on social media, there is need for ways to index them with rich metadata. To advance research on animated GIF understanding, we collected a new dataset, Tumblr GIF (TGIF), with 100K animated GIFs from Tumblr and 120K natural language descriptions obtained via crowdsourcing. The motivation for this work is to develop a testbed for image sequence description systems, where the task is to generate natural language descriptions for animated GIFs or video clips. To ensure a high quality dataset, we developed a series of novel quality controls to validate free-form text input from crowdworkers. We show that there is unambiguous association between visual content and natural language descriptions in our dataset, making it an ideal benchmark for the visual content captioning task. We perform extensive statistical analyses to compare our dataset to existing image and video description datasets. Next, we provide baseline results on the animated GIF description task, using three representative techniques: nearest neighbor, statistical machine translation, and recurrent neural networks. Finally, we show that models fine-tuned from our animated GIF description dataset can be helpful for automatic movie description.

引用总数

被引用次数：286

2016201720182019202020212022202320245 17 16 35 28 49 50 37 47

学术搜索中的文章

TGIF: A new dataset and benchmark on animated GIF description

Y Li, Y Song, L Cao, J Tetreault, L Goldberg, A Jaimes… - Proceedings of the IEEE Conference on Computer …, 2016

被引用次数：286 相关文章所有 9 个版本