Image captioning with semantic attention

Q You, H Jin, Z Wang, C Fang… - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
Automatically generating a natural language description of an image has attracted interests
recently both because of its importance in practical applications and because it connects two …

Towards diverse and natural image descriptions via a conditional gan

B Dai, S Fidler, R Urtasun, D Lin - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
Despite the substantial progress in recent years, the problem of image captioning remains
far from being satisfactorily tackled. Sentences produced by existing methods, eg those …

Microsoft coco captions: Data collection and evaluation server

X Chen, H Fang, TY Lin, R Vedantam, S Gupta… - arXiv preprint arXiv …, 2015 - arxiv.org
In this paper we describe the Microsoft COCO Caption dataset and evaluation server. When
completed, the dataset will contain over one and a half million captions describing over …

A comprehensive survey on image captioning: from handcrafted to deep learning-based techniques, a taxonomy and open research issues

H Sharma, D Padha - Artificial Intelligence Review, 2023 - Springer
Image captioning is a pretty modern area of the convergence of computer vision and natural
language processing and is widely used in a range of applications such as multi-modal …

Deep reinforcement learning-based image captioning with embedding reward

Z Ren, X Wang, N Zhang, X Lv… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
Image captioning is a challenging problem owing to the complexity in understanding the
image content and diverse ways of describing it in natural language. Recent advances in …

Are you talking to a machine? dataset and methods for multilingual image question

H Gao, J Mao, J Zhou, Z Huang… - Advances in neural …, 2015 - proceedings.neurips.cc
In this paper, we present the mQA model, which is able to answer questions about the
content of an image. The answer can be a sentence, a phrase or a single word. Our model …

From methods to datasets: A survey on Image-Caption Generators

L Agarwal, B Verma - Multimedia Tools and Applications, 2024 - Springer
Abstract Image-Caption Generator is a popular Artificial Intelligence research tool that works
with image comprehension and language definition. Creating well-structured sentences …

Exploring nearest neighbor approaches for image captioning

J Devlin, S Gupta, R Girshick, M Mitchell… - arXiv preprint arXiv …, 2015 - arxiv.org
We explore a variety of nearest neighbor baseline approaches for image captioning. These
approaches find a set of nearest neighbor images in the training set from which a caption …

Learning like a child: Fast novel visual concept learning from sentence descriptions of images

J Mao, X Wei, Y Yang, J Wang… - Proceedings of the …, 2015 - openaccess.thecvf.com
In this paper, we address the task of learning novel visual concepts, and their interactions
with other concepts, from a few images with sentence descriptions. Using linguistic context …

A survey on deep neural network-based image captioning

X Liu, Q Xu, N Wang - The Visual Computer, 2019 - Springer
Image captioning is a hot topic of image understanding, and it is composed of two natural
parts (“look” and “language expression”) which correspond to the two most important fields …