A comprehensive survey of deep learning for image captioning

MDZ Hossain, F Sohel, MF Shiratuddin… - ACM Computing Surveys …, 2019 - dl.acm.org
Generating a description of an image is called image captioning. Image captioning requires
recognizing the important objects, their attributes, and their relationships in an image. It also …

Vehicle detection from UAV imagery with deep learning: A review

A Bouguettaya, H Zarzour, A Kechida… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Vehicle detection from unmanned aerial vehicle (UAV) imagery is one of the most important
tasks in a large number of computer vision-based applications. This crucial task needed to …

A survey on automatic image caption generation

S Bai, S An - Neurocomputing, 2018 - Elsevier
Image captioning means automatically generating a caption for an image. As a recently
emerged research area, it is attracting more and more attention. To achieve the goal of …

Visuals to text: A comprehensive review on automatic image captioning

Y Ming, N Hu, C Fan, F Feng… - IEEE/CAA Journal of …, 2022 - researchportal.port.ac.uk
Image captioning refers to automatic generation of descriptive texts according to the visual
content of images. It is a technique integrating multiple disciplines including the computer …

Where to put the image in an image caption generator

M Tanti, A Gatt, KP Camilleri - Natural Language Engineering, 2018 - cambridge.org
When a recurrent neural network (RNN) language model is used for caption generation, the
image information can be fed to the neural network either by directly incorporating it in the …

Evolution of visual data captioning Methods, Datasets, and evaluation Metrics: A comprehensive survey

D Sharma, C Dhiman, D Kumar - Expert Systems with Applications, 2023 - Elsevier
Abstract Automatic Visual Captioning (AVC) generates syntactically and semantically correct
sentences by describing important objects, attributes, and their relationships with each other …

[图书][B] Computational methods for deep learning: theory, algorithms, and implementations

WQ Yan - 2023 - books.google.com
The first edition of this textbook was published in 2021. Over the past two years, we have
invested in enhancing all aspects of deep learning methods to ensure the book is …

Task-driven dynamic fusion: Reducing ambiguity in video description

X Zhang, K Gao, Y Zhang, D Zhang… - Proceedings of the …, 2017 - openaccess.thecvf.com
Integrating complementary features from multiple channels is expected to solve the
description ambiguity problem in video captioning, whereas inappropriate fusion strategies …

An encoder-decoder based framework for hindi image caption generation

A Singh, TD Singh, S Bandyopadhyay - Multimedia tools and applications, 2021 - Springer
In recent times, research activity on image caption generation has attracted several
researchers. The present work attempt to address the problem of Hindi image caption …

Image captioning for cultural artworks: a case study on ceramics

B Zheng, F Liu, M Zhang, T Zhou, S Cui, Y Ye… - Multimedia Systems, 2023 - Springer
When viewing ancient artworks, people try to build connections with them to 'read'the correct
messages from the past. A proper descriptive caption is essential for viewers to attain …