A parallel-fusion RNN-LSTM architecture for image caption generation

MDZ Hossain, F Sohel, MF Shiratuddin… - ACM Computing Surveys …, 2019 - dl.acm.org

Generating a description of an image is called image captioning. Image captioning requires
recognizing the important objects, their attributes, and their relationships in an image. It also …

被引用次数：922 相关文章所有 8 个版本

Vehicle detection from UAV imagery with deep learning: A review

A Bouguettaya, H Zarzour, A Kechida… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Vehicle detection from unmanned aerial vehicle (UAV) imagery is one of the most important
tasks in a large number of computer vision-based applications. This crucial task needed to …

被引用次数：111 相关文章所有 3 个版本

[PDF] researchgate.net

A survey on automatic image caption generation

S Bai, S An - Neurocomputing, 2018 - Elsevier

Image captioning means automatically generating a caption for an image. As a recently
emerged research area, it is attracting more and more attention. To achieve the goal of …

被引用次数：268 相关文章所有 4 个版本

[PDF] port.ac.uk

Visuals to text: A comprehensive review on automatic image captioning

Y Ming, N Hu, C Fan, F Feng… - IEEE/CAA Journal of …, 2022 - researchportal.port.ac.uk

Image captioning refers to automatic generation of descriptive texts according to the visual
content of images. It is a technique integrating multiple disciplines including the computer …

被引用次数：39 相关文章所有 6 个版本

[PDF] arxiv.org

Where to put the image in an image caption generator

M Tanti, A Gatt, KP Camilleri - Natural Language Engineering, 2018 - cambridge.org

When a recurrent neural network (RNN) language model is used for caption generation, the
image information can be fed to the neural network either by directly incorporating it in the …

被引用次数：126 相关文章所有 15 个版本

Evolution of visual data captioning Methods, Datasets, and evaluation Metrics: A comprehensive survey

D Sharma, C Dhiman, D Kumar - Expert Systems with Applications, 2023 - Elsevier

Abstract Automatic Visual Captioning (AVC) generates syntactically and semantically correct
sentences by describing important objects, attributes, and their relationships with each other …

被引用次数：11 相关文章所有 2 个版本

[PDF] aut.ac.nz

[图书][B] Computational methods for deep learning: theory, algorithms, and implementations

WQ Yan - 2023 - books.google.com

The first edition of this textbook was published in 2021. Over the past two years, we have
invested in enhancing all aspects of deep learning methods to ensure the book is …

被引用次数：16 相关文章所有 5 个版本

[PDF] thecvf.com

Task-driven dynamic fusion: Reducing ambiguity in video description

X Zhang, K Gao, Y Zhang, D Zhang… - Proceedings of the …, 2017 - openaccess.thecvf.com

Integrating complementary features from multiple channels is expected to solve the
description ambiguity problem in video captioning, whereas inappropriate fusion strategies …

被引用次数：83 相关文章所有 3 个版本

[PDF] academia.edu

An encoder-decoder based framework for hindi image caption generation

A Singh, TD Singh, S Bandyopadhyay - Multimedia tools and applications, 2021 - Springer

In recent times, research activity on image caption generation has attracted several
researchers. The present work attempt to address the problem of Hindi image caption …

被引用次数：25 相关文章所有 5 个版本

Image captioning for cultural artworks: a case study on ceramics

B Zheng, F Liu, M Zhang, T Zhou, S Cui, Y Ye… - Multimedia Systems, 2023 - Springer

When viewing ancient artworks, people try to build connections with them to 'read'the correct
messages from the past. A proper descriptive caption is essential for viewers to attain …

被引用次数：3 相关文章所有 2 个版本