A comprehensive review of image caption generation

O Arshi, P Dadure - Multimedia Tools and Applications, 2024 - Springer
Image Caption generation is the process of generating textual descriptions of the images by
using natural language processing and computer vision. This review explores the …

CNN-Transformer based Encoder-Decoder Model for Nepali Image Captioning

B Subedi, BK Bal - … of the 19th International Conference on …, 2022 - aclanthology.org
Many image captioning tasks have been carried out in recent years, the majority of the work
being for the English language. A few research works have also been carried out for Hindi …

Enhancing image caption generation through context-aware attention mechanism

A Bhuiyan, E Hossain, MM Hoque, MAA Dewan - Heliyon, 2024 - cell.com
Image captioning, the process of generating natural language descriptions based on image
content, has garnered attention in AI research for its implications in scene understanding …

Note: Towards devising an efficient vqa in the bengali language

SMS Islam, RA Auntor, M Islam, MYH Anik… - Proceedings of the 5th …, 2022 - dl.acm.org
Designing and implementing visual question answering tasks using Bengali datasets and
native VQA based smart systems are important, as a huge number of people speak in …

Nature-based Bengali Picture Captioning using Global Attention with GRU

FT Zohora, S Biswas, AK Bairagi… - 2024 IEEE 34th …, 2024 - ieeexplore.ieee.org
Automatic picture captioning is a prominent re-search area of artificial intelligence
technology (AI). Its ability to enhance AI models by translating observed data into human …

Attention Based Encoder Decoder Model for Video Captioning in Nepali (2023)

K Parajuli, SR Joshi - arXiv preprint arXiv:2312.07418, 2023 - arxiv.org
Video captioning in Nepali, a language written in the Devanagari script, presents a unique
challenge due to the lack of existing academic work in this domain. This work develops a …

[PDF][PDF] CapNet: An Encoder-Decoder based Neural Network Model for Automatic Bangla Image Caption Generation

R Rahman, H Murad, NN Rahman… - … Journal of Advanced …, 2022 - researchgate.net
Automatic caption generation from images has become an active research topic in the field
of Computer Vision (CV) and Natural Language Processing (NLP). Machine generated …

Bengali Image Captioning Using Vision Encoder-Decoder Model

TI Ishan, A Al Noman, R Rokib… - … on Computer and …, 2023 - ieeexplore.ieee.org
Our research focuses on Bangla Image Captioning which involves generating descriptive
captions for the images. To address this task, we propose a new approach using the Vision …

Automatic Bengali Image Captioning using EfficientNet-Transformer Network

MK Kabir, A Labonno, S Amin, F Tahsin… - 2023 22nd …, 2023 - ieeexplore.ieee.org
The task of image captioning is a complex process that involves generating textual
descriptions for images. Much of the research done in this particular domain, especially …

Bangla Handwritten Character and Words Recognition-based on the YOLOv5 Algorithm

P Haque, U Salma, R Chowdhury - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Today, handwritten character recognition (HCR) is a significant research project issue in the
Bangla language. One of the most well-known fundamental issues in Artificial Intelligence …