A comprehensive review of image caption generation
Image Caption generation is the process of generating textual descriptions of the images by
using natural language processing and computer vision. This review explores the …
using natural language processing and computer vision. This review explores the …
CNN-Transformer based Encoder-Decoder Model for Nepali Image Captioning
Many image captioning tasks have been carried out in recent years, the majority of the work
being for the English language. A few research works have also been carried out for Hindi …
being for the English language. A few research works have also been carried out for Hindi …
Enhancing image caption generation through context-aware attention mechanism
Image captioning, the process of generating natural language descriptions based on image
content, has garnered attention in AI research for its implications in scene understanding …
content, has garnered attention in AI research for its implications in scene understanding …
Note: Towards devising an efficient vqa in the bengali language
Designing and implementing visual question answering tasks using Bengali datasets and
native VQA based smart systems are important, as a huge number of people speak in …
native VQA based smart systems are important, as a huge number of people speak in …
Nature-based Bengali Picture Captioning using Global Attention with GRU
Automatic picture captioning is a prominent re-search area of artificial intelligence
technology (AI). Its ability to enhance AI models by translating observed data into human …
technology (AI). Its ability to enhance AI models by translating observed data into human …
Attention Based Encoder Decoder Model for Video Captioning in Nepali (2023)
K Parajuli, SR Joshi - arXiv preprint arXiv:2312.07418, 2023 - arxiv.org
Video captioning in Nepali, a language written in the Devanagari script, presents a unique
challenge due to the lack of existing academic work in this domain. This work develops a …
challenge due to the lack of existing academic work in this domain. This work develops a …
[PDF][PDF] CapNet: An Encoder-Decoder based Neural Network Model for Automatic Bangla Image Caption Generation
Automatic caption generation from images has become an active research topic in the field
of Computer Vision (CV) and Natural Language Processing (NLP). Machine generated …
of Computer Vision (CV) and Natural Language Processing (NLP). Machine generated …
Bengali Image Captioning Using Vision Encoder-Decoder Model
TI Ishan, A Al Noman, R Rokib… - … on Computer and …, 2023 - ieeexplore.ieee.org
Our research focuses on Bangla Image Captioning which involves generating descriptive
captions for the images. To address this task, we propose a new approach using the Vision …
captions for the images. To address this task, we propose a new approach using the Vision …
Automatic Bengali Image Captioning using EfficientNet-Transformer Network
The task of image captioning is a complex process that involves generating textual
descriptions for images. Much of the research done in this particular domain, especially …
descriptions for images. Much of the research done in this particular domain, especially …
Bangla Handwritten Character and Words Recognition-based on the YOLOv5 Algorithm
P Haque, U Salma, R Chowdhury - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Today, handwritten character recognition (HCR) is a significant research project issue in the
Bangla language. One of the most well-known fundamental issues in Artificial Intelligence …
Bangla language. One of the most well-known fundamental issues in Artificial Intelligence …