Deep Learning Approaches for Image Captioning: Opportunities, Challenges and Future Potential

A Jamil, K Mahmood, MG Villar, T Prola… - IEEE …, 2024 - ieeexplore.ieee.org
Generative intelligence relies heavily on the integration of vision and language. Much of the
research has focused on image captioning, which involves describing images with …

Toward textual transform coding

T Weissman - IEEE BITS the Information Theory Magazine, 2023 - ieeexplore.ieee.org
Inspired by recent work on compression with and for humans, the success of transform-
based approaches to information processing, and the rise of powerful language-based AI …

Novel concept-based image captioning models using LSTM and multi-encoder transformer architecture

AAE Osman, MAW Shalaby, MM Soliman… - Scientific Reports, 2024 - nature.com
Captioning an image involves using a combination of vision and language models to
describe the image in an expressive and concise sentence. Successful captioning task …

Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

W Li, J Li, R Ramos, R Tang, D Elliott - arXiv preprint arXiv:2406.02265, 2024 - arxiv.org
Recent advancements in retrieval-augmented models for image captioning highlight the
significance of retrieving related captions for efficient, lightweight models with strong domain …

Nature-based Bengali Picture Captioning using Global Attention with GRU

FT Zohora, S Biswas, AK Bairagi… - 2024 IEEE 34th …, 2024 - ieeexplore.ieee.org
Automatic picture captioning is a prominent re-search area of artificial intelligence
technology (AI). Its ability to enhance AI models by translating observed data into human …

KNN and the CNN for Handwritten Digit Recognition: A comparative study

TM El-Sahhar, MAW Shalaby - 2023 5th Novel Intelligent and …, 2023 - ieeexplore.ieee.org
The automatic identification of handwritten digits by computers or other devices is referred to
as" handwritten digit recognition technology," and it has a wider range of potential …

Hearing Beyond Sight: Image Captioning and Translation with Speech Synthesis for the Visually Impaired

G Saraf, M Kulkarni, R Sabane, D Oswal… - 2024 Asia Pacific …, 2024 - ieeexplore.ieee.org
The ability to receive and comprehend visual content is crucial for many aspects of human
existence, including education, entertainment, and communication. However, millions of …

[PDF][PDF] Ar-CM-ViMETA: Arabic Image Captioning based on Concept Model and Vision-based Multi-Encoder Transformer Architecture

A Osman, M Shalaby, M Soliman, K Elsayed - researchgate.net
Image captioning is a major artificial intelligence research field that involves visual
interpretation and linguistic description of a corresponding image. Successful image …

[PDF][PDF] Revealing AI-Driven Chest X-Ray Image Captioning Using Blip Transformer

AK Aggarwal - researchgate.net
Misdiagnoses, delays in the preparation of medical diagnosis results, and a lack of
experience in clearly identifying insights from medical scans, such as X-rays or MRI, present …

[PDF][PDF] ПОБУДОВА МОДЕЛІ ОПИСУ ЗОБРАЖЕНЬ ДЛЯ ЗАДАЧІ РОЗПІЗНАВАННЯ ДОРОГОЦІННОСТЕЙ

АС Коваленко - The X International Scientific and Practical Conference … - researchgate.net
Image Captioning—це динамічна галузь, яка поєднує комп'ютерний зір і обробку
природної мови для автоматичного створення текстових описів зображень. Її головне …