Storytelling with image data: A systematic review and comparative analysis of methods and tools

F Lotfi, A Beheshti, H Farhood, M Pooshideh… - Algorithms, 2023 - mdpi.com
In our digital age, data are generated constantly from public and private sources, social
media platforms, and the Internet of Things. A significant portion of this information comes in …

Image captioning using inception V3 transfer learning model

S Degadwala, D Vyas, H Biswas… - 2021 6th …, 2021 - ieeexplore.ieee.org
As artificial intelligence has grown rapidly in recent years, picture captioning has attracted
the interest of numerous experts, which has become a fascinating and challenging …

Image captioning with novel topics guidance and retrieval-based topics re-weighting

M Al-Qatf, X Wang, A Hawbani… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
Topic modelling (TM) has shown significant progress in boosting the effectiveness of image
captioning in the last few years. Although important improvements have been shown in …

Optimal transformers based image captioning using beam search

A Shetty, Y Kale, Y Patil, R Patil, S Sharma - Multimedia Tools and …, 2024 - Springer
Image Captioning is the process of generating textual descriptions of given images. It
encompasses two major fields of deep learning, computer vision, and natural language …

Topic-guided abstractive multimodal summarization with multimodal output

S Rafi, R Das - Neural Computing and Applications, 2023 - Springer
Summarization is a technique that produces condensed text from large text documents by
using different deep-learning techniques. Over the past few years, abstractive …

NumCap: a number-controlled multi-caption image captioning network

A Abdussalam, Z Ye, A Hawbani, M Al-Qatf… - ACM Transactions on …, 2023 - dl.acm.org
Image captioning is a promising task that attracted researchers in the last few years. Existing
image captioning models are primarily trained to generate one caption per image. However …

Novel concept-based image captioning models using LSTM and multi-encoder transformer architecture

AAE Osman, MAW Shalaby, MM Soliman… - Scientific Reports, 2024 - nature.com
Captioning an image involves using a combination of vision and language models to
describe the image in an expressive and concise sentence. Successful captioning task …

基于Se-ResNet50 特征编码器的公共环境图像描述生成.

唐渔, 何志琴, 周宇辉, 吴钦木… - Application Research of …, 2023 - search.ebscohost.com
针对传统公共环境图像描述模型中编码器—解码器结构在编码过程中特征提取能力不足以及
解码过程中上下文信息丢失严重的问题, 提出了一种基于se Resnet50 与M LsTM …

Image caption generation using deep neural networks

J Sudhakar, VV Iyer, ST Sharmila - … International Conference for …, 2022 - ieeexplore.ieee.org
In recent years, computer vision has made significant progress, primarily in the field of image
classification and object detection and recognition. Describing the image content …

A linear sub-structure with co-variance shift for image captioning

S Rafi, R Das - 2021 8th International Conference on Soft …, 2021 - ieeexplore.ieee.org
Automatic description of image has attracted many researchers in the field of computer
vision for captioning the image in artificial intelligence which connects with Natural …