Topic-based image caption generation

F Lotfi, A Beheshti, H Farhood, M Pooshideh… - Algorithms, 2023 - mdpi.com

In our digital age, data are generated constantly from public and private sources, social
media platforms, and the Internet of Things. A significant portion of this information comes in …

被引用次数：23 相关文章所有 7 个版本

Image captioning using inception V3 transfer learning model

S Degadwala, D Vyas, H Biswas… - 2021 6th …, 2021 - ieeexplore.ieee.org

As artificial intelligence has grown rapidly in recent years, picture captioning has attracted
the interest of numerous experts, which has become a fascinating and challenging …

被引用次数：45 相关文章

Image captioning with novel topics guidance and retrieval-based topics re-weighting

M Al-Qatf, X Wang, A Hawbani… - IEEE Transactions …, 2022 - ieeexplore.ieee.org

Topic modelling (TM) has shown significant progress in boosting the effectiveness of image
captioning in the last few years. Although important improvements have been shown in …

被引用次数：14 相关文章所有 2 个版本

Optimal transformers based image captioning using beam search

A Shetty, Y Kale, Y Patil, R Patil, S Sharma - Multimedia Tools and …, 2024 - Springer

Image Captioning is the process of generating textual descriptions of given images. It
encompasses two major fields of deep learning, computer vision, and natural language …

被引用次数：3 相关文章

Topic-guided abstractive multimodal summarization with multimodal output

S Rafi, R Das - Neural Computing and Applications, 2023 - Springer

Summarization is a technique that produces condensed text from large text documents by
using different deep-learning techniques. Over the past few years, abstractive …

被引用次数：5 相关文章

NumCap: a number-controlled multi-caption image captioning network

A Abdussalam, Z Ye, A Hawbani, M Al-Qatf… - ACM Transactions on …, 2023 - dl.acm.org

Image captioning is a promising task that attracted researchers in the last few years. Existing
image captioning models are primarily trained to generate one caption per image. However …

被引用次数：10 相关文章

[PDF] nature.com

Novel concept-based image captioning models using LSTM and multi-encoder transformer architecture

AAE Osman, MAW Shalaby, MM Soliman… - Scientific Reports, 2024 - nature.com

Captioning an image involves using a combination of vision and language models to
describe the image in an expressive and concise sentence. Successful captioning task …

基于Se-ResNet50 特征编码器的公共环境图像描述生成.

唐渔，何志琴，周宇辉，吴钦木… - Application Research of …, 2023 - search.ebscohost.com

针对传统公共环境图像描述模型中编码器—解码器结构在编码过程中特征提取能力不足以及
解码过程中上下文信息丢失严重的问题, 提出了一种基于se Resnet50 与M LsTM …

被引用次数：2 相关文章所有 2 个版本

Image caption generation using deep neural networks

J Sudhakar, VV Iyer, ST Sharmila - … International Conference for …, 2022 - ieeexplore.ieee.org

In recent years, computer vision has made significant progress, primarily in the field of image
classification and object detection and recognition. Describing the image content …

被引用次数：10 相关文章

A linear sub-structure with co-variance shift for image captioning

S Rafi, R Das - 2021 8th International Conference on Soft …, 2021 - ieeexplore.ieee.org

Automatic description of image has attracted many researchers in the field of computer
vision for captioning the image in artificial intelligence which connects with Natural …

被引用次数：5 相关文章