From show to tell: A survey on deep learning-based image captioning
Connecting Vision and Language plays an essential role in Generative Intelligence. For this
reason, large research efforts have been devoted to image captioning, ie describing images …
reason, large research efforts have been devoted to image captioning, ie describing images …
A shared task on multimodal machine translation and crosslingual image description
This paper introduces and summarises the findings of a new shared task at the intersection
of Natural Language Processing and Computer Vision: the generation of image descriptions …
of Natural Language Processing and Computer Vision: the generation of image descriptions …
Visual news: Benchmark and challenges in news image captioning
We propose Visual News Captioner, an entity-aware model for the task of news image
captioning. We also introduce Visual News, a large-scale benchmark consisting of more …
captioning. We also introduce Visual News, a large-scale benchmark consisting of more …
Good news, everyone! context driven entity-aware captioning for news images
Current image captioning systems perform at a merely descriptive level, essentially
enumerating the objects in the scene and their relations. Humans, on the contrary, interpret …
enumerating the objects in the scene and their relations. Humans, on the contrary, interpret …
Semantic interdisciplinary evaluation of image captioning models
U Sirisha, B Sai Chandana - Cogent Engineering, 2022 - Taylor & Francis
In our day-to-day life, synchronizing vision and language aspects plays a crucial role in
solving various real-time challenges. Image captioning is one of them, and it aims to …
solving various real-time challenges. Image captioning is one of them, and it aims to …
Transform and tell: Entity-aware news image captioning
We propose an end-to-end model which generates captions for images embedded in news
articles. News images present two key challenges: they rely on real-world knowledge …
articles. News images present two key challenges: they rely on real-world knowledge …
Boosting entity-aware image captioning with multi-modal knowledge graph
W Zhao, X Wu - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Entity-aware image captioning aims to describe named entities and events related to the
image by utilizing the background knowledge in the associated article. This task remains …
image by utilizing the background knowledge in the associated article. This task remains …
Predicting economic development using geolocated wikipedia articles
Progress on the UN Sustainable Development Goals (SDGs) is hampered by a persistent
lack of data regarding key social, environmental, and economic indicators, particularly in …
lack of data regarding key social, environmental, and economic indicators, particularly in …
Journalistic guidelines aware news image captioning
The task of news article image captioning aims to generate descriptive and informative
captions for news article images. Unlike conventional image captions that simply describe …
captions for news article images. Unlike conventional image captions that simply describe …
Automatic and intelligent content visualization system based on deep learning and genetic algorithm
M Ince - Neural Computing and Applications, 2022 - Springer
Increasing demand in distance education, e-learning, web-based learning, and other digital
sectors (eg, entertainment) has led to excessive amounts of e-content. Learning objects …
sectors (eg, entertainment) has led to excessive amounts of e-content. Learning objects …