LCM-Captioner: A lightweight text-based image captioning method with collaborative mechanism between vision and text
Text-based image captioning (TextCap) aims to remedy the shortcomings of existing image
captioning tasks that ignore text content when describing images. Instead, it requires models …
captioning tasks that ignore text content when describing images. Instead, it requires models …
Cross-region feature fusion with geometrical relationship for OCR-based image captioning
J Zhou, C Yang, Y Zhu, Y Zhang - Neurocomputing, 2024 - Elsevier
Automatically generating a readable sentence that describes the text-contained image is a
challenging task. Compared to traditional image captioning algorithms, OCR-based image …
challenging task. Compared to traditional image captioning algorithms, OCR-based image …
Transformer with multi-level grid features and depth pooling for image captioning
Image captioning is an exciting yet challenging problem in both computer vision and natural
language processing research. In recent years, this problem has been addressed by …
language processing research. In recent years, this problem has been addressed by …
AFSDCGN: Adaptive Feature Scaling and Dynamic Contextual Graph Networks for image captioning with unseen relationship detection
YA Thakare, KH Walse, M Atique - Multimedia Tools and Applications, 2024 - Springer
Automated image captioning systems play a crucial role in various applications such as
assistive technologies, content indexing, and robotics. However, current frameworks face …
assistive technologies, content indexing, and robotics. However, current frameworks face …
Accurate and Complete Captions for Question-controlled Text-aware Image Captioning
Y Wang, J Hu, L Shang - 2023 IEEE International Conference …, 2023 - ieeexplore.ieee.org
Question-controlled Text-aware Image Captioning (Qc-TextCap), is the task of generating a
distinctive scene text aware caption according to several personalized questions when …
distinctive scene text aware caption according to several personalized questions when …
Improving Human-Object Interaction Detection via Streamlining Matching Procedure and Enhancing Interaction Query
The domain of Human-Object Interaction (HOI) detection has captured considerable
attention within the computer vision research community, given its focus on grasping the …
attention within the computer vision research community, given its focus on grasping the …