LCM-Captioner: A lightweight text-based image captioning method with collaborative mechanism between vision and text

Q Wang, H Deng, X Wu, Z Yang, Y Liu, Y Wang, G Hao - Neural Networks, 2023 - Elsevier
Text-based image captioning (TextCap) aims to remedy the shortcomings of existing image
captioning tasks that ignore text content when describing images. Instead, it requires models …

Cross-region feature fusion with geometrical relationship for OCR-based image captioning

J Zhou, C Yang, Y Zhu, Y Zhang - Neurocomputing, 2024 - Elsevier
Automatically generating a readable sentence that describes the text-contained image is a
challenging task. Compared to traditional image captioning algorithms, OCR-based image …

Transformer with multi-level grid features and depth pooling for image captioning

DC Bui, TV Nguyen, K Nguyen - Machine Vision and Applications, 2024 - Springer
Image captioning is an exciting yet challenging problem in both computer vision and natural
language processing research. In recent years, this problem has been addressed by …

AFSDCGN: Adaptive Feature Scaling and Dynamic Contextual Graph Networks for image captioning with unseen relationship detection

YA Thakare, KH Walse, M Atique - Multimedia Tools and Applications, 2024 - Springer
Automated image captioning systems play a crucial role in various applications such as
assistive technologies, content indexing, and robotics. However, current frameworks face …

Accurate and Complete Captions for Question-controlled Text-aware Image Captioning

Y Wang, J Hu, L Shang - 2023 IEEE International Conference …, 2023 - ieeexplore.ieee.org
Question-controlled Text-aware Image Captioning (Qc-TextCap), is the task of generating a
distinctive scene text aware caption according to several personalized questions when …

Improving Human-Object Interaction Detection via Streamlining Matching Procedure and Enhancing Interaction Query

TT Pham, VL Bui, TV Le, DC Bui… - … and Electronics (ICCE), 2024 - ieeexplore.ieee.org
The domain of Human-Object Interaction (HOI) detection has captured considerable
attention within the computer vision research community, given its focus on grasping the …