S3c: Semi-supervised vqa natural language explanation via self-critical learning

W Suo, M Sun, W Liu, Y Gao, P Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract VQA Natural Language Explanation (VQA-NLE) task aims to explain the decision-
making process of VQA models in natural language. Unlike traditional attention or gradient …

The current and future role of visual question answering in eXplainable artificial intelligence.

M Caro-Martinez, A Wijekoon, B Diaz-Agudo… - 2023 - rgu-repository.worktribe.com
Over the last few years, we have seen how the interest of the computer science research
community on eXplainable Artificial Intelligence has grown in leaps and bounds. The reason …

MSGeN: Multimodal Selective Generation Network for Grounded Explanations

D Li, W Chen, X Lin - Electronics, 2023 - mdpi.com
Modern models have shown impressive capabilities in visual reasoning tasks. However, the
interpretability of their decision-making processes remains a challenge, causing uncertainty …

A Review on VQA: Methods, Tools and Datasets

M Agrawal, AS Jalal, H Sharma - … International Conference on …, 2023 - ieeexplore.ieee.org
An new area called “visual question answering”(VQA) seeks to integrate CV with NLP. In
order to get correct results, it entails creating models that can comprehend both textual …

II-MMR: Identifying and improving multi-modal multi-hop reasoning in visual question answering

J Kil, F Tavazoee, D Kang, JK Kim - arXiv preprint arXiv:2402.11058, 2024 - arxiv.org
Visual Question Answering (VQA) often involves diverse reasoning scenarios across Vision
and Language (V&L). Most prior VQA studies, however, have merely focused on assessing …

VLIB: Unveiling insights through Visual and Linguistic Integration of Biorxiv data relevant to cancer via Multimodal Large Language Model

V Prabhakar, K Liu - bioRxiv, 2023 - biorxiv.org
The field of cancer research has greatly benefited from the wealth of new knowledge
provided by research articles and preprints on platforms like Biorxiv. This study investigates …