Quantifying and alleviating the language prior problem in visual question answering

Y Guo, Z Cheng, L Nie, Y Liu, Y Wang… - Proceedings of the 42nd …, 2019 - dl.acm.org
Benefiting from the advancement of computer vision, natural language processing and
information retrieval techniques, visual question answering (VQA), which aims to answer …

Regulating Balance Degree for More Reasonable Visual Question Answering Benchmark

K Lin, A Mao, J Liu - 2022 International Joint Conference on …, 2022 - ieeexplore.ieee.org
Superficial linguistic correlations is a critical issue for Visual Question Answering (VQA),
where models can achieve high performance by exploiting the connection between question …

Loss re-scaling VQA: Revisiting the language prior problem from a class-imbalance view

Y Guo, L Nie, Z Cheng, Q Tian… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Recent studies have pointed out that many well-developed Visual Question Answering
(VQA) models are heavily affected by the language prior problem. It refers to making …

Vqa-bc: Robust visual question answering via bidirectional chaining

M Lao, Y Guo, W Chen, N Pu… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Current VQA models are suffering from the problem of overdependence on language bias,
which severely reduces their robustness in real-world scenarios. In this paper, we analyze …

Digging out discrimination information from generated samples for robust visual question answering

Z Wen, Y Wang, M Tan, Q Wu, Q Wu - Findings of the Association …, 2023 - aclanthology.org
Abstract Visual Question Answering (VQA) aims to answer a textual question based on a
given image. Nevertheless, recent studies have shown that VQA models tend to capture the …

Debiased Visual Question Answering via the perspective of question types

T Huai, S Yang, J Zhang, J Zhao, L He - Pattern Recognition Letters, 2024 - Elsevier
Abstract Visual Question Answering (VQA) aims to answer questions according to the given
image. However, current VQA models tend to rely solely on textual information from the …

Greedy gradient ensemble for robust visual question answering

X Han, S Wang, C Su, Q Huang… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract Language bias is a critical issue in Visual Question Answering (VQA), where
models often exploit dataset biases for the final decision without considering the image …

Simple contrastive learning in a self-supervised manner for robust visual question answering

S Yang, L Xiao, X Wu, J Xu, L Wang, L He - Computer Vision and Image …, 2024 - Elsevier
Recent observations have revealed that Visual Question Answering models are susceptible
to learning the spurious correlations formed by dataset biases, ie, the language priors …

A Multi-modal Debiasing Model with Dynamical Constraint for Robust Visual Question Answering

Y Li, B Hu, F Zhang, Y Yu, J Liu… - Findings of the …, 2023 - aclanthology.org
Recent studies have pointed out that many well-developed Visual Question Answering
(VQA) systems suffer from bias problem. Despite the remarkable performance gained on In …

Fair Attention Network for Robust Visual Question Answering

Y Bi, H Jiang, Y Hu, Y Sun, B Yin - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
As a prevailing cross-modal reasoning task, Visual Question Answering (VQA) has achieved
impressive progress in the last few years, where the language bias is widely studied to learn …