相关文章- 学术资源搜索

Quantifying and alleviating the language prior problem in visual question answering

Y Guo, Z Cheng, L Nie, Y Liu, Y Wang… - Proceedings of the 42nd …, 2019 - dl.acm.org

Benefiting from the advancement of computer vision, natural language processing and
information retrieval techniques, visual question answering (VQA), which aims to answer …

被引用次数：42 相关文章所有 4 个版本

Regulating Balance Degree for More Reasonable Visual Question Answering Benchmark

K Lin, A Mao, J Liu - 2022 International Joint Conference on …, 2022 - ieeexplore.ieee.org

Superficial linguistic correlations is a critical issue for Visual Question Answering (VQA),
where models can achieve high performance by exploiting the connection between question …

被引用次数：1 相关文章

[PDF] arxiv.org

Loss re-scaling VQA: Revisiting the language prior problem from a class-imbalance view

Y Guo, L Nie, Z Cheng, Q Tian… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Recent studies have pointed out that many well-developed Visual Question Answering
(VQA) models are heavily affected by the language prior problem. It refers to making …

被引用次数：55 相关文章所有 8 个版本

Vqa-bc: Robust visual question answering via bidirectional chaining

M Lao, Y Guo, W Chen, N Pu… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

Current VQA models are suffering from the problem of overdependence on language bias,
which severely reduces their robustness in real-world scenarios. In this paper, we analyze …

被引用次数：4 相关文章

[PDF] aclanthology.org

Digging out discrimination information from generated samples for robust visual question answering

Z Wen, Y Wang, M Tan, Q Wu, Q Wu - Findings of the Association …, 2023 - aclanthology.org

Abstract Visual Question Answering (VQA) aims to answer a textual question based on a
given image. Nevertheless, recent studies have shown that VQA models tend to capture the …

被引用次数：5 相关文章所有 2 个版本

Debiased Visual Question Answering via the perspective of question types

T Huai, S Yang, J Zhang, J Zhao, L He - Pattern Recognition Letters, 2024 - Elsevier

Abstract Visual Question Answering (VQA) aims to answer questions according to the given
image. However, current VQA models tend to rely solely on textual information from the …

被引用次数：2 相关文章所有 3 个版本

[PDF] thecvf.com

Greedy gradient ensemble for robust visual question answering

X Han, S Wang, C Su, Q Huang… - Proceedings of the …, 2021 - openaccess.thecvf.com

Abstract Language bias is a critical issue in Visual Question Answering (VQA), where
models often exploit dataset biases for the final decision without considering the image …

被引用次数：65 相关文章所有 6 个版本

Simple contrastive learning in a self-supervised manner for robust visual question answering

S Yang, L Xiao, X Wu, J Xu, L Wang, L He - Computer Vision and Image …, 2024 - Elsevier

Recent observations have revealed that Visual Question Answering models are susceptible
to learning the spurious correlations formed by dataset biases, ie, the language priors …

[PDF] aclanthology.org

A Multi-modal Debiasing Model with Dynamical Constraint for Robust Visual Question Answering

Y Li, B Hu, F Zhang, Y Yu, J Liu… - Findings of the …, 2023 - aclanthology.org

Recent studies have pointed out that many well-developed Visual Question Answering
(VQA) systems suffer from bias problem. Despite the remarkable performance gained on In …

被引用次数：3 相关文章所有 2 个版本

Fair Attention Network for Robust Visual Question Answering

Y Bi, H Jiang, Y Hu, Y Sun, B Yin - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

As a prevailing cross-modal reasoning task, Visual Question Answering (VQA) has achieved
impressive progress in the last few years, where the language bias is widely studied to learn …

被引用次数：1 相关文章