相关文章- 学术资源搜索

Overcoming language priors via shuffling language bias for robust visual question answering

J Zhao, Z Yu, X Zhang, Y Yang - IEEE Access, 2023 - ieeexplore.ieee.org

Recent research has revealed the notorious language prior problem in visual question
answering (VQA) tasks based on visual-textual interaction, which indicates that well …

被引用次数：3 相关文章所有 2 个版本

[PDF] mlr.press

Overcoming language priors for visual question answering via loss rebalancing label and global context

R Cao, Z Li - Uncertainty in Artificial Intelligence, 2023 - proceedings.mlr.press

Despite the advances in Visual Question Answering (VQA), many VQA models currently
suffer from language priors (ie generating answers directly from questions without using …

被引用次数：2 相关文章所有 5 个版本

A language prior based focal loss for visual question answering

M Lao, Y Guo, Y Liu, MS Lew - 2021 IEEE International …, 2021 - ieeexplore.ieee.org

According to current research, one of the major challenges in Visual Question Answering
(VQA) models is the overdependence on language priors (and neglect of the visual …

被引用次数：14 相关文章所有 2 个版本

Vqa-bc: Robust visual question answering via bidirectional chaining

M Lao, Y Guo, W Chen, N Pu… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

Current VQA models are suffering from the problem of overdependence on language bias,
which severely reduces their robustness in real-world scenarios. In this paper, we analyze …

被引用次数：4 相关文章

[PDF] arxiv.org

Overcoming language priors with self-supervised learning for visual question answering

X Zhu, Z Mao, C Liu, P Zhang, B Wang… - arXiv preprint arXiv …, 2020 - arxiv.org

Most Visual Question Answering (VQA) models suffer from the language prior problem,
which is caused by inherent data biases. Specifically, VQA models tend to answer questions …

被引用次数：106 相关文章所有 5 个版本

Suppressing biased samples for robust VQA

N Ouyang, Q Huang, P Li, Y Cai, B Liu… - IEEE Transactions …, 2021 - ieeexplore.ieee.org

Most existing visual question answering (VQA) models strongly rely on language bias to
answer questions, ie, they always tend to fit question-answer pairs on the train split and …

被引用次数：22 相关文章所有 3 个版本

[PDF] arxiv.org

Learning from lexical perturbations for consistent visual question answering

S Whitehead, H Wu, YR Fung, H Ji, R Feris… - arXiv preprint arXiv …, 2020 - arxiv.org

Existing Visual Question Answering (VQA) models are often fragile and sensitive to input
variations. In this paper, we propose a novel approach to address this issue based on …

被引用次数：14 相关文章所有 2 个版本

Fair Attention Network for Robust Visual Question Answering

Y Bi, H Jiang, Y Hu, Y Sun, B Yin - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

As a prevailing cross-modal reasoning task, Visual Question Answering (VQA) has achieved
impressive progress in the last few years, where the language bias is widely studied to learn …

被引用次数：1 相关文章

Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA

A Vosoughi, S Deng, S Zhang, Y Tian… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

To increase the generalization capability of VQA systems, many recent studies have tried to
de-bias spurious language or vision associations that shortcut the question or image to the …

被引用次数：1 相关文章所有 2 个版本

[PDF] thecvf.com

Counterfactual samples synthesizing for robust visual question answering

L Chen, X Yan, J Xiao, H Zhang… - Proceedings of the …, 2020 - openaccess.thecvf.com

Abstract Despite Visual Question Answering (VQA) has realized impressive progress over
the last few years, today's VQA models tend to capture superficial linguistic correlations in …

被引用次数：325 相关文章所有 8 个版本