相关文章- 学术资源搜索

Lpf: A language-prior feedback objective function for de-biased visual question answering

Z Liang, H Hu, J Zhu - Proceedings of the 44th international ACM SIGIR …, 2021 - dl.acm.org

Most existing Visual Question Answering (VQA) systems tend to overly rely on the language
bias and hence fail to reason from the visual clue. To address this issue, we propose a novel …

被引用次数：31 相关文章所有 4 个版本

[PDF] arxiv.org

Quantifying and alleviating the language prior problem in visual question answering

Y Guo, Z Cheng, L Nie, Y Liu, Y Wang… - Proceedings of the 42nd …, 2019 - dl.acm.org

Benefiting from the advancement of computer vision, natural language processing and
information retrieval techniques, visual question answering (VQA), which aims to answer …

被引用次数：42 相关文章所有 4 个版本

Debiased Visual Question Answering via the perspective of question types

T Huai, S Yang, J Zhang, J Zhao, L He - Pattern Recognition Letters, 2024 - Elsevier

Abstract Visual Question Answering (VQA) aims to answer questions according to the given
image. However, current VQA models tend to rely solely on textual information from the …

被引用次数：2 相关文章所有 3 个版本

[PDF] mlr.press

Overcoming language priors for visual question answering via loss rebalancing label and global context

R Cao, Z Li - Uncertainty in Artificial Intelligence, 2023 - proceedings.mlr.press

Despite the advances in Visual Question Answering (VQA), many VQA models currently
suffer from language priors (ie generating answers directly from questions without using …

被引用次数：2 相关文章所有 5 个版本

[PDF] arxiv.org

Show, ask, attend, and answer: A strong baseline for visual question answering

V Kazemi, A Elqursh - arXiv preprint arXiv:1704.03162, 2017 - arxiv.org

This paper presents a new baseline for visual question answering task. Given an image and
a question in natural language, our model produces accurate answers according to the …

被引用次数：208 相关文章所有 4 个版本

[PDF] neurips.cc

Debiased visual question answering from feature and sample perspectives

Z Wen, G Xu, M Tan, Q Wu… - Advances in Neural …, 2021 - proceedings.neurips.cc

Visual question answering (VQA) is designed to examine the visual-textual reasoning ability
of an intelligent agent. However, recent observations show that many VQA models may only …

被引用次数：54 相关文章所有 9 个版本

[PDF] arxiv.org

Overcoming language priors with self-supervised learning for visual question answering

X Zhu, Z Mao, C Liu, P Zhang, B Wang… - arXiv preprint arXiv …, 2020 - arxiv.org

Most Visual Question Answering (VQA) models suffer from the language prior problem,
which is caused by inherent data biases. Specifically, VQA models tend to answer questions …

被引用次数：106 相关文章所有 5 个版本

[PDF] neurips.cc

Rubi: Reducing unimodal biases for visual question answering

R Cadene, C Dancette, M Cord… - Advances in neural …, 2019 - proceedings.neurips.cc

Abstract Visual Question Answering (VQA) is the task of answering questions about an
image. Some VQA models often exploit unimodal biases to provide the correct answer …

被引用次数：383 相关文章所有 11 个版本

[PDF] ieee.org

Overcoming language priors via shuffling language bias for robust visual question answering

J Zhao, Z Yu, X Zhang, Y Yang - IEEE Access, 2023 - ieeexplore.ieee.org

Recent research has revealed the notorious language prior problem in visual question
answering (VQA) tasks based on visual-textual interaction, which indicates that well …

被引用次数：3 相关文章所有 2 个版本

From superficial to deep: Language bias driven curriculum learning for visual question answering

M Lao, Y Guo, Y Liu, W Chen, N Pu… - Proceedings of the 29th …, 2021 - dl.acm.org

Most Visual Question Answering (VQA) models are faced with language bias when learning
to answer a given question, thereby failing to understand multimodal knowledge …

被引用次数：17 相关文章

Lpf: A language-prior feedback objective function for de-biased visual question answering

Quantifying and alleviating the language prior problem in visual question answering

Debiased Visual Question Answering via the perspective of question types

Overcoming language priors for visual question answering via loss rebalancing label and global context

Show, ask, attend, and answer: A strong baseline for visual question answering

Debiased visual question answering from feature and sample perspectives

Overcoming language priors with self-supervised learning for visual question answering

Rubi: Reducing unimodal biases for visual question answering

Overcoming language priors via shuffling language bias for robust visual question answering

From superficial to deep: Language bias driven curriculum learning for visual question answering

相关搜索

高级搜索

引用