查看文章

neurips.cc 中的 [PDF]

RUBi: Reducing Unimodal Biases in Visual Question Answering

作者

Remi Cadene, Corentin Dancette, Hedi Ben-younes, Matthieu Cord, Devi Parikh

发表日期

2019/6/24

研讨会论文

NeurIPS 2019 -- arXiv preprint arXiv:1906.10169

页码范围

arXiv preprint arXiv:1906.10169

简介

Visual Question Answering (VQA) is the task of answering questions about an image. Some VQA models often exploit unimodal biases to provide the correct answer without using the image information. As a result, they suffer from a huge drop in performance when evaluated on data outside their training set distribution. This critical issue makes them unsuitable for real-world settings.

引用总数

被引用次数：381

2019202020212022202320243 36 83 104 114 40

学术搜索中的文章

Rubi: Reducing unimodal biases for visual question answering

R Cadene, C Dancette, M Cord, D Parikh - Advances in neural information processing systems, 2019

被引用次数：381 相关文章所有 12 个版本