作者
Remi Cadene, Corentin Dancette, Hedi Ben-younes, Matthieu Cord, Devi Parikh
发表日期
2019/6/24
研讨会论文
NeurIPS 2019 -- arXiv preprint arXiv:1906.10169
页码范围
arXiv preprint arXiv:1906.10169
简介
Visual Question Answering (VQA) is the task of answering questions about an image. Some VQA models often exploit unimodal biases to provide the correct answer without using the image information. As a result, they suffer from a huge drop in performance when evaluated on data outside their training set distribution. This critical issue makes them unsuitable for real-world settings.
引用总数
2019202020212022202320243368310411440
学术搜索中的文章
R Cadene, C Dancette, M Cord, D Parikh - Advances in neural information processing systems, 2019