Id and ood performance are sometimes inversely correlated on real-world datasets
Several studies have compared the in-distribution (ID) and out-of-distribution (OOD)
performance of models in computer vision and NLP. They report a frequent positive …
performance of models in computer vision and NLP. They report a frequent positive …
Towards logiglue: A brief survey and a benchmark for analyzing logical reasoning capabilities of language models
Logical reasoning is fundamental for humans yet presents a substantial challenge in the
domain of Artificial Intelligence. Initially, researchers used Knowledge Representation and …
domain of Artificial Intelligence. Initially, researchers used Knowledge Representation and …
AI robustness: a human-centered perspective on technological challenges and opportunities
Despite the impressive performance of Artificial Intelligence (AI) systems, their robustness
remains elusive and constitutes a key issue that impedes large-scale adoption. Besides …
remains elusive and constitutes a key issue that impedes large-scale adoption. Besides …
On the adversarial robustness of out-of-distribution generalization models
Abstract Out-of-distribution (OOD) generalization has attracted increasing research attention
in recent years, due to its promising experimental results in real-world applications …
in recent years, due to its promising experimental results in real-world applications …
A survey on out-of-distribution evaluation of neural nlp models
Adversarial robustness, domain generalization and dataset biases are three active lines of
research contributing to out-of-distribution (OOD) evaluation on neural NLP models …
research contributing to out-of-distribution (OOD) evaluation on neural NLP models …
Adversarial Bayesian augmentation for single-source domain generalization
Generalizing to unseen image domains is a challenging problem primarily due to the lack of
diverse training data, inaccessible target data, and the large domain shift that may exist in …
diverse training data, inaccessible target data, and the large domain shift that may exist in …
Choose your qa model wisely: A systematic study of generative and extractive readers for question answering
While both extractive and generative readers have been successfully applied to the
Question Answering (QA) task, little attention has been paid toward the systematic …
Question Answering (QA) task, little attention has been paid toward the systematic …
Biotabqa: Instruction learning for biomedical table question answering
Table Question Answering (TQA) is an important but under-explored task. Most of the
existing QA datasets are in unstructured text format and only few of them use tables as the …
existing QA datasets are in unstructured text format and only few of them use tables as the …
Evaluating human-ai collaboration: A review and methodological framework
The use of artificial intelligence (AI) in working environments with individuals, known as
Human-AI Collaboration (HAIC), has become essential in a variety of domains, boosting …
Human-AI Collaboration (HAIC), has become essential in a variety of domains, boosting …
Robust source-free domain adaptation for fundus image segmentation
Abstract Unsupervised Domain Adaptation (UDA) is a learning technique that transfers
knowledge learned in the source domain from labelled training data to the target domain …
knowledge learned in the source domain from labelled training data to the target domain …