Active learning for BERT: an empirical study

LE Dor, A Halfon, A Gera, E Shnarch… - Proceedings of the …, 2020 - aclanthology.org
Real world scenarios present a challenge for text classification, since labels are usually
expensive and the data is often characterized by class imbalance. Active Learning (AL) is a …

Plex: Towards reliability using pretrained large model extensions

D Tran, J Liu, MW Dusenberry, D Phan… - arXiv preprint arXiv …, 2022 - arxiv.org
A recent trend in artificial intelligence is the use of pretrained models for language and
vision tasks, which have achieved extraordinary performance but also puzzling failures …

Simulation intelligence: Towards a new generation of scientific methods

A Lavin, D Krakauer, H Zenil, J Gottschlich… - arXiv preprint arXiv …, 2021 - arxiv.org
The original" Seven Motifs" set forth a roadmap of essential methods for the field of scientific
computing, where a motif is an algorithmic method that captures a pattern of computation …

Deep bayesian active learning for natural language processing: Results of a large-scale empirical study

A Siddhant, ZC Lipton - arXiv preprint arXiv:1808.05697, 2018 - arxiv.org
Several recent papers investigate Active Learning (AL) for mitigating the data dependence
of deep learning for natural language processing. However, the applicability of AL to real …

On statistical bias in active learning: How and when to fix it

S Farquhar, Y Gal, T Rainforth - arXiv preprint arXiv:2101.11665, 2021 - arxiv.org
Active learning is a powerful tool when labelling data is expensive, but it introduces a bias
because the training data no longer follows the population distribution. We formalize this …

Adaptive graph guided disambiguation for partial label learning

DB Wang, L Li, ML Zhang - Proceedings of the 25th ACM SIGKDD …, 2019 - dl.acm.org
Partial label learning aims to induce a multi-class classifier from training examples where
each of them is associated with a set of candidate labels, among which only one is the …

Active learning for imbalanced datasets

U Aggarwal, A Popescu… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Active learning increases the effectiveness of labeling when only subsets of unlabeled
datasets can be processed manually. To our knowledge, existing algorithms are designed …

A survey on cost types, interaction schemes, and annotator performance models in selection algorithms for active learning in classification

M Herde, D Huseljic, B Sick, A Calma - IEEE Access, 2021 - ieeexplore.ieee.org
Pool-based active learning (AL) aims to optimize the annotation process (ie, labeling) as the
acquisition of annotations is often time-consuming and therefore expensive. For this …

Importance of self-consistency in active learning for semantic segmentation

SA Golestaneh, KM Kitani - arXiv preprint arXiv:2008.01860, 2020 - arxiv.org
We address the task of active learning in the context of semantic segmentation and show
that self-consistency can be a powerful source of self-supervision to greatly improve the …

Learning rare category classifiers on a tight labeling budget

RT Mullapudi, F Poms, WR Mark… - Proceedings of the …, 2021 - openaccess.thecvf.com
Many real-world ML deployments face the challenge of training a rare category model with a
small labeling bud-get. In these settings, there is often access to large amounts of unlabeled …