A survey on curriculum learning
Curriculum learning (CL) is a training strategy that trains a machine learning model from
easier data to harder data, which imitates the meaningful learning order in human curricula …
easier data to harder data, which imitates the meaningful learning order in human curricula …
Recent advances in convolutional neural networks
In the last few years, deep learning has led to very good performance on a variety of
problems, such as visual recognition, speech recognition and natural language processing …
problems, such as visual recognition, speech recognition and natural language processing …
Data selection for language models via importance resampling
Selecting a suitable pretraining dataset is crucial for both general-domain (eg, GPT-3) and
domain-specific (eg, Codex) language models (LMs). We formalize this problem as selecting …
domain-specific (eg, Codex) language models (LMs). We formalize this problem as selecting …
Meta-learning in neural networks: A survey
The field of meta-learning, or learning-to-learn, has seen a dramatic rise in interest in recent
years. Contrary to conventional approaches to AI where tasks are solved from scratch using …
years. Contrary to conventional approaches to AI where tasks are solved from scratch using …
Dataset cartography: Mapping and diagnosing datasets with training dynamics
Large datasets have become commonplace in NLP research. However, the increased
emphasis on data quantity has made it challenging to assess the quality of data. We …
emphasis on data quantity has made it challenging to assess the quality of data. We …
Distributed prioritized experience replay
We propose a distributed architecture for deep reinforcement learning at scale, that enables
agents to learn effectively from orders of magnitude more data than previously possible. The …
agents to learn effectively from orders of magnitude more data than previously possible. The …
Prioritized training on points that are learnable, worth learning, and not yet learnt
S Mindermann, JM Brauner… - International …, 2022 - proceedings.mlr.press
Training on web-scale data can take months. But much computation and time is wasted on
redundant and noisy points that are already learnt or not learnable. To accelerate training …
redundant and noisy points that are already learnt or not learnable. To accelerate training …
Not all samples are created equal: Deep learning with importance sampling
A Katharopoulos, F Fleuret - International conference on …, 2018 - proceedings.mlr.press
Abstract Deep Neural Network training spends most of the computation on examples that
are properly handled, and could be ignored. We propose to mitigate this phenomenon with a …
are properly handled, and could be ignored. We propose to mitigate this phenomenon with a …
Training region-based object detectors with online hard example mining
The field of object detection has made significant advances riding on the wave of region-
based ConvNets, but their training procedure still includes many heuristics and …
based ConvNets, but their training procedure still includes many heuristics and …
A-fast-rcnn: Hard positive generation via adversary for object detection
How do we learn an object detector that is invariant to occlusions and deformations? Our
current solution is to use a data-driven strategy--collect large-scale datasets which have …
current solution is to use a data-driven strategy--collect large-scale datasets which have …