- 学术资源搜索

A comprehensive survey on pretrained foundation models: A history from bert to chatgpt

C Zhou, Q Li, C Li, J Yu, Y Liu, G Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

Pretrained Foundation Models (PFMs) are regarded as the foundation for various
downstream tasks with different data modalities. A PFM (eg, BERT, ChatGPT, and GPT-4) is …

被引用次数：473 相关文章所有 2 个版本

[PDF] springer.com

A survey on data‐efficient algorithms in big data era

A Adadi - Journal of Big Data, 2021 - Springer

The leading approaches in Machine Learning are notoriously data-hungry. Unfortunately,
many application domains do not have access to big data because acquiring data involves a …

被引用次数：255 相关文章所有 11 个版本

[PDF] arxiv.org

Mastering diverse domains through world models

D Hafner, J Pasukonis, J Ba, T Lillicrap - arXiv preprint arXiv:2301.04104, 2023 - arxiv.org

Developing a general algorithm that learns to solve tasks across a wide range of
applications has been a fundamental challenge in artificial intelligence. Although current …

被引用次数：383 相关文章所有 2 个版本

[PDF] neurips.cc

Deep reinforcement learning at the edge of the statistical precipice

R Agarwal, M Schwarzer, PS Castro… - Advances in neural …, 2021 - proceedings.neurips.cc

Deep reinforcement learning (RL) algorithms are predominantly evaluated by comparing
their relative performance on a large suite of tasks. Most published results on deep RL …

被引用次数：616 相关文章所有 8 个版本

[PDF] mlr.press

The primacy bias in deep reinforcement learning

E Nikishin, M Schwarzer, P D'Oro… - International …, 2022 - proceedings.mlr.press

This work identifies a common flaw of deep reinforcement learning (RL) algorithms: a
tendency to rely on early interactions and ignore useful evidence encountered later …

被引用次数：142 相关文章所有 5 个版本

[PDF] mlr.press

Bigger, better, faster: Human-level atari with human-level efficiency

M Schwarzer, JSO Ceron, A Courville… - International …, 2023 - proceedings.mlr.press

We introduce a value-based RL agent, which we call BBF, that achieves super-human
performance in the Atari 100K benchmark. BBF relies on scaling the neural networks used …

被引用次数：56 相关文章所有 8 个版本

[PDF] arxiv.org

Gaia-1: A generative world model for autonomous driving

A Hu, L Russell, H Yeo, Z Murez, G Fedoseev… - arXiv preprint arXiv …, 2023 - arxiv.org

Autonomous driving promises transformative improvements to transportation, but building
systems capable of safely navigating the unstructured complexity of real-world scenarios …

被引用次数：102 相关文章所有 2 个版本

[PDF] arxiv.org

Mastering visual continuous control: Improved data-augmented reinforcement learning

D Yarats, R Fergus, A Lazaric, L Pinto - arXiv preprint arXiv:2107.09645, 2021 - arxiv.org

We present DrQ-v2, a model-free reinforcement learning (RL) algorithm for visual
continuous control. DrQ-v2 builds on DrQ, an off-policy actor-critic approach that uses data …

被引用次数：284 相关文章所有 4 个版本

[PDF] mlr.press

Masked world models for visual control

Y Seo, D Hafner, H Liu, F Liu, S James… - … on Robot Learning, 2023 - proceedings.mlr.press

Visual model-based reinforcement learning (RL) has the potential to enable sample-efficient
robot learning from visual observations. Yet the current approaches typically train a single …

被引用次数：106 相关文章所有 6 个版本

[PDF] mlr.press

Reinforcement learning with action-free pre-training from videos

Y Seo, K Lee, SL James… - … Conference on Machine …, 2022 - proceedings.mlr.press

Recent unsupervised pre-training methods have shown to be effective on language and
vision domains by learning useful representations for multiple downstream tasks. In this …

被引用次数：100 相关文章所有 5 个版本