A brief overview of ChatGPT: The history, status quo and potential future development

T Wu, S He, J Liu, S Sun, K Liu… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org
ChatGPT, an artificial intelligence generated content (AIGC) model developed by OpenAI,
has attracted world-wide attention for its capability of dealing with challenging language …

Exploration in deep reinforcement learning: A survey

P Ladosz, L Weng, M Kim, H Oh - Information Fusion, 2022 - Elsevier
This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …

Champion-level drone racing using deep reinforcement learning

E Kaufmann, L Bauersfeld, A Loquercio, M Müller… - Nature, 2023 - nature.com
First-person view (FPV) drone racing is a televised sport in which professional competitors
pilot high-speed aircraft through a 3D circuit. Each pilot sees the environment from the …

Video pretraining (vpt): Learning to act by watching unlabeled online videos

B Baker, I Akkaya, P Zhokov… - Advances in …, 2022 - proceedings.neurips.cc
Pretraining on noisy, internet-scale datasets has been heavily studied as a technique for
training models with broad, general capabilities for text, images, and other modalities …

Jump-start reinforcement learning

I Uchendu, T Xiao, Y Lu, B Zhu, M Yan… - International …, 2023 - proceedings.mlr.press
Reinforcement learning (RL) provides a theoretical framework for continuously improving an
agent's behavior via trial and error. However, efficiently learning policies from scratch can be …

Urlb: Unsupervised reinforcement learning benchmark

M Laskin, D Yarats, H Liu, K Lee, A Zhan, K Lu… - arXiv preprint arXiv …, 2021 - arxiv.org
Deep Reinforcement Learning (RL) has emerged as a powerful paradigm to solve a range
of complex yet specific control tasks. Yet training generalist agents that can quickly adapt to …

Byol-explore: Exploration by bootstrapped prediction

Z Guo, S Thakoor, M Pîslar… - Advances in neural …, 2022 - proceedings.neurips.cc
We present BYOL-Explore, a conceptually simple yet general approach for curiosity-driven
exploration in visually complex environments. BYOL-Explore learns the world …

Rorl: Robust offline reinforcement learning via conservative smoothing

R Yang, C Bai, X Ma, Z Wang… - Advances in neural …, 2022 - proceedings.neurips.cc
Offline reinforcement learning (RL) provides a promising direction to exploit massive amount
of offline data for complex decision-making tasks. Due to the distribution shift issue, current …

Reinforcement learning for optimization of variational quantum circuit architectures

M Ostaszewski, LM Trenkwalder… - Advances in …, 2021 - proceedings.neurips.cc
Abstract The study of Variational Quantum Eigensolvers (VQEs) has been in the spotlight in
recent times as they may lead to real-world applications of near-term quantum devices …

Semantic exploration from language abstractions and pretrained representations

A Tam, N Rabinowitz, A Lampinen… - Advances in neural …, 2022 - proceedings.neurips.cc
Effective exploration is a challenge in reinforcement learning (RL). Novelty-based
exploration methods can suffer in high-dimensional state spaces, such as continuous …