A brief overview of ChatGPT: The history, status quo and potential future development
ChatGPT, an artificial intelligence generated content (AIGC) model developed by OpenAI,
has attracted world-wide attention for its capability of dealing with challenging language …
has attracted world-wide attention for its capability of dealing with challenging language …
Exploration in deep reinforcement learning: A survey
This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …
techniques are of primary importance when solving sparse reward problems. In sparse …
Champion-level drone racing using deep reinforcement learning
First-person view (FPV) drone racing is a televised sport in which professional competitors
pilot high-speed aircraft through a 3D circuit. Each pilot sees the environment from the …
pilot high-speed aircraft through a 3D circuit. Each pilot sees the environment from the …
Video pretraining (vpt): Learning to act by watching unlabeled online videos
Pretraining on noisy, internet-scale datasets has been heavily studied as a technique for
training models with broad, general capabilities for text, images, and other modalities …
training models with broad, general capabilities for text, images, and other modalities …
Jump-start reinforcement learning
Reinforcement learning (RL) provides a theoretical framework for continuously improving an
agent's behavior via trial and error. However, efficiently learning policies from scratch can be …
agent's behavior via trial and error. However, efficiently learning policies from scratch can be …
Urlb: Unsupervised reinforcement learning benchmark
Deep Reinforcement Learning (RL) has emerged as a powerful paradigm to solve a range
of complex yet specific control tasks. Yet training generalist agents that can quickly adapt to …
of complex yet specific control tasks. Yet training generalist agents that can quickly adapt to …
Byol-explore: Exploration by bootstrapped prediction
We present BYOL-Explore, a conceptually simple yet general approach for curiosity-driven
exploration in visually complex environments. BYOL-Explore learns the world …
exploration in visually complex environments. BYOL-Explore learns the world …
Rorl: Robust offline reinforcement learning via conservative smoothing
Offline reinforcement learning (RL) provides a promising direction to exploit massive amount
of offline data for complex decision-making tasks. Due to the distribution shift issue, current …
of offline data for complex decision-making tasks. Due to the distribution shift issue, current …
Reinforcement learning for optimization of variational quantum circuit architectures
M Ostaszewski, LM Trenkwalder… - Advances in …, 2021 - proceedings.neurips.cc
Abstract The study of Variational Quantum Eigensolvers (VQEs) has been in the spotlight in
recent times as they may lead to real-world applications of near-term quantum devices …
recent times as they may lead to real-world applications of near-term quantum devices …
Semantic exploration from language abstractions and pretrained representations
Effective exploration is a challenge in reinforcement learning (RL). Novelty-based
exploration methods can suffer in high-dimensional state spaces, such as continuous …
exploration methods can suffer in high-dimensional state spaces, such as continuous …