Rvs: What is essential for offline rl via supervised learning?
Recent work has shown that supervised learning alone, without temporal difference (TD)
learning, can be remarkably effective for offline RL. When does this hold true, and which …
learning, can be remarkably effective for offline RL. When does this hold true, and which …
Waypoint transformer: Reinforcement learning via supervised learning with intermediate targets
Despite the recent advancements in offline reinforcement learning via supervised learning
(RvS) and the success of the decision transformer (DT) architecture in various domains, DTs …
(RvS) and the success of the decision transformer (DT) architecture in various domains, DTs …
Model-based offline policy optimization with adversarial network
Abstract Model-based offline reinforcement learning (RL), which builds a supervised
transition model with logging dataset to avoid costly interactions with the online …
transition model with logging dataset to avoid costly interactions with the online …
Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning
Adaptive interfaces can help users perform sequential decision-making tasks like robotic
teleoperation given noisy, high-dimensional command signals (eg, from a brain-computer …
teleoperation given noisy, high-dimensional command signals (eg, from a brain-computer …
State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning
Pessimism is of great importance in offline reinforcement learning (RL). One broad category
of offline RL algorithms fulfills pessimism by explicit or implicit behavior regularization …
of offline RL algorithms fulfills pessimism by explicit or implicit behavior regularization …