A review on deep reinforcement learning for fluid mechanics: An update
J Viquerat, P Meliga, A Larcher, E Hachem - Physics of Fluids, 2022 - pubs.aip.org
In the past couple of years, the interest of the fluid mechanics community for deep
reinforcement learning techniques has increased at fast pace, leading to a growing …
reinforcement learning techniques has increased at fast pace, leading to a growing …
A survey on policy search algorithms for learning robot controllers in a handful of trials
K Chatzilygeroudis, V Vassiliades… - IEEE Transactions …, 2019 - ieeexplore.ieee.org
Most policy search (PS) algorithms require thousands of training episodes to find an
effective policy, which is often infeasible with a physical robot. This survey article focuses on …
effective policy, which is often infeasible with a physical robot. This survey article focuses on …
Conflict-averse gradient descent for multi-task learning
The goal of multi-task learning is to enable more efficient learning than single task learning
by sharing model structures for a diverse set of tasks. A standard multi-task learning …
by sharing model structures for a diverse set of tasks. A standard multi-task learning …
Gradient surgery for multi-task learning
While deep learning and deep reinforcement learning (RL) systems have demonstrated
impressive results in domains such as image classification, game playing, and robotic …
impressive results in domains such as image classification, game playing, and robotic …
Unidexgrasp++: Improving dexterous grasping policy learning via geometry-aware curriculum and iterative generalist-specialist learning
We propose a novel, object-agnostic method for learning a universal policy for dexterous
object grasping from realistic point cloud observations and proprioceptive information under …
object grasping from realistic point cloud observations and proprioceptive information under …
Progress & compress: A scalable framework for continual learning
We introduce a conceptually simple and scalable framework for continual learning domains
where tasks are learned sequentially. Our method is constant in the number of parameters …
where tasks are learned sequentially. Our method is constant in the number of parameters …
Relay policy learning: Solving long-horizon tasks via imitation and reinforcement learning
We present relay policy learning, a method for imitation and reinforcement learning that can
solve multi-stage, long-horizon robotic tasks. This general and universally-applicable, two …
solve multi-stage, long-horizon robotic tasks. This general and universally-applicable, two …
Mt-opt: Continuous multi-task robotic reinforcement learning at scale
General-purpose robotic systems must master a large repertoire of diverse skills to be useful
in a range of daily tasks. While reinforcement learning provides a powerful framework for …
in a range of daily tasks. While reinforcement learning provides a powerful framework for …
Learning by playing solving sparse reward tasks from scratch
Abstract We propose Scheduled Auxiliary Control (SAC-X), a new learning paradigm in the
context of Reinforcement Learning (RL). SAC-X enables learning of complex behaviors-from …
context of Reinforcement Learning (RL). SAC-X enables learning of complex behaviors-from …
Self-distillation amplifies regularization in hilbert space
H Mobahi, M Farajtabar… - Advances in Neural …, 2020 - proceedings.neurips.cc
Abstract Knowledge distillation introduced in the deep learning context is a method to
transfer knowledge from one architecture to another. In particular, when the architectures are …
transfer knowledge from one architecture to another. In particular, when the architectures are …