Reinforcement learning for robot research: A comprehensive review and open issues
T Zhang, H Mo - International Journal of Advanced Robotic …, 2021 - journals.sagepub.com
Applying the learning mechanism of natural living beings to endow intelligent robots with
humanoid perception and decision-making wisdom becomes an important force to promote …
humanoid perception and decision-making wisdom becomes an important force to promote …
[图书][B] Algorithms for decision making
A broad introduction to algorithms for decision making under uncertainty, introducing the
underlying mathematical problem formulations and the algorithms for solving them …
underlying mathematical problem formulations and the algorithms for solving them …
An introduction to deep reinforcement learning
Deep reinforcement learning is the combination of reinforcement learning (RL) and deep
learning. This field of research has been able to solve a wide range of complex …
learning. This field of research has been able to solve a wide range of complex …
Impala: Scalable distributed deep-rl with importance weighted actor-learner architectures
In this work we aim to solve a large collection of tasks using a single reinforcement learning
agent with a single set of parameters. A key challenge is to handle the increased amount of …
agent with a single set of parameters. A key challenge is to handle the increased amount of …
A review of machine learning for new generation smart dispatch in power systems
L Yin, Q Gao, L Zhao, B Zhang, T Wang, S Li… - … Applications of Artificial …, 2020 - Elsevier
This paper analyzes the characteristics and challenges of the new generation smart
dispatch systems, and proposes the framework of smart dispatch. Secondly, the …
dispatch systems, and proposes the framework of smart dispatch. Secondly, the …
A distributional perspective on reinforcement learning
MG Bellemare, W Dabney… - … conference on machine …, 2017 - proceedings.mlr.press
In this paper we argue for the fundamental importance of the value distribution: the
distribution of the random return received by a reinforcement learning agent. This is in …
distribution of the random return received by a reinforcement learning agent. This is in …
Revisiting the arcade learning environment: Evaluation protocols and open problems for general agents
The Arcade Learning Environment (ALE) is an evaluation platform that poses the challenge
of building AI agents with general competency across dozens of Atari 2600 games. It …
of building AI agents with general competency across dozens of Atari 2600 games. It …
Sample efficient actor-critic with experience replay
This paper presents an actor-critic deep reinforcement learning agent with experience
replay that is stable, sample efficient, and performs remarkably well on challenging …
replay that is stable, sample efficient, and performs remarkably well on challenging …
Safe and efficient off-policy reinforcement learning
R Munos, T Stepleton… - Advances in neural …, 2016 - proceedings.neurips.cc
In this work, we take a fresh look at some old and new algorithms for off-policy, return-based
reinforcement learning. Expressing these in a common form, we derive a novel algorithm …
reinforcement learning. Expressing these in a common form, we derive a novel algorithm …
Meta-gradient reinforcement learning
The goal of reinforcement learning algorithms is to estimate and/or optimise the value
function. However, unlike supervised learning, no teacher or oracle is available to provide …
function. However, unlike supervised learning, no teacher or oracle is available to provide …