A review of safe reinforcement learning: Methods, theory and applications
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …
making tasks. However, safety concerns are raised during deploying RL in real-world …
Loss of plasticity in continual deep reinforcement learning
In this paper, we characterize the behavior of canonical value-based deep reinforcement
learning (RL) approaches under varying degrees of non-stationarity. In particular, we …
learning (RL) approaches under varying degrees of non-stationarity. In particular, we …
Adversarial training for high-stakes reliability
In the future, powerful AI systems may be deployed in high-stakes settings, where a single
failure could be catastrophic. One technique for improving AI safety in high-stakes settings is …
failure could be catastrophic. One technique for improving AI safety in high-stakes settings is …
Beyond games: a systematic review of neural Monte Carlo tree search applications
The advent of AlphaGo and its successors marked the beginning of a new paradigm in
playing games using artificial intelligence. This was achieved by combining Monte Carlo …
playing games using artificial intelligence. This was achieved by combining Monte Carlo …
Last-iterate convergent policy gradient primal-dual methods for constrained mdps
We study the problem of computing an optimal policy of an infinite-horizon discounted
constrained Markov decision process (constrained MDP). Despite the popularity of …
constrained Markov decision process (constrained MDP). Despite the popularity of …
High-accuracy model-based reinforcement learning, a survey
Deep reinforcement learning has shown remarkable success in the past few years. Highly
complex sequential decision making problems from game playing and robotics have been …
complex sequential decision making problems from game playing and robotics have been …
GVFs in the real world: making predictions online for water treatment
In this paper we investigate the use of reinforcement-learning based prediction approaches
for a real drinking-water treatment plant. Developing such a prediction system is a critical …
for a real drinking-water treatment plant. Developing such a prediction system is a critical …
Deep Video Codec Control for Vision Models
Standardized lossy video coding is at the core of almost all real-world video processing
pipelines. Rate control is used to enable standard codecs to adapt to different network …
pipelines. Rate control is used to enable standard codecs to adapt to different network …
A Perspective on Deep Vision Performance with Standard Image and Video Codecs
Resource-constrained hardware such as edge devices or cell phones often rely on cloud
servers to provide the required computational resources for inference in deep vision models …
servers to provide the required computational resources for inference in deep vision models …
Hybrid search for efficient planning with completeness guarantees
Solving complex planning problems has been a long-standing challenge in computer
science. Learning-based subgoal search methods have shown promise in tackling these …
science. Learning-based subgoal search methods have shown promise in tackling these …