A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Loss of plasticity in continual deep reinforcement learning

Z Abbas, R Zhao, J Modayil, A White… - … on Lifelong Learning …, 2023 - proceedings.mlr.press
In this paper, we characterize the behavior of canonical value-based deep reinforcement
learning (RL) approaches under varying degrees of non-stationarity. In particular, we …

Adversarial training for high-stakes reliability

D Ziegler, S Nix, L Chan, T Bauman… - Advances in …, 2022 - proceedings.neurips.cc
In the future, powerful AI systems may be deployed in high-stakes settings, where a single
failure could be catastrophic. One technique for improving AI safety in high-stakes settings is …

Beyond games: a systematic review of neural Monte Carlo tree search applications

M Kemmerling, D Lütticke, RH Schmitt - Applied Intelligence, 2024 - Springer
The advent of AlphaGo and its successors marked the beginning of a new paradigm in
playing games using artificial intelligence. This was achieved by combining Monte Carlo …

Last-iterate convergent policy gradient primal-dual methods for constrained mdps

D Ding, CY Wei, K Zhang… - Advances in Neural …, 2024 - proceedings.neurips.cc
We study the problem of computing an optimal policy of an infinite-horizon discounted
constrained Markov decision process (constrained MDP). Despite the popularity of …

High-accuracy model-based reinforcement learning, a survey

A Plaat, W Kosters, M Preuss - Artificial Intelligence Review, 2023 - Springer
Deep reinforcement learning has shown remarkable success in the past few years. Highly
complex sequential decision making problems from game playing and robotics have been …

GVFs in the real world: making predictions online for water treatment

MK Janjua, H Shah, M White, E Miahi, MC Machado… - Machine Learning, 2023 - Springer
In this paper we investigate the use of reinforcement-learning based prediction approaches
for a real drinking-water treatment plant. Developing such a prediction system is a critical …

Deep Video Codec Control for Vision Models

C Reich, B Debnath, D Patel… - Proceedings of the …, 2024 - openaccess.thecvf.com
Standardized lossy video coding is at the core of almost all real-world video processing
pipelines. Rate control is used to enable standard codecs to adapt to different network …

A Perspective on Deep Vision Performance with Standard Image and Video Codecs

C Reich, O Hahn, D Cremers… - Proceedings of the …, 2024 - openaccess.thecvf.com
Resource-constrained hardware such as edge devices or cell phones often rely on cloud
servers to provide the required computational resources for inference in deep vision models …

Hybrid search for efficient planning with completeness guarantees

K Kujanpää, J Pajarinen, A Ilin - Advances in Neural …, 2024 - proceedings.neurips.cc
Solving complex planning problems has been a long-standing challenge in computer
science. Learning-based subgoal search methods have shown promise in tackling these …