Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Reinforcement learning for selective key applications in power systems: Recent advances and future challenges

X Chen, G Qu, Y Tang, S Low… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
With large-scale integration of renewable generation and distributed energy resources,
modern power systems are confronted with new operational challenges, such as growing …

Deep reinforcement learning for task offloading in mobile edge computing systems

M Tang, VWS Wong - IEEE Transactions on Mobile Computing, 2020 - ieeexplore.ieee.org
In mobile edge computing systems, an edge node may have a high load when a large
number of mobile devices offload their tasks to it. Those offloaded tasks may experience …

A survey of reinforcement learning algorithms for dynamically varying environments

S Padakandla - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Reinforcement learning (RL) algorithms find applications in inventory control, recommender
systems, vehicular traffic management, cloud computing, and robotics. The real-world …

Trustworthy reinforcement learning against intrinsic vulnerabilities: Robustness, safety, and generalizability

M Xu, Z Liu, P Huang, W Ding, Z Cen, B Li… - arXiv preprint arXiv …, 2022 - arxiv.org
A trustworthy reinforcement learning algorithm should be competent in solving challenging
real-world problems, including {robustly} handling uncertainties, satisfying {safety} …

[PDF][PDF] Reinforcement learning for decision-making and control in power systems: Tutorial, review, and vision

X Chen, G Qu, Y Tang, S Low… - arXiv preprint arXiv …, 2021 - authors.library.caltech.edu
With large-scale integration of renewable generation and distributed energy resources
(DERs), modern power systems are confronted with new operational challenges, such as …

Wasserstein robust reinforcement learning

MA Abdullah, H Ren, HB Ammar, V Milenkovic… - arXiv preprint arXiv …, 2019 - arxiv.org
Reinforcement learning algorithms, though successful, tend to over-fit to training
environments hampering their application to the real-world. This paper proposes $\text …

Optimizing for the future in non-stationary mdps

Y Chandak, G Theocharous… - International …, 2020 - proceedings.mlr.press
Most reinforcement learning methods are based upon the key assumption that the transition
dynamics and reward functions are fixed, that is, the underlying Markov decision process is …

Architecting efficient multi-modal aiot systems

X Hou, J Liu, X Tang, C Li, J Chen, L Liang… - Proceedings of the 50th …, 2023 - dl.acm.org
Multi-modal computing (M 2 C) has recently exhibited impressive accuracy improvements in
numerous autonomous artificial intelligence of things (AIoT) systems. However, this …

Deep reinforcement learning control for non-stationary building energy management

A Naug, M Quinones-Grueiro, G Biswas - Energy and Buildings, 2022 - Elsevier
Developing an optimal supervisory control policy for building energy management is a
complex problem because the system exhibits non-stationary behaviors, and the target …