Recent advances in reinforcement learning in finance

B Hambly, R Xu, H Yang - Mathematical Finance, 2023 - Wiley Online Library
The rapid changes in the finance industry due to the increasing amount of data have
revolutionized the techniques on data processing and data analysis and brought new …

Reinforcement learning for predictive maintenance: A systematic technical review

R Siraskar, S Kumar, S Patil, A Bongale… - Artificial Intelligence …, 2023 - Springer
The manufacturing world is subject to ever-increasing cost optimization pressures.
Maintenance adds to cost and disrupts production; optimized maintenance is therefore of …

Recovery rl: Safe reinforcement learning with learned recovery zones

B Thananjeyan, A Balakrishna, S Nair… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org
Safety remains a central obstacle preventing widespread use of RL in the real world:
learning new tasks in uncertain environments requires extensive exploration, but safety …

Safe reinforcement learning with model uncertainty estimates

B Lütjens, M Everett, JP How - 2019 International Conference …, 2019 - ieeexplore.ieee.org
Many current autonomous systems are being designed with a strong reliance on black box
predictions from deep neural networks (DNNs). However, DNNs tend to be overconfident in …

Exponential bellman equation and improved regret bounds for risk-sensitive reinforcement learning

Y Fei, Z Yang, Y Chen, Z Wang - Advances in neural …, 2021 - proceedings.neurips.cc
We study risk-sensitive reinforcement learning (RL) based on the entropic risk measure.
Although existing works have established non-asymptotic regret guarantees for this …

Addressing optimism bias in sequence modeling for reinforcement learning

AR Villaflor, Z Huang, S Pande… - international …, 2022 - proceedings.mlr.press
Impressive results in natural language processing (NLP) based on the Transformer neural
network architecture have inspired researchers to explore viewing offline reinforcement …

Dsac: Distributional soft actor critic for risk-sensitive reinforcement learning

X Ma, L Xia, Z Zhou, J Yang, Q Zhao - arXiv preprint arXiv:2004.14547, 2020 - arxiv.org
In this paper, we present a new reinforcement learning (RL) algorithm called Distributional
Soft Actor Critic (DSAC), which exploits the distributional information of accumulated …

Risk-sensitive reinforcement learning with function approximation: A debiasing approach

Y Fei, Z Yang, Z Wang - International Conference on …, 2021 - proceedings.mlr.press
We study function approximation for episodic reinforcement learning with entropic risk
measure. We first propose an algorithm with linear function approximation. Compared to …

Risk-sensitive reinforcement learning: Near-optimal risk-sample tradeoff in regret

Y Fei, Z Yang, Y Chen, Z Wang… - Advances in Neural …, 2020 - proceedings.neurips.cc
We study risk-sensitive reinforcement learning in episodic Markov decision processes with
unknown transition kernels, where the goal is to optimize the total reward under the risk …

Safe exploration algorithms for reinforcement learning controllers

T Mannucci, EJ van Kampen… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
Self-learning approaches, such as reinforcement learning, offer new possibilities for
autonomous control of uncertain or time-varying systems. However, exploring an unknown …