Risk-sensitive reinforcement learning

B Hambly, R Xu, H Yang - Mathematical Finance, 2023 - Wiley Online Library

The rapid changes in the finance industry due to the increasing amount of data have
revolutionized the techniques on data processing and data analysis and brought new …

被引用次数：125 相关文章所有 13 个版本

Reinforcement learning for predictive maintenance: A systematic technical review

R Siraskar, S Kumar, S Patil, A Bongale… - Artificial Intelligence …, 2023 - Springer

The manufacturing world is subject to ever-increasing cost optimization pressures.
Maintenance adds to cost and disrupts production; optimized maintenance is therefore of …

被引用次数：18 相关文章所有 2 个版本

[PDF] arxiv.org

Recovery rl: Safe reinforcement learning with learned recovery zones

B Thananjeyan, A Balakrishna, S Nair… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org

Safety remains a central obstacle preventing widespread use of RL in the real world:
learning new tasks in uncertain environments requires extensive exploration, but safety …

被引用次数：213 相关文章所有 6 个版本

[PDF] arxiv.org

Safe reinforcement learning with model uncertainty estimates

B Lütjens, M Everett, JP How - 2019 International Conference …, 2019 - ieeexplore.ieee.org

Many current autonomous systems are being designed with a strong reliance on black box
predictions from deep neural networks (DNNs). However, DNNs tend to be overconfident in …

被引用次数：182 相关文章所有 9 个版本

[PDF] neurips.cc

Exponential bellman equation and improved regret bounds for risk-sensitive reinforcement learning

Y Fei, Z Yang, Y Chen, Z Wang - Advances in neural …, 2021 - proceedings.neurips.cc

We study risk-sensitive reinforcement learning (RL) based on the entropic risk measure.
Although existing works have established non-asymptotic regret guarantees for this …

被引用次数：53 相关文章所有 9 个版本

[PDF] mlr.press

Addressing optimism bias in sequence modeling for reinforcement learning

AR Villaflor, Z Huang, S Pande… - international …, 2022 - proceedings.mlr.press

Impressive results in natural language processing (NLP) based on the Transformer neural
network architecture have inspired researchers to explore viewing offline reinforcement …

被引用次数：28 相关文章所有 5 个版本

[PDF] arxiv.org

Dsac: Distributional soft actor critic for risk-sensitive reinforcement learning

X Ma, L Xia, Z Zhou, J Yang, Q Zhao - arXiv preprint arXiv:2004.14547, 2020 - arxiv.org

In this paper, we present a new reinforcement learning (RL) algorithm called Distributional
Soft Actor Critic (DSAC), which exploits the distributional information of accumulated …

被引用次数：80 相关文章所有 4 个版本

[PDF] mlr.press

Risk-sensitive reinforcement learning with function approximation: A debiasing approach

Y Fei, Z Yang, Z Wang - International Conference on …, 2021 - proceedings.mlr.press

We study function approximation for episodic reinforcement learning with entropic risk
measure. We first propose an algorithm with linear function approximation. Compared to …

被引用次数：41 相关文章所有 4 个版本

[PDF] neurips.cc

Risk-sensitive reinforcement learning: Near-optimal risk-sample tradeoff in regret

Y Fei, Z Yang, Y Chen, Z Wang… - Advances in Neural …, 2020 - proceedings.neurips.cc

We study risk-sensitive reinforcement learning in episodic Markov decision processes with
unknown transition kernels, where the goal is to optimize the total reward under the risk …

被引用次数：65 相关文章所有 8 个版本

Safe exploration algorithms for reinforcement learning controllers

T Mannucci, EJ van Kampen… - IEEE transactions on …, 2017 - ieeexplore.ieee.org

Self-learning approaches, such as reinforcement learning, offer new possibilities for
autonomous control of uncertain or time-varying systems. However, exploring an unknown …

被引用次数：105 相关文章所有 5 个版本

Risk-sensitive reinforcement learning

Recent advances in reinforcement learning in finance

Reinforcement learning for predictive maintenance: A systematic technical review

Recovery rl: Safe reinforcement learning with learned recovery zones

Safe reinforcement learning with model uncertainty estimates

Exponential bellman equation and improved regret bounds for risk-sensitive reinforcement learning

Addressing optimism bias in sequence modeling for reinforcement learning

Dsac: Distributional soft actor critic for risk-sensitive reinforcement learning

Risk-sensitive reinforcement learning with function approximation: A debiasing approach

Risk-sensitive reinforcement learning: Near-optimal risk-sample tradeoff in regret

Safe exploration algorithms for reinforcement learning controllers

高级搜索

引用