Uncertainty-aware Constraint Inference in Inverse Constrained Reinforcement Learning

S Xu, G Liu - The Twelfth International Conference on Learning …, 2023 - openreview.net
Aiming for safe control, Inverse Constrained Reinforcement Learning (ICRL) considers
inferring the constraints respected by expert agents from their demonstrations and learning …

Last-iterate global convergence of policy gradients for constrained reinforcement learning

A Montenegro, M Mussi, M Papini… - arXiv preprint arXiv …, 2024 - arxiv.org
Constrained Reinforcement Learning (CRL) tackles sequential decision-making problems
where agents are required to achieve goals by maximizing the expected return while …

A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization

Y Luo, Y Pan, H Wang, P Torr, P Poupart - arXiv preprint arXiv:2403.11062, 2024 - arxiv.org
Reinforcement learning algorithms utilizing policy gradients (PG) to optimize Conditional
Value at Risk (CVaR) face significant challenges with sample inefficiency, hindering their …

EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional Reinforcement Learning

P Malekzadeh, Z Poulos, J Chen, Z Wang… - Proceedings of the 5th …, 2024 - dl.acm.org
Recent advancements in Distributional Reinforcement Learning (DRL) for modeling loss
distributions have shown promise in developing hedging strategies in derivatives markets. A …

Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis

JL Hau, E Delage, E Derman, M Ghavamzadeh… - arXiv preprint arXiv …, 2024 - arxiv.org
In Markov decision processes (MDPs), quantile risk measures such as Value-at-Risk are a
standard metric for modeling RL agents' preferences for certain outcomes. This paper …

Extreme Risk Mitigation in Reinforcement Learning using Extreme Value Theory

KS NS, Y Wang, M Schram, J Drgona… - arXiv preprint arXiv …, 2023 - arxiv.org
Risk-sensitive reinforcement learning (RL) has garnered significant attention in recent years
due to the growing interest in deploying RL agents in real-world scenarios. A critical aspect …

[PDF][PDF] Policy Learning under Uncertainty and Risk

Y Luo - 2024 - uwspace.uwaterloo.ca
Recent years have seen a rapid growth of reinforcement learning (RL) research. In year
2015, deep RL achieved superhuman performance in Atari video games (Mnih et al., 2015) …

[PDF][PDF] 11. Risk, Reward, and Reinforcement Learning in Ice Hockey Analytics

S Xu, O Schulte, Y Luo, P Poupart, G Liu - cs.sfu.ca
What makes many decisions in sports difficult is that they involve a trade-off between risk
and reward. Actions such as taking a three-point shot, carrying a puck, or dribbling with a …

[PDF][PDF] 11. Risiko, Belohnung und Verstärkungslernen in der Eishockey-Analytik

S Xu, O Schulte, Y Luo, P Poupart, G Liu - cs.sfu.ca
Abstrakt Viele Entscheidungen im Sport sind deshalb so schwierig, weil sie eine Abwägung
zwischen Risiko und Belohnung beinhalten. Aktionen wie ein Drei-Punkte-Schuss, das …