Uncertainty-aware Constraint Inference in Inverse Constrained Reinforcement Learning
Aiming for safe control, Inverse Constrained Reinforcement Learning (ICRL) considers
inferring the constraints respected by expert agents from their demonstrations and learning …
inferring the constraints respected by expert agents from their demonstrations and learning …
Last-iterate global convergence of policy gradients for constrained reinforcement learning
Constrained Reinforcement Learning (CRL) tackles sequential decision-making problems
where agents are required to achieve goals by maximizing the expected return while …
where agents are required to achieve goals by maximizing the expected return while …
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Reinforcement learning algorithms utilizing policy gradients (PG) to optimize Conditional
Value at Risk (CVaR) face significant challenges with sample inefficiency, hindering their …
Value at Risk (CVaR) face significant challenges with sample inefficiency, hindering their …
EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional Reinforcement Learning
P Malekzadeh, Z Poulos, J Chen, Z Wang… - Proceedings of the 5th …, 2024 - dl.acm.org
Recent advancements in Distributional Reinforcement Learning (DRL) for modeling loss
distributions have shown promise in developing hedging strategies in derivatives markets. A …
distributions have shown promise in developing hedging strategies in derivatives markets. A …
Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis
In Markov decision processes (MDPs), quantile risk measures such as Value-at-Risk are a
standard metric for modeling RL agents' preferences for certain outcomes. This paper …
standard metric for modeling RL agents' preferences for certain outcomes. This paper …
Extreme Risk Mitigation in Reinforcement Learning using Extreme Value Theory
Risk-sensitive reinforcement learning (RL) has garnered significant attention in recent years
due to the growing interest in deploying RL agents in real-world scenarios. A critical aspect …
due to the growing interest in deploying RL agents in real-world scenarios. A critical aspect …
[PDF][PDF] Policy Learning under Uncertainty and Risk
Y Luo - 2024 - uwspace.uwaterloo.ca
Recent years have seen a rapid growth of reinforcement learning (RL) research. In year
2015, deep RL achieved superhuman performance in Atari video games (Mnih et al., 2015) …
2015, deep RL achieved superhuman performance in Atari video games (Mnih et al., 2015) …
[PDF][PDF] 11. Risk, Reward, and Reinforcement Learning in Ice Hockey Analytics
What makes many decisions in sports difficult is that they involve a trade-off between risk
and reward. Actions such as taking a three-point shot, carrying a puck, or dribbling with a …
and reward. Actions such as taking a three-point shot, carrying a puck, or dribbling with a …
[PDF][PDF] 11. Risiko, Belohnung und Verstärkungslernen in der Eishockey-Analytik
Abstrakt Viele Entscheidungen im Sport sind deshalb so schwierig, weil sie eine Abwägung
zwischen Risiko und Belohnung beinhalten. Aktionen wie ein Drei-Punkte-Schuss, das …
zwischen Risiko und Belohnung beinhalten. Aktionen wie ein Drei-Punkte-Schuss, das …