The fundamental risk quadrangle in risk management, optimization and statistical estimation

RT Rockafellar, S Uryasev - Surveys in Operations Research and …, 2013 - Elsevier
Random variables that stand for cost, loss or damage must be confronted in numerous
situations. Dealing with them systematically for purposes in risk management, optimization …

Distributionally robust stochastic programming

A Shapiro - SIAM Journal on Optimization, 2017 - SIAM
In this paper we study distributionally robust stochastic programming in a setting where there
is a specified reference probability measure and the uncertainty set of probability measures …

On tilted losses in machine learning: Theory and applications

T Li, A Beirami, M Sanjabi, V Smith - Journal of Machine Learning …, 2023 - jmlr.org
Exponential tilting is a technique commonly used in fields such as statistics, probability,
information theory, and optimization to create parametric distribution shifts. Despite its …

Cascaded gaps: Towards logarithmic regret for risk-sensitive reinforcement learning

Y Fei, R Xu - International Conference on Machine Learning, 2022 - proceedings.mlr.press
In this paper, we study gap-dependent regret guarantees for risk-sensitive reinforcement
learning based on the entropic risk measure. We propose a novel definition of sub-optimality …

Policy gradient bayesian robust optimization for imitation learning

Z Javed, DS Brown, S Sharma, J Zhu… - International …, 2021 - proceedings.mlr.press
The difficulty in specifying rewards for many real-world problems has led to an increased
focus on learning rewards from human feedback, such as demonstrations. However, there …

Game-theoretic planning for risk-aware interactive agents

M Wang, N Mehr, A Gaidon… - 2020 IEEE/RSJ …, 2020 - ieeexplore.ieee.org
Modeling the stochastic behavior of interacting agents is key for safe motion planning. In this
paper, we study the interaction of risk-aware agents in a game-theoretical framework. Under …

Entropy based risk measures

A Pichler, R Schlotter - European Journal of Operational Research, 2020 - Elsevier
Entropy is a measure of self-information which is used to quantify information losses.
Entropy was developed in thermodynamics, but is also used to compare probabilities based …

Efficient probabilistic performance bounds for inverse reinforcement learning

D Brown, S Niekum - Proceedings of the AAAI conference on artificial …, 2018 - ojs.aaai.org
In the field of reinforcement learning there has been recent progress towards safety and high-
confidence bounds on policy performance. However, to our knowledge, no practical …

Entropic risk measure in policy search

D Nass, B Belousov, J Peters - 2019 IEEE/RSJ International …, 2019 - ieeexplore.ieee.org
With the increasing pace of automation, modern robotic systems need to act in stochastic,
non-stationary, partially observable environments. A range of algorithms for finding …

[PDF][PDF] Risk-aware reinforcement learning with coherent risk measures and non-linear function approximation

T Lam, A Verma, BKH Low, P Jaillet - The Eleventh International …, 2022 - drive.google.com
We study the risk-aware reinforcement learning (RL) problem in the episodic finite-horizon
Markov decision process with unknown transition and reward functions. In contrast to the risk …