Entropic risk measures: Coherence vs. convexity, model ambiguity and robust large deviations

RT Rockafellar, S Uryasev - Surveys in Operations Research and …, 2013 - Elsevier

Random variables that stand for cost, loss or damage must be confronted in numerous
situations. Dealing with them systematically for purposes in risk management, optimization …

被引用次数：304 相关文章所有 9 个版本

[PDF] nsf.gov

Distributionally robust stochastic programming

A Shapiro - SIAM Journal on Optimization, 2017 - SIAM

In this paper we study distributionally robust stochastic programming in a setting where there
is a specified reference probability measure and the uncertainty set of probability measures …

被引用次数：217 相关文章所有 6 个版本

[PDF] jmlr.org

On tilted losses in machine learning: Theory and applications

T Li, A Beirami, M Sanjabi, V Smith - Journal of Machine Learning …, 2023 - jmlr.org

Exponential tilting is a technique commonly used in fields such as statistics, probability,
information theory, and optimization to create parametric distribution shifts. Despite its …

被引用次数：43 相关文章所有 4 个版本

[PDF] mlr.press

Cascaded gaps: Towards logarithmic regret for risk-sensitive reinforcement learning

Y Fei, R Xu - International Conference on Machine Learning, 2022 - proceedings.mlr.press

In this paper, we study gap-dependent regret guarantees for risk-sensitive reinforcement
learning based on the entropic risk measure. We propose a novel definition of sub-optimality …

被引用次数：14 相关文章所有 2 个版本

[PDF] mlr.press

Policy gradient bayesian robust optimization for imitation learning

Z Javed, DS Brown, S Sharma, J Zhu… - International …, 2021 - proceedings.mlr.press

The difficulty in specifying rewards for many real-world problems has led to an increased
focus on learning rewards from human feedback, such as demonstrations. However, there …

被引用次数：25 相关文章所有 6 个版本

[PDF] stanford.edu

Game-theoretic planning for risk-aware interactive agents

M Wang, N Mehr, A Gaidon… - 2020 IEEE/RSJ …, 2020 - ieeexplore.ieee.org

Modeling the stochastic behavior of interacting agents is key for safe motion planning. In this
paper, we study the interaction of risk-aware agents in a game-theoretical framework. Under …

被引用次数：36 相关文章所有 6 个版本

[PDF] arxiv.org

Entropy based risk measures

A Pichler, R Schlotter - European Journal of Operational Research, 2020 - Elsevier

Entropy is a measure of self-information which is used to quantify information losses.
Entropy was developed in thermodynamics, but is also used to compare probabilities based …

被引用次数：49 相关文章所有 5 个版本

[PDF] aaai.org

Efficient probabilistic performance bounds for inverse reinforcement learning

D Brown, S Niekum - Proceedings of the AAAI conference on artificial …, 2018 - ojs.aaai.org

In the field of reinforcement learning there has been recent progress towards safety and high-
confidence bounds on policy performance. However, to our knowledge, no practical …

被引用次数：55 相关文章所有 13 个版本

[PDF] arxiv.org

Entropic risk measure in policy search

D Nass, B Belousov, J Peters - 2019 IEEE/RSJ International …, 2019 - ieeexplore.ieee.org

With the increasing pace of automation, modern robotic systems need to act in stochastic,
non-stationary, partially observable environments. A range of algorithms for finding …

被引用次数：43 相关文章所有 13 个版本

[PDF] google.com

[PDF][PDF] Risk-aware reinforcement learning with coherent risk measures and non-linear function approximation

T Lam, A Verma, BKH Low, P Jaillet - The Eleventh International …, 2022 - drive.google.com

We study the risk-aware reinforcement learning (RL) problem in the episodic finite-horizon
Markov decision process with unknown transition and reward functions. In contrast to the risk …

被引用次数：13 相关文章所有 4 个版本