Recursive optimization of convex risk measures: Mean-semideviation models

DS Kalogerias, WB Powell - arXiv preprint arXiv:1804.00636, 2018 - arxiv.org
We develop recursive, data-driven, stochastic subgradient methods for optimizing a new,
versatile, and application-driven class of convex risk measures, termed here as mean …

On Monte-Carlo methods in convex stochastic optimization

D Bartl, S Mendelson - The Annals of Applied Probability, 2022 - projecteuclid.org
We develop a novel procedure for estimating the optimizer of general convex stochastic
optimization problems of the form min x∈ XE [F (x, ξ)], when the given data is a finite …

An online algorithm for the risk-aware restless bandit

J Xu, L Chen, O Tang - European Journal of Operational Research, 2021 - Elsevier
The multi-armed bandit (MAB) is a classical model for the exploration vs. exploitation trade-
off. Among existing MAB models, the restless bandit model is of increasing interest because …

Moderate deviations for minimax stochastic programs

M Gao - Optimization Letters, 2024 - Springer
In this paper, we study moderate deviations for minimax stochastic programs. We first
establish a minimax delta theorem in large deviations, and then apply the minimax delta …

Model and reinforcement learning for markov games with risk preferences

W Huang, VH Pham, WB Haskell - … of the AAAI Conference on Artificial …, 2020 - ojs.aaai.org
We motivate and propose a new model for non-cooperative Markov game which considers
the interactions of risk-aware players. This model characterizes the time-consistent dynamic …

[PDF][PDF] Scenario reduction for risk-averse stochastic programs

S Arpón, T Homem-de Mello… - … Online http://www …, 2018 - optimization-online.org
In this paper we discuss scenario reduction methods for risk-averse stochastic optimization
problems. Scenario reduction techniques have received some attention in the literature and …

[PDF][PDF] Model and algorithm for time-consistent risk-aware Markov games

W Huang, PV Hai, WB Haskell - arXiv preprint arXiv:1901.04882, 2019 - academia.edu
In this paper, we propose a model for non-cooperative Markov games with time-consistent
risk-aware players. In particular, our model characterizes the risk arising from both the …

Asymptotic behaviors of semidefinite programming with a covariance perturbation

MJ Gao, KFC Yiu - Optimization Letters, 2019 - Springer
In this paper, we study asymptotic behaviors of semidefinite programming with a covariance
perturbation. We obtain some moderate deviations, Cramér-type moderate deviations and a …

[图书][B] Kernel Smoothing in Sample-based Optimization

Y Lin - 2020 - search.proquest.com
The area of stochastic optimization (stochastic programming) has gained interests and
attention by its relevance to many problems arising in other areas. The multitude of papers in …

[PDF][PDF] RISK-AWARE PREFERENCE ELICITATION, ESTIMATION AND OPTIMIZATION

H WENJIE - 2019 - researchgate.net
Decision-making is the cognitive process of identifying the decision maker's preference and
finding a corresponding favorite solution. This thesis focus on decision maker's risk …