Index policy for multiarmed bandit problem with dynamic risk measures

M Malekipirbazari, Ö Çavuş - European Journal of Operational Research, 2024 - Elsevier
The multiarmed bandit problem (MAB) is a classic problem in which a finite amount of
resources must be allocated among competing choices with the aim of identifying a policy …

Risk-averse allocation indices for multiarmed bandit problem

M Malekipirbazari, Ö Çavuş - IEEE Transactions on Automatic …, 2021 - ieeexplore.ieee.org
In classical multiarmed bandit problem, the aim is to find a policy maximizing the expected
total reward, implicitly assuming that the decision-maker is risk-neutral. On the other hand …

Risk-averse flexible policy on ambulance allocation in humanitarian operations under uncertainty

G Yu, A Liu, H Sun - International Journal of Production Research, 2021 - Taylor & Francis
Proactive ambulance management is constructive to improve the response efficiency for
emergency medical service (EMS) systems under uncertainty. In this paper, we present a …

Risk-Averse Multi-Armed Bandit Problem with Multiple Plays

S Dahlgren, N Marriott - 2023 - gupea.ub.gu.se
This study aims to construct an efficient heuristic, referred to as RA, for a riskaverse
Markovian multi-armed bandit problem (MAB) with multiple plays. The RA incorporates risk …

Dynamic Ambulance Redeployment via Multi-armed Bandits

V Yücesoy - 2019 27th Signal Processing and …, 2019 - ieeexplore.ieee.org
Improving a country's emergency medical services results in serving more calls on time and
saving more lives in return. The ambulance redeployment problem, which is a part of the …

Ambulance Redeployment via Reinforcement Learning

Ü Şahin, V YÜcesoy - 2020 28th Signal Processing and …, 2020 - ieeexplore.ieee.org
In this study, ambulance redeployment is performed by using reinforcement learning
methods. The objective in the ambulance redeployment problem is to redeploy the limited …