A bayesian approach to online learning for contextual restless bandits with applications to public health

B Liang, L Xu, A Taneja, M Tambe, L Janson - arXiv preprint arXiv …, 2024 - arxiv.org
Restless multi-armed bandits (RMABs) are used to model sequential resource allocation in
public health intervention programs. In these settings, the underlying transition dynamics are …

A Bayesian Approach to Online Learning for Contextual Restless Bandits with Applications to Public Health

B Liang, L Xu, A Taneja, M Tambe, L Janson - arXiv e-prints, 2024 - ui.adsabs.harvard.edu
Restless multi-armed bandits (RMABs) are used to model sequential resource allocation in
public health intervention programs. In these settings, the underlying transition dynamics are …