Global Rewards in Restless Multi-Armed Bandits

N Raman, R Shi, F Fang - arXiv preprint arXiv:2406.00738, 2024 - arxiv.org
Restless multi-armed bandits (RMAB) extend multi-armed bandits so pulling an arm impacts
future states. Despite the success of RMABs, a key limiting assumption is the separability of …

Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making

X Chen, I Hou - arXiv preprint arXiv:2403.15640, 2024 - arxiv.org
This paper introduces a novel multi-armed bandits framework, termed Contextual Restless
Bandits (CRB), for complex online decision-making. This CRB framework incorporates the …