Refined analysis of fpl for adversarial markov decision processes

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

我的图书馆

Refined analysis of fpl for adversarial markov decision processes

在引用文章中搜索

[PDF] neurips.cc

Follow-the-perturbed-leader for adversarial markov decision processes with bandit feedback

Y Dai, H Luo, L Chen - Advances in Neural Information …, 2022 - proceedings.neurips.cc

We consider regret minimization for Adversarial Markov Decision Processes (AMDPs),
where the loss functions are changing over time and adversarially chosen, and the learner …

被引用次数：16 相关文章所有 7 个版本

[PDF] openreview.net

Follow-the-Perturbed-Leader for Adversarial Bandits: Heavy Tails, Robustness, and Privacy

D Cheng, X Zhou, B Ji - openreview.net

We study adversarial bandit problems with potentially heavy-tailed losses. Unlike standard
settings with non-negative and bounded losses, managing negative and unbounded losses …