An optimal algorithm for bandit convex optimization with strongly-convex and smooth loss

T Lattimore, A Gyorgy - Conference on Learning Theory, 2021 - proceedings.mlr.press

Improved Regret for Zeroth-Order Stochastic Convex Bandits Page 1 Proceedings of Machine
Learning Research vol 134:1–27, 2021 34th Annual Conference on Learning Theory Improved …

被引用次数：27 相关文章所有 3 个版本

[PDF] arxiv.org

Bandit convex optimisation

T Lattimore - arXiv preprint arXiv:2402.06535, 2024 - arxiv.org

Bandit convex optimisation is a fundamental framework for studying zeroth-order convex
optimisation. These notes cover the many tools used for this problem, including cutting plane …

被引用次数：8 相关文章所有 3 个版本

[PDF] mlr.press

Efficient bandit convex optimization: Beyond linear losses

AS Suggala, P Ravikumar… - Conference on Learning …, 2021 - proceedings.mlr.press

We study the problem of online learning with bandit feedback, where a learner aims to
minimize a sequence of adversarially generated loss functions, while only observing the …

被引用次数：13 相关文章所有 3 个版本

[PDF] arxiv.org

Improved Regret for Bandit Convex Optimization with Delayed Feedback

Y Wan, C Yao, M Song, L Zhang - arXiv preprint arXiv:2402.09152, 2024 - arxiv.org

We investigate bandit convex optimization (BCO) with delayed feedback, where only the
loss value of the action is revealed under an arbitrary delay. Previous studies have …

被引用次数：2 相关文章所有 2 个版本

[PDF] mlr.press

Adaptive bandit convex optimization with heterogeneous curvature

H Luo, M Zhang, P Zhao - Conference on Learning Theory, 2022 - proceedings.mlr.press

We consider the problem of adversarial bandit convex optimization, that is, online learning
over a sequence of arbitrary convex loss functions with only one function evaluation for each …

被引用次数：5 相关文章所有 4 个版本

Provably correct sgd-based exploration for generalized stochastic bandit problem

J Dong, J Wang, LF Yang - 2024 International Conference on …, 2024 - ieeexplore.ieee.org

Bandit problems have been widely used in wireless communication systems which involve
generalized reward models and may suffer high computational complexity. Despite the …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Contextual Continuum Bandits: Static Versus Dynamic Regret

A Akhavan, K Lounici, M Pontil… - arXiv preprint arXiv …, 2024 - arxiv.org

We study the contextual continuum bandits problem, where the learner sequentially receives
a side information vector and has to choose an action in a convex set, minimizing a function …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

CONGO: Compressive Online Gradient Optimization

J Carleton, P Vijaykumar, D Saxena… - arXiv preprint arXiv …, 2024 - arxiv.org

We address the challenge of zeroth-order online convex optimization where the objective
function's gradient exhibits sparsity, indicating that only a small number of dimensions …

[PDF] nju.edu.cn

Projection-Free Bandit Convex Optimization over Strongly Convex Sets

C Zhang, Y Wang, P Tian, X Cheng, Y Wan… - Pacific-Asia Conference …, 2024 - Springer

Projection-free algorithms for bandit convex optimization have received increasing attention,
due to the ability to deal with the bandit feedback and complicated constraints …

Learning Time-Varying Convexifications of Multiple Fairness Measures

Q Zhou, J Marecek, RN Shorten - openreview.net

There is an increasing appreciation that one may need to consider multiple measures of
fairness, eg, considering multiple group and individual fairness notions. The relative weights …