Combinatorial pure exploration of multi-armed bandits

T Lanciano, A Miyauchi, A Fazzone, F Bonchi - ACM Computing Surveys, 2024 - dl.acm.org

The Densest Subgraph Problem requires us to find, in a given graph, a subset of vertices
whose induced subgraph maximizes a measure of density. The problem has received a …

被引用次数：40 相关文章所有 6 个版本

[PDF] tor-lattimore.com

[图书][B] Bandit algorithms

T Lattimore, C Szepesvári - 2020 - books.google.com

Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

被引用次数：3229 相关文章所有 9 个版本

[PDF] acm.org

Robust influence maximization

W Chen, T Lin, Z Tan, M Zhao, X Zhou - Proceedings of the 22nd ACM …, 2016 - dl.acm.org

In this paper, we address the important issue of uncertainty in the edge influence probability
estimates for the well studied influence maximization problem---the task of finding k seed …

被引用次数：245 相关文章所有 17 个版本

[PDF] mlr.press

Gamification of pure exploration for linear bandits

R Degenne, P Ménard, X Shang… - … on Machine Learning, 2020 - proceedings.mlr.press

We investigate an active\emph {pure-exploration} setting, that includes\emph {best-arm
identification}, in the context of\emph {linear stochastic bandits}. While asymptotically optimal …

被引用次数：100 相关文章所有 12 个版本

[PDF] neurips.cc

Sequential experimental design for transductive linear bandits

T Fiez, L Jain, KG Jamieson… - Advances in neural …, 2019 - proceedings.neurips.cc

In this paper we introduce the pure exploration transductive linear bandit problem: given a
set of measurement vectors $\mathcal {X}\subset\mathbb {R}^ d $, a set of items $\mathcal …

被引用次数：139 相关文章所有 8 个版本

[PDF] jmlr.org

Mixture martingales revisited with applications to sequential tests and confidence intervals

E Kaufmann, WM Koolen - Journal of Machine Learning Research, 2021 - jmlr.org

This paper presents new deviation inequalities that are valid uniformly in time under
adaptive sampling in a multi-armed bandit model. The deviations are measured using the …

被引用次数：137 相关文章所有 12 个版本

[PDF] mlr.press

An optimal algorithm for the thresholding bandit problem

A Locatelli, M Gutzeit… - … Conference on Machine …, 2016 - proceedings.mlr.press

We study a specific combinatorial pure exploration stochastic bandit problem where the
learner aims at finding the set of arms whose means are above a given threshold, up to a …

被引用次数：173 相关文章所有 13 个版本

[PDF] mlr.press

Tight (lower) bounds for the fixed budget best arm identification bandit problem

A Carpentier, A Locatelli - Conference on Learning Theory, 2016 - proceedings.mlr.press

We consider the problem of\textitbest arm identification with a\textitfixed budget T, in the K-
armed stochastic bandit setting, with arms distribution defined on [0, 1]. We prove that any …

被引用次数：158 相关文章所有 7 个版本

[PDF] neurips.cc

Combinatorial multi-armed bandit with general reward functions

W Chen, W Hu, F Li, J Li, Y Liu… - Advances in Neural …, 2016 - proceedings.neurips.cc

In this paper, we study the stochastic combinatorial multi-armed bandit (CMAB) framework
that allows a general nonlinear reward function, whose expected value may not depend only …

被引用次数：158 相关文章所有 15 个版本

[PDF] mlr.press

Contextual combinatorial cascading bandits

S Li, B Wang, S Zhang, W Chen - … conference on machine …, 2016 - proceedings.mlr.press

We propose the contextual combinatorial cascading bandits, a combinatorial online learning
game, where at each time step a learning agent is given a set of contextual information, then …

被引用次数：154 相关文章所有 17 个版本