A survey on the densest subgraph problem and its variants
The Densest Subgraph Problem requires us to find, in a given graph, a subset of vertices
whose induced subgraph maximizes a measure of density. The problem has received a …
whose induced subgraph maximizes a measure of density. The problem has received a …
[图书][B] Bandit algorithms
T Lattimore, C Szepesvári - 2020 - books.google.com
Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …
and the multi-armed bandit model is a commonly used framework to address it. This …
Robust influence maximization
In this paper, we address the important issue of uncertainty in the edge influence probability
estimates for the well studied influence maximization problem---the task of finding k seed …
estimates for the well studied influence maximization problem---the task of finding k seed …
Gamification of pure exploration for linear bandits
We investigate an active\emph {pure-exploration} setting, that includes\emph {best-arm
identification}, in the context of\emph {linear stochastic bandits}. While asymptotically optimal …
identification}, in the context of\emph {linear stochastic bandits}. While asymptotically optimal …
Sequential experimental design for transductive linear bandits
In this paper we introduce the pure exploration transductive linear bandit problem: given a
set of measurement vectors $\mathcal {X}\subset\mathbb {R}^ d $, a set of items $\mathcal …
set of measurement vectors $\mathcal {X}\subset\mathbb {R}^ d $, a set of items $\mathcal …
Mixture martingales revisited with applications to sequential tests and confidence intervals
E Kaufmann, WM Koolen - Journal of Machine Learning Research, 2021 - jmlr.org
This paper presents new deviation inequalities that are valid uniformly in time under
adaptive sampling in a multi-armed bandit model. The deviations are measured using the …
adaptive sampling in a multi-armed bandit model. The deviations are measured using the …
An optimal algorithm for the thresholding bandit problem
A Locatelli, M Gutzeit… - … Conference on Machine …, 2016 - proceedings.mlr.press
We study a specific combinatorial pure exploration stochastic bandit problem where the
learner aims at finding the set of arms whose means are above a given threshold, up to a …
learner aims at finding the set of arms whose means are above a given threshold, up to a …
Tight (lower) bounds for the fixed budget best arm identification bandit problem
A Carpentier, A Locatelli - Conference on Learning Theory, 2016 - proceedings.mlr.press
We consider the problem of\textitbest arm identification with a\textitfixed budget T, in the K-
armed stochastic bandit setting, with arms distribution defined on [0, 1]. We prove that any …
armed stochastic bandit setting, with arms distribution defined on [0, 1]. We prove that any …
Combinatorial multi-armed bandit with general reward functions
In this paper, we study the stochastic combinatorial multi-armed bandit (CMAB) framework
that allows a general nonlinear reward function, whose expected value may not depend only …
that allows a general nonlinear reward function, whose expected value may not depend only …
Contextual combinatorial cascading bandits
We propose the contextual combinatorial cascading bandits, a combinatorial online learning
game, where at each time step a learning agent is given a set of contextual information, then …
game, where at each time step a learning agent is given a set of contextual information, then …