Best arm identification with fixed budget: A large deviation perspective
We consider the problem of identifying the best arm in stochastic Multi-Armed Bandits
(MABs) using a fixed sampling budget. Characterizing the minimal instance-specific error …
(MABs) using a fixed sampling budget. Characterizing the minimal instance-specific error …
On the existence of a complexity in fixed budget bandit identification
R Degenne - The Thirty Sixth Annual Conference on …, 2023 - proceedings.mlr.press
In fixed budget bandit identification, an algorithm sequentially observes samples from
several distributions up to a given final time. It then answers a query about the set of …
several distributions up to a given final time. It then answers a query about the set of …
Best arm identification for prompt learning under a limited budget
The remarkable instruction-following capability of large language models (LLMs) has
sparked a growing interest in automatically learning suitable prompts. However, while many …
sparked a growing interest in automatically learning suitable prompts. However, while many …
Experimental designs for heteroskedastic variance
Most linear experimental design problems assume homogeneous variance, while the
presence of heteroskedastic noise is present in many realistic settings. Let a learner have …
presence of heteroskedastic noise is present in many realistic settings. Let a learner have …
Fixed-Budget Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit
S Nakamura, M Sugiyama - International Conference on …, 2024 - proceedings.mlr.press
We study the real-valued combinatorial pure exploration of the multi-armed bandit in the
fixed-budget setting. We first introduce an algorithm named the Combinatorial Successive …
fixed-budget setting. We first introduce an algorithm named the Combinatorial Successive …
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity
We investigate the fixed-budget best-arm identification (BAI) problem for linear bandits in a
potentially non-stationary environment. Given a finite arm set $\mathcal {X}\subset\mathbb …
potentially non-stationary environment. Given a finite arm set $\mathcal {X}\subset\mathbb …
[PDF][PDF] Optimal clustering with bandit feedback
This paper considers the problem of online clustering with bandit feedback. A set of arms (or
items) can be partitioned into various groups that are unknown. Within each group, the …
items) can be partitioned into various groups that are unknown. Within each group, the …
Efficient prompt optimization through the lens of best arm identification
The remarkable instruction-following capability of large language models (LLMs) has
sparked a growing interest in automatically finding good prompts, ie, prompt optimization …
sparked a growing interest in automatically finding good prompts, ie, prompt optimization …
Model-Based Best Arm Identification for Decreasing Bandits
S Takemori, Y Umeda… - … Conference on Artificial …, 2024 - proceedings.mlr.press
We study the problem of reliably identifying the best (lowest loss) arm in a stochastic multi-
armed bandit when the expected loss of each arm is monotone decreasing as a function of …
armed bandit when the expected loss of each arm is monotone decreasing as a function of …
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
We study the problem of Bayesian fixed-budget best-arm identification (BAI) in structured
bandits. We propose an algorithm that uses fixed allocations based on the prior information …
bandits. We propose an algorithm that uses fixed allocations based on the prior information …