An -Best-Arm Identification Algorithm for Fixed-Confidence and Beyond
We propose EB-TC $\varepsilon $, a novel sampling rule for $\varepsilon $-best arm
identification in stochastic bandits. It is the first instance of Top Two algorithm analyzed for …
identification in stochastic bandits. It is the first instance of Top Two algorithm analyzed for …
Non-asymptotic analysis of a ucb-based top two algorithm
A Top Two sampling rule for bandit identification is a method which selects the next arm to
sample from among two candidate arms, a leader and a challenger. Due to their simplicity …
sample from among two candidate arms, a leader and a challenger. Due to their simplicity …
Bandit Pareto Set Identification: the Fixed Budget Setting
C Kone, E Kaufmann, L Richert - … Conference on Artificial …, 2024 - proceedings.mlr.press
We study a multi-objective pure exploration problem in a multi-armed bandit model. Each
arm is associated to an unknown multi-variate distribution and the goal is to identify the …
arm is associated to an unknown multi-variate distribution and the goal is to identify the …
Asymptotically Optimal Sampling Policy for Selecting Top-m Alternatives
We consider selecting the top-m alternatives from a finite number of alternatives via Monte
Carlo simulation. Under a Bayesian framework, we formulate the sampling decision as a …
Carlo simulation. Under a Bayesian framework, we formulate the sampling decision as a …
Marginal improvement procedures for top-m selection
Given a fixed simulation budget, the problem of selecting the best and top-m alternatives
among a finite set of alternatives have been studied separately in simulation optimization …
among a finite set of alternatives have been studied separately in simulation optimization …
Differentially Private Best-Arm Identification
Best Arm Identification (BAI) problems are progressively used for data-sensitive applications,
such as designing adaptive clinical trials, tuning hyper-parameters, and conducting user …
such as designing adaptive clinical trials, tuning hyper-parameters, and conducting user …
Optimal Top-Two Method for Best Arm Identification and Fluid Analysis
Top-$2 $ methods have become popular in solving the best arm identification (BAI) problem.
The best arm, or the arm with the largest mean amongst finitely many, is identified through …
The best arm, or the arm with the largest mean amongst finitely many, is identified through …
Top-Two Thompson Sampling for Contextual Top-mc Selection Problems
We aim to efficiently allocate a fixed simulation budget to identify the top-mc designs for
each context among a finite number of contexts. The performance of each design under a …
each context among a finite number of contexts. The performance of each design under a …
On Experimentation With Heterogeneous Subgroups: An Asymptotic Optimal δ-Weighted-PAC Design
D Simchi-Levi, C Wang, J Xu - Available at SSRN 4721755, 2024 - papers.ssrn.com
We consider the experimental design with the objective of identifying the potentially
heterogeneous optimal treatment for each subgroup, utilizing minimal experimental units …
heterogeneous optimal treatment for each subgroup, utilizing minimal experimental units …
[HTML][HTML] Fundamental Limits in Stochastic Bandits
PA Wang - 2024 - diva-portal.org
This thesis contributes to the field of stochastic bandits by exploring the fundamental limits
(information-theoretic lower bounds) of three prevalent objects in various reinforcement …
(information-theoretic lower bounds) of three prevalent objects in various reinforcement …