An -Best-Arm Identification Algorithm for Fixed-Confidence and Beyond

M Jourdan, R Degenne… - Advances in Neural …, 2023 - proceedings.neurips.cc
We propose EB-TC $\varepsilon $, a novel sampling rule for $\varepsilon $-best arm
identification in stochastic bandits. It is the first instance of Top Two algorithm analyzed for …

Non-asymptotic analysis of a ucb-based top two algorithm

M Jourdan, R Degenne - Advances in Neural Information …, 2024 - proceedings.neurips.cc
A Top Two sampling rule for bandit identification is a method which selects the next arm to
sample from among two candidate arms, a leader and a challenger. Due to their simplicity …

Bandit Pareto Set Identification: the Fixed Budget Setting

C Kone, E Kaufmann, L Richert - … Conference on Artificial …, 2024 - proceedings.mlr.press
We study a multi-objective pure exploration problem in a multi-armed bandit model. Each
arm is associated to an unknown multi-variate distribution and the goal is to identify the …

Asymptotically Optimal Sampling Policy for Selecting Top-m Alternatives

G Zhang, Y Peng, J Zhang… - INFORMS Journal on …, 2023 - pubsonline.informs.org
We consider selecting the top-m alternatives from a finite number of alternatives via Monte
Carlo simulation. Under a Bayesian framework, we formulate the sampling decision as a …

Marginal improvement procedures for top-m selection

H Liu, Z Xing, H Xiao, EP Chew - Automatica, 2024 - Elsevier
Given a fixed simulation budget, the problem of selecting the best and top-m alternatives
among a finite set of alternatives have been studied separately in simulation optimization …

Differentially Private Best-Arm Identification

A Azize, M Jourdan, AA Marjani, D Basu - arXiv preprint arXiv:2406.06408, 2024 - arxiv.org
Best Arm Identification (BAI) problems are progressively used for data-sensitive applications,
such as designing adaptive clinical trials, tuning hyper-parameters, and conducting user …

Optimal Top-Two Method for Best Arm Identification and Fluid Analysis

A Bandyopadhyay, S Juneja, S Agrawal - arXiv preprint arXiv:2403.09123, 2024 - arxiv.org
Top-$2 $ methods have become popular in solving the best arm identification (BAI) problem.
The best arm, or the arm with the largest mean amongst finitely many, is identified through …

Top-Two Thompson Sampling for Contextual Top-mc Selection Problems

X Shi, Y Peng, G Zhang - arXiv preprint arXiv:2306.17704, 2023 - arxiv.org
We aim to efficiently allocate a fixed simulation budget to identify the top-mc designs for
each context among a finite number of contexts. The performance of each design under a …

On Experimentation With Heterogeneous Subgroups: An Asymptotic Optimal δ-Weighted-PAC Design

D Simchi-Levi, C Wang, J Xu - Available at SSRN 4721755, 2024 - papers.ssrn.com
We consider the experimental design with the objective of identifying the potentially
heterogeneous optimal treatment for each subgroup, utilizing minimal experimental units …

[HTML][HTML] Fundamental Limits in Stochastic Bandits

PA Wang - 2024 - diva-portal.org
This thesis contributes to the field of stochastic bandits by exploring the fundamental limits
(information-theoretic lower bounds) of three prevalent objects in various reinforcement …