Best arm identification for prompt learning under a limited budget

C Shi, K Yang, J Yang, C Shen - arXiv preprint arXiv:2402.09723, 2024 - openreview.net
The remarkable instruction-following capability of large language models (LLMs) has
sparked a growing interest in automatically learning suitable prompts. However, while many …

Efficient prompt optimization through the lens of best arm identification

C Shi, K Yang, Z Chen, J Li, J Yang… - The Thirty-eighth Annual …, 2024 - openreview.net
The remarkable instruction-following capability of large language models (LLMs) has
sparked a growing interest in automatically finding good prompts, ie, prompt optimization …

Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits

N Nguyen, I Aouali, A György, C Vernade - arXiv preprint arXiv …, 2024 - arxiv.org
We study the problem of Bayesian fixed-budget best-arm identification (BAI) in structured
bandits. We propose an algorithm that uses fixed allocations based on the prior information …

Adaptive Generalized Neyman Allocation: Local Asymptotic Minimax Optimal Best Arm Identification

M Kato - arXiv preprint arXiv:2405.19317, 2024 - arxiv.org
This study investigates a local asymptotic minimax optimal strategy for fixed-budget best arm
identification (BAI). We propose the Adaptive Generalized Neyman Allocation (AGNA) …

Fixed Confidence Best Arm Identification in the Bayesian Setting

K Jang, J Komiyama, K Yamazaki - arXiv preprint arXiv:2402.10429, 2024 - arxiv.org
We consider the fixed-confidence best arm identification (FC-BAI) problem in the Bayesian
Setting. This problem aims to find the arm of the largest mean with a fixed confidence level …

Best Arm Identification in Batched Multi-armed Bandit Problems

S Cao, S He, R Jiang, J Xu, H Yuan - arXiv preprint arXiv:2312.13875, 2023 - arxiv.org
Recently multi-armed bandit problem arises in many real-life scenarios where arms must be
sampled in batches, due to limited time the agent can wait for the feedback. Such …

UCB Exploration for Fixed-Budget Bayesian Best Arm Identification

RJB Zhu, Y Qiu - arXiv preprint arXiv:2408.04869, 2024 - arxiv.org
We study best-arm identification (BAI) in the fixed-budget setting. Adaptive allocations based
on upper confidence bounds (UCBs), such as UCBE, are known to work well in BAI …

[HTML][HTML] Inference and Online Learning in Structured Stochastic Systems

K Ariu - 2023 - diva-portal.org
This thesis contributes to the field of stochastic online learning problems, with a collection of
six papers each addressing unique aspects of online learning and inference problems …

Adaptive Experimental Design for Policy Learning: Contextual Best Arm Identification

M Kato, K Okumura, T Ishihara, T Kitagawa - ICML 2024 Workshop … - openreview.net
This study investigates the contextual best arm identification (BAI) problem, aiming to design
an adaptive experiment to identify the best treatment arm conditioned on contextual …

Fixed Budget Bayesian Best Arm Identification in Unimodal Bandits

D Ghosh, MK Hanawal, N Zlatanov - Authorea Preprints - techrxiv.org
In high-speed vehicular communications, eg, a high-speed train system, the beam between
the base station and user is frequently misaligned, resulting in a huge amount of associated …