Best arm identification for prompt learning under a limited budget
The remarkable instruction-following capability of large language models (LLMs) has
sparked a growing interest in automatically learning suitable prompts. However, while many …
sparked a growing interest in automatically learning suitable prompts. However, while many …
Efficient prompt optimization through the lens of best arm identification
The remarkable instruction-following capability of large language models (LLMs) has
sparked a growing interest in automatically finding good prompts, ie, prompt optimization …
sparked a growing interest in automatically finding good prompts, ie, prompt optimization …
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
We study the problem of Bayesian fixed-budget best-arm identification (BAI) in structured
bandits. We propose an algorithm that uses fixed allocations based on the prior information …
bandits. We propose an algorithm that uses fixed allocations based on the prior information …
Adaptive Generalized Neyman Allocation: Local Asymptotic Minimax Optimal Best Arm Identification
M Kato - arXiv preprint arXiv:2405.19317, 2024 - arxiv.org
This study investigates a local asymptotic minimax optimal strategy for fixed-budget best arm
identification (BAI). We propose the Adaptive Generalized Neyman Allocation (AGNA) …
identification (BAI). We propose the Adaptive Generalized Neyman Allocation (AGNA) …
Fixed Confidence Best Arm Identification in the Bayesian Setting
We consider the fixed-confidence best arm identification (FC-BAI) problem in the Bayesian
Setting. This problem aims to find the arm of the largest mean with a fixed confidence level …
Setting. This problem aims to find the arm of the largest mean with a fixed confidence level …
Best Arm Identification in Batched Multi-armed Bandit Problems
S Cao, S He, R Jiang, J Xu, H Yuan - arXiv preprint arXiv:2312.13875, 2023 - arxiv.org
Recently multi-armed bandit problem arises in many real-life scenarios where arms must be
sampled in batches, due to limited time the agent can wait for the feedback. Such …
sampled in batches, due to limited time the agent can wait for the feedback. Such …
UCB Exploration for Fixed-Budget Bayesian Best Arm Identification
RJB Zhu, Y Qiu - arXiv preprint arXiv:2408.04869, 2024 - arxiv.org
We study best-arm identification (BAI) in the fixed-budget setting. Adaptive allocations based
on upper confidence bounds (UCBs), such as UCBE, are known to work well in BAI …
on upper confidence bounds (UCBs), such as UCBE, are known to work well in BAI …
[HTML][HTML] Inference and Online Learning in Structured Stochastic Systems
K Ariu - 2023 - diva-portal.org
This thesis contributes to the field of stochastic online learning problems, with a collection of
six papers each addressing unique aspects of online learning and inference problems …
six papers each addressing unique aspects of online learning and inference problems …
Adaptive Experimental Design for Policy Learning: Contextual Best Arm Identification
This study investigates the contextual best arm identification (BAI) problem, aiming to design
an adaptive experiment to identify the best treatment arm conditioned on contextual …
an adaptive experiment to identify the best treatment arm conditioned on contextual …
Fixed Budget Bayesian Best Arm Identification in Unimodal Bandits
In high-speed vehicular communications, eg, a high-speed train system, the beam between
the base station and user is frequently misaligned, resulting in a huge amount of associated …
the base station and user is frequently misaligned, resulting in a huge amount of associated …