Prompt Optimization with Human Feedback

X Lin, Z Dai, A Verma, SK Ng, P Jaillet… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) have demonstrated remarkable performances in various
tasks. However, the performance of LLMs heavily depends on the input prompt, which has …

A Tutorial on Multi-Armed Bandit Applications for Large Language Models

D Bouneffouf, R Féraud - Proceedings of the 30th ACM SIGKDD …, 2024 - dl.acm.org
This tutorial offers a comprehensive guide on using multi-armed bandit (MAB) algorithms to
improve Large Language Models (LLMs). As Natural Language Processing (NLP) tasks …

Prompt Exploration with Prompt Regression

M Feffer, R Xu, Y Sun, M Yurochkin - arXiv preprint arXiv:2405.11083, 2024 - arxiv.org
In the advent of democratized usage of large language models (LLMs), there is a growing
desire to systematize LLM prompt creation and selection processes beyond iterative trial …

Efficient multi-prompt evaluation of LLMs

FM Polo, R Xu, L Weber, M Silva, O Bhardwaj… - arXiv preprint arXiv …, 2024 - arxiv.org
Most popular benchmarks for comparing LLMs rely on a limited set of prompt templates,
which may not fully capture the LLMs' abilities and can affect the reproducibility of results on …