H Chi, James Caverlee, Julian McAuley, and Derek Zhiyuan Cheng. How to train data-efficient llms- 学术资源搜索

文章

学术资源搜索

我的图书馆

[引用][C] H Chi, James Caverlee, Julian McAuley, and Derek Zhiyuan Cheng. How to train data-efficient llms

N Sachdeva, B Coleman, WC Kang, J Ni, L Hong - arXiv preprint arXiv:2402.09668, 2024

被引用次数：11 相关文章

[PDF] arxiv.org

How to Train Data-Efficient LLMs

N Sachdeva, B Coleman, WC Kang, J Ni… - arXiv preprint arXiv …, 2024 - arxiv.org

The training of large language models (LLMs) is expensive. In this paper, we study data-
efficient approaches for pre-training LLMs, ie, techniques that aim to optimize the Pareto
frontier of model quality and training resource/data consumption. We seek to understand the
tradeoffs associated with data selection routines based on (i) expensive-to-compute data-
quality estimates, and (ii) maximization of coverage and diversity-based measures in the
feature space. Our first technique, Ask-LLM, leverages the zero-shot reasoning capabilities …

被引用次数：4 相关文章所有 2 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果