Measuring causal effects of data statistics on language model'sfactual'predictions

TA Chang, BK Bergen - Computational Linguistics, 2024 - direct.mit.edu

Transformer language models have received widespread public attention, yet their
generated text is often surprising even to NLP researchers. In this survey, we discuss over …

被引用次数：54 相关文章所有 7 个版本

[PDF] arxiv.org

Chatgpt is not enough: Enhancing large language models with knowledge graphs for fact-aware language modeling

L Yang, H Chen, Z Li, X Ding, X Wu - arXiv preprint arXiv:2306.11489, 2023 - arxiv.org

Recently, ChatGPT, a representative large language model (LLM), has gained considerable
attention due to its powerful emergent abilities. Some researchers suggest that LLMs could …

被引用次数：56 相关文章所有 2 个版本

[PDF] mlr.press

Pythia: A suite for analyzing large language models across training and scaling

S Biderman, H Schoelkopf… - International …, 2023 - proceedings.mlr.press

How do large language models (LLMs) develop and evolve over the course of training?
How do these patterns change as models scale? To answer these questions, we introduce …

被引用次数：604 相关文章所有 7 个版本

[PDF] mlr.press

Large language models struggle to learn long-tail knowledge

N Kandpal, H Deng, A Roberts… - International …, 2023 - proceedings.mlr.press

The Internet contains a wealth of knowledge—from the birthdays of historical figures to
tutorials on how to code—all of which may be learned by language models. However, while …

被引用次数：243 相关文章所有 8 个版本

[PDF] arxiv.org

Impact of pretraining term frequencies on few-shot reasoning

Y Razeghi, RL Logan IV, M Gardner… - arXiv preprint arXiv …, 2022 - arxiv.org

Pretrained Language Models (LMs) have demonstrated ability to perform numerical
reasoning by extrapolating from a few examples in few-shot settings. However, the extent to …

被引用次数：186 相关文章所有 7 个版本

[PDF] arxiv.org

Trustworthy LLMs: A survey and guideline for evaluating large language models' alignment

Y Liu, Y Yao, JF Ton, X Zhang, RGH Cheng… - arXiv preprint arXiv …, 2023 - arxiv.org

Ensuring alignment, which refers to making models behave in accordance with human
intentions [1, 2], has become a critical task before deploying large language models (LLMs) …

被引用次数：151 相关文章所有 3 个版本

[PDF] neurips.cc

Interpretability at scale: Identifying causal mechanisms in alpaca

Z Wu, A Geiger, T Icard, C Potts… - Advances in Neural …, 2024 - proceedings.neurips.cc

Obtaining human-interpretable explanations of large, general-purpose language models is
an urgent goal for AI safety. However, it is just as important that our interpretability methods …

被引用次数：55 相关文章所有 6 个版本

[PDF] arxiv.org

Speak, memory: An archaeology of books known to chatgpt/gpt-4

KK Chang, M Cramer, S Soni, D Bamman - arXiv preprint arXiv …, 2023 - arxiv.org

In this work, we carry out a data archaeology to infer books that are known to ChatGPT and
GPT-4 using a name cloze membership inference query. We find that OpenAI models have …

被引用次数：80 相关文章所有 5 个版本

[PDF] neurips.cc

Counterfactual memorization in neural language models

C Zhang, D Ippolito, K Lee… - Advances in …, 2023 - proceedings.neurips.cc

Modern neural language models that are widely used in various NLP tasks risk memorizing
sensitive information from their training data. Understanding this memorization is important …

被引用次数：102 相关文章所有 7 个版本

[PDF] arxiv.org

Embers of autoregression: Understanding large language models through the problem they are trained to solve

RT McCoy, S Yao, D Friedman, M Hardy… - arXiv preprint arXiv …, 2023 - arxiv.org

The widespread adoption of large language models (LLMs) makes it important to recognize
their strengths and limitations. We argue that in order to develop a holistic understanding of …

被引用次数：59 相关文章所有 3 个版本