- 学术资源搜索

Language model behavior: A comprehensive survey

TA Chang, BK Bergen - Computational Linguistics, 2024 - direct.mit.edu

Transformer language models have received widespread public attention, yet their
generated text is often surprising even to NLP researchers. In this survey, we discuss over …

被引用次数：65 相关文章所有 7 个版本

[PDF] neurips.cc

Revisiting out-of-distribution robustness in nlp: Benchmarks, analysis, and LLMs evaluations

L Yuan, Y Chen, G Cui, H Gao, F Zou… - Advances in …, 2023 - proceedings.neurips.cc

This paper reexamines the research on out-of-distribution (OOD) robustness in the field of
NLP. We find that the distribution shift settings in previous studies commonly lack adequate …

被引用次数：47 相关文章所有 6 个版本

[PDF] arxiv.org

Embers of autoregression: Understanding large language models through the problem they are trained to solve

RT McCoy, S Yao, D Friedman, M Hardy… - arXiv preprint arXiv …, 2023 - arxiv.org

The widespread adoption of large language models (LLMs) makes it important to recognize
their strengths and limitations. We argue that in order to develop a holistic understanding of …

被引用次数：77 相关文章所有 3 个版本

[PDF] arxiv.org

Glue-x: Evaluating natural language understanding models from an out-of-distribution generalization perspective

L Yang, S Zhang, L Qin, Y Li, Y Wang, H Liu… - arXiv preprint arXiv …, 2022 - arxiv.org

Pre-trained language models (PLMs) are known to improve the generalization performance
of natural language understanding models by leveraging large amounts of data during the …

被引用次数：63 相关文章所有 6 个版本

[PDF] arxiv.org

Uncertainty in natural language generation: From theory to applications

J Baan, N Daheim, E Ilia, D Ulmer, HS Li… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent advances of powerful Language Models have allowed Natural Language
Generation (NLG) to emerge as an important technology that can not only perform traditional …

被引用次数：24 相关文章所有 2 个版本

[PDF] arxiv.org

Cross-lingual consistency of factual knowledge in multilingual language models

J Qi, R Fernández, A Bisazza - arXiv preprint arXiv:2310.10378, 2023 - arxiv.org

Multilingual large-scale Pretrained Language Models (PLMs) have been shown to store
considerable amounts of factual knowledge, but large variations are observed across …

被引用次数：26 相关文章所有 6 个版本

[PDF] arxiv.org

Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning

L Weber, E Bruni, D Hupkes - arXiv preprint arXiv:2310.13486, 2023 - arxiv.org

Finding the best way of adapting pre-trained language models to a task is a big challenge in
current NLP. Just like the previous generation of task-tuned models (TT), models that are …

被引用次数：16 相关文章所有 4 个版本

[PDF] arxiv.org

Characterizing mechanisms for factual recall in language models

Q Yu, J Merullo, E Pavlick - arXiv preprint arXiv:2310.15910, 2023 - arxiv.org

Language Models (LMs) often must integrate facts they memorized in pretraining with new
information that appears in a given context. These two sources can disagree, causing …

被引用次数：15 相关文章所有 4 个版本

[PDF] aclanthology.org

Out-of-distribution generalization in natural language processing: Past, present, and future

L Yang, Y Song, X Ren, C Lyu, Y Wang… - Proceedings of the …, 2023 - aclanthology.org

Abstract Machine learning (ML) systems in natural language processing (NLP) face
significant challenges in generalizing to out-of-distribution (OOD) data, where the test …

被引用次数：7 相关文章所有 3 个版本

[PDF] arxiv.org

Kg-gpt: A general framework for reasoning on knowledge graphs using large language models

J Kim, Y Kwon, Y Jo, E Choi - arXiv preprint arXiv:2310.11220, 2023 - arxiv.org

While large language models (LLMs) have made considerable advancements in
understanding and generating unstructured text, their application in structured data remains …

被引用次数：16 相关文章所有 4 个版本