Language in brains, minds, and machines
G Tuckute, N Kanwisher… - Annual Review of …, 2024 - annualreviews.org
It has long been argued that only humans could produce and understand language. But
now, for the first time, artificial language models (LMs) achieve this feat. Here we survey the …
now, for the first time, artificial language models (LMs) achieve this feat. Here we survey the …
[HTML][HTML] Modern language models refute Chomsky's approach to language
ST Piantadosi - From fieldwork to linguistic theory: A tribute to …, 2023 - books.google.com
Modern machine learning has subverted and bypassed the theoretical framework of
Chomsky's generative approach to linguistics, including its core claims to particular insights …
Chomsky's generative approach to linguistics, including its core claims to particular insights …
[PDF][PDF] Findings of the BabyLM Challenge: Sample-efficient pretraining on developmentally plausible corpora
Children can acquire language from less than 100 million words of input. Large language
models are far less data-efficient: they typically require 3 or 4 orders of magnitude more data …
models are far less data-efficient: they typically require 3 or 4 orders of magnitude more data …
Benchmarking large language models as ai research agents
Scientific experimentation involves an iterative process of creating hypotheses, designing
experiments, running experiments, and analyzing the results. Can we build AI research …
experiments, running experiments, and analyzing the results. Can we build AI research …
Language models as models of language
R Millière - arXiv preprint arXiv:2408.07144, 2024 - arxiv.org
This chapter critically examines the potential contributions of modern language models to
theoretical linguistics. Despite their focus on engineering goals, these models' ability to …
theoretical linguistics. Despite their focus on engineering goals, these models' ability to …
Efficient benchmarking (of language models)
The increasing versatility of language models LMs has given rise to a new class of
benchmarks that comprehensively assess a broad range of capabilities. Such benchmarks …
benchmarks that comprehensively assess a broad range of capabilities. Such benchmarks …
From form (s) to meaning: Probing the semantic depths of language models using multisense consistency
The staggering pace with which the capabilities of large language models (LLMs) are
increasing, as measured by a range of commonly used natural language understanding …
increasing, as measured by a range of commonly used natural language understanding …
Baby llama: knowledge distillation from an ensemble of teachers trained on a small dataset with no performance penalty
I Timiryasov, JL Tastet - arXiv preprint arXiv:2308.02019, 2023 - arxiv.org
We present our proposed solution to the BabyLM challenge [arXiv: 2301.11796], whose goal
was to improve the sample efficiency of language models. We trained an ensemble …
was to improve the sample efficiency of language models. We trained an ensemble …
On the unexpected abilities of large language models
S Nolfi - Adaptive Behavior, 2024 - journals.sagepub.com
Large Language Models (LLMs) are capable of displaying a wide range of abilities that are
not directly connected with the task for which they are trained: predicting the next words of …
not directly connected with the task for which they are trained: predicting the next words of …
Mission: Impossible language models
Chomsky and others have very directly claimed that large language models (LLMs) are
equally capable of learning languages that are possible and impossible for humans to learn …
equally capable of learning languages that are possible and impossible for humans to learn …