[PDF][PDF] Modern language models refute Chomsky's approach to language
S Piantadosi - Lingbuzz Preprint, lingbuzz, 2023 - lingbuzz.net
The rise and success of large language models undermines virtually every strong claim for
the innateness of language that has been proposed by generative linguistics. Modern …
the innateness of language that has been proposed by generative linguistics. Modern …
Language varieties of Italy: Technology challenges and opportunities
A Ramponi - Transactions of the Association for Computational …, 2024 - direct.mit.edu
Italy is characterized by a one-of-a-kind linguistic diversity landscape in Europe, which
implicitly encodes local knowledge, cultural traditions, artistic expressions, and history of its …
implicitly encodes local knowledge, cultural traditions, artistic expressions, and history of its …
The ParlaMint corpora of parliamentary proceedings
This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17
European national parliaments with half a billion words. The corpora are uniformly encoded …
European national parliaments with half a billion words. The corpora are uniformly encoded …
Schrödinger's tree—On syntax and neural language models
A Kulmizev, J Nivre - Frontiers in Artificial Intelligence, 2022 - frontiersin.org
In the last half-decade, the field of natural language processing (NLP) has undergone two
major transitions: the switch to neural networks as the primary modeling paradigm and the …
major transitions: the switch to neural networks as the primary modeling paradigm and the …
A survey on narrative extraction from textual data
Narratives are present in many forms of human expression and can be understood as a
fundamental way of communication between people. Computational understanding of the …
fundamental way of communication between people. Computational understanding of the …
The SIGMORPHON 2022 shared task on morpheme segmentation
The SIGMORPHON 2022 shared task on morpheme segmentation challenged systems to
decompose a word into a sequence of morphemes and covered most types of morphology …
decompose a word into a sequence of morphemes and covered most types of morphology …
HuSpaCy: an industrial-strength Hungarian natural language processing toolkit
Although there are a couple of open-source language processing pipelines available for
Hungarian, none of them satisfies the requirements of today's NLP applications. A language …
Hungarian, none of them satisfies the requirements of today's NLP applications. A language …
[HTML][HTML] GENA: a knowledge graph for nutrition and mental health
LD Dang, UTP Phan, NTH Nguyen - Journal of Biomedical Informatics, 2023 - Elsevier
While a large number of knowledge graphs have previously been developed by
automatically extracting and structuring knowledge from literature, there is currently no such …
automatically extracting and structuring knowledge from literature, there is currently no such …
Construction grammar provides unique insight into neural language models
Construction Grammar (CxG) has recently been used as the basis for probing studies that
have investigated the performance of large pretrained language models (PLMs) with respect …
have investigated the performance of large pretrained language models (PLMs) with respect …
Large language models implicitly learn to straighten neural sentence trajectories to construct a predictive representation of natural language.
E Hosseini, E Fedorenko - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Predicting upcoming events is critical to our ability to effectively interact with ourenvironment
and conspecifics. In natural language processing, transformer models, which are trained on …
and conspecifics. In natural language processing, transformer models, which are trained on …