[PDF][PDF] Modern language models refute Chomsky's approach to language

S Piantadosi - Lingbuzz Preprint, lingbuzz, 2023 - lingbuzz.net
The rise and success of large language models undermines virtually every strong claim for
the innateness of language that has been proposed by generative linguistics. Modern …

Language varieties of Italy: Technology challenges and opportunities

A Ramponi - Transactions of the Association for Computational …, 2024 - direct.mit.edu
Italy is characterized by a one-of-a-kind linguistic diversity landscape in Europe, which
implicitly encodes local knowledge, cultural traditions, artistic expressions, and history of its …

The ParlaMint corpora of parliamentary proceedings

T Erjavec, M Ogrodniczuk, P Osenova… - Language resources …, 2023 - Springer
This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17
European national parliaments with half a billion words. The corpora are uniformly encoded …

Schrödinger's tree—On syntax and neural language models

A Kulmizev, J Nivre - Frontiers in Artificial Intelligence, 2022 - frontiersin.org
In the last half-decade, the field of natural language processing (NLP) has undergone two
major transitions: the switch to neural networks as the primary modeling paradigm and the …

A survey on narrative extraction from textual data

B Santana, R Campos, E Amorim, A Jorge… - Artificial Intelligence …, 2023 - Springer
Narratives are present in many forms of human expression and can be understood as a
fundamental way of communication between people. Computational understanding of the …

The SIGMORPHON 2022 shared task on morpheme segmentation

K Batsuren, G Bella, A Arora, V Martinović… - arXiv preprint arXiv …, 2022 - arxiv.org
The SIGMORPHON 2022 shared task on morpheme segmentation challenged systems to
decompose a word into a sequence of morphemes and covered most types of morphology …

HuSpaCy: an industrial-strength Hungarian natural language processing toolkit

G Orosz, Z Szántó, P Berkecz, G Szabó… - arXiv preprint arXiv …, 2022 - arxiv.org
Although there are a couple of open-source language processing pipelines available for
Hungarian, none of them satisfies the requirements of today's NLP applications. A language …

[HTML][HTML] GENA: a knowledge graph for nutrition and mental health

LD Dang, UTP Phan, NTH Nguyen - Journal of Biomedical Informatics, 2023 - Elsevier
While a large number of knowledge graphs have previously been developed by
automatically extracting and structuring knowledge from literature, there is currently no such …

Construction grammar provides unique insight into neural language models

L Weissweiler, T He, N Otani, DR Mortensen… - arXiv preprint arXiv …, 2023 - arxiv.org
Construction Grammar (CxG) has recently been used as the basis for probing studies that
have investigated the performance of large pretrained language models (PLMs) with respect …

Large language models implicitly learn to straighten neural sentence trajectories to construct a predictive representation of natural language.

E Hosseini, E Fedorenko - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Predicting upcoming events is critical to our ability to effectively interact with ourenvironment
and conspecifics. In natural language processing, transformer models, which are trained on …