Shared computational principles for language processing in humans and deep language models

A Goldstein, Z Zada, E Buchnik, M Schain, A Price… - Nature …, 2022 - nature.com
Departing from traditional linguistic models, advances in deep learning have resulted in a
new type of predictive (autoregressive) deep language models (DLMs). Using a self …

Expanding horizons of cross-linguistic research on reading: The Multilingual Eye-movement Corpus (MECO)

N Siegelman, S Schroeder, C Acartürk, HD Ahn… - Behavior research …, 2022 - Springer
Scientific studies of language behavior need to grapple with a large diversity of languages in
the world and, for reading, a further variability in writing systems. Yet, the ability to form …

CogniVal: A framework for cognitive word embedding evaluation

N Hollenstein, A de la Torre, N Langer… - arXiv preprint arXiv …, 2019 - arxiv.org
An interesting method of evaluating word representations is by how much they reflect the
semantic representations in the human brain. However, most, if not all, previous works only …

Multilingual language models predict human reading behavior

N Hollenstein, F Pirovano, C Zhang, L Jäger… - arXiv preprint arXiv …, 2021 - arxiv.org
We analyze if large language models are able to predict patterns of human reading
behavior. We compare the performance of language-specific and multilingual pretrained …

The Beijing Sentence Corpus: A Chinese sentence corpus with eye movement data and predictability norms

J Pan, M Yan, EM Richter, H Shu, R Kliegl - Behavior Research Methods, 2021 - Springer
This report introduces the Beijing Sentence Corpus (BSC). This is a Chinese sentence
corpus of eye-tracking data with relatively clear word boundaries. In addition, we report …

Thinking ahead: spontaneous prediction in context as a keystone of language in humans and machines

A Goldstein, Z Zada, E Buchnik, M Schain, A Price… - BioRxiv, 2020 - biorxiv.org
Departing from classical rule-based linguistic models, advances in deep learning have led
to the development of a new family of self-supervised deep language models (DLMs). These …

CMCL 2021 shared task on eye-tracking prediction

N Hollenstein, E Chersoni, CL Jacobs… - Proceedings of the …, 2021 - aclanthology.org
Eye-tracking data from reading represent an important resource for both linguistics and
natural language processing. The ability to accurately model gaze features is crucial to …

GECO-CN: Ghent Eye-tracking COrpus of sentence reading for Chinese-English bilinguals

L Sui, N Dirix, E Woumans, W Duyck - Behavior Research Methods, 2023 - Springer
The current work presents the very first eye-tracking corpus of natural reading by Chinese-
English bilinguals, whose two languages entail different writing systems and orthographies …

TURead: An eye movement dataset of Turkish reading

C Acartürk, A Özkan, TN Pekçetin, Z Ormanoğlu… - Behavior Research …, 2024 - Springer
In this study, we present TURead, an eye movement dataset of silent and oral sentence
reading in Turkish, an agglutinative language with a shallow orthography understudied in …

Relative importance in sentence processing

N Hollenstein, L Beinborn - arXiv preprint arXiv:2106.03471, 2021 - arxiv.org
Determining the relative importance of the elements in a sentence is a key factor for
effortless natural language understanding. For human language processing, we can …