Shared computational principles for language processing in humans and deep language models
Departing from traditional linguistic models, advances in deep learning have resulted in a
new type of predictive (autoregressive) deep language models (DLMs). Using a self …
new type of predictive (autoregressive) deep language models (DLMs). Using a self …
Expanding horizons of cross-linguistic research on reading: The Multilingual Eye-movement Corpus (MECO)
Scientific studies of language behavior need to grapple with a large diversity of languages in
the world and, for reading, a further variability in writing systems. Yet, the ability to form …
the world and, for reading, a further variability in writing systems. Yet, the ability to form …
CogniVal: A framework for cognitive word embedding evaluation
N Hollenstein, A de la Torre, N Langer… - arXiv preprint arXiv …, 2019 - arxiv.org
An interesting method of evaluating word representations is by how much they reflect the
semantic representations in the human brain. However, most, if not all, previous works only …
semantic representations in the human brain. However, most, if not all, previous works only …
Multilingual language models predict human reading behavior
We analyze if large language models are able to predict patterns of human reading
behavior. We compare the performance of language-specific and multilingual pretrained …
behavior. We compare the performance of language-specific and multilingual pretrained …
The Beijing Sentence Corpus: A Chinese sentence corpus with eye movement data and predictability norms
This report introduces the Beijing Sentence Corpus (BSC). This is a Chinese sentence
corpus of eye-tracking data with relatively clear word boundaries. In addition, we report …
corpus of eye-tracking data with relatively clear word boundaries. In addition, we report …
Thinking ahead: spontaneous prediction in context as a keystone of language in humans and machines
Departing from classical rule-based linguistic models, advances in deep learning have led
to the development of a new family of self-supervised deep language models (DLMs). These …
to the development of a new family of self-supervised deep language models (DLMs). These …
CMCL 2021 shared task on eye-tracking prediction
Eye-tracking data from reading represent an important resource for both linguistics and
natural language processing. The ability to accurately model gaze features is crucial to …
natural language processing. The ability to accurately model gaze features is crucial to …
GECO-CN: Ghent Eye-tracking COrpus of sentence reading for Chinese-English bilinguals
The current work presents the very first eye-tracking corpus of natural reading by Chinese-
English bilinguals, whose two languages entail different writing systems and orthographies …
English bilinguals, whose two languages entail different writing systems and orthographies …
TURead: An eye movement dataset of Turkish reading
In this study, we present TURead, an eye movement dataset of silent and oral sentence
reading in Turkish, an agglutinative language with a shallow orthography understudied in …
reading in Turkish, an agglutinative language with a shallow orthography understudied in …
Relative importance in sentence processing
N Hollenstein, L Beinborn - arXiv preprint arXiv:2106.03471, 2021 - arxiv.org
Determining the relative importance of the elements in a sentence is a key factor for
effortless natural language understanding. For human language processing, we can …
effortless natural language understanding. For human language processing, we can …