On the proper treatment of tokenization in psycholinguistics
Language models are widely used in computational psycholinguistics to test theories that
relate the negative log probability (the surprisal) of a region of interest (a substring of …
relate the negative log probability (the surprisal) of a region of interest (a substring of …
[HTML][HTML] Mouse Tracking for Reading (MoTR): A new naturalistic incremental processing measurement tool
Abstract We introduce Mouse Tracking for Reading (MoTR) a new incremental processing
measurement tool that can be used to collect word-by-word reading times. In a MoTR trial …
measurement tool that can be used to collect word-by-word reading times. In a MoTR trial …
EMTeC: A Corpus of Eye Movements on Machine-Generated Texts
The Eye Movements on Machine-Generated Texts Corpus (EMTeC) is a naturalistic eye-
movements-while-reading corpus of 107 native English speakers reading machine …
movements-while-reading corpus of 107 native English speakers reading machine …