Decoding speech perception from non-invasive brain recordings

A Défossez, C Caucheteux, J Rapin, O Kabeli… - Nature Machine …, 2023 - nature.com
Decoding speech from brain activity is a long-awaited goal in both healthcare and
neuroscience. Invasive devices have recently led to major milestones in this regard: deep …

On the linguistic representational power of neural machine translation models

Y Belinkov, N Durrani, F Dalvi, H Sajjad… - Computational …, 2020 - direct.mit.edu
Despite the recent success of deep neural networks in natural language processing and
other spheres of artificial intelligence, their interpretability remains a challenge. We analyze …

The GUM corpus: Creating multilayer resources in the classroom

A Zeldes - Language Resources and Evaluation, 2017 - Springer
This paper presents the methodology, design principles and detailed evaluation of a new
freely available multilayer corpus, collected and edited via classroom annotation using …

Corpora annotated with negation: An overview

SM Jiménez-Zafra, R Morante… - Computational …, 2020 - aclanthology.org
Negation is a universal linguistic phenomenon with a great qualitative impact on natural
language processing applications. The availability of corpora annotated with negation is …

Emobank: Studying the impact of annotation perspective and representation format on dimensional emotion analysis

S Buechel, U Hahn - arXiv preprint arXiv:2205.01996, 2022 - arxiv.org
We describe EmoBank, a corpus of 10k English sentences balancing multiple genres, which
we annotated with dimensional emotion metadata in the Valence-Arousal-Dominance (VAD) …

[图书][B] Corpus linguistics: Method, theory and practice

T McEnery, A Hardie - 2011 - books.google.com
Corpus linguistics is the study of language data on a large scale-the computer-aided
analysis of very extensive collections of transcribed utterances or written texts. This textbook …

The groningen meaning bank

J Bos, V Basile, K Evang, NJ Venhuizen… - Handbook of linguistic …, 2017 - Springer
The goal of the Groningen Meaning Bank (GMB) is to obtain a large corpus of English texts
annotated with formal meaning representations. Since manually annotating a …

Corpus linguistics and linguistically annotated corpora

S Kübler, H Zinsmeister - 2014 - torrossa.com
The idea for this textbook emerged when Sandra was teaching corpus linguistics to
linguistics and computational linguistics students at Indiana University. One of the goals of …

[HTML][HTML] BioC: a minimalist approach to interoperability for biomedical text processing

DC Comeau, R Islamaj Doğan, P Ciccarese… - Database, 2013 - academic.oup.com
A vast amount of scientific information is encoded in natural language text, and the quantity
of such text has become so great that it is no longer economically feasible to have a human …

From word types to tokens and back: A survey of approaches to word meaning representation and interpretation

M Apidianaki - Computational Linguistics, 2023 - direct.mit.edu
Vector-based word representation paradigms situate lexical meaning at different levels of
abstraction. Distributional and static embedding models generate a single vector per word …