The GUM corpus: Creating multilayer resources in the classroom
A Zeldes - Language Resources and Evaluation, 2017 - Springer
This paper presents the methodology, design principles and detailed evaluation of a new
freely available multilayer corpus, collected and edited via classroom annotation using …
freely available multilayer corpus, collected and edited via classroom annotation using …
Cross-lingual RST discourse parsing
Discourse parsing is an integral part of understanding information flow and argumentative
structure in documents. Most previous research has focused on inducing and evaluating …
structure in documents. Most previous research has focused on inducing and evaluating …
DisCut and DiscReT: MELODI at DISRPT 2023
This paper presents the results obtained by the MELODI team for the three tasks proposed
within the DISRPT 2023 shared task on discourse: segmentation, connective identification …
within the DISRPT 2023 shared task on discourse: segmentation, connective identification …
The DISRPT 2023 shared task on elementary discourse unit segmentation, connective detection, and relation classification
In 2023, the third iteration of the DISRPT Shared Task (Discourse Relation Parsing and
Treebanking) was held, dedicated to the underlying units used in discourse parsing across …
Treebanking) was held, dedicated to the underlying units used in discourse parsing across …
HITS at DISRPT 2023: Discourse segmentation, connective detection, and relation classification
HITS participated in the Discourse Segmentation (DS, Task 1) and Connective Detection
(CD, Task 2) tasks at the DISRPT 2023. Task 1 focuses on segmenting the text into …
(CD, Task 2) tasks at the DISRPT 2023. Task 1 focuses on segmenting the text into …
RST Signalling Corpus: A corpus of signals of coherence relations
We present the RST Signalling Corpus (Das et al. in RST signalling corpus, LDC2015T10.
https://catalog. ldc. upenn. edu/LDC2015T10, 2015), a corpus annotated for signals of …
https://catalog. ldc. upenn. edu/LDC2015T10, 2015), a corpus annotated for signals of …
Why can't discourse parsing generalize? A thorough investigation of the impact of data diversity
Recent advances in discourse parsing performance create the impression that, as in other
NLP tasks, performance for high-resource languages such as English is finally becoming …
NLP tasks, performance for high-resource languages such as English is finally becoming …
ToNy: Contextual embeddings for accurate multilingual discourse segmentation of full documents
Segmentation is the first step in building practical discourse parsers, and is often neglected
in discourse parsing studies. The goal is to identify the minimal spans of text to be linked by …
in discourse parsing studies. The goal is to identify the minimal spans of text to be linked by …
GCDT: A Chinese RST treebank for multigenre and multilingual discourse parsing
A lack of large-scale human-annotated data has hampered the hierarchical discourse
parsing of Chinese. In this paper, we present GCDT, the largest hierarchical discourse …
parsing of Chinese. In this paper, we present GCDT, the largest hierarchical discourse …
Surprise! Uniform Information Density Isn't the Whole Story: Predicting Surprisal Contours in Long-form Discourse
The Uniform Information Density (UID) hypothesis posits that speakers tend to distribute
information evenly across linguistic units to achieve efficient communication. Of course …
information evenly across linguistic units to achieve efficient communication. Of course …