Computational models of anaphora
Interpreting anaphoric references is a fundamental aspect of our language competence that
has long attracted the attention of computational linguists. The appearance of ever-larger …
has long attracted the attention of computational linguists. The appearance of ever-larger …
A survey on narrative extraction from textual data
Narratives are present in many forms of human expression and can be understood as a
fundamental way of communication between people. Computational understanding of the …
fundamental way of communication between people. Computational understanding of the …
Corpus Linguistics Use in Vocabulary Teaching Principle and Technique Application: A Study of Indonesian Language for Foreign Speakers
Indonesian Language for Foreign Speakers (BIPA) is Indonesian language learning
intended for foreigners. The aim of this research was to examine the vocabulary …
intended for foreigners. The aim of this research was to examine the vocabulary …
DisCoDisCo at the DISRPT2021 shared task: A system for discourse segmentation, classification, and connective detection
This paper describes our submission to the DISRPT2021 Shared Task on Discourse Unit
Segmentation, Connective Detection, and Relation Classification. Our system, called …
Segmentation, Connective Detection, and Relation Classification. Our system, called …
Why can't discourse parsing generalize? A thorough investigation of the impact of data diversity
Recent advances in discourse parsing performance create the impression that, as in other
NLP tasks, performance for high-resource languages such as English is finally becoming …
NLP tasks, performance for high-resource languages such as English is finally becoming …
Opinion Piece: Can we Fix the Scope for Coreference? Problems and Solutions for Benchmarks beyond OntoNotes
A Zeldes - Dialogue & Discourse, 2022 - journals.uic.edu
Current work on automatic coreference resolution has focused on the OntoNotes benchmark
dataset, due to both its size and consistency. However many aspects of the OntoNotes …
dataset, due to both its size and consistency. However many aspects of the OntoNotes …
Aggregating crowdsourced and automatic judgments to scale up a corpus of anaphoric reference for fiction and Wikipedia texts
Although several datasets annotated for anaphoric reference/coreference exist, even the
largest such datasets have limitations in terms of size, range of domains, coverage of …
largest such datasets have limitations in terms of size, range of domains, coverage of …
Developing a multilayer semantic annotation scheme based on ISO standards for the visualization of a newswire corpus
In this paper, we describe the process of developing a multilayer semantic annotation
scheme designed for extracting information from a European Portuguese corpus of news …
scheme designed for extracting information from a European Portuguese corpus of news …
Corpus annotation
J Newman, C Cox - A practical handbook of corpus linguistics, 2021 - Springer
In this chapter, we provide an overview of the main concepts relating to corpus annotation,
along with some discussion of the practical aspects of creating annotated texts and working …
along with some discussion of the practical aspects of creating annotated texts and working …
Exploring a Multi-Layered Cross-Genre Corpus of Document-Level Semantic Relations
This paper introduces a multi-layered cross-genre corpus, annotated for coreference
resolution, causal relations, and temporal relations, comprising a variety of genres, from …
resolution, causal relations, and temporal relations, comprising a variety of genres, from …