Computational models of anaphora

M Poesio, J Yu, S Paun, A Aloraini, P Lu… - Annual Review of …, 2023 - annualreviews.org
Interpreting anaphoric references is a fundamental aspect of our language competence that
has long attracted the attention of computational linguists. The appearance of ever-larger …

A survey on narrative extraction from textual data

B Santana, R Campos, E Amorim, A Jorge… - Artificial Intelligence …, 2023 - Springer
Narratives are present in many forms of human expression and can be understood as a
fundamental way of communication between people. Computational understanding of the …

Corpus Linguistics Use in Vocabulary Teaching Principle and‎ Technique Application: A Study of Indonesian Language for‎ Foreign Speakers

K Saddhono, M Rohmadi, B Setiawan, R Suhita… - International Journal of …, 2023 - ijscl.com
Indonesian Language for Foreign Speakers (BIPA) is Indonesian language learning
intended for foreigners. The aim of this research was to examine the vocabulary …

DisCoDisCo at the DISRPT2021 shared task: A system for discourse segmentation, classification, and connective detection

L Gessler, S Behzad, YJ Liu, S Peng, Y Zhu… - arXiv preprint arXiv …, 2021 - arxiv.org
This paper describes our submission to the DISRPT2021 Shared Task on Discourse Unit
Segmentation, Connective Detection, and Relation Classification. Our system, called …

Why can't discourse parsing generalize? A thorough investigation of the impact of data diversity

YJ Liu, A Zeldes - arXiv preprint arXiv:2302.06488, 2023 - arxiv.org
Recent advances in discourse parsing performance create the impression that, as in other
NLP tasks, performance for high-resource languages such as English is finally becoming …

Opinion Piece: Can we Fix the Scope for Coreference? Problems and Solutions for Benchmarks beyond OntoNotes

A Zeldes - Dialogue & Discourse, 2022 - journals.uic.edu
Current work on automatic coreference resolution has focused on the OntoNotes benchmark
dataset, due to both its size and consistency. However many aspects of the OntoNotes …

Aggregating crowdsourced and automatic judgments to scale up a corpus of anaphoric reference for fiction and Wikipedia texts

J Yu, S Paun, M Camilleri, PC Garcia… - arXiv preprint arXiv …, 2022 - arxiv.org
Although several datasets annotated for anaphoric reference/coreference exist, even the
largest such datasets have limitations in terms of size, range of domains, coverage of …

Developing a multilayer semantic annotation scheme based on ISO standards for the visualization of a newswire corpus

P Silvano, A Leal, F Silva, I Cantante… - Proceedings of the …, 2021 - aclanthology.org
In this paper, we describe the process of developing a multilayer semantic annotation
scheme designed for extracting information from a European Portuguese corpus of news …

Corpus annotation

J Newman, C Cox - A practical handbook of corpus linguistics, 2021 - Springer
In this chapter, we provide an overview of the main concepts relating to corpus annotation,
along with some discussion of the practical aspects of creating annotated texts and working …

Exploring a Multi-Layered Cross-Genre Corpus of Document-Level Semantic Relations

G Williamson, A Cao, Y Chen, Y Ji, L Xu, JD Choi - Information, 2023 - mdpi.com
This paper introduces a multi-layered cross-genre corpus, annotated for coreference
resolution, causal relations, and temporal relations, comprising a variety of genres, from …