State-of-the-art generalisation research in NLP: a taxonomy and review

D Hupkes, M Giulianelli, V Dankers, M Artetxe… - arXiv preprint arXiv …, 2022 - arxiv.org
The ability to generalise well is one of the primary desiderata of natural language
processing (NLP). Yet, what'good generalisation'entails and how it should be evaluated is …

Measuring conversational uptake: A case study on student-teacher interactions

D Demszky, J Liu, Z Mancenido, J Cohen, H Hill… - arXiv preprint arXiv …, 2021 - arxiv.org
In conversation, uptake happens when a speaker builds on the contribution of their
interlocutor by, for example, acknowledging, repeating or reformulating what they have said …

PLANET: Dynamic content planning in autoregressive transformers for long-form text generation

Z Hu, HP Chan, J Liu, X Xiao, H Wu… - arXiv preprint arXiv …, 2022 - arxiv.org
Despite recent progress of pre-trained language models on generating fluent text, existing
methods still suffer from incoherence problems in long-form text generation tasks that …

Improving unsupervised dialogue topic segmentation with utterance-pair coherence scoring

L Xing, G Carenini - arXiv preprint arXiv:2106.06719, 2021 - arxiv.org
Dialogue topic segmentation is critical in several dialogue modeling problems. However,
popular unsupervised approaches only exploit surface features in assessing topical …

Mawseo: Adversarial wiki search poisoning for illicit online promotion

Z Lin, Z Li, X Liao, XF Wang… - 2024 IEEE Symposium on …, 2024 - ieeexplore.ieee.org
As a prominent instance of vandalism edits, Wiki search poisoning for illicit promotion is a
cybercrime in which the adversary aims at editing Wiki articles to promote illicit businesses …

Revisiting cross-lingual summarization: A corpus-based study and a new benchmark with improved annotation

Y Chen, H Zhang, Y Zhou, X Bai, Y Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Most existing cross-lingual summarization (CLS) work constructs CLS corpora by simply and
directly translating pre-annotated summaries from one language to another, which can …

BERT-enhanced relational sentence ordering network

B Cui, Y Li, Z Zhang - Proceedings of the 2020 conference on …, 2020 - aclanthology.org
In this paper, we introduce a novel BERT-enhanced Relational Sentence Ordering Network
(referred to as BRSON) by leveraging BERT for capturing better dependency relationship …

Representation learning in discourse parsing: A survey

W Song, LZ Liu - Science China Technological Sciences, 2020 - Springer
Neural network based deep learning methods aim to learn representations of data and have
produced state-of-the-art results in many natural language processing (NLP) tasks …

Evaluating document coherence modeling

A Shen, M Mistica, B Salehi, H Li… - Transactions of the …, 2021 - direct.mit.edu
While pretrained language models (LMs) have driven impressive gains over morpho-
syntactic and semantic tasks, their ability to model discourse and pragmatic phenomena is …

D-score: Holistic dialogue evaluation without reference

C Zhang, G Lee, LF D'Haro, H Li - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org
In artistic gymnastics, difficulty score or D-score is used for judging performance. Starting
from zero, an athlete earns points from different aspects such as composition requirement …