Mathdial: A dialogue tutoring dataset with rich pedagogical properties grounded in math reasoning problems

J Macina, N Daheim, SP Chowdhury, T Sinha… - arXiv preprint arXiv …, 2023 - arxiv.org
While automatic dialogue tutors hold great potential in making education personalized and
more accessible, research on such systems has been hampered by a lack of sufficiently …

Impressions: Visual semiotics and aesthetic impact understanding

J Kruk, C Ziems, D Yang - … of the 2023 Conference on Empirical …, 2023 - aclanthology.org
Is aesthetic impact different from beauty? Is visual salience a reflection of its capacity for
effective communication? We present Impressions, a novel dataset through which to …

Impressions: Understanding Visual Semiotics and Aesthetic Impact

J Kruk, C Ziems, D Yang - arXiv preprint arXiv:2310.17887, 2023 - arxiv.org
Is aesthetic impact different from beauty? Is visual salience a reflection of its capacity for
effective communication? We present Impressions, a novel dataset through which to …

ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications

S Takeshita, T Green, I Reinig, K Eckert… - arXiv preprint arXiv …, 2024 - arxiv.org
Extensive efforts in the past have been directed toward the development of summarization
datasets. However, a predominant number of these resources have been (semi) …

Reproduction of human evaluations in:“it's not rocket science: Interpreting figurative language in narratives”

S Mahamood - Proceedings of the 3rd Workshop on Human …, 2023 - aclanthology.org
We describe in this paper an attempt to reproduce some of the human of evaluation results
from the paper “It's not Rocket Science: Interpreting Figurative Language in Narratives”. In …

The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels

E Fleisig, SL Blodgett, D Klein, Z Talat - arXiv preprint arXiv:2405.05860, 2024 - arxiv.org
Longstanding data labeling practices in machine learning involve collecting and
aggregating labels from multiple annotators. But what should we do when annotators …

From Random to Informed Data Selection: A Diversity-Based Approach to Optimize Human Annotation and Few-Shot Learning

A Alcoforado, TP Ferraz, LH Okamura, IC Fama… - arXiv preprint arXiv …, 2024 - arxiv.org
A major challenge in Natural Language Processing is obtaining annotated data for
supervised learning. An option is the use of crowdsourcing platforms for data annotation …