Learning from the worst: Dynamically generated datasets to improve online hate detection

B Vidgen, T Thrush, Z Waseem, D Kiela - arXiv preprint arXiv:2012.15761, 2020 - arxiv.org
We present a human-and-model-in-the-loop process for dynamically generating datasets
and training better performing and more robust hate detection models. We provide a new …

Two contrasting data annotation paradigms for subjective NLP tasks

P Röttger, B Vidgen, D Hovy… - arXiv preprint arXiv …, 2021 - arxiv.org
Labelled data is the foundation of most natural language processing tasks. However,
labelling data is difficult and there often are diverse valid beliefs about what the correct data …

Confronting abusive language online: A survey from the ethical and human rights perspective

S Kiritchenko, I Nejadgholi, KC Fraser - Journal of Artificial Intelligence …, 2021 - jair.org
The pervasiveness of abusive content on the internet can lead to severe psychological and
physical harm. Significant effort in Natural Language Processing (NLP) research has been …

KOLD: Korean offensive language dataset

Y Jeong, J Oh, J Ahn, J Lee, J Moon, S Park… - arXiv preprint arXiv …, 2022 - arxiv.org
Recent directions for offensive language detection are hierarchical modeling, identifying the
type and the target of offensive language, and interpretability with offensive span annotation …

Hatemoji: A test suite and adversarially-generated dataset for benchmarking and detecting emoji-based hate

HR Kirk, B Vidgen, P Röttger, T Thrush… - arXiv preprint arXiv …, 2021 - arxiv.org
Detecting online hate is a complex task, and low-performing models have harmful
consequences when used for sensitive applications such as content moderation. Emoji …

Introducing MBIB-the first media bias identification benchmark task and dataset collection

M Wessel, T Horych, T Ruas, A Aizawa, B Gipp… - Proceedings of the 46th …, 2023 - dl.acm.org
Although media bias detection is a complex multi-task problem, there is, to date, no unified
benchmark grouping these evaluation tasks. We introduce the Media Bias Identification …

Cobra frames: Contextual reasoning about effects and harms of offensive statements

X Zhou, H Zhu, A Yerukola, T Davidson… - arXiv preprint arXiv …, 2023 - arxiv.org
Warning: This paper contains content that may be offensive or upsetting. Understanding the
harms and offensiveness of statements requires reasoning about the social and situational …

Hate speech and counter speech detection: Conversational context does matter

X Yu, E Blanco, L Hong - arXiv preprint arXiv:2206.06423, 2022 - arxiv.org
Hate speech is plaguing the cyberspace along with user-generated content. This paper
investigates the role of conversational context in the annotation and detection of online hate …

SOLD: Sinhala offensive language dataset

T Ranasinghe, I Anuradha, D Premasiri, K Silva… - Language Resources …, 2024 - Springer
The widespread of offensive content online, such as hate speech and cyber-bullying, is a
global phenomenon. This has sparked interest in the artificial intelligence (AI) and natural …

[HTML][HTML] Hidden behind the obvious: Misleading keywords and implicitly abusive language on social media

W Yin, A Zubiaga - Online Social Networks and Media, 2022 - Elsevier
While social media offers freedom of self-expression, abusive language carry significant
negative social impact. Driven by the importance of the issue, research in the automated …