Introducing CAD: the contextual abuse dataset

B Vidgen, T Thrush, Z Waseem, D Kiela - arXiv preprint arXiv:2012.15761, 2020 - arxiv.org

We present a human-and-model-in-the-loop process for dynamically generating datasets
and training better performing and more robust hate detection models. We provide a new …

被引用次数：202 相关文章所有 4 个版本

[PDF] arxiv.org

Two contrasting data annotation paradigms for subjective NLP tasks

P Röttger, B Vidgen, D Hovy… - arXiv preprint arXiv …, 2021 - arxiv.org

Labelled data is the foundation of most natural language processing tasks. However,
labelling data is difficult and there often are diverse valid beliefs about what the correct data …

被引用次数：123 相关文章所有 8 个版本

[PDF] jair.org Full View

Confronting abusive language online: A survey from the ethical and human rights perspective

S Kiritchenko, I Nejadgholi, KC Fraser - Journal of Artificial Intelligence …, 2021 - jair.org

The pervasiveness of abusive content on the internet can lead to severe psychological and
physical harm. Significant effort in Natural Language Processing (NLP) research has been …

被引用次数：87 相关文章所有 13 个版本

[PDF] arxiv.org

KOLD: Korean offensive language dataset

Y Jeong, J Oh, J Ahn, J Lee, J Moon, S Park… - arXiv preprint arXiv …, 2022 - arxiv.org

Recent directions for offensive language detection are hierarchical modeling, identifying the
type and the target of offensive language, and interpretability with offensive span annotation …

被引用次数：34 相关文章所有 5 个版本

[PDF] arxiv.org

Hatemoji: A test suite and adversarially-generated dataset for benchmarking and detecting emoji-based hate

HR Kirk, B Vidgen, P Röttger, T Thrush… - arXiv preprint arXiv …, 2021 - arxiv.org

Detecting online hate is a complex task, and low-performing models have harmful
consequences when used for sensitive applications such as content moderation. Emoji …

被引用次数：52 相关文章所有 7 个版本

[PDF] acm.org

Introducing MBIB-the first media bias identification benchmark task and dataset collection

M Wessel, T Horych, T Ruas, A Aizawa, B Gipp… - Proceedings of the 46th …, 2023 - dl.acm.org

Although media bias detection is a complex multi-task problem, there is, to date, no unified
benchmark grouping these evaluation tasks. We introduce the Media Bias Identification …

被引用次数：21 相关文章所有 8 个版本

[PDF] arxiv.org

Cobra frames: Contextual reasoning about effects and harms of offensive statements

X Zhou, H Zhu, A Yerukola, T Davidson… - arXiv preprint arXiv …, 2023 - arxiv.org

Warning: This paper contains content that may be offensive or upsetting. Understanding the
harms and offensiveness of statements requires reasoning about the social and situational …

被引用次数：17 相关文章所有 10 个版本

[PDF] arxiv.org

Hate speech and counter speech detection: Conversational context does matter

X Yu, E Blanco, L Hong - arXiv preprint arXiv:2206.06423, 2022 - arxiv.org

Hate speech is plaguing the cyberspace along with user-generated content. This paper
investigates the role of conversational context in the annotation and detection of online hate …

被引用次数：36 相关文章所有 7 个版本

[PDF] springer.com

SOLD: Sinhala offensive language dataset

T Ranasinghe, I Anuradha, D Premasiri, K Silva… - Language Resources …, 2024 - Springer

The widespread of offensive content online, such as hate speech and cyber-bullying, is a
global phenomenon. This has sparked interest in the artificial intelligence (AI) and natural …

被引用次数：22 相关文章所有 6 个版本

[HTML] sciencedirect.com

[HTML][HTML] Hidden behind the obvious: Misleading keywords and implicitly abusive language on social media

W Yin, A Zubiaga - Online Social Networks and Media, 2022 - Elsevier

While social media offers freedom of self-expression, abusive language carry significant
negative social impact. Driven by the importance of the issue, research in the automated …

被引用次数：24 相关文章所有 4 个版本