Watching the Watchers: A Comparative Fairness Audit of Cloud-based Content Moderation Services

D Hartmann, A Oueslati, D Staufer - arXiv preprint arXiv:2406.14154, 2024 - arxiv.org
Online platforms face the challenge of moderating an ever-increasing volume of content,
including harmful hate speech. In the absence of clear legal definitions and a lack of …

The Regulation of Content Moderation

F Galli, A Loreggia, G Sartor - … Conference on the Legal Challenges of the …, 2022 - Springer
Online platforms have become a key infrastructure for creating and sharing content, thus
representing a paramount context for the individual/collective exercise of fundamental rights …

[PDF][PDF] Toward Better Automated Content Moderation in Low-Resource Languages

G Nicholas, A Bhatia - Journal of Online Trust and Safety, 2023 - tsjournal.org
Social media companies have learned the hard way that poor moderation of content in
languages other than English can have grave consequences. Leaving harmful content up …

A Critical Reflection on the Use of Toxicity Detection Algorithms in Proactive Content Moderation Systems

M Warner, A Strohmayer, M Higgs… - arXiv preprint arXiv …, 2024 - arxiv.org
Toxicity detection algorithms, originally designed with reactive content moderation in mind,
are increasingly being deployed into proactive end-user interventions to moderate content …

Multilingual content moderation: A case study on Reddit

M Ye, K Sikka, K Atwell, S Hassan, A Divakaran… - arXiv preprint arXiv …, 2023 - arxiv.org
Content moderation is the process of flagging content based on pre-defined platform rules.
There has been a growing need for AI moderators to safeguard users as well as protect the …

[图书][B] An examination of the Algorithmic Accountability Act of 2019

M MacCarthy - 2020 - cdn.annenbergpublicpolicycenter …
The Algorithmic Accountability Act of 2019, sponsored by Senators Cory Booker (D-NJ) and
Ron Wyden (D-OR), with a House equivalent sponsored by Rep. Yvette Clarke (D-NY) …

Beyond Trial-and-Error: Predicting User Abandonment After a Moderation Intervention

B Tessa, L Cima, A Trujillo, M Avvenuti… - arXiv preprint arXiv …, 2024 - arxiv.org
Current content moderation practices follow the\textit {trial-and-error} approach, meaning
that moderators apply sequences of interventions until they obtain the desired outcome …

Comparing the perceived legitimacy of content moderation processes: Contractors, algorithms, expert panels, and digital juries

CA Pan, S Yakhmi, TP Iyer, E Strasnick… - Proceedings of the …, 2022 - dl.acm.org
While research continues to investigate and improve the accuracy, fairness, and normative
appropriateness of content moderation processes on large social media platforms, even the …

Resolving content moderation dilemmas between free speech and harmful misinformation

A Kozyreva, SM Herzog… - Proceedings of the …, 2023 - National Acad Sciences
In online content moderation, two key values may come into conflict: protecting freedom of
expression and preventing harm. Robust rules based in part on how citizens think about …

[PDF][PDF] On algorithmic content moderation

E Prem, B Krenn - Hannes Werthner· Carlo Ghezzi· Jeff Kramer …, 2024 - library.oapen.org
This chapter provides an overview of the challenges involved in algorithmic content
moderation. Content moderation is the organized practice of screening user-generated …