Beyond Trial-and-Error: Predicting User Abandonment After a Moderation Intervention
Current content moderation practices follow the\textit {trial-and-error} approach, meaning
that moderators apply sequences of interventions until they obtain the desired outcome …
that moderators apply sequences of interventions until they obtain the desired outcome …
One of many: Assessing user-level effects of moderation interventions on r/The_Donald
A Trujillo, S Cresci - Proceedings of the 15th ACM Web Science …, 2023 - dl.acm.org
Evaluating the effects of moderation interventions is a task of paramount importance, as it
allows assessing the success of content moderation processes. So far, intervention effects …
allows assessing the success of content moderation processes. So far, intervention effects …
Multilingual content moderation: A case study on Reddit
Content moderation is the process of flagging content based on pre-defined platform rules.
There has been a growing need for AI moderators to safeguard users as well as protect the …
There has been a growing need for AI moderators to safeguard users as well as protect the …
[PDF][PDF] To Act or React
H Habib, M Bin Musa, F Zaffar… - … Proactive Strategies for …, 2019 - academia.edu
Reddit, the self-proclaimed łfront page of the Internetž with over 330M active users, has
found its communities playing a prominent role in originating and propagating sexist, racist …
found its communities playing a prominent role in originating and propagating sexist, racist …
Learning to defer in content moderation: The human-ai interplay
T Lykouris, W Weng - arXiv preprint arXiv:2402.12237, 2024 - arxiv.org
Successful content moderation in online platforms relies on a human-AI collaboration
approach. A typical heuristic estimates the expected harmfulness of a post and uses fixed …
approach. A typical heuristic estimates the expected harmfulness of a post and uses fixed …
Reliable decision from multiple subtasks through threshold optimization: Content moderation in the wild
Social media platforms struggle to protect users from harmful content through content
moderation. These platforms have recently leveraged machine learning models to cope with …
moderation. These platforms have recently leveraged machine learning models to cope with …
Does transparency in moderation really matter? User behavior after content removal explanations on reddit
When posts are removed on a social media platform, users may or may not receive an
explanation. What kinds of explanations are provided? Do those explanations matter? Using …
explanation. What kinds of explanations are provided? Do those explanations matter? Using …
To act or react: Investigating proactive strategies for online community moderation
Reddit administrators have generally struggled to prevent or contain such discourse for
several reasons including:(1) the inability for a handful of human administrators to track and …
several reasons including:(1) the inability for a handful of human administrators to track and …
Bystanders of Online Moderation: Examining the Effects of Witnessing Post-Removal Explanations
Prior research on transparency in content moderation has demonstrated the benefits of
offering post-removal explanations to sanctioned users. In this paper, we examine whether …
offering post-removal explanations to sanctioned users. In this paper, we examine whether …
Bandits for Online Calibration: An Application to Content Moderation on Social Media Platforms
We describe the current content moderation strategy employed by Meta to remove policy-
violating content from its platforms. Meta relies on both handcrafted and learned risk models …
violating content from its platforms. Meta relies on both handcrafted and learned risk models …