Beyond Trial-and-Error: Predicting User Abandonment After a Moderation Intervention

B Tessa, L Cima, A Trujillo, M Avvenuti… - arXiv preprint arXiv …, 2024 - arxiv.org
Current content moderation practices follow the\textit {trial-and-error} approach, meaning
that moderators apply sequences of interventions until they obtain the desired outcome …

One of many: Assessing user-level effects of moderation interventions on r/The_Donald

A Trujillo, S Cresci - Proceedings of the 15th ACM Web Science …, 2023 - dl.acm.org
Evaluating the effects of moderation interventions is a task of paramount importance, as it
allows assessing the success of content moderation processes. So far, intervention effects …

Multilingual content moderation: A case study on Reddit

M Ye, K Sikka, K Atwell, S Hassan, A Divakaran… - arXiv preprint arXiv …, 2023 - arxiv.org
Content moderation is the process of flagging content based on pre-defined platform rules.
There has been a growing need for AI moderators to safeguard users as well as protect the …

[PDF][PDF] To Act or React

H Habib, M Bin Musa, F Zaffar… - … Proactive Strategies for …, 2019 - academia.edu
Reddit, the self-proclaimed łfront page of the Internetž with over 330M active users, has
found its communities playing a prominent role in originating and propagating sexist, racist …

Learning to defer in content moderation: The human-ai interplay

T Lykouris, W Weng - arXiv preprint arXiv:2402.12237, 2024 - arxiv.org
Successful content moderation in online platforms relies on a human-AI collaboration
approach. A typical heuristic estimates the expected harmfulness of a post and uses fixed …

Reliable decision from multiple subtasks through threshold optimization: Content moderation in the wild

D Son, B Lew, K Choi, Y Baek, S Choi, B Shin… - Proceedings of the …, 2023 - dl.acm.org
Social media platforms struggle to protect users from harmful content through content
moderation. These platforms have recently leveraged machine learning models to cope with …

Does transparency in moderation really matter? User behavior after content removal explanations on reddit

S Jhaver, A Bruckman, E Gilbert - Proceedings of the ACM on Human …, 2019 - dl.acm.org
When posts are removed on a social media platform, users may or may not receive an
explanation. What kinds of explanations are provided? Do those explanations matter? Using …

To act or react: Investigating proactive strategies for online community moderation

H Habib, MB Musa, F Zaffar, R Nithyanand - arXiv preprint arXiv …, 2019 - arxiv.org
Reddit administrators have generally struggled to prevent or contain such discourse for
several reasons including:(1) the inability for a handful of human administrators to track and …

Bystanders of Online Moderation: Examining the Effects of Witnessing Post-Removal Explanations

S Jhaver, H Rathi, K Saha - Proceedings of the CHI Conference on …, 2024 - dl.acm.org
Prior research on transparency in content moderation has demonstrated the benefits of
offering post-removal explanations to sanctioned users. In this paper, we examine whether …

Bandits for Online Calibration: An Application to Content Moderation on Social Media Platforms

V Avadhanula, OA Baki, H Bastani, O Bastani… - arXiv preprint arXiv …, 2022 - arxiv.org
We describe the current content moderation strategy employed by Meta to remove policy-
violating content from its platforms. Meta relies on both handcrafted and learned risk models …