Legilimens: Practical and Unified Content Moderation for Large Language Model Services
Given the societal impact of unsafe content generated by large language models (LLMs),
ensuring that LLM services comply with safety standards is a crucial concern for LLM service …
ensuring that LLM services comply with safety standards is a crucial concern for LLM service …
Seeing like an ai: How llms apply (and misapply) wikipedia neutrality norms
Large language models (LLMs) are trained on broad corpora and then used in communities
with specialized norms. Is providing LLMs with community rules enough for models to follow …
with specialized norms. Is providing LLMs with community rules enough for models to follow …
AI Rules? Characterizing Reddit Community Policies Towards AI-Generated Content
How are Reddit communities responding to AI-generated content? We explored this
question through a large-scale analysis of subreddit community rules and their change over …
question through a large-scale analysis of subreddit community rules and their change over …
" They are uncultured": Unveiling Covert Harms and Social Threats in LLM Generated Conversations
Large language models (LLMs) have emerged as an integral part of modern societies,
powering user-facing applications such as personal assistants and enterprise applications …
powering user-facing applications such as personal assistants and enterprise applications …