Potential and Challenges of Model Editing for Social Debiasing

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

我的图书馆

Potential and Challenges of Model Editing for Social Debiasing

在引用文章中搜索

[PDF] arxiv.org

Can Editing LLMs Inject Harm?

C Chen, B Huang, Z Li, Z Chen, S Lai, X Xu… - arXiv preprint arXiv …, 2024 - arxiv.org

Knowledge editing techniques have been increasingly adopted to efficiently correct the false
or outdated knowledge in Large Language Models (LLMs), due to the high cost of retraining …

被引用次数：1 相关文章所有 6 个版本

[PDF] arxiv.org

Detoxifying Large Language Models via Knowledge Editing

M Wang, N Zhang, Z Xu, Z Xi, S Deng, Y Yao… - arXiv preprint arXiv …, 2024 - arxiv.org

This paper investigates using knowledge editing techniques to detoxify Large Language
Models (LLMs). We construct a benchmark, SafeEdit, which covers nine unsafe categories …

被引用次数：3 相关文章所有 2 个版本