Unified concept editing in diffusion models

R Gandikota, H Orgad, Y Belinkov… - Proceedings of the …, 2024 - openaccess.thecvf.com
Text-to-image models suffer from various safety issues that may limit their suitability for
deployment. Previous methods have separately addressed individual issues of bias …

Adversarial attacks and defenses on text-to-image diffusion models: A survey

C Zhang, M Hu, W Li, L Wang - Information Fusion, 2024 - Elsevier
Recently, the text-to-image diffusion model has gained considerable attention from the
community due to its exceptional image generation capability. A representative model …

Mace: Mass concept erasure in diffusion models

S Lu, Z Wang, L Li, Y Liu… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
The rapid expansion of large-scale text-to-image diffusion models has raised growing
concerns regarding their potential misuse in creating harmful or misleading content. In this …

T2ishield: Defending against backdoors on text-to-image diffusion models

Z Wang, J Zhang, S Shan, X Chen - arXiv preprint arXiv:2407.04215, 2024 - arxiv.org
While text-to-image diffusion models demonstrate impressive generation capabilities, they
also exhibit vulnerability to backdoor attacks, which involve the manipulation of model …

Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines

M Toker, H Orgad, M Ventura, D Arad… - arXiv preprint arXiv …, 2024 - arxiv.org
Text-to-image diffusion models (T2I) use a latent representation of a text prompt to guide the
image generation process. However, the process by which the encoder produces the text …

Eviledit: Backdooring text-to-image diffusion models in one second

H Wang, S Guo, J He, K Chen, S Zhang… - ACM Multimedia …, 2024 - openreview.net
Text-to-image (T2I) diffusion models enjoy great popularity and many individuals and
companies build their applications based on publicly released T2I diffusion models …

Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion

H Gu, K Zhou, Y Wang, R Wang, X Wang - arXiv preprint arXiv:2409.17928, 2024 - arxiv.org
During pre-training, the Text-to-Image (T2I) diffusion models encode factual knowledge into
their parameters. These parameterized facts enable realistic image generation, but they may …

Editing Massive Concepts in Text-to-Image Diffusion Models

T Xiong, Y Wu, E Xie, Z Li, X Liu - arXiv preprint arXiv:2403.13807, 2024 - arxiv.org
Text-to-image diffusion models suffer from the risk of generating outdated, copyrighted,
incorrect, and biased content. While previous methods have mitigated the issues on a small …

On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs

N Calderon, R Reichart - arXiv preprint arXiv:2407.19200, 2024 - arxiv.org
Recent advancements in NLP systems, particularly with the introduction of LLMs, have led to
widespread adoption of these systems by a broad spectrum of users across various …