Adversarial attacks and defenses on text-to-image diffusion models: A survey

C Zhang, M Hu, W Li, L Wang - Information Fusion, 2024 - Elsevier
Recently, the text-to-image diffusion model has gained considerable attention from the
community due to its exceptional image generation capability. A representative model …

Attacks and defenses for generative diffusion models: A comprehensive survey

VT Truong, LB Dang, LB Le - arXiv preprint arXiv:2408.03400, 2024 - arxiv.org
Diffusion models (DMs) have achieved state-of-the-art performance on various generative
tasks such as image synthesis, text-to-image, and text-guided image-to-image generation …

Jailbreaking prompt attack: A controllable adversarial attack against diffusion models

J Ma, A Cao, Z Xiao, Y Li, J Zhang, C Ye… - arXiv preprint arXiv …, 2024 - arxiv.org
Text-to-image (T2I) models can be maliciously used to generate harmful content such as
sexually explicit, unfaithful, and misleading or Not-Safe-for-Work (NSFW) images. Previous …

Mma-diffusion: Multimodal attack on diffusion models

Y Yang, R Gao, X Wang, TY Ho… - Proceedings of the …, 2024 - openaccess.thecvf.com
In recent years Text-to-Image (T2I) models have seen remarkable advancements gaining
widespread adoption. However this progress has inadvertently opened avenues for …

[PDF][PDF] SafeGen: Mitigating Sexually Explicit Content Generation in Text-to-Image Models

X Li, Y Yang, J Deng, C Yan, Y Chen… - arXiv preprint arXiv …, 2024 - researchgate.net
Text-to-image (T2I) models, such as Stable Diffusion, have exhibited remarkable
performance in generating highquality images from text descriptions in recent years …

ProTIP: Probabilistic robustness verification on text-to-image diffusion models against stochastic perturbation

Y Zhang, Y Tang, W Ruan, X Huang, S Khastgir… - … on Computer Vision, 2025 - Springer
Abstract Text-to-Image (T2I) Diffusion Models (DMs) excel at creating high-quality images
from text descriptions but, like many deep learning models, suffer from robustness issues …

Perception-guided jailbreak against text-to-image models

Y Huang, L Liang, T Li, X Jia, R Wang, W Miao… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, Text-to-Image (T2I) models have garnered significant attention due to their
remarkable advancements. However, security concerns have emerged due to their potential …

Meta-unlearning on diffusion models: Preventing relearning unlearned concepts

H Gao, T Pang, C Du, T Hu, Z Deng, M Lin - arXiv preprint arXiv …, 2024 - arxiv.org
With the rapid progress of diffusion-based content generation, significant efforts are being
made to unlearn harmful or copyrighted concepts from pretrained diffusion models (DMs) to …

Universal prompt optimizer for safe text-to-image generation

Z Wu, H Gao, Y Wang, X Zhang, S Wang - arXiv preprint arXiv:2402.10882, 2024 - arxiv.org
Text-to-Image (T2I) models have shown great performance in generating images based on
textual prompts. However, these models are vulnerable to unsafe input to generate unsafe …

Discovering universal semantic triggers for text-to-image synthesis

S Zhai, W Wang, J Li, Y Dong, H Su, Q Shen - arXiv preprint arXiv …, 2024 - arxiv.org
Recently text-to-image models have gained widespread attention in the community due to
their controllable and high-quality generation ability. However, the robustness of such …