Adversarial attacks and defenses on text-to-image diffusion models: A survey

C Zhang, M Hu, W Li, L Wang - Information Fusion, 2024 - Elsevier
Recently, the text-to-image diffusion model has gained considerable attention from the
community due to its exceptional image generation capability. A representative model …

Attacks and defenses for generative diffusion models: A comprehensive survey

VT Truong, LB Dang, LB Le - arXiv preprint arXiv:2408.03400, 2024 - arxiv.org
Diffusion models (DMs) have achieved state-of-the-art performance on various generative
tasks such as image synthesis, text-to-image, and text-guided image-to-image generation …

Vbench++: Comprehensive and versatile benchmark suite for video generative models

Z Huang, F Zhang, X Xu, Y He, J Yu, Z Dong… - arXiv preprint arXiv …, 2024 - arxiv.org
Video generation has witnessed significant advancements, yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …

Safree: Training-free and adaptive guard for safe text-to-image and video generation

J Yoon, S Yu, V Patil, H Yao, M Bansal - arXiv preprint arXiv:2410.12761, 2024 - arxiv.org
Recent advances in diffusion models have significantly enhanced their ability to generate
high-quality images and videos, but they have also increased the risk of producing unsafe …

Rt-attack: Jailbreaking text-to-image models via random token

S Gao, X Jia, Y Huang, R Duan, J Gu, Y Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, Text-to-Image (T2I) models have achieved remarkable success in image
generation and editing, yet these models still have many potential issues, particularly in …

Meta-unlearning on diffusion models: Preventing relearning unlearned concepts

H Gao, T Pang, C Du, T Hu, Z Deng, M Lin - arXiv preprint arXiv …, 2024 - arxiv.org
With the rapid progress of diffusion-based content generation, significant efforts are being
made to unlearn harmful or copyrighted concepts from pretrained diffusion models (DMs) to …

Navigating the risks: A survey of security, privacy, and ethics threats in llm-based agents

Y Gan, Y Yang, Z Ma, P He, R Zeng, Y Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
With the continuous development of large language models (LLMs), transformer-based
models have made groundbreaking advances in numerous natural language processing …

Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey

X Liu, X Cui, P Li, Z Li, H Huang, S Xia, M Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
The rapid evolution of multimodal foundation models has led to significant advancements in
cross-modal understanding and generation across diverse modalities, including text …

AEIOU: A Unified Defense Framework against NSFW Prompts in Text-to-Image Models

Y Wang, J Chen, Q Li, X Yang, S Ji - arXiv preprint arXiv:2412.18123, 2024 - arxiv.org
As text-to-image (T2I) models continue to advance and gain widespread adoption, their
associated safety issues are becoming increasingly prominent. Malicious users often exploit …

Espresso: Robust Concept Filtering in Text-to-Image Models

A Das, V Duddu, R Zhang, N Asokan - arXiv preprint arXiv:2404.19227, 2024 - arxiv.org
Diffusion-based text-to-image (T2I) models generate high-fidelity images for given textual
prompts. They are trained on large datasets scraped from the Internet, potentially containing …