Turboedit: Instant text-based image editing

Z Wu, N Kolkin, J Brandt, R Zhang… - European Conference on …, 2025 - Springer
We address the challenges of precise image inversion and disentangled image editing in
the context of few-step diffusion models. We introduce an encoder based iterative inversion …

Magic clothing: Controllable garment-driven image synthesis

W Chen, T Gu, Y Xu, A Chen - … of the 32nd ACM International Conference …, 2024 - dl.acm.org
We propose Magic Clothing, a latent diffusion model (LDM)-based network architecture for
an unexplored garment-driven image synthesis task. Aiming at generating customized …

Source prompt disentangled inversion for boosting image editability with diffusion models

R Li, R Li, S Guo, L Zhang - European Conference on Computer Vision, 2025 - Springer
Text-driven diffusion models have significantly advanced the image editing performance by
using text prompts as inputs. One crucial step in text-driven image editing is to invert the …

Doubly Abductive Counterfactual Inference for Text-based Image Editing

X Song, J Cui, H Zhang, J Chen… - Proceedings of the …, 2024 - openaccess.thecvf.com
We study text-based image editing (TBIE) of a single image by counterfactual inference
because it is an elegant formulation to precisely address the requirement: the edited image …

Trame: Trajectory-anchored multi-view editing for text-guided 3d gaussian splatting manipulation

C Luo, D Di, X Yang, Y Ma, Z Xue, C Wei… - arXiv preprint arXiv …, 2024 - arxiv.org
Despite significant strides in the field of 3D scene editing, current methods encounter
substantial challenge, particularly in preserving 3D consistency in multi-view editing …

A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models

X Shuai, H Ding, X Ma, R Tu, YG Jiang… - arXiv preprint arXiv …, 2024 - arxiv.org
Image editing aims to edit the given synthetic or real image to meet the specific requirements
from users. It is widely studied in recent years as a promising and challenging field of …

Dit4edit: Diffusion transformer for image editing

K Feng, Y Ma, B Wang, C Qi, H Chen, Q Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Despite recent advances in UNet-based image editing, methods for shape-aware object
editing in high-resolution images are still lacking. Compared to UNet, Diffusion Transformers …

Image Inpainting Models are Effective Tools for Instruction-guided Image Editing

X Ju, J Zhuang, Z Zhang, Y Bian, Q Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
This is the technique report for the winning solution of the CVPR2024 GenAI Media
Generation Challenge Workshop's Instruction-guided Image Editing track. Instruction-guided …

Advances in text-guided 3D editing: a survey

L Lu, R Li, X Zhang, H Wei, G Du, B Wang - Artificial Intelligence Review, 2024 - Springer
Abstract In 3D Artificial Intelligence Generated Content (AIGC), compared with generating
3D assets from scratch, editing extant 3D assets satisfies user prompts, allowing the creation …

UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control

X Chen, T Xia, S Xu - arXiv preprint arXiv:2403.02332, 2024 - arxiv.org
Video Diffusion Models have been developed for video generation, usually integrating text
and image conditioning to enhance control over the generated content. Despite the …