Inversion-free image editing with natural language

Z Wu, N Kolkin, J Brandt, R Zhang… - European Conference on …, 2025 - Springer

We address the challenges of precise image inversion and disentangled image editing in
the context of few-step diffusion models. We introduce an encoder based iterative inversion …

被引用次数：4 相关文章所有 6 个版本

[PDF] arxiv.org

Magic clothing: Controllable garment-driven image synthesis

W Chen, T Gu, Y Xu, A Chen - … of the 32nd ACM International Conference …, 2024 - dl.acm.org

We propose Magic Clothing, a latent diffusion model (LDM)-based network architecture for
an unexplored garment-driven image synthesis task. Aiming at generating customized …

被引用次数：12 相关文章所有 2 个版本

[PDF] arxiv.org

Source prompt disentangled inversion for boosting image editability with diffusion models

R Li, R Li, S Guo, L Zhang - European Conference on Computer Vision, 2025 - Springer

Text-driven diffusion models have significantly advanced the image editing performance by
using text prompts as inputs. One crucial step in text-driven image editing is to invert the …

被引用次数：3 相关文章所有 2 个版本

[PDF] thecvf.com

Doubly Abductive Counterfactual Inference for Text-based Image Editing

X Song, J Cui, H Zhang, J Chen… - Proceedings of the …, 2024 - openaccess.thecvf.com

We study text-based image editing (TBIE) of a single image by counterfactual inference
because it is an elegant formulation to precisely address the requirement: the edited image …

被引用次数：5 相关文章所有 3 个版本

[PDF] arxiv.org

Trame: Trajectory-anchored multi-view editing for text-guided 3d gaussian splatting manipulation

C Luo, D Di, X Yang, Y Ma, Z Xue, C Wei… - arXiv preprint arXiv …, 2024 - arxiv.org

Despite significant strides in the field of 3D scene editing, current methods encounter
substantial challenge, particularly in preserving 3D consistency in multi-view editing …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models

X Shuai, H Ding, X Ma, R Tu, YG Jiang… - arXiv preprint arXiv …, 2024 - arxiv.org

Image editing aims to edit the given synthetic or real image to meet the specific requirements
from users. It is widely studied in recent years as a promising and challenging field of …

被引用次数：13 相关文章

[PDF] arxiv.org

Dit4edit: Diffusion transformer for image editing

K Feng, Y Ma, B Wang, C Qi, H Chen, Q Chen… - arXiv preprint arXiv …, 2024 - arxiv.org

Despite recent advances in UNet-based image editing, methods for shape-aware object
editing in high-resolution images are still lacking. Compared to UNet, Diffusion Transformers …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Image Inpainting Models are Effective Tools for Instruction-guided Image Editing

X Ju, J Zhuang, Z Zhang, Y Bian, Q Xu… - arXiv preprint arXiv …, 2024 - arxiv.org

This is the technique report for the winning solution of the CVPR2024 GenAI Media
Generation Challenge Workshop's Instruction-guided Image Editing track. Instruction-guided …

被引用次数：1 相关文章所有 2 个版本

[PDF] springer.com

Advances in text-guided 3D editing: a survey

L Lu, R Li, X Zhang, H Wei, G Du, B Wang - Artificial Intelligence Review, 2024 - Springer

Abstract In 3D Artificial Intelligence Generated Content (AIGC), compared with generating
3D assets from scratch, editing extant 3D assets satisfies user prompts, allowing the creation …

[PDF] arxiv.org

UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control

X Chen, T Xia, S Xu - arXiv preprint arXiv:2403.02332, 2024 - arxiv.org

Video Diffusion Models have been developed for video generation, usually integrating text
and image conditioning to enhance control over the generated content. Despite the …

被引用次数：4 相关文章所有 2 个版本