Expressive text-to-image generation with rich text

S Ge, S Nah, G Liu, T Poon, A Tao… - Proceedings of the …, 2023 - openaccess.thecvf.com

Despite tremendous progress in generating high-quality images using diffusion models,
synthesizing a sequence of animated frames that are both photorealistic and temporally …

被引用次数：136 相关文章所有 6 个版本

[PDF] thecvf.com

Grounded text-to-image synthesis with attention refocusing

Q Phung, S Ge, JB Huang - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Driven by the scalable diffusion models trained on large-scale datasets text-to-image
synthesis methods have shown compelling results. However these models still fail to …

被引用次数：49 相关文章所有 3 个版本

[PDF] thecvf.com

Freecontrol: Training-free spatial control of any text-to-image diffusion model with any condition

S Mo, F Mu, KH Lin, Y Liu, B Guan… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recent approaches such as ControlNet offer users fine-grained spatial control over text-to-
image (T2I) diffusion models. However auxiliary modules have to be trained for each spatial …

被引用次数：12 相关文章所有 4 个版本

[PDF] thecvf.com

Portraitbooth: A versatile portrait model for fast identity-preserved personalization

X Peng, J Zhu, B Jiang, Y Tai, D Luo… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recent advancements in personalized image generation using diffusion models have been
noteworthy. However existing methods suffer from inefficiencies due to the requirement for …

被引用次数：10 相关文章所有 3 个版本

Cross-image attention for zero-shot appearance transfer

Y Alaluf, D Garibi, O Patashnik… - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org

Recent advancements in text-to-image generative models have demonstrated a remarkable
ability to capture a deep semantic understanding of images. In this work, we leverage this …

被引用次数：20 相关文章所有 2 个版本

[PDF] thecvf.com

It's All About Your Sketch: Democratising Sketch Control in Diffusion Models

S Koley, AK Bhunia, D Sekhri, A Sain… - Proceedings of the …, 2024 - openaccess.thecvf.com

This paper unravels the potential of sketches for diffusion models addressing the deceptive
promise of direct sketch control in generative AI. We importantly democratise the process …

被引用次数：5 相关文章所有 4 个版本

Text-guided synthesis of eulerian cinemagraphs

A Mahapatra, A Siarohin, HY Lee, S Tulyakov… - ACM Transactions on …, 2023 - dl.acm.org

We introduce Text2Cinemagraph, a fully automated method for creating cinemagraphs from
text descriptions---an especially challenging task when prompts feature imaginary elements …

被引用次数：9 相关文章

[PDF] thecvf.com

Alchemist: Parametric control of material properties with diffusion models

P Sharma, V Jampani, Y Li, X Jia… - Proceedings of the …, 2024 - openaccess.thecvf.com

We propose a method to control material attributes of objects like roughness metallic albedo
and transparency in real images. Our method capitalizes on the generative prior of text-to …

被引用次数：6 相关文章所有 6 个版本

[PDF] acm.org

The chosen one: Consistent characters in text-to-image diffusion models

O Avrahami, A Hertz, Y Vinker, M Arar… - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org

Recent advances in text-to-image generation models have unlocked vast potential for visual
creativity. However, the users that use these models struggle with the generation of …

被引用次数：8 相关文章所有 2 个版本

[PDF] thecvf.com

GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image

C Bao, Y Zhang, Y Li, X Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recently we have witnessed the explosive growth of various volumetric representations in
modeling animatable head avatars. However due to the diversity of frameworks there is no …

被引用次数：1 相关文章所有 3 个版本