Learning Continuous 3D Words for Text-to-Image Generation

TY Cheng, P Sharma, A Markham, N Trigoni… - … on Computer Vision, 2025 - Springer

We propose ZeST, a method for zero-shot material transfer to an object in the input image
given a material exemplar image. ZeST leverages existing diffusion adapters to extract …

被引用次数：4 相关文章所有 2 个版本

Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models

J Burgess, KC Wang, S Yeung-Levy - European Conference on Computer …, 2025 - Springer

Text-to-image diffusion models generate impressive and realistic images, but do they learn
to represent the 3D world from only 2D supervision? We demonstrate that yes, certain 3D …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Customizing Text-to-Image Diffusion with Camera Viewpoint Control

N Kumari, G Su, R Zhang, T Park, E Shechtman… - arXiv preprint arXiv …, 2024 - arxiv.org

Model customization introduces new concepts to existing text-to-image models, enabling the
generation of the new concept in novel contexts. However, such methods lack accurate …

被引用次数：4 相关文章所有 2 个版本

ParSEL: Parameterized shape editing with language

A Ganeshan, R Huang, X Xu, RK Jones… - ACM Transactions on …, 2024 - dl.acm.org

The ability to edit 3D assets with natural language presents a compelling paradigm to aid in
the democratization of 3D content creation. However, while natural language is often …

被引用次数：2 相关文章所有 2 个版本

[PDF] acm.org

Customizing Text-to-Image Diffusion with Object Viewpoint Control

N Kumari, G Su, R Zhang, T Park… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org

Model customization introduces new concepts to existing text-to-image models, enabling the
generation of these new concepts/objects in novel contexts. However, such methods lack …

[PDF] arxiv.org