Clip-mesh: Generating textured meshes from text using pretrained image-text models

CH Lin, J Gao, L Tang, T Takikawa… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recently, DreamFusion demonstrated the utility of a pretrained text-to-image diffusion model
to optimize Neural Radiance Fields (NeRF), achieving remarkable text-to-3D synthesis …

被引用次数：824 相关文章所有 6 个版本

[PDF] neurips.cc

Prolificdreamer: High-fidelity and diverse text-to-3d generation with variational score distillation

Z Wang, C Lu, Y Wang, F Bao, C Li… - Advances in Neural …, 2024 - proceedings.neurips.cc

Score distillation sampling (SDS) has shown great promise in text-to-3D generation by
distilling pretrained large-scale text-to-image diffusion models, but suffers from over …

被引用次数：485 相关文章所有 5 个版本

[PDF] thecvf.com

Score jacobian chaining: Lifting pretrained 2d diffusion models for 3d generation

H Wang, X Du, J Li, RA Yeh… - Proceedings of the …, 2023 - openaccess.thecvf.com

A diffusion model learns to predict a vector field of gradients. We propose to apply chain rule
on the learned gradients, and back-propagate the score of a diffusion model through the …

被引用次数：388 相关文章所有 6 个版本

[PDF] thecvf.com

Fantasia3d: Disentangling geometry and appearance for high-quality text-to-3d content creation

R Chen, Y Chen, N Jiao, K Jia - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Automatic 3D content creation has achieved rapid progress recently due to the availability of
pre-trained, large language models and image diffusion models, forming the emerging topic …

被引用次数：377 相关文章所有 5 个版本

[PDF] thecvf.com

Latent-nerf for shape-guided generation of 3d shapes and textures

G Metzer, E Richardson, O Patashnik… - Proceedings of the …, 2023 - openaccess.thecvf.com

Text-guided image generation has progressed rapidly in recent years, inspiring major
breakthroughs in text-guided shape generation. Recently, it has been shown that using …

被引用次数：322 相关文章所有 5 个版本

[PDF] thecvf.com

Realfusion: 360deg reconstruction of any object from a single image

L Melas-Kyriazi, I Laina… - Proceedings of the …, 2023 - openaccess.thecvf.com

We consider the problem of reconstructing a full 360deg photographic model of an object
from a single image of it. We do so by fitting a neural radiance field to the image, but find this …

被引用次数：236 相关文章所有 6 个版本

[PDF] neurips.cc

Motiongpt: Human motion as a foreign language

B Jiang, X Chen, W Liu, J Yu, G Yu… - Advances in Neural …, 2023 - proceedings.neurips.cc

Though the advancement of pre-trained large language models unfolds, the exploration of
building a unified model for language and other multimodal data, such as motion, remains …

被引用次数：135 相关文章所有 5 个版本

[PDF] neurips.cc

Emergent correspondence from image diffusion

L Tang, M Jia, Q Wang, CP Phoo… - Advances in Neural …, 2023 - proceedings.neurips.cc

Finding correspondences between images is a fundamental problem in computer vision. In
this paper, we show that correspondence emerges in image diffusion models without any …

被引用次数：187 相关文章所有 12 个版本

[PDF] thecvf.com

Make-it-3d: High-fidelity 3d creation from a single image with diffusion prior

J Tang, T Wang, B Zhang, T Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

In this work, we investigate the problem of creating high-fidelity 3D content from only a single
image. This is inherently challenging: it essentially involves estimating the underlying 3D …

被引用次数：202 相关文章所有 7 个版本

[PDF] arxiv.org

Shap-e: Generating conditional 3d implicit functions

H Jun, A Nichol - arXiv preprint arXiv:2305.02463, 2023 - arxiv.org

We present Shap-E, a conditional generative model for 3D assets. Unlike recent work on 3D
generative models which produce a single output representation, Shap-E directly generates …

被引用次数：279 相关文章所有 2 个版本