Magic3d: High-resolution text-to-3d content creation

CH Lin, J Gao, L Tang, T Takikawa… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, DreamFusion demonstrated the utility of a pretrained text-to-image diffusion model
to optimize Neural Radiance Fields (NeRF), achieving remarkable text-to-3D synthesis …

Prolificdreamer: High-fidelity and diverse text-to-3d generation with variational score distillation

Z Wang, C Lu, Y Wang, F Bao, C Li… - Advances in Neural …, 2024 - proceedings.neurips.cc
Score distillation sampling (SDS) has shown great promise in text-to-3D generation by
distilling pretrained large-scale text-to-image diffusion models, but suffers from over …

Score jacobian chaining: Lifting pretrained 2d diffusion models for 3d generation

H Wang, X Du, J Li, RA Yeh… - Proceedings of the …, 2023 - openaccess.thecvf.com
A diffusion model learns to predict a vector field of gradients. We propose to apply chain rule
on the learned gradients, and back-propagate the score of a diffusion model through the …

Fantasia3d: Disentangling geometry and appearance for high-quality text-to-3d content creation

R Chen, Y Chen, N Jiao, K Jia - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Automatic 3D content creation has achieved rapid progress recently due to the availability of
pre-trained, large language models and image diffusion models, forming the emerging topic …

Latent-nerf for shape-guided generation of 3d shapes and textures

G Metzer, E Richardson, O Patashnik… - Proceedings of the …, 2023 - openaccess.thecvf.com
Text-guided image generation has progressed rapidly in recent years, inspiring major
breakthroughs in text-guided shape generation. Recently, it has been shown that using …

Realfusion: 360deg reconstruction of any object from a single image

L Melas-Kyriazi, I Laina… - Proceedings of the …, 2023 - openaccess.thecvf.com
We consider the problem of reconstructing a full 360deg photographic model of an object
from a single image of it. We do so by fitting a neural radiance field to the image, but find this …

Motiongpt: Human motion as a foreign language

B Jiang, X Chen, W Liu, J Yu, G Yu… - Advances in Neural …, 2023 - proceedings.neurips.cc
Though the advancement of pre-trained large language models unfolds, the exploration of
building a unified model for language and other multimodal data, such as motion, remains …

Emergent correspondence from image diffusion

L Tang, M Jia, Q Wang, CP Phoo… - Advances in Neural …, 2023 - proceedings.neurips.cc
Finding correspondences between images is a fundamental problem in computer vision. In
this paper, we show that correspondence emerges in image diffusion models without any …

Make-it-3d: High-fidelity 3d creation from a single image with diffusion prior

J Tang, T Wang, B Zhang, T Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this work, we investigate the problem of creating high-fidelity 3D content from only a single
image. This is inherently challenging: it essentially involves estimating the underlying 3D …

Shap-e: Generating conditional 3d implicit functions

H Jun, A Nichol - arXiv preprint arXiv:2305.02463, 2023 - arxiv.org
We present Shap-E, a conditional generative model for 3D assets. Unlike recent work on 3D
generative models which produce a single output representation, Shap-E directly generates …