Magic3d: High-resolution text-to-3d content creation
Recently, DreamFusion demonstrated the utility of a pretrained text-to-image diffusion model
to optimize Neural Radiance Fields (NeRF), achieving remarkable text-to-3D synthesis …
to optimize Neural Radiance Fields (NeRF), achieving remarkable text-to-3D synthesis …
Prolificdreamer: High-fidelity and diverse text-to-3d generation with variational score distillation
Score distillation sampling (SDS) has shown great promise in text-to-3D generation by
distilling pretrained large-scale text-to-image diffusion models, but suffers from over …
distilling pretrained large-scale text-to-image diffusion models, but suffers from over …
Score jacobian chaining: Lifting pretrained 2d diffusion models for 3d generation
A diffusion model learns to predict a vector field of gradients. We propose to apply chain rule
on the learned gradients, and back-propagate the score of a diffusion model through the …
on the learned gradients, and back-propagate the score of a diffusion model through the …
Fantasia3d: Disentangling geometry and appearance for high-quality text-to-3d content creation
Automatic 3D content creation has achieved rapid progress recently due to the availability of
pre-trained, large language models and image diffusion models, forming the emerging topic …
pre-trained, large language models and image diffusion models, forming the emerging topic …
Latent-nerf for shape-guided generation of 3d shapes and textures
Text-guided image generation has progressed rapidly in recent years, inspiring major
breakthroughs in text-guided shape generation. Recently, it has been shown that using …
breakthroughs in text-guided shape generation. Recently, it has been shown that using …
Realfusion: 360deg reconstruction of any object from a single image
L Melas-Kyriazi, I Laina… - Proceedings of the …, 2023 - openaccess.thecvf.com
We consider the problem of reconstructing a full 360deg photographic model of an object
from a single image of it. We do so by fitting a neural radiance field to the image, but find this …
from a single image of it. We do so by fitting a neural radiance field to the image, but find this …
Motiongpt: Human motion as a foreign language
Though the advancement of pre-trained large language models unfolds, the exploration of
building a unified model for language and other multimodal data, such as motion, remains …
building a unified model for language and other multimodal data, such as motion, remains …
Emergent correspondence from image diffusion
Finding correspondences between images is a fundamental problem in computer vision. In
this paper, we show that correspondence emerges in image diffusion models without any …
this paper, we show that correspondence emerges in image diffusion models without any …
Make-it-3d: High-fidelity 3d creation from a single image with diffusion prior
In this work, we investigate the problem of creating high-fidelity 3D content from only a single
image. This is inherently challenging: it essentially involves estimating the underlying 3D …
image. This is inherently challenging: it essentially involves estimating the underlying 3D …
Shap-e: Generating conditional 3d implicit functions
H Jun, A Nichol - arXiv preprint arXiv:2305.02463, 2023 - arxiv.org
We present Shap-E, a conditional generative model for 3D assets. Unlike recent work on 3D
generative models which produce a single output representation, Shap-E directly generates …
generative models which produce a single output representation, Shap-E directly generates …