Tc4d: Trajectory-conditioned text-to-4d generation
Recent techniques for text-to-4D generation synthesize dynamic 3D scenes using
supervision from pre-trained text-to-video models. However, existing representations, such …
supervision from pre-trained text-to-video models. However, existing representations, such …
Vd3d: Taming large video diffusion transformers for 3d camera control
Modern text-to-video synthesis models demonstrate coherent, photorealistic generation of
complex videos from a text description. However, most existing models lack fine-grained …
complex videos from a text description. However, most existing models lack fine-grained …
Animatabledreamer: Text-guided non-rigid 3d model generation and reconstruction with canonical score distillation
Advances in 3D generation have facilitated sequential 3D model generation (aka 4D
generation), yet its application for animatable objects with large motion remains scarce. Our …
generation), yet its application for animatable objects with large motion remains scarce. Our …
Alignment of diffusion models: Fundamentals, challenges, and future
Diffusion models have emerged as the leading paradigm in generative modeling, excelling
in various applications. Despite their success, these models often misalign with human …
in various applications. Despite their success, these models often misalign with human …
Vividdreamer: Invariant score distillation for hyper-realistic text-to-3d generation
Abstract This paper presents Invariant Score Distillation (ISD), a novel method for high-
fidelity text-to-3D generation. ISD aims to tackle the over-saturation and over-smoothing …
fidelity text-to-3D generation. ISD aims to tackle the over-saturation and over-smoothing …
Dimensionx: Create any 3d and 4d scenes from a single image with controllable video diffusion
In this paper, we introduce\textbf {DimensionX}, a framework designed to generate
photorealistic 3D and 4D scenes from just a single image with video diffusion. Our approach …
photorealistic 3D and 4D scenes from just a single image with video diffusion. Our approach …
fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction
Reconstructing 3D visuals from functional Magnetic Resonance Imaging (fMRI) data,
introduced as Recon3DMind in our conference work, is of significant interest to both …
introduced as Recon3DMind in our conference work, is of significant interest to both …
GradualReality: Enhancing Physical Object Interaction in Virtual Reality via Interaction State-Aware Blending
We present GradualReality, a novel interface enabling a Cross Reality experience that
includes gradual interaction with physical objects in a virtual environment and supports both …
includes gradual interaction with physical objects in a virtual environment and supports both …
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation
Existing feed-forward image-to-3D methods mainly rely on 2D multi-view diffusion models
that cannot guarantee 3D consistency. These methods easily collapse when changing the …
that cannot guarantee 3D consistency. These methods easily collapse when changing the …
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
Numerous works have recently integrated 3D camera control into foundational text-to-video
models, but the resulting camera control is often imprecise, and video generation quality …
models, but the resulting camera control is often imprecise, and video generation quality …