Generative ai meets 3d: A survey on text-to-3d in aigc era

C Li, C Zhang, J Cho, A Waghwase, LH Lee… - arXiv preprint arXiv …, 2023 - arxiv.org
Generative AI has made significant progress in recent years, with text-guided content
generation being the most practical as it facilitates interaction between human instructions …

Vd3d: Taming large video diffusion transformers for 3d camera control

S Bahmani, I Skorokhodov, A Siarohin… - arXiv preprint arXiv …, 2024 - arxiv.org
Modern text-to-video synthesis models demonstrate coherent, photorealistic generation of
complex videos from a text description. However, most existing models lack fine-grained …

Dreamscene4d: Dynamic multi-object scene generation from monocular videos

WH Chu, L Ke, K Fragkiadaki - arXiv preprint arXiv:2405.02280, 2024 - arxiv.org
Existing VLMs can track in-the-wild 2D video objects while current generative models
provide powerful visual priors for synthesizing novel views for the highly under-constrained …

Compositional 3d-aware video generation with llm director

H Zhu, T He, A Tang, J Guo, Z Chen, J Bian - arXiv preprint arXiv …, 2024 - arxiv.org
Significant progress has been made in text-to-video generation through the use of powerful
generative models and large-scale internet data. However, substantial challenges remain in …

Animate3d: Animating any 3d model with multi-view video diffusion

Y Jiang, C Yu, C Cao, F Wang, W Hu, J Gao - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advances in 4D generation mainly focus on generating 4D content by distilling pre-
trained text or single-view image-conditioned models. It is inconvenient for them to take …

Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis

B Zeng, L Yang, S Li, J Liu, Z Zhang, J Tian… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advances in diffusion models have demonstrated exceptional capabilities in image
and video generation, further improving the effectiveness of 4D synthesis. Existing 4D …

AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation

Y Cao, L Pan, K Han, KYK Wong, Z Liu - arXiv preprint arXiv:2410.07164, 2024 - arxiv.org
Recent advancements in diffusion models have led to significant improvements in the
generation and animation of 4D full-body human-object interactions (HOI). Nevertheless …

4Dynamic: Text-to-4D Generation with Hybrid Priors

YJ Yuan, L Kobbelt, J Liu, Y Zhang, P Wan… - arXiv preprint arXiv …, 2024 - arxiv.org
Due to the fascinating generative performance of text-to-image diffusion models, growing
text-to-3D generation works explore distilling the 2D generative priors into 3D, using the …

ElastoGen: 4D Generative Elastodynamics

Y Feng, Y Shang, X Feng, L Lan, S Zhe, T Shao… - arXiv preprint arXiv …, 2024 - arxiv.org
We present ElastoGen, a knowledge-driven model that generates physically accurate and
coherent 4D elastodynamics. Instead of relying on petabyte-scale data-driven learning …

EG4D: Explicit Generation of 4D Object without Score Distillation

Q Sun, Z Guo, Z Wan, JN Yan, S Yin, W Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, the increasing demand for dynamic 3D assets in design and gaming
applications has given rise to powerful generative pipelines capable of synthesizing high …