Dreamreward: Text-to-3d generation with human preference

S Bahmani, X Liu, W Yifan, I Skorokhodov… - … on Computer Vision, 2025 - Springer

Recent techniques for text-to-4D generation synthesize dynamic 3D scenes using
supervision from pre-trained text-to-video models. However, existing representations, such …

被引用次数：19 相关文章所有 3 个版本

[PDF] arxiv.org

Vd3d: Taming large video diffusion transformers for 3d camera control

S Bahmani, I Skorokhodov, A Siarohin… - arXiv preprint arXiv …, 2024 - arxiv.org

Modern text-to-video synthesis models demonstrate coherent, photorealistic generation of
complex videos from a text description. However, most existing models lack fine-grained …

被引用次数：19 相关文章所有 3 个版本

[PDF] arxiv.org

Animatabledreamer: Text-guided non-rigid 3d model generation and reconstruction with canonical score distillation

X Wang, Y Wang, J Ye, F Sun, Z Wang, L Wang… - … on Computer Vision, 2025 - Springer

Advances in 3D generation have facilitated sequential 3D model generation (aka 4D
generation), yet its application for animatable objects with large motion remains scarce. Our …

被引用次数：13 相关文章所有 2 个版本

[PDF] arxiv.org

Alignment of diffusion models: Fundamentals, challenges, and future

B Liu, S Shao, B Li, L Bai, Z Xu, H Xiong, J Kwok… - arXiv preprint arXiv …, 2024 - arxiv.org

Diffusion models have emerged as the leading paradigm in generative modeling, excelling
in various applications. Despite their success, these models often misalign with human …

被引用次数：7 相关文章所有 3 个版本

[PDF] arxiv.org

Vividdreamer: Invariant score distillation for hyper-realistic text-to-3d generation

W Zhuo, F Ma, H Fan, Y Yang - European Conference on Computer Vision, 2025 - Springer

Abstract This paper presents Invariant Score Distillation (ISD), a novel method for high-
fidelity text-to-3D generation. ISD aims to tackle the over-saturation and over-smoothing …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Dimensionx: Create any 3d and 4d scenes from a single image with controllable video diffusion

W Sun, S Chen, F Liu, Z Chen, Y Duan, J Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

In this paper, we introduce\textbf {DimensionX}, a framework designed to generate
photorealistic 3D and 4D scenes from just a single image with video diffusion. Our approach …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction

J Gao, Y Fu, Y Wang, X Qian, J Feng, Y Fu - arXiv preprint arXiv …, 2024 - arxiv.org

Reconstructing 3D visuals from functional Magnetic Resonance Imaging (fMRI) data,
introduced as Recon3DMind in our conference work, is of significant interest to both …

被引用次数：1 相关文章所有 3 个版本