Hexagen3d: Stablediffusion is just one step away from fast and diverse text-to-3d generation

V Voleti, CH Yao, M Boss, A Letts, D Pankratz… - … on Computer Vision, 2025 - Springer

Abstract We present Stable Video 3D (SV3D)—a latent video diffusion model for high-
resolution, image-to-multi-view generation of orbital videos around a 3D object. Recent …

被引用次数：103 相关文章所有 2 个版本

[PDF] arxiv.org

Crm: Single image to 3d textured mesh with convolutional reconstruction model

Z Wang, Y Wang, Y Chen, C Xiang, S Chen… - … on Computer Vision, 2025 - Springer

Feed-forward 3D generative models like the Large Reconstruction Model (LRM)[18] have
demonstrated exceptional generation speed. However, the transformer-based methods do …

被引用次数：89 相关文章所有 3 个版本

[PDF] arxiv.org

Vfusion3d: Learning scalable 3d generative models from video diffusion models

J Han, F Kokkinos, P Torr - European Conference on Computer Vision, 2025 - Springer

This paper presents a novel method for building scalable 3D generative models utilizing pre-
trained video diffusion models. The primary obstacle in developing foundation 3D …

被引用次数：21 相关文章所有 2 个版本

[PDF] arxiv.org

View selection for 3d captioning via diffusion ranking

T Luo, J Johnson, H Lee - European Conference on Computer Vision, 2025 - Springer

Scalable annotation approaches are crucial for constructing extensive 3D-text datasets,
facilitating a broader range of applications. However, existing methods sometimes lead to …

被引用次数：8 相关文章所有 2 个版本

[PDF] arxiv.org

Scaledreamer: Scalable text-to-3d synthesis with asynchronous score distillation

Z Ma, Y Wei, Y Zhang, X Zhu, Z Lei, L Zhang - European Conference on …, 2025 - Springer

By leveraging the text-to-image diffusion prior, score distillation can synthesize 3D contents
without paired text-3D training data. Instead of spending hours of online optimization per text …

被引用次数：7 相关文章所有 6 个版本

[PDF] arxiv.org

Compress3D: a compressed latent space for 3D generation from a single image

B Zhang, T Yang, Y Li, L Zhang, X Zhao - European Conference on …, 2025 - Springer

Abstract 3D generation has witnessed significant advancements, yet efficiently producing
high-quality 3D assets from a single image remains challenging. In this paper, we present a …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Collaborative control for geometry-conditioned PBR image generation

S Vainer, M Boss, M Parger, K Kutsy… - … on Computer Vision, 2025 - Springer

Graphics pipelines require physically-based rendering (PBR) materials, yet current 3D
content generation approaches are built on RGB models. We propose to model the PBR …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

FlexiDreamer: Single Image-to-3D Generation with FlexiCubes

R Zhao, Z Wang, Y Wang, Z Zhou, J Zhu - arXiv preprint arXiv:2404.00987, 2024 - arxiv.org

3D content generation from text prompts or single images has made remarkable progress in
quality and speed recently. One of its dominant paradigms involves generating consistent …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

A Survey on 3D Human Avatar Modeling--From Reconstruction to Generation

R Wang, Y Cao, K Han, KYK Wong - arXiv preprint arXiv:2406.04253, 2024 - arxiv.org

3D modeling has long been an important area in computer vision and computer graphics.
Recently, thanks to the breakthroughs in neural representations and generative models, we …

被引用次数：1 相关文章所有 2 个版本