Sv3d: Novel multi-view synthesis and 3d generation from a single image using latent video diffusion

V Voleti, CH Yao, M Boss, A Letts, D Pankratz… - … on Computer Vision, 2025 - Springer
Abstract We present Stable Video 3D (SV3D)—a latent video diffusion model for high-
resolution, image-to-multi-view generation of orbital videos around a 3D object. Recent …

Crm: Single image to 3d textured mesh with convolutional reconstruction model

Z Wang, Y Wang, Y Chen, C Xiang, S Chen… - … on Computer Vision, 2025 - Springer
Feed-forward 3D generative models like the Large Reconstruction Model (LRM)[18] have
demonstrated exceptional generation speed. However, the transformer-based methods do …

Vfusion3d: Learning scalable 3d generative models from video diffusion models

J Han, F Kokkinos, P Torr - European Conference on Computer Vision, 2025 - Springer
This paper presents a novel method for building scalable 3D generative models utilizing pre-
trained video diffusion models. The primary obstacle in developing foundation 3D …

View selection for 3d captioning via diffusion ranking

T Luo, J Johnson, H Lee - European Conference on Computer Vision, 2025 - Springer
Scalable annotation approaches are crucial for constructing extensive 3D-text datasets,
facilitating a broader range of applications. However, existing methods sometimes lead to …

Scaledreamer: Scalable text-to-3d synthesis with asynchronous score distillation

Z Ma, Y Wei, Y Zhang, X Zhu, Z Lei, L Zhang - European Conference on …, 2025 - Springer
By leveraging the text-to-image diffusion prior, score distillation can synthesize 3D contents
without paired text-3D training data. Instead of spending hours of online optimization per text …

Compress3D: a compressed latent space for 3D generation from a single image

B Zhang, T Yang, Y Li, L Zhang, X Zhao - European Conference on …, 2025 - Springer
Abstract 3D generation has witnessed significant advancements, yet efficiently producing
high-quality 3D assets from a single image remains challenging. In this paper, we present a …

Collaborative control for geometry-conditioned PBR image generation

S Vainer, M Boss, M Parger, K Kutsy… - … on Computer Vision, 2025 - Springer
Graphics pipelines require physically-based rendering (PBR) materials, yet current 3D
content generation approaches are built on RGB models. We propose to model the PBR …

FlexiDreamer: Single Image-to-3D Generation with FlexiCubes

R Zhao, Z Wang, Y Wang, Z Zhou, J Zhu - arXiv preprint arXiv:2404.00987, 2024 - arxiv.org
3D content generation from text prompts or single images has made remarkable progress in
quality and speed recently. One of its dominant paradigms involves generating consistent …

A Survey on 3D Human Avatar Modeling--From Reconstruction to Generation

R Wang, Y Cao, K Han, KYK Wong - arXiv preprint arXiv:2406.04253, 2024 - arxiv.org
3D modeling has long been an important area in computer vision and computer graphics.
Recently, thanks to the breakthroughs in neural representations and generative models, we …