Bird's-Eye-View Scene Graph for Vision-Language Navigation
Abstract Vision-language navigation (VLN), which entails an agent to navigate 3D
environments following human instructions, has shown great advances. However, current …
environments following human instructions, has shown great advances. However, current …
Frankenstein: Generating semantic-compositional 3d scenes in one tri-plane
We present Frankenstein, a diffusion-based framework that can generate semantic-
compositional 3D scenes in a single pass. Unlike existing methods that output a single …
compositional 3D scenes in a single pass. Unlike existing methods that output a single …
Gem3d: Generative medial abstractions for 3d shape synthesis
We introduce GEM3D 1–a new deep, topology-aware generative model of 3D shapes. The
key ingredient of our method is a neural skeleton-based representation encoding …
key ingredient of our method is a neural skeleton-based representation encoding …
Instructscene: Instruction-driven 3d indoor scene synthesis with semantic graph prior
Comprehending natural language instructions is a charming property for 3D indoor scene
synthesis systems. Existing methods directly model object joint distributions and express …
synthesis systems. Existing methods directly model object joint distributions and express …
Forest2seq: Revitalizing order prior for sequential indoor scene synthesis
Synthesizing realistic 3D indoor scenes is a challenging task that traditionally relies on
manual arrangement and annotation by expert designers. Recent advances in …
manual arrangement and annotation by expert designers. Recent advances in …
Infinigen Indoors: Photorealistic Indoor Scenes using Procedural Generation
Abstract We introduce Infinigen Indoors a Blender-based procedural generator of
photorealistic indoor scenes. It builds upon the existing Infinigen system which focuses on …
photorealistic indoor scenes. It builds upon the existing Infinigen system which focuses on …
LLM-enhanced Scene Graph Learning for Household Rearrangement
The household rearrangement task involves spotting misplaced objects in a scene and
accommodate them with proper places. It depends both on common-sense knowledge on …
accommodate them with proper places. It depends both on common-sense knowledge on …
SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene Generation
We present SceneFactor, a diffusion-based approach for large-scale 3D scene generation
that enables controllable generation and effortless editing. SceneFactor enables text-guided …
that enables controllable generation and effortless editing. SceneFactor enables text-guided …
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior
Comprehending natural language instructions is a charming property for both 2D and 3D
layout synthesis systems. Existing methods implicitly model object joint distributions and …
layout synthesis systems. Existing methods implicitly model object joint distributions and …
Crowd Data-driven Artwork Placement in Virtual Exhibitions for Visitor Density Distribution Planning
We propose a novel crowd data-driven optimization approach for artwork placement in
virtual exhibitions. With the emerging concept of Metaverse, a multitude of users can engage …
virtual exhibitions. With the emerging concept of Metaverse, a multitude of users can engage …