Kubric: A scalable dataset generator
Data is the driving force of machine learning, with the amount and quality of training data
often being more important for the performance of a system than architecture and training …
often being more important for the performance of a system than architecture and training …
Savi++: Towards end-to-end object-centric learning from real-world videos
G Elsayed, A Mahendran… - Advances in …, 2022 - proceedings.neurips.cc
The visual world can be parsimoniously characterized in terms of distinct entities with sparse
interactions. Discovering this compositional structure in dynamic visual scenes has proven …
interactions. Discovering this compositional structure in dynamic visual scenes has proven …
Conditional object-centric learning from video
Object-centric representations are a promising path toward more systematic generalization
by providing flexible abstractions upon which compositional world models can be built …
by providing flexible abstractions upon which compositional world models can be built …
Object scene representation transformer
MSM Sajjadi, D Duckworth… - Advances in neural …, 2022 - proceedings.neurips.cc
A compositional understanding of the world in terms of objects and their geometry in 3D
space is considered a cornerstone of human cognition. Facilitating the learning of such a …
space is considered a cornerstone of human cognition. Facilitating the learning of such a …
Simple unsupervised object-centric learning for complex and naturalistic videos
Unsupervised object-centric learning aims to represent the modular, compositional, and
causal structure of a scene as a set of object representations and thereby promises to …
causal structure of a scene as a set of object representations and thereby promises to …
Illiterate dall-e learns to compose
Although DALL-E has shown an impressive ability of composition-based systematic
generalization in image generation, it requires the dataset of text-image pairs and the …
generalization in image generation, it requires the dataset of text-image pairs and the …
Object discovery and representation networks
The promise of self-supervised learning (SSL) is to leverage large amounts of unlabeled
data to solve complex tasks. While there has been excellent progress with simple, image …
data to solve complex tasks. While there has been excellent progress with simple, image …
Towards unsupervised object detection from lidar point clouds
In this paper, we study the problem of unsupervised object detection from 3D point clouds in
self-driving scenes. We present a simple yet effective method that exploits (i) point clustering …
self-driving scenes. We present a simple yet effective method that exploits (i) point clustering …
Slotformer: Unsupervised visual dynamics simulation with object-centric models
Understanding dynamics from visual observations is a challenging problem that requires
disentangling individual objects from the scene and learning their interactions. While recent …
disentangling individual objects from the scene and learning their interactions. While recent …
Decomposing 3d scenes into objects via unsupervised volume segmentation
We present ObSuRF, a method which turns a single image of a scene into a 3D model
represented as a set of Neural Radiance Fields (NeRFs), with each NeRF corresponding to …
represented as a set of Neural Radiance Fields (NeRFs), with each NeRF corresponding to …