Bridging the gap to real-world object-centric learning
Humans naturally decompose their environment into entities at the appropriate level of
abstraction to act in the world. Allowing machine learning algorithms to derive this …
abstraction to act in the world. Allowing machine learning algorithms to derive this …
Provably learning object-centric representations
Learning structured representations of the visual world in terms of objects promises to
significantly improve the generalization abilities of current machine learning models. While …
significantly improve the generalization abilities of current machine learning models. While …
Ogc: Unsupervised 3d object segmentation from rigid dynamics of point clouds
In this paper, we study the problem of 3D object segmentation from raw point clouds. Unlike
all existing methods which usually require a large amount of human annotations for full …
all existing methods which usually require a large amount of human annotations for full …
Object-centric learning for real-world videos by predicting temporal feature similarities
A Zadaianchuk, M Seitzer… - Advances in Neural …, 2024 - proceedings.neurips.cc
Unsupervised video-based object-centric learning is a promising avenue to learn structured
representations from large, unlabeled video collections, but previous approaches have only …
representations from large, unlabeled video collections, but previous approaches have only …
Improving object-centric learning with query optimization
The ability to decompose complex natural scenes into meaningful object-centric abstractions
lies at the core of human perception and reasoning. In the recent culmination of …
lies at the core of human perception and reasoning. In the recent culmination of …
Compositional scene representation learning via reconstruction: A survey
Visual scenes are composed of visual concepts and have the property of combinatorial
explosion. An important reason for humans to efficiently learn from diverse visual scenes is …
explosion. An important reason for humans to efficiently learn from diverse visual scenes is …
SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers
I Kakogeorgiou, S Gidaris… - Proceedings of the …, 2024 - openaccess.thecvf.com
Unsupervised object-centric learning aims to decompose scenes into interpretable object
entities termed slots. Slot-based auto-encoders stand out as a prominent method for this …
entities termed slots. Slot-based auto-encoders stand out as a prominent method for this …
Multi-object representation learning via feature connectivity and object-centric regularization
Discovering object-centric representations from images has the potential to greatly improve
the robustness, sample efficiency and interpretability of machine learning algorithms …
the robustness, sample efficiency and interpretability of machine learning algorithms …
Recasting Generic Pretrained Vision Transformers As Object-Centric Scene Encoders For Manipulation Policies
J Qian, A Panagopoulos, D Jayaraman - arXiv preprint arXiv:2405.15916, 2024 - arxiv.org
Generic re-usable pre-trained image representation encoders have become a standard
component of methods for many computer vision tasks. As visual representations for robots …
component of methods for many computer vision tasks. As visual representations for robots …
Unsupervised 3D Object Segmentation of Point Clouds by Geometry Consistency
In this paper, we study the problem of 3D object segmentation from raw point clouds. Unlike
existing methods which usually require a large amount of human annotations for full …
existing methods which usually require a large amount of human annotations for full …