Bridging the gap to real-world object-centric learning

M Seitzer, M Horn, A Zadaianchuk, D Zietlow… - arXiv preprint arXiv …, 2022 - arxiv.org
Humans naturally decompose their environment into entities at the appropriate level of
abstraction to act in the world. Allowing machine learning algorithms to derive this …

Provably learning object-centric representations

J Brady, RS Zimmermann, Y Sharma… - International …, 2023 - proceedings.mlr.press
Learning structured representations of the visual world in terms of objects promises to
significantly improve the generalization abilities of current machine learning models. While …

Ogc: Unsupervised 3d object segmentation from rigid dynamics of point clouds

Z Song, B Yang - Advances in Neural Information …, 2022 - proceedings.neurips.cc
In this paper, we study the problem of 3D object segmentation from raw point clouds. Unlike
all existing methods which usually require a large amount of human annotations for full …

Object-centric learning for real-world videos by predicting temporal feature similarities

A Zadaianchuk, M Seitzer… - Advances in Neural …, 2024 - proceedings.neurips.cc
Unsupervised video-based object-centric learning is a promising avenue to learn structured
representations from large, unlabeled video collections, but previous approaches have only …

Improving object-centric learning with query optimization

B Jia, Y Liu, S Huang - arXiv preprint arXiv:2210.08990, 2022 - arxiv.org
The ability to decompose complex natural scenes into meaningful object-centric abstractions
lies at the core of human perception and reasoning. In the recent culmination of …

Compositional scene representation learning via reconstruction: A survey

J Yuan, T Chen, B Li, X Xue - IEEE Transactions on Pattern …, 2023 - ieeexplore.ieee.org
Visual scenes are composed of visual concepts and have the property of combinatorial
explosion. An important reason for humans to efficiently learn from diverse visual scenes is …

SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers

I Kakogeorgiou, S Gidaris… - Proceedings of the …, 2024 - openaccess.thecvf.com
Unsupervised object-centric learning aims to decompose scenes into interpretable object
entities termed slots. Slot-based auto-encoders stand out as a prominent method for this …

Multi-object representation learning via feature connectivity and object-centric regularization

A Foo, W Hsu, ML Lee - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Discovering object-centric representations from images has the potential to greatly improve
the robustness, sample efficiency and interpretability of machine learning algorithms …

Recasting Generic Pretrained Vision Transformers As Object-Centric Scene Encoders For Manipulation Policies

J Qian, A Panagopoulos, D Jayaraman - arXiv preprint arXiv:2405.15916, 2024 - arxiv.org
Generic re-usable pre-trained image representation encoders have become a standard
component of methods for many computer vision tasks. As visual representations for robots …

Unsupervised 3D Object Segmentation of Point Clouds by Geometry Consistency

Z Song, B Yang - IEEE Transactions on Pattern Analysis and …, 2024 - ieeexplore.ieee.org
In this paper, we study the problem of 3D object segmentation from raw point clouds. Unlike
existing methods which usually require a large amount of human annotations for full …