A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends

J Gui, T Chen, J Zhang, Q Cao, Z Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep supervised learning algorithms typically require a large volume of labeled data to
achieve satisfactory performance. However, the process of collecting and labeling such data …

[HTML][HTML] A survey on deep learning-based monocular spacecraft pose estimation: Current state, limitations and prospects

L Pauly, W Rharbaoui, C Shneider, A Rathinam… - Acta Astronautica, 2023 - Elsevier
Estimating the pose of an uncooperative spacecraft is an important computer vision problem
for enabling the deployment of automatic vision-based systems in orbit, with applications …

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Neural feature fusion fields: 3d distillation of self-supervised 2d image representations

V Tschernezki, I Laina, D Larlus… - … Conference on 3D …, 2022 - ieeexplore.ieee.org
We present Neural Feature Fusion Fields (N3F),\a method that improves dense 2D image
feature extractors when the latter are applied to the analysis of multiple images …

[PDF][PDF] Deep vit features as dense visual descriptors

S Amir, Y Gandelsman, S Bagon… - arXiv preprint arXiv …, 2021 - dino-vit-features.github.io
We study the use of deep features extracted from a pretrained Vision Transformer (ViT) as
dense visual descriptors. We observe and empirically demonstrate that such features, when …

Bridging the gap to real-world object-centric learning

M Seitzer, M Horn, A Zadaianchuk, D Zietlow… - arXiv preprint arXiv …, 2022 - arxiv.org
Humans naturally decompose their environment into entities at the appropriate level of
abstraction to act in the world. Allowing machine learning algorithms to derive this …

Simple unsupervised object-centric learning for complex and naturalistic videos

G Singh, YF Wu, S Ahn - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Unsupervised object-centric learning aims to represent the modular, compositional, and
causal structure of a scene as a set of object representations and thereby promises to …

Reco: Retrieve and co-segment for zero-shot transfer

G Shin, W Xie, S Albanie - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Semantic segmentation has a broad range of applications, but its real-world impact has
been significantly limited by the prohibitive annotation costs necessary to enable …

Self-supervised object-centric learning for videos

G Aydemir, W Xie, F Guney - Advances in Neural …, 2023 - proceedings.neurips.cc
Unsupervised multi-object segmentation has shown impressive results on images by
utilizing powerful semantics learned from self-supervised pretraining. An additional modality …

Featurenerf: Learning generalizable nerfs by distilling foundation models

J Ye, N Wang, X Wang - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Recent works on generalizable NeRFs have shown promising results on novel view
synthesis from single or few images. However, such models have rarely been applied on …