A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends

J Gui, T Chen, J Zhang, Q Cao, Z Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep supervised learning algorithms typically require a large volume of labeled data to
achieve satisfactory performance. However, the process of collecting and labeling such data …

Diffusion probabilistic modeling for video generation

R Yang, P Srivastava, S Mandt - Entropy, 2023 - mdpi.com
Denoising diffusion probabilistic models are a promising new class of generative models
that mark a milestone in high-quality image generation. This paper showcases their ability to …

Generative image dynamics

Z Li, R Tucker, N Snavely… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We present an approach to modeling an image-space prior on scene motion. Our prior is
learned from a collection of motion trajectories extracted from real video sequences …

Diffusion with forward models: Solving stochastic inverse problems without direct supervision

A Tewari, T Yin, G Cazenavette… - Advances in …, 2023 - proceedings.neurips.cc
Denoising diffusion models are a powerful type of generative models used to capture
complex distributions of real-world signals. However, their applicability is limited to …

Neat: Neural attention fields for end-to-end autonomous driving

K Chitta, A Prakash, A Geiger - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Efficient reasoning about the semantic, spatial, and temporal structure of a scene is a crucial
prerequisite for autonomous driving. We present NEural ATtention fields (NEAT), a novel …

Conditional object-centric learning from video

T Kipf, GF Elsayed, A Mahendran, A Stone… - arXiv preprint arXiv …, 2021 - arxiv.org
Object-centric representations are a promising path toward more systematic generalization
by providing flexible abstractions upon which compositional world models can be built …

Self-supervised learning for medical image analysis using image context restoration

L Chen, P Bentley, K Mori, K Misawa, M Fujiwara… - Medical image …, 2019 - Elsevier
Abstract Machine learning, particularly deep learning has boosted medical image analysis
over the past years. Training a good model based on deep learning requires large amount …

A review of predictive and contrastive self-supervised learning for medical images

WC Wang, E Ahn, D Feng, J Kim - Machine Intelligence Research, 2023 - Springer
Over the last decade, supervised deep learning on manually annotated big data has been
progressing significantly on computer vision tasks. But, the application of deep learning in …

Diffusion models for video prediction and infilling

T Höppe, A Mehrjou, S Bauer, D Nielsen… - arXiv preprint arXiv …, 2022 - arxiv.org
Predicting and anticipating future outcomes or reasoning about missing information in a
sequence are critical skills for agents to be able to make intelligent decisions. This requires …

Learning 3d human dynamics from video

A Kanazawa, JY Zhang, P Felsen… - Proceedings of the …, 2019 - openaccess.thecvf.com
From an image of a person in action, we can easily guess the 3D motion of the person in the
immediate past and future. This is because we have a mental model of 3D human dynamics …