Dense optical flow prediction from a static image

J Gui, T Chen, J Zhang, Q Cao, Z Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Deep supervised learning algorithms typically require a large volume of labeled data to
achieve satisfactory performance. However, the process of collecting and labeling such data …

被引用次数：61 相关文章所有 3 个版本

[PDF] mdpi.com

Diffusion probabilistic modeling for video generation

R Yang, P Srivastava, S Mandt - Entropy, 2023 - mdpi.com

Denoising diffusion probabilistic models are a promising new class of generative models
that mark a milestone in high-quality image generation. This paper showcases their ability to …

被引用次数：203 相关文章所有 9 个版本

[PDF] thecvf.com

Generative image dynamics

Z Li, R Tucker, N Snavely… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

We present an approach to modeling an image-space prior on scene motion. Our prior is
learned from a collection of motion trajectories extracted from real video sequences …

被引用次数：37 相关文章所有 9 个版本

[PDF] neurips.cc

Diffusion with forward models: Solving stochastic inverse problems without direct supervision

A Tewari, T Yin, G Cazenavette… - Advances in …, 2023 - proceedings.neurips.cc

Denoising diffusion models are a powerful type of generative models used to capture
complex distributions of real-world signals. However, their applicability is limited to …

被引用次数：58 相关文章所有 7 个版本

[PDF] thecvf.com

Neat: Neural attention fields for end-to-end autonomous driving

K Chitta, A Prakash, A Geiger - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Efficient reasoning about the semantic, spatial, and temporal structure of a scene is a crucial
prerequisite for autonomous driving. We present NEural ATtention fields (NEAT), a novel …

被引用次数：204 相关文章所有 8 个版本

[PDF] arxiv.org

Conditional object-centric learning from video

T Kipf, GF Elsayed, A Mahendran, A Stone… - arXiv preprint arXiv …, 2021 - arxiv.org

Object-centric representations are a promising path toward more systematic generalization
by providing flexible abstractions upon which compositional world models can be built …

被引用次数：186 相关文章所有 3 个版本

[HTML] nih.gov

Self-supervised learning for medical image analysis using image context restoration

L Chen, P Bentley, K Mori, K Misawa, M Fujiwara… - Medical image …, 2019 - Elsevier

Abstract Machine learning, particularly deep learning has boosted medical image analysis
over the past years. Training a good model based on deep learning requires large amount …

被引用次数：542 相关文章所有 8 个版本

[PDF] springer.com

A review of predictive and contrastive self-supervised learning for medical images

WC Wang, E Ahn, D Feng, J Kim - Machine Intelligence Research, 2023 - Springer

Over the last decade, supervised deep learning on manually annotated big data has been
progressing significantly on computer vision tasks. But, the application of deep learning in …

被引用次数：22 相关文章所有 9 个版本

[PDF] arxiv.org

Diffusion models for video prediction and infilling

T Höppe, A Mehrjou, S Bauer, D Nielsen… - arXiv preprint arXiv …, 2022 - arxiv.org

Predicting and anticipating future outcomes or reasoning about missing information in a
sequence are critical skills for agents to be able to make intelligent decisions. This requires …

被引用次数：101 相关文章所有 5 个版本

[PDF] thecvf.com

Learning 3d human dynamics from video

A Kanazawa, JY Zhang, P Felsen… - Proceedings of the …, 2019 - openaccess.thecvf.com

From an image of a person in action, we can easily guess the 3D motion of the person in the
immediate past and future. This is because we have a mental model of 3D human dynamics …

被引用次数：557 相关文章所有 8 个版本