Deep learning from temporal coherence in video

D Chicco - Artificial neural networks, 2021 - Springer

Similarity has always been a key aspect in computer science and statistics. Any time two
element vectors are compared, many different similarity approaches can be used …

被引用次数：635 相关文章所有 7 个版本

[PDF] arxiv.org

Recent advances in deep learning for object detection

X Wu, D Sahoo, SCH Hoi - Neurocomputing, 2020 - Elsevier

Object detection is a fundamental visual recognition problem in computer vision and has
been widely studied in the past decades. Visual object detection aims to find objects of …

被引用次数：926 相关文章所有 9 个版本

[PDF] thecvf.com

A large-scale study on unsupervised spatiotemporal representation learning

C Feichtenhofer, H Fan, B Xiong… - Proceedings of the …, 2021 - openaccess.thecvf.com

We present a large-scale study on unsupervised spatiotemporal representation learning
from videos. With a unified perspective on four recent image-based frameworks, we study a …

被引用次数：263 相关文章所有 6 个版本

[PDF] arxiv.org

Contrastive multiview coding

Y Tian, D Krishnan, P Isola - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer

Humans view the world through many sensory channels, eg, the long-wavelength light
channel, viewed by the left eye, or the high-frequency vibrations channel, heard by the right …

被引用次数：2492 相关文章所有 11 个版本

[HTML] nih.gov

Self-supervised learning for medical image analysis using image context restoration

L Chen, P Bentley, K Mori, K Misawa, M Fujiwara… - Medical image …, 2019 - Elsevier

Abstract Machine learning, particularly deep learning has boosted medical image analysis
over the past years. Training a good model based on deep learning requires large amount …

被引用次数：513 相关文章所有 8 个版本

Slow down to go better: A survey on slow feature analysis

P Song, C Zhao - IEEE Transactions on Neural Networks and …, 2022 - ieeexplore.ieee.org

Temporal data contain a wealth of valuable information, playing an essential role in various
machine-learning tasks. Slow feature analysis (SFA), one of the most classic temporal …

被引用次数：79 相关文章所有 3 个版本

[PDF] thecvf.com

Scaling and benchmarking self-supervised visual representation learning

P Goyal, D Mahajan, A Gupta… - Proceedings of the ieee …, 2019 - openaccess.thecvf.com

Self-supervised learning aims to learn representations from the data itself without explicit
manual supervision. Existing efforts ignore a crucial aspect of self-supervised learning-the …

被引用次数：457 相关文章所有 10 个版本

[PDF] thecvf.com

Graph convolutional label noise cleaner: Train a plug-and-play action classifier for anomaly detection

JX Zhong, N Li, W Kong, S Liu… - Proceedings of the …, 2019 - openaccess.thecvf.com

Video anomaly detection under weak labels is formulated as a typical multiple-instance
learning problem in previous works. In this paper, we provide a new perspective, ie, a …

被引用次数：450 相关文章所有 6 个版本

[PDF] thecvf.com

Audio-visual instance discrimination with cross-modal agreement

P Morgado, N Vasconcelos… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

We present a self-supervised learning approach to learn audio-visual representations from
video and audio. Our method uses contrastive learning for cross-modal discrimination of …

被引用次数：275 相关文章所有 7 个版本

[PDF] thecvf.com

Multi-task self-supervised visual learning

C Doersch, A Zisserman - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

We investigate methods for combining multiple self-supervised tasks---ie, supervised tasks
where data can be collected without manual labeling---in order to train a single visual …

被引用次数：771 相关文章所有 8 个版本