Siamese neural networks: An overview

D Chicco - Artificial neural networks, 2021 - Springer
Similarity has always been a key aspect in computer science and statistics. Any time two
element vectors are compared, many different similarity approaches can be used …

Recent advances in deep learning for object detection

X Wu, D Sahoo, SCH Hoi - Neurocomputing, 2020 - Elsevier
Object detection is a fundamental visual recognition problem in computer vision and has
been widely studied in the past decades. Visual object detection aims to find objects of …

A large-scale study on unsupervised spatiotemporal representation learning

C Feichtenhofer, H Fan, B Xiong… - Proceedings of the …, 2021 - openaccess.thecvf.com
We present a large-scale study on unsupervised spatiotemporal representation learning
from videos. With a unified perspective on four recent image-based frameworks, we study a …

Contrastive multiview coding

Y Tian, D Krishnan, P Isola - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
Humans view the world through many sensory channels, eg, the long-wavelength light
channel, viewed by the left eye, or the high-frequency vibrations channel, heard by the right …

Self-supervised learning for medical image analysis using image context restoration

L Chen, P Bentley, K Mori, K Misawa, M Fujiwara… - Medical image …, 2019 - Elsevier
Abstract Machine learning, particularly deep learning has boosted medical image analysis
over the past years. Training a good model based on deep learning requires large amount …

Slow down to go better: A survey on slow feature analysis

P Song, C Zhao - IEEE Transactions on Neural Networks and …, 2022 - ieeexplore.ieee.org
Temporal data contain a wealth of valuable information, playing an essential role in various
machine-learning tasks. Slow feature analysis (SFA), one of the most classic temporal …

Scaling and benchmarking self-supervised visual representation learning

P Goyal, D Mahajan, A Gupta… - Proceedings of the ieee …, 2019 - openaccess.thecvf.com
Self-supervised learning aims to learn representations from the data itself without explicit
manual supervision. Existing efforts ignore a crucial aspect of self-supervised learning-the …

Graph convolutional label noise cleaner: Train a plug-and-play action classifier for anomaly detection

JX Zhong, N Li, W Kong, S Liu… - Proceedings of the …, 2019 - openaccess.thecvf.com
Video anomaly detection under weak labels is formulated as a typical multiple-instance
learning problem in previous works. In this paper, we provide a new perspective, ie, a …

Audio-visual instance discrimination with cross-modal agreement

P Morgado, N Vasconcelos… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We present a self-supervised learning approach to learn audio-visual representations from
video and audio. Our method uses contrastive learning for cross-modal discrimination of …

Multi-task self-supervised visual learning

C Doersch, A Zisserman - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
We investigate methods for combining multiple self-supervised tasks---ie, supervised tasks
where data can be collected without manual labeling---in order to train a single visual …