Deep learning for computer vision: A brief review

A Voulodimos, N Doulamis, A Doulamis… - Computational …, 2018 - Wiley Online Library
Over the last years deep learning methods have been shown to outperform previous state‐of‐
the‐art machine learning techniques in several fields, with computer vision being one of the …

A review of human activity recognition methods

M Vrigkas, C Nikou, IA Kakadiaris - Frontiers in Robotics and AI, 2015 - frontiersin.org
Recognizing human activities from video sequences or still images is a challenging task due
to problems, such as background clutter, partial occlusion, changes in scale, viewpoint …

On-device training under 256kb memory

J Lin, L Zhu, WM Chen, WC Wang… - Advances in Neural …, 2022 - proceedings.neurips.cc
On-device training enables the model to adapt to new data collected from the sensors by
fine-tuning a pre-trained model. Users can benefit from customized AI models without having …

Visual saliency transformer

N Liu, N Zhang, K Wan, L Shao… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Existing state-of-the-art saliency detection methods heavily rely on CNN-based
architectures. Alternatively, we rethink this task from a convolution-free sequence-to …

Vision-based human activity recognition: a survey

DR Beddiar, B Nini, M Sabokrou, A Hadid - Multimedia Tools and …, 2020 - Springer
Human activity recognition (HAR) systems attempt to automatically identify and analyze
human activities using acquired information from various types of sensors. Although several …

Graph convolutional networks for temporal action localization

R Zeng, W Huang, M Tan, Y Rong… - Proceedings of the …, 2019 - openaccess.thecvf.com
Most state-of-the-art action localization systems process each action proposal individually,
without explicitly exploiting their relations during learning. However, the relations between …

Tsm: Temporal shift module for efficient video understanding

J Lin, C Gan, S Han - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
The explosive growth in video streaming gives rise to challenges on performing video
understanding at high accuracy and low computation cost. Conventional 2D CNNs are …

Group-aware label transfer for domain adaptive person re-identification

K Zheng, W Liu, L He, T Mei, J Luo… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract Unsupervised Domain Adaptive (UDA) person re-identification (ReID) aims at
adapting the model trained on a labeled source-domain dataset to a target-domain dataset …

Tinytl: Reduce memory, not parameters for efficient on-device learning

H Cai, C Gan, L Zhu, S Han - Advances in Neural …, 2020 - proceedings.neurips.cc
Efficient on-device learning requires a small memory footprint at training time to fit the tight
memory constraint. Existing work solves this problem by reducing the number of trainable …

Grad-cam: Visual explanations from deep networks via gradient-based localization

RR Selvaraju, M Cogswell, A Das… - Proceedings of the …, 2017 - openaccess.thecvf.com
We propose a technique for producing'visual explanations' for decisions from a large class
of Convolutional Neural Network (CNN)-based models, making them more transparent. Our …