Devnet: A deep event network for multimedia event detection and evidence recounting

A Voulodimos, N Doulamis, A Doulamis… - Computational …, 2018 - Wiley Online Library

Over the last years deep learning methods have been shown to outperform previous state‐of‐
the‐art machine learning techniques in several fields, with computer vision being one of the …

被引用次数：3613 相关文章所有 15 个版本

[PDF] frontiersin.org

A review of human activity recognition methods

M Vrigkas, C Nikou, IA Kakadiaris - Frontiers in Robotics and AI, 2015 - frontiersin.org

Recognizing human activities from video sequences or still images is a challenging task due
to problems, such as background clutter, partial occlusion, changes in scale, viewpoint …

被引用次数：622 相关文章所有 8 个版本

[PDF] neurips.cc

On-device training under 256kb memory

J Lin, L Zhu, WM Chen, WC Wang… - Advances in Neural …, 2022 - proceedings.neurips.cc

On-device training enables the model to adapt to new data collected from the sensors by
fine-tuning a pre-trained model. Users can benefit from customized AI models without having …

被引用次数：162 相关文章所有 8 个版本

[PDF] thecvf.com

Visual saliency transformer

N Liu, N Zhang, K Wan, L Shao… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Existing state-of-the-art saliency detection methods heavily rely on CNN-based
architectures. Alternatively, we rethink this task from a convolution-free sequence-to …

被引用次数：409 相关文章所有 7 个版本

[PDF] springer.com

Vision-based human activity recognition: a survey

DR Beddiar, B Nini, M Sabokrou, A Hadid - Multimedia Tools and …, 2020 - Springer

Human activity recognition (HAR) systems attempt to automatically identify and analyze
human activities using acquired information from various types of sensors. Although several …

被引用次数：401 相关文章所有 8 个版本

[PDF] thecvf.com

Graph convolutional networks for temporal action localization

R Zeng, W Huang, M Tan, Y Rong… - Proceedings of the …, 2019 - openaccess.thecvf.com

Most state-of-the-art action localization systems process each action proposal individually,
without explicitly exploiting their relations during learning. However, the relations between …

被引用次数：569 相关文章所有 8 个版本

[PDF] thecvf.com

Tsm: Temporal shift module for efficient video understanding

J Lin, C Gan, S Han - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com

The explosive growth in video streaming gives rise to challenges on performing video
understanding at high accuracy and low computation cost. Conventional 2D CNNs are …

被引用次数：2048 相关文章所有 15 个版本

[PDF] thecvf.com

Group-aware label transfer for domain adaptive person re-identification

K Zheng, W Liu, L He, T Mei, J Luo… - Proceedings of the …, 2021 - openaccess.thecvf.com

Abstract Unsupervised Domain Adaptive (UDA) person re-identification (ReID) aims at
adapting the model trained on a labeled source-domain dataset to a target-domain dataset …

被引用次数：194 相关文章所有 6 个版本

[PDF] neurips.cc

Tinytl: Reduce memory, not parameters for efficient on-device learning

H Cai, C Gan, L Zhu, S Han - Advances in Neural …, 2020 - proceedings.neurips.cc

Efficient on-device learning requires a small memory footprint at training time to fit the tight
memory constraint. Existing work solves this problem by reducing the number of trainable …

被引用次数：256 相关文章所有 6 个版本

[PDF] thecvf.com

Grad-cam: Visual explanations from deep networks via gradient-based localization

RR Selvaraju, M Cogswell, A Das… - Proceedings of the …, 2017 - openaccess.thecvf.com

We propose a technique for producing'visual explanations' for decisions from a large class
of Convolutional Neural Network (CNN)-based models, making them more transparent. Our …

被引用次数：18980 相关文章所有 11 个版本