Human action recognition from various data modalities: A review

Z Sun, Q Ke, H Rahmani, M Bennamoun… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …

A survey on deep learning: Algorithms, techniques, and applications

S Pouyanfar, S Sadiq, Y Yan, H Tian, Y Tao… - ACM computing …, 2018 - dl.acm.org
The field of machine learning is witnessing its golden era as deep learning slowly becomes
the leader in this domain. Deep learning uses multiple layers to represent the abstractions of …

Diffusion probabilistic modeling for video generation

R Yang, P Srivastava, S Mandt - Entropy, 2023 - mdpi.com
Denoising diffusion probabilistic models are a promising new class of generative models
that mark a milestone in high-quality image generation. This paper showcases their ability to …

Fiery: Future instance prediction in bird's-eye view from surround monocular cameras

A Hu, Z Murez, N Mohan, S Dudas… - Proceedings of the …, 2021 - openaccess.thecvf.com
Driving requires interacting with road agents and predicting their future behaviour in order to
navigate safely. We present FIERY: a probabilistic future prediction model in bird's-eye view …

Recurring the transformer for video action recognition

J Yang, X Dong, L Liu, C Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Existing video understanding approaches, such as 3D convolutional neural networks and
Transformer-Based methods, usually process the videos in a clip-wise manner. Hence huge …

Video object segmentation with episodic graph memory networks

X Lu, W Wang, M Danelljan, T Zhou, J Shen… - Computer Vision–ECCV …, 2020 - Springer
How to make a segmentation model efficiently adapt to a specific video as well as online
target appearance variations is a fundamental issue in the field of video object …

Learning to track with object permanence

P Tokmakov, J Li, W Burgard… - Proceedings of the …, 2021 - openaccess.thecvf.com
Tracking by detection, the dominant approach for online multi-object tracking, alternates
between localization and association steps. As a result, it strongly depends on the quality of …

Full-duplex strategy for video object segmentation

GP Ji, K Fu, Z Wu, DP Fan, J Shen… - Proceedings of the …, 2021 - openaccess.thecvf.com
Appearance and motion are two important sources of information in video object
segmentation (VOS). Previous methods mainly focus on using simplex solutions, lowering …

Robust high-resolution video matting with temporal guidance

S Lin, L Yang, I Saleemi… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We introduce a robust, real-time, high-resolution human video matting method that achieves
new state-of-the-art performance. Our method is much lighter than previous approaches and …

Know your surroundings: Exploiting scene information for object tracking

G Bhat, M Danelljan, L Van Gool, R Timofte - Computer Vision–ECCV …, 2020 - Springer
Current state-of-the-art trackers rely only on a target appearance model in order to localize
the object in each frame. Such approaches are however prone to fail in case of eg fast …