Human action recognition from various data modalities: A review
Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …
each action. It has a wide range of applications, and therefore has been attracting increasing …
A survey on deep learning: Algorithms, techniques, and applications
The field of machine learning is witnessing its golden era as deep learning slowly becomes
the leader in this domain. Deep learning uses multiple layers to represent the abstractions of …
the leader in this domain. Deep learning uses multiple layers to represent the abstractions of …
Diffusion probabilistic modeling for video generation
Denoising diffusion probabilistic models are a promising new class of generative models
that mark a milestone in high-quality image generation. This paper showcases their ability to …
that mark a milestone in high-quality image generation. This paper showcases their ability to …
Fiery: Future instance prediction in bird's-eye view from surround monocular cameras
Driving requires interacting with road agents and predicting their future behaviour in order to
navigate safely. We present FIERY: a probabilistic future prediction model in bird's-eye view …
navigate safely. We present FIERY: a probabilistic future prediction model in bird's-eye view …
Recurring the transformer for video action recognition
Existing video understanding approaches, such as 3D convolutional neural networks and
Transformer-Based methods, usually process the videos in a clip-wise manner. Hence huge …
Transformer-Based methods, usually process the videos in a clip-wise manner. Hence huge …
Video object segmentation with episodic graph memory networks
How to make a segmentation model efficiently adapt to a specific video as well as online
target appearance variations is a fundamental issue in the field of video object …
target appearance variations is a fundamental issue in the field of video object …
Learning to track with object permanence
Tracking by detection, the dominant approach for online multi-object tracking, alternates
between localization and association steps. As a result, it strongly depends on the quality of …
between localization and association steps. As a result, it strongly depends on the quality of …
Full-duplex strategy for video object segmentation
Appearance and motion are two important sources of information in video object
segmentation (VOS). Previous methods mainly focus on using simplex solutions, lowering …
segmentation (VOS). Previous methods mainly focus on using simplex solutions, lowering …
Robust high-resolution video matting with temporal guidance
We introduce a robust, real-time, high-resolution human video matting method that achieves
new state-of-the-art performance. Our method is much lighter than previous approaches and …
new state-of-the-art performance. Our method is much lighter than previous approaches and …
Know your surroundings: Exploiting scene information for object tracking
Current state-of-the-art trackers rely only on a target appearance model in order to localize
the object in each frame. Such approaches are however prone to fail in case of eg fast …
the object in each frame. Such approaches are however prone to fail in case of eg fast …