Actionformer: Localizing moments of actions with transformers
Self-attention based Transformer models have demonstrated impressive results for image
classification and object detection, and more recently for video understanding. Inspired by …
classification and object detection, and more recently for video understanding. Inspired by …
End-to-end temporal action detection with transformer
Temporal action detection (TAD) aims to determine the semantic label and the temporal
interval of every action instance in an untrimmed video. It is a fundamental and challenging …
interval of every action instance in an untrimmed video. It is a fundamental and challenging …
Physformer: Facial video-based physiological measurement with temporal difference transformer
Remote photoplethysmography (rPPG), which aims at measuring heart activities and
physiological signals from facial video without any contact, has great potential in many …
physiological signals from facial video without any contact, has great potential in many …
Physformer++: Facial video-based physiological measurement with slowfast temporal difference transformer
Remote photoplethysmography (rPPG), which aims at measuring heart activities and
physiological signals from facial video without any contact, has great potential in many …
physiological signals from facial video without any contact, has great potential in many …
Proposal-free temporal action detection via global segmentation mask learning
Existing temporal action detection (TAD) methods rely on generating an overwhelmingly
large number of proposals per video. This leads to complex model designs due to proposal …
large number of proposals per video. This leads to complex model designs due to proposal …
Difftad: Temporal action detection with proposal denoising diffusion
We propose a new formulation of temporal action detection (TAD) with denoising diffusion,
DiffTAD in short. Taking as input random temporal proposals, it can yield action proposals …
DiffTAD in short. Taking as input random temporal proposals, it can yield action proposals …
Gatehub: Gated history unit with background suppression for online action detection
Online action detection is the task of predicting the action as soon as it happens in a
streaming video. A major challenge is that the model does not have access to the future and …
streaming video. A major challenge is that the model does not have access to the future and …
Temporalmaxer: Maximize temporal context with only max pooling for temporal action localization
Temporal Action Localization (TAL) is a challenging task in video understanding that aims to
identify and localize actions within a video sequence. Recent studies have emphasized the …
identify and localize actions within a video sequence. Recent studies have emphasized the …
Temporal action proposal generation with background constraint
Temporal action proposal generation (TAPG) is a challenging task that aims to locate action
instances in untrimmed videos with temporal boundaries. To evaluate the confidence of …
instances in untrimmed videos with temporal boundaries. To evaluate the confidence of …
Good practices and a strong baseline for traffic anomaly detection
The detection of traffic anomalies is a critical component of the intelligent city transportation
management system. Previous works have proposed a variety of notable insights and taken …
management system. Previous works have proposed a variety of notable insights and taken …