Temporal action segmentation: An analysis of modern techniques

G Ding, F Sener, A Yao - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Temporal action segmentation (TAS) in videos aims at densely identifying video frames in
minutes-long videos with multiple action classes. As a long-range video understanding task …

Fine-grained temporal contrastive learning for weakly-supervised temporal action localization

J Gao, M Chen, C Xu - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
We target at the task of weakly-supervised action localization (WSAL), where only video-
level action labels are available during model training. Despite the recent progress, existing …

Deep learning approaches for seizure video analysis: A review

D Ahmedt-Aristizabal, MA Armin, Z Hayder… - Epilepsy & Behavior, 2024 - Elsevier
Seizure events can manifest as transient disruptions in the control of movements which may
be organized in distinct behavioral sequences, accompanied or not by other observable …

Weakly-supervised action segmentation and unseen error detection in anomalous instructional videos

R Ghoddoosian, I Dwivedi… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a novel method for weakly-supervised action segmentation and unseen error
detection in anomalous instructional videos. In the absence of an appropriate dataset for this …

Stepformer: Self-supervised step discovery and localization in instructional videos

N Dvornik, I Hadji, R Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Instructional videos are an important resource to learn procedural tasks from human
demonstrations. However, the instruction steps in such videos are typically short and sparse …

Drop-dtw: Aligning common signal between sequences while dropping outliers

M Dvornik, I Hadji, KG Derpanis… - Advances in Neural …, 2021 - proceedings.neurips.cc
In this work, we consider the problem of sequence-to-sequence alignment for signals
containing outliers. Assuming the absence of outliers, the standard Dynamic Time Warping …

Video-text representation learning via differentiable weak temporal alignment

D Ko, J Choi, J Ko, S Noh, KW On… - Proceedings of the …, 2022 - openaccess.thecvf.com
Learning generic joint representations for video and text by a supervised method requires a
prohibitively substantial amount of manually annotated video datasets. As a practical …

Weakly-supervised online action segmentation in multi-view instructional videos

R Ghoddoosian, I Dwivedi, N Agarwal… - Proceedings of the …, 2022 - openaccess.thecvf.com
This paper addresses a new problem of weakly-supervised online action segmentation in
instructional videos. We present a framework to segment streaming videos online at test time …

Flow graph to video grounding for weakly-supervised multi-step localization

N Dvornik, I Hadji, H Pham, D Bhatt, B Martinez… - … on Computer Vision, 2022 - Springer
In this work, we consider the problem of weakly-supervised multi-step localization in
instructional videos. An established approach to this problem is to rely on a given list of …

Semi-weakly-supervised learning of complex actions from instructional task videos

Y Shen, E Elhamifar - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
We address the problem of action segmentation in instructional task videos with a small
number of weakly-labeled training videos and a large number of unlabeled videos, which …