Temporal action segmentation: An analysis of modern techniques

G Ding, F Sener, A Yao - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Temporal action segmentation (TAS) in videos aims at densely identifying video frames in
minutes-long videos with multiple action classes. As a long-range video understanding task …

Error detection in egocentric procedural task videos

SP Lee, Z Lu, Z Zhang, M Hoai… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present a new egocentric procedural error dataset containing videos with various types
of errors as well as normal videos and propose a new framework for procedural error …

Fact: Frame-action cross-attention temporal modeling for efficient action segmentation

Z Lu, E Elhamifar - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
We study supervised action segmentation whose goal is to predict framewise action labels
of a video. To capture temporal dependencies over long horizons prior works either improve …

Progress-aware online action segmentation for egocentric procedural task videos

Y Shen, E Elhamifar - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
We address the problem of online action segmentation for egocentric procedural task
videos. While previous studies have mostly focused on offline action segmentation where …

Permutation-aware activity segmentation via unsupervised frame-to-segment alignment

QH Tran, A Mehmood, M Ahmed… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper presents an unsupervised transformer-based framework for temporal activity
segmentation which leverages not only frame-level cues but also segment-level cues. This …

Positive and Negative Set Designs in Contrastive Feature Learning for Temporal Action Segmentation

YC Chen, WT Chu - IEEE Transactions on Circuits and Systems …, 2024 - ieeexplore.ieee.org
When data labels are scarce, contrastive learning is often used to learn representations in a
weakly-supervised or unsupervised way. In contrastive learning, not only the learning …

Coherent Temporal Synthesis for Incremental Action Segmentation

G Ding, H Golong, A Yao - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Data replay is a successful incremental learning technique for images. It prevents
catastrophic forgetting by keeping a reservoir of previous data original or synthesized to …

Learning temporal sentence grounding from narrated egovideos

K Flanagan, D Damen, M Wray - arXiv preprint arXiv:2310.17395, 2023 - arxiv.org
The onset of long-form egocentric datasets such as Ego4D and EPIC-Kitchens presents a
new challenge for the task of Temporal Sentence Grounding (TSG). Compared to traditional …

Efficient and Effective Weakly-Supervised Action Segmentation via Action-Transition-Aware Boundary Alignment

A Xu, WS Zheng - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Weakly-supervised action segmentation is a task of learning to partition a long video into
several action segments where training videos are only accompanied by transcripts …

Timestamp-supervised Wearable-based Activity Segmentation and Recognition with Contrastive Learning and Order-Preserving Optimal Transport

S Xia, L Chu, L Pei, J Yang, W Yu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Human activity recognition (HAR) with wearables is one of the serviceable technologies in
ubiquitous and mobile computing applications. The sliding-window scheme is widely …