Temporal action segmentation: An analysis of modern techniques
Temporal action segmentation (TAS) in videos aims at densely identifying video frames in
minutes-long videos with multiple action classes. As a long-range video understanding task …
minutes-long videos with multiple action classes. As a long-range video understanding task …
Error detection in egocentric procedural task videos
We present a new egocentric procedural error dataset containing videos with various types
of errors as well as normal videos and propose a new framework for procedural error …
of errors as well as normal videos and propose a new framework for procedural error …
Fact: Frame-action cross-attention temporal modeling for efficient action segmentation
Z Lu, E Elhamifar - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
We study supervised action segmentation whose goal is to predict framewise action labels
of a video. To capture temporal dependencies over long horizons prior works either improve …
of a video. To capture temporal dependencies over long horizons prior works either improve …
Progress-aware online action segmentation for egocentric procedural task videos
Y Shen, E Elhamifar - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
We address the problem of online action segmentation for egocentric procedural task
videos. While previous studies have mostly focused on offline action segmentation where …
videos. While previous studies have mostly focused on offline action segmentation where …
Permutation-aware activity segmentation via unsupervised frame-to-segment alignment
This paper presents an unsupervised transformer-based framework for temporal activity
segmentation which leverages not only frame-level cues but also segment-level cues. This …
segmentation which leverages not only frame-level cues but also segment-level cues. This …
Positive and Negative Set Designs in Contrastive Feature Learning for Temporal Action Segmentation
YC Chen, WT Chu - IEEE Transactions on Circuits and Systems …, 2024 - ieeexplore.ieee.org
When data labels are scarce, contrastive learning is often used to learn representations in a
weakly-supervised or unsupervised way. In contrastive learning, not only the learning …
weakly-supervised or unsupervised way. In contrastive learning, not only the learning …
Coherent Temporal Synthesis for Incremental Action Segmentation
Data replay is a successful incremental learning technique for images. It prevents
catastrophic forgetting by keeping a reservoir of previous data original or synthesized to …
catastrophic forgetting by keeping a reservoir of previous data original or synthesized to …
Learning temporal sentence grounding from narrated egovideos
The onset of long-form egocentric datasets such as Ego4D and EPIC-Kitchens presents a
new challenge for the task of Temporal Sentence Grounding (TSG). Compared to traditional …
new challenge for the task of Temporal Sentence Grounding (TSG). Compared to traditional …
Efficient and Effective Weakly-Supervised Action Segmentation via Action-Transition-Aware Boundary Alignment
A Xu, WS Zheng - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Weakly-supervised action segmentation is a task of learning to partition a long video into
several action segments where training videos are only accompanied by transcripts …
several action segments where training videos are only accompanied by transcripts …
Timestamp-supervised Wearable-based Activity Segmentation and Recognition with Contrastive Learning and Order-Preserving Optimal Transport
Human activity recognition (HAR) with wearables is one of the serviceable technologies in
ubiquitous and mobile computing applications. The sliding-window scheme is widely …
ubiquitous and mobile computing applications. The sliding-window scheme is widely …