Egocentric video task translation
Different video understanding tasks are typically treated in isolation, and even with distinct
types of curated data (eg, classifying sports in one dataset, tracking animals in another) …
types of curated data (eg, classifying sports in one dataset, tracking animals in another) …
Egocentric video task translation@ ego4d challenge 2022
This technical report describes the EgoTask Translation approach that explores relations
among a set of egocentric video tasks in the Ego4D challenge. To improve the primary task …
among a set of egocentric video tasks in the Ego4D challenge. To improve the primary task …
Efficient video representation learning via motion-aware token selection
Recently emerged Masked Video Modeling techniques demonstrated their potential by
significantly outperforming previous methods in self-supervised learning for video. However …
significantly outperforming previous methods in self-supervised learning for video. However …
Masked Autoencoders for Egocentric Video Understanding@ Ego4D Challenge 2022
In this report, we present our approach and empirical results of applying masked
autoencoders in two egocentric video understanding tasks, namely, Object State Change …
autoencoders in two egocentric video understanding tasks, namely, Object State Change …
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens
Masked video autoencoder approaches have demonstrated their potential by significantly
outperforming previous self-supervised learning methods in video representation learning …
outperforming previous self-supervised learning methods in video representation learning …