Causal reasoning meets visual representation learning: A prospective study
Visual representation learning is ubiquitous in various real-world applications, including
visual comprehension, video understanding, multi-modal analysis, human-computer …
visual comprehension, video understanding, multi-modal analysis, human-computer …
Beyond supervised learning for pervasive healthcare
The integration of machine/deep learning and sensing technologies is transforming
healthcare and medical practice. However, inherent limitations in healthcare data, namely …
healthcare and medical practice. However, inherent limitations in healthcare data, namely …
Skeletonmae: graph-based masked autoencoder for skeleton sequence pre-training
Skeleton sequence representation learning has shown great advantages for action
recognition due to its promising ability to model human joints and topology. However, the …
recognition due to its promising ability to model human joints and topology. However, the …
Cross-modal causal relational reasoning for event-level visual question answering
Existing visual question answering methods often suffer from cross-modal spurious
correlations and oversimplified event-level reasoning processes that fail to capture event …
correlations and oversimplified event-level reasoning processes that fail to capture event …
Tcgl: Temporal contrastive graph for self-supervised video representation learning
Video self-supervised learning is a challenging task, which requires significant expressive
power from the model to leverage rich spatial-temporal knowledge and generate effective …
power from the model to leverage rich spatial-temporal knowledge and generate effective …
[HTML][HTML] Video surveillance using deep transfer learning and deep domain adaptation: Towards better generalization
Y Himeur, S Al-Maadeed, H Kheddar… - … Applications of Artificial …, 2023 - Elsevier
Recently, developing automated video surveillance systems (VSSs) has become crucial to
ensure the security and safety of the population, especially during events involving large …
ensure the security and safety of the population, especially during events involving large …
A combined multiple action recognition and summarization for surveillance video sequences
Human action recognition and video summarization represent challenging tasks for several
computer vision applications including video surveillance, criminal investigations, and sports …
computer vision applications including video surveillance, criminal investigations, and sports …
Semantics-aware adaptive knowledge distillation for sensor-to-vision action recognition
Existing vision-based action recognition is susceptible to occlusion and appearance
variations, while wearable sensors can alleviate these challenges by capturing human …
variations, while wearable sensors can alleviate these challenges by capturing human …
Normality learning in multispace for video anomaly detection
Video anomaly detection is a challenging task owing to the rare and diverse nature of
abnormal events. However, most of the existing methods only learn the normality in a single …
abnormal events. However, most of the existing methods only learn the normality in a single …
Simone: View-invariant, temporally-abstracted object representations via unsupervised video decomposition
To help agents reason about scenes in terms of their building blocks, we wish to extract the
compositional structure of any given scene (in particular, the configuration and …
compositional structure of any given scene (in particular, the configuration and …