Causal reasoning meets visual representation learning: A prospective study

Y Liu, YS Wei, H Yan, GB Li, L Lin - Machine Intelligence Research, 2022 - Springer
Visual representation learning is ubiquitous in various real-world applications, including
visual comprehension, video understanding, multi-modal analysis, human-computer …

Beyond supervised learning for pervasive healthcare

X Gu, F Deligianni, J Han, X Liu, W Chen… - IEEE Reviews in …, 2023 - ieeexplore.ieee.org
The integration of machine/deep learning and sensing technologies is transforming
healthcare and medical practice. However, inherent limitations in healthcare data, namely …

Skeletonmae: graph-based masked autoencoder for skeleton sequence pre-training

H Yan, Y Liu, Y Wei, Z Li, G Li… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Skeleton sequence representation learning has shown great advantages for action
recognition due to its promising ability to model human joints and topology. However, the …

Cross-modal causal relational reasoning for event-level visual question answering

Y Liu, G Li, L Lin - IEEE Transactions on Pattern Analysis and …, 2023 - ieeexplore.ieee.org
Existing visual question answering methods often suffer from cross-modal spurious
correlations and oversimplified event-level reasoning processes that fail to capture event …

Tcgl: Temporal contrastive graph for self-supervised video representation learning

Y Liu, K Wang, L Liu, H Lan, L Lin - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Video self-supervised learning is a challenging task, which requires significant expressive
power from the model to leverage rich spatial-temporal knowledge and generate effective …

[HTML][HTML] Video surveillance using deep transfer learning and deep domain adaptation: Towards better generalization

Y Himeur, S Al-Maadeed, H Kheddar… - … Applications of Artificial …, 2023 - Elsevier
Recently, developing automated video surveillance systems (VSSs) has become crucial to
ensure the security and safety of the population, especially during events involving large …

A combined multiple action recognition and summarization for surveillance video sequences

O Elharrouss, N Almaadeed, S Al-Maadeed… - Applied …, 2021 - Springer
Human action recognition and video summarization represent challenging tasks for several
computer vision applications including video surveillance, criminal investigations, and sports …

Semantics-aware adaptive knowledge distillation for sensor-to-vision action recognition

Y Liu, K Wang, G Li, L Lin - IEEE Transactions on Image …, 2021 - ieeexplore.ieee.org
Existing vision-based action recognition is susceptible to occlusion and appearance
variations, while wearable sensors can alleviate these challenges by capturing human …

Normality learning in multispace for video anomaly detection

Y Zhang, X Nie, R He, M Chen… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Video anomaly detection is a challenging task owing to the rare and diverse nature of
abnormal events. However, most of the existing methods only learn the normality in a single …

Simone: View-invariant, temporally-abstracted object representations via unsupervised video decomposition

R Kabra, D Zoran, G Erdogan… - Advances in …, 2021 - proceedings.neurips.cc
To help agents reason about scenes in terms of their building blocks, we wish to extract the
compositional structure of any given scene (in particular, the configuration and …