Embodied navigation with multi-modal information: A survey from tasks to methodology

Y Wu, P Zhang, M Gu, J Zheng, X Bai - Information Fusion, 2024 - Elsevier
Embodied AI aims to create agents that complete complex tasks by interacting with the
environment. A key problem in this field is embodied navigation which understands multi …

PAR-Net: An Enhanced Dual-Stream CNN–ESN Architecture for Human Physical Activity Recognition

IU Khan, JW Lee - Sensors, 2024 - mdpi.com
Physical exercise affects many facets of life, including mental health, social interaction,
physical fitness, and illness prevention, among many others. Therefore, several AI-driven …

VAX: Using Existing Video and Audio-based Activity Recognition Models to Bootstrap Privacy-Sensitive Sensors

P Patidar, M Goel, Y Agarwal - Proceedings of the ACM on Interactive …, 2023 - dl.acm.org
The use of audio and video modalities for Human Activity Recognition (HAR) is common,
given the richness of the data and the availability of pre-trained ML models using a large …

A work-related musculoskeletal disorders (wmsds) risk-assessment system using a single-view pose estimation model

YJ Kwon, DH Kim, BC Son, KH Choi, S Kwak… - International Journal of …, 2022 - mdpi.com
Musculoskeletal disorders are an unavoidable occupational health problem. In particular,
workers who perform repetitive tasks onsite in the manufacturing industry suffer from …

Automated, high-throughput image calibration for parallel-laser photogrammetry

JL Richardson, EJ Levy, R Ranjithkumar, H Yang… - Mammalian …, 2022 - Springer
Parallel-laser photogrammetry is growing in popularity as a way to collect non-invasive body
size data from wild mammals. Despite its many appeals, this method requires researchers to …

Quantitative physical ergonomics assessment of teleoperation interfaces

S Gholami, M Lorenzini, E De Momi… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Human factors and ergonomics are the essential constituents of teleoperation interfaces,
which can significantly affect the human operator's performance. Thus, a quantitative …

Realishuman: A two-stage approach for refining malformed human parts in generated images

B Wang, J Zhou, J Bai, Y Yang, W Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, diffusion models have revolutionized visual generation, outperforming
traditional frameworks like Generative Adversarial Networks (GANs). However, generating …

[HTML][HTML] Enhancing Badminton Game Analysis: An Approach to Shot Refinement via a Fusion of Shuttlecock Tracking and Hit Detection from Monocular Camera

YH Hsu, CC Yu, HY Cheng - Sensors, 2024 - mdpi.com
Extracting the flight trajectory of the shuttlecock in a single turn in badminton games is
important for automated sports analytics. This study proposes a novel method to extract …

Privacy-Preserving Video Anomaly Detection: A Survey

J Liu, Y Liu, X Zhu - arXiv preprint arXiv:2411.14565, 2024 - arxiv.org
Video Anomaly Detection (VAD) aims to automatically analyze spatiotemporal patterns in
surveillance videos collected from open spaces to detect anomalous events that may cause …

optNet-50: An Optimized Residual Neural Network Architecture of Deep Learning for Driver's Distraction

T Abbas, SF Ali, AZ Khan… - 2020 IEEE 23rd …, 2020 - ieeexplore.ieee.org
Over the last few decades, human facial recognition has gained significant popularity in
areas ranging from surveillance, tracking, and access control to more recent developments …