Embodied navigation with multi-modal information: A survey from tasks to methodology
Embodied AI aims to create agents that complete complex tasks by interacting with the
environment. A key problem in this field is embodied navigation which understands multi …
environment. A key problem in this field is embodied navigation which understands multi …
PAR-Net: An Enhanced Dual-Stream CNN–ESN Architecture for Human Physical Activity Recognition
IU Khan, JW Lee - Sensors, 2024 - mdpi.com
Physical exercise affects many facets of life, including mental health, social interaction,
physical fitness, and illness prevention, among many others. Therefore, several AI-driven …
physical fitness, and illness prevention, among many others. Therefore, several AI-driven …
VAX: Using Existing Video and Audio-based Activity Recognition Models to Bootstrap Privacy-Sensitive Sensors
The use of audio and video modalities for Human Activity Recognition (HAR) is common,
given the richness of the data and the availability of pre-trained ML models using a large …
given the richness of the data and the availability of pre-trained ML models using a large …
A work-related musculoskeletal disorders (wmsds) risk-assessment system using a single-view pose estimation model
Musculoskeletal disorders are an unavoidable occupational health problem. In particular,
workers who perform repetitive tasks onsite in the manufacturing industry suffer from …
workers who perform repetitive tasks onsite in the manufacturing industry suffer from …
Automated, high-throughput image calibration for parallel-laser photogrammetry
JL Richardson, EJ Levy, R Ranjithkumar, H Yang… - Mammalian …, 2022 - Springer
Parallel-laser photogrammetry is growing in popularity as a way to collect non-invasive body
size data from wild mammals. Despite its many appeals, this method requires researchers to …
size data from wild mammals. Despite its many appeals, this method requires researchers to …
Quantitative physical ergonomics assessment of teleoperation interfaces
Human factors and ergonomics are the essential constituents of teleoperation interfaces,
which can significantly affect the human operator's performance. Thus, a quantitative …
which can significantly affect the human operator's performance. Thus, a quantitative …
Realishuman: A two-stage approach for refining malformed human parts in generated images
In recent years, diffusion models have revolutionized visual generation, outperforming
traditional frameworks like Generative Adversarial Networks (GANs). However, generating …
traditional frameworks like Generative Adversarial Networks (GANs). However, generating …
[HTML][HTML] Enhancing Badminton Game Analysis: An Approach to Shot Refinement via a Fusion of Shuttlecock Tracking and Hit Detection from Monocular Camera
Extracting the flight trajectory of the shuttlecock in a single turn in badminton games is
important for automated sports analytics. This study proposes a novel method to extract …
important for automated sports analytics. This study proposes a novel method to extract …
Privacy-Preserving Video Anomaly Detection: A Survey
J Liu, Y Liu, X Zhu - arXiv preprint arXiv:2411.14565, 2024 - arxiv.org
Video Anomaly Detection (VAD) aims to automatically analyze spatiotemporal patterns in
surveillance videos collected from open spaces to detect anomalous events that may cause …
surveillance videos collected from open spaces to detect anomalous events that may cause …
optNet-50: An Optimized Residual Neural Network Architecture of Deep Learning for Driver's Distraction
Over the last few decades, human facial recognition has gained significant popularity in
areas ranging from surveillance, tracking, and access control to more recent developments …
areas ranging from surveillance, tracking, and access control to more recent developments …