Exploration of Attention Mechanism-Enhanced Deep Learning Models in the Mining of Medical Textual Data

L Xiao, M Li, Y Feng, M Wang, Z Zhu… - arXiv preprint arXiv …, 2024 - arxiv.org
The research explores the utilization of a deep learning model employing an attention
mechanism in medical text mining. It targets the challenge of analyzing unstructured text …

pix2gestalt: Amodal segmentation by synthesizing wholes

E Ozguroglu, R Liu, D Surís, D Chen, A Dave… - 2024 IEEE/CVF …, 2024 - computer.org
We introduce pix2gestalt, a framework for zero-shot amodal segmentation, which learns to
estimate the shape and appearance of whole objects that are only partially visible behind …

Deep learning for 3D human pose estimation and mesh recovery: A survey

Y Liu, C Qiu, Z Zhang - Neurocomputing, 2024 - Elsevier
Abstract 3D human pose estimation and mesh recovery have attracted widespread research
interest in many areas, such as computer vision, autonomous driving, and robotics. Deep …

RoHM: Robust Human Motion Reconstruction via Diffusion

S Zhang, BL Bhatnagar, Y Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
We propose RoHM an approach for robust 3D human motion reconstruction from monocular
RGB (-D) videos in the presence of noise and occlusions. Most previous approaches either …

Application of multimodal fusion deep learning model in disease recognition

X Liu, H Qiu, M Li, Z Yu, Y Yang… - 2024 IEEE 2nd …, 2024 - ieeexplore.ieee.org
This paper introduces an innovative multi-modal fusion deep learning approach to
overcome the drawbacks of traditional single-modal recognition techniques. These …

Enhancing Medical Imaging with GANs Synthesizing Realistic Images from Limited Data

Y Feng, B Zhang, L Xiao, Y Yang… - 2024 IEEE 4th …, 2024 - ieeexplore.ieee.org
In this research, we introduce an innovative method for synthesizing medical images using
generative adversarial networks (GANs). Our proposed GANs method demonstrates the …

Deep learning-based lung medical image recognition

X Fei, Y Wang, L Dai, M Sui - International Journal of …, 2024 - ijircst.irpublications.org
Pulmonary nodules serve as critical indicators for early lung cancer diagnosis, making their
detection and classification essential. The prevalent use of transfer learning in recognition …

PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos

Y Zhang, JO Kephart, Z Cui, Q Ji - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
While current methods have shown promising progress on estimating 3D human motion
from monocular videos their motion estimates are often physically unrealistic because they …

Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning

J Jeong, D Park, KJ Yoon - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Human pose forecasting garners attention for its diverse applications. However challenges
in modeling the multi-modal nature of human motion and intricate interactions among agents …

Neural textured deformable meshes for robust analysis-by-synthesis

A Wang, W Ma, A Yuille… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Human vision demonstrates higher robustness than current AI algorithms under out-of-
distribution scenarios. It has been conjectured such robustness benefits from performing …