Survey on automatic lip-reading in the era of deep learning

A Fernandez-Lopez, FM Sukno - Image and Vision Computing, 2018 - Elsevier
In the last few years, there has been an increasing interest in developing systems for
Automatic Lip-Reading (ALR). Similarly to other computer vision applications, methods …

mSilent: Towards general corpus silent speech recognition using COTS mmWave radar

S Zeng, H Wan, S Shi, W Wang - Proceedings of the ACM on Interactive …, 2023 - dl.acm.org
Silent speech recognition (SSR) allows users to speak to the device without making a
sound, avoiding being overheard or disturbing others. Compared to the video-based …

Analyzing lower half facial gestures for lip reading applications: Survey on vision techniques

SJ Preethi - Computer Vision and Image Understanding, 2023 - Elsevier
Lip reading has gained popularity due to the proliferation of emerging real-world
applications. This article provides a comprehensive review of benchmark datasets available …

[PDF][PDF] Lip reading using CNN and LSTM

A Garg, J Noyola, S Bagadia - Technical report, Stanford …, 2016 - vision.stanford.edu
Here we present various methods to predict words and phrases from only video without any
audio signal. We employ a VGGNet pre-trained on human faces of celebrities from IMDB …

Human motion tracking using 3d image features with a long short-term memory mechanism model—an example of forward reaching

KY Chen, LW Chou, HM Lee, ST Young, CH Lin… - Sensors, 2021 - mdpi.com
Human motion tracking is widely applied to rehabilitation tasks, and inertial measurement
unit (IMU) sensors are a well-known approach for recording motion behavior. IMU sensors …

Deep hybrid architectures and DenseNet35 in speaker-dependent visual speech recognition

PJ Seegehalli, BN Krupa - Signal, Image and Video Processing, 2024 - Springer
Visual speech recognition (VSR) translates the visual speech cues into transcription.
Speaker-dependent VSR (SD-VSR) can be used for authentication and secure human …

Human machine interaction via visual speech spotting

A Rekik, A Ben-Hamadou, W Mahdi - … 2015, Catania, Italy, October 26-29 …, 2015 - Springer
In this paper, we propose an automatic visual speech spotting system adapted for RGB-D
cameras and based on Hidden Markov Models (HMMs). Our system is based on two main …

A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation

L Liu, L Gao, W Lei, F Ma, X Lin, J Wang - arXiv preprint arXiv:2308.08849, 2023 - arxiv.org
Body language (BL) refers to the non-verbal communication expressed through physical
movements, gestures, facial expressions, and postures. It is a form of communication that …

[PDF][PDF] 唇语识别的视觉特征提取方法综述

马金林, 巩元文, 马自萍, 陈德光, 朱艳彬… - 计算机科学与 …, 2021 - scholar.archive.org
现有唇语识别研究多专注于提高识别精度, 研究多模态输入特征等方面, 对提高唇部视觉特征的
有效性关注不多. 而唇部的视觉信息在视觉语音识别和唇语识别中起着关键作用 …

[PDF][PDF] The state of the art and prospects of lip reading

C Xiao-Ding, S Chang-Chong, K Gang-Yao, L Li - Acta Autom. Sin, 2020 - aas.net.cn
Lip reading, also known as visual speech recognition, aims to infer the content of a speech
through the motion of the speakers mouth. Lip reading is an important issue in the field of …