An adaptive approach for lip-reading using image and depth data

A Fernandez-Lopez, FM Sukno - Image and Vision Computing, 2018 - Elsevier

In the last few years, there has been an increasing interest in developing systems for
Automatic Lip-Reading (ALR). Similarly to other computer vision applications, methods …

被引用次数：156 相关文章所有 3 个版本

mSilent: Towards general corpus silent speech recognition using COTS mmWave radar

S Zeng, H Wan, S Shi, W Wang - Proceedings of the ACM on Interactive …, 2023 - dl.acm.org

Silent speech recognition (SSR) allows users to speak to the device without making a
sound, avoiding being overheard or disturbing others. Compared to the video-based …

被引用次数：14 相关文章

Analyzing lower half facial gestures for lip reading applications: Survey on vision techniques

SJ Preethi - Computer Vision and Image Understanding, 2023 - Elsevier

Lip reading has gained popularity due to the proliferation of emerging real-world
applications. This article provides a comprehensive review of benchmark datasets available …

被引用次数：8 相关文章所有 2 个版本

[PDF] stanford.edu

[PDF][PDF] Lip reading using CNN and LSTM

A Garg, J Noyola, S Bagadia - Technical report, Stanford …, 2016 - vision.stanford.edu

Here we present various methods to predict words and phrases from only video without any
audio signal. We employ a VGGNet pre-trained on human faces of celebrities from IMDB …

被引用次数：82 相关文章所有 2 个版本

[PDF] mdpi.com

Human motion tracking using 3d image features with a long short-term memory mechanism model—an example of forward reaching

KY Chen, LW Chou, HM Lee, ST Young, CH Lin… - Sensors, 2021 - mdpi.com

Human motion tracking is widely applied to rehabilitation tasks, and inertial measurement
unit (IMU) sensors are a well-known approach for recording motion behavior. IMU sensors …

被引用次数：9 相关文章所有 11 个版本

Deep hybrid architectures and DenseNet35 in speaker-dependent visual speech recognition

PJ Seegehalli, BN Krupa - Signal, Image and Video Processing, 2024 - Springer

Visual speech recognition (VSR) translates the visual speech cues into transcription.
Speaker-dependent VSR (SD-VSR) can be used for authentication and secure human …

被引用次数：1 相关文章

Human machine interaction via visual speech spotting

A Rekik, A Ben-Hamadou, W Mahdi - … 2015, Catania, Italy, October 26-29 …, 2015 - Springer

In this paper, we propose an automatic visual speech spotting system adapted for RGB-D
cameras and based on Hidden Markov Models (HMMs). Our system is based on two main …

被引用次数：33 相关文章所有 2 个版本

[PDF] arxiv.org

A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation

L Liu, L Gao, W Lei, F Ma, X Lin, J Wang - arXiv preprint arXiv:2308.08849, 2023 - arxiv.org

Body language (BL) refers to the non-verbal communication expressed through physical
movements, gestures, facial expressions, and postures. It is a form of communication that …

被引用次数：3 相关文章所有 2 个版本

[PDF] archive.org

[PDF][PDF] 唇语识别的视觉特征提取方法综述

马金林，巩元文，马自萍，陈德光，朱艳彬… - 计算机科学与 …, 2021 - scholar.archive.org

现有唇语识别研究多专注于提高识别精度, 研究多模态输入特征等方面, 对提高唇部视觉特征的
有效性关注不多. 而唇部的视觉信息在视觉语音识别和唇语识别中起着关键作用 …

被引用次数：2 相关文章所有 2 个版本

[PDF] aas.net.cn

[PDF][PDF] The state of the art and prospects of lip reading

C Xiao-Ding, S Chang-Chong, K Gang-Yao, L Li - Acta Autom. Sin, 2020 - aas.net.cn

Lip reading, also known as visual speech recognition, aims to infer the content of a speech
through the motion of the speakers mouth. Lip reading is an important issue in the field of …

被引用次数：3 相关文章所有 2 个版本