Survey on automatic lip-reading in the era of deep learning

A Fernandez-Lopez, FM Sukno - Image and Vision Computing, 2018 - Elsevier
In the last few years, there has been an increasing interest in developing systems for
Automatic Lip-Reading (ALR). Similarly to other computer vision applications, methods …

A survey of research on lipreading technology

M Hao, M Mamut, N Yadikar, A Aysa, K Ubul - IEEE Access, 2020 - ieeexplore.ieee.org
Although automatic speech recognition (ASR) technology is mature, there are still some
unsolved problems, such as how to accurately identify what the speaker is saying in a noisy …

EchoSpeech: Continuous Silent Speech Recognition on Minimally-obtrusive Eyewear Powered by Acoustic Sensing

R Zhang, K Li, Y Hao, Y Wang, Z Lai… - Proceedings of the …, 2023 - dl.acm.org
We present EchoSpeech, a minimally-obtrusive silent speech interface (SSI) powered by
low-power active acoustic sensing. EchoSpeech uses speakers and microphones mounted …

Lip reading sentences using deep learning with only visual cues

S Fenghour, D Chen, K Guo, P Xiao - IEEE Access, 2020 - ieeexplore.ieee.org
In this paper, a neural network-based lip reading system is proposed. The system is lexicon-
free and uses purely visual cues. With only a limited number of visemes as classes to …

Lipformer: learning to lipread unseen speakers based on visual-landmark transformers

F Xue, Y Li, D Liu, Y Xie, L Wu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Lipreading refers to understanding and further translating the speech of a video speaker into
textual outputs. State-of-the-art lipreading methods excel in interpreting overlap speakers, ie …

Speechin: A smart necklace for silent speech recognition

R Zhang, M Chen, B Steeper, Y Li, Z Yan… - Proceedings of the …, 2021 - dl.acm.org
This paper presents SpeeChin, a smart necklace that can recognize 54 English and 44
Chinese silent speech commands. A customized infrared (IR) imaging system is mounted on …

A Lightweight Driver Drowsiness Detection System Using 3DCNN With LSTM.

SA Alameen, AM Alhothali - Computer Systems Science & …, 2023 - search.ebscohost.com
Today, fatalities, physical injuries, and significant economic losses occur due to car
accidents. Among the leading causes of car accidents is drowsiness behind the wheel …

Improving the DBLSTM for on-line Arabic handwriting recognition

R Maalej, M Kherallah - Multimedia Tools and Applications, 2020 - Springer
Various applications involved in the computer recognition of pen-input handwritten words,
such as the online form filling, text editing, note taking, and so on. Therefore, a great deal of …

End-to-end visual speech recognition for small-scale datasets

S Petridis, Y Wang, P Ma, Z Li, M Pantic - Pattern Recognition Letters, 2020 - Elsevier
Visual speech recognition models traditionally consist of two stages, feature extraction and
classification. Several deep learning approaches have been recently presented aiming to …

Robust dual-modal speech keyword spotting for XR headsets

Z Cai, Y Ma, F Lu - IEEE Transactions on Visualization and …, 2024 - ieeexplore.ieee.org
While speech interaction finds widespread utility within the Extended Reality (XR) domain,
conventional vocal speech keyword spotting systems continue to grapple with formidable …