End-to-end low-resource lip-reading with maxout CNN and LSTM

A Fernandez-Lopez, FM Sukno - Image and Vision Computing, 2018 - Elsevier

In the last few years, there has been an increasing interest in developing systems for
Automatic Lip-Reading (ALR). Similarly to other computer vision applications, methods …

被引用次数：156 相关文章所有 3 个版本

[PDF] ieee.org

A survey of research on lipreading technology

M Hao, M Mamut, N Yadikar, A Aysa, K Ubul - IEEE Access, 2020 - ieeexplore.ieee.org

Although automatic speech recognition (ASR) technology is mature, there are still some
unsolved problems, such as how to accurately identify what the speaker is saying in a noisy …

被引用次数：46 相关文章所有 3 个版本

EchoSpeech: Continuous Silent Speech Recognition on Minimally-obtrusive Eyewear Powered by Acoustic Sensing

R Zhang, K Li, Y Hao, Y Wang, Z Lai… - Proceedings of the …, 2023 - dl.acm.org

We present EchoSpeech, a minimally-obtrusive silent speech interface (SSI) powered by
low-power active acoustic sensing. EchoSpeech uses speakers and microphones mounted …

被引用次数：27 相关文章

[PDF] ieee.org

Lip reading sentences using deep learning with only visual cues

S Fenghour, D Chen, K Guo, P Xiao - IEEE Access, 2020 - ieeexplore.ieee.org

In this paper, a neural network-based lip reading system is proposed. The system is lexicon-
free and uses purely visual cues. With only a limited number of visemes as classes to …

被引用次数：63 相关文章所有 3 个版本

[PDF] arxiv.org

Lipformer: learning to lipread unseen speakers based on visual-landmark transformers

F Xue, Y Li, D Liu, Y Xie, L Wu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Lipreading refers to understanding and further translating the speech of a video speaker into
textual outputs. State-of-the-art lipreading methods excel in interpreting overlap speakers, ie …

被引用次数：14 相关文章所有 4 个版本

[PDF] czhang.org

Speechin: A smart necklace for silent speech recognition

R Zhang, M Chen, B Steeper, Y Li, Z Yan… - Proceedings of the …, 2021 - dl.acm.org

This paper presents SpeeChin, a smart necklace that can recognize 54 English and 44
Chinese silent speech commands. A customized infrared (IR) imaging system is mounted on …

被引用次数：26 相关文章所有 3 个版本

A Lightweight Driver Drowsiness Detection System Using 3DCNN With LSTM.

SA Alameen, AM Alhothali - Computer Systems Science & …, 2023 - search.ebscohost.com

Today, fatalities, physical injuries, and significant economic losses occur due to car
accidents. Among the leading causes of car accidents is drowsiness behind the wheel …

被引用次数：17 相关文章

Improving the DBLSTM for on-line Arabic handwriting recognition

R Maalej, M Kherallah - Multimedia Tools and Applications, 2020 - Springer

Various applications involved in the computer recognition of pen-input handwritten words,
such as the online form filling, text editing, note taking, and so on. Therefore, a great deal of …

被引用次数：38 相关文章所有 4 个版本

[PDF] arxiv.org

End-to-end visual speech recognition for small-scale datasets

S Petridis, Y Wang, P Ma, Z Li, M Pantic - Pattern Recognition Letters, 2020 - Elsevier

Visual speech recognition models traditionally consist of two stages, feature extraction and
classification. Several deep learning approaches have been recently presented aiming to …

被引用次数：49 相关文章所有 9 个版本

[PDF] arxiv.org

Robust dual-modal speech keyword spotting for XR headsets

Z Cai, Y Ma, F Lu - IEEE Transactions on Visualization and …, 2024 - ieeexplore.ieee.org

While speech interaction finds widespread utility within the Extended Reality (XR) domain,
conventional vocal speech keyword spotting systems continue to grapple with formidable …

被引用次数：4 相关文章所有 7 个版本