Combining multiple views for visual speech recognition

A method of amino acid terahertz spectrum recognition based on the convolutional neural network and bidirectional gated recurrent network model

T Li, Y Xu, J Luo, J He, S Lin - Scientific Programming, 2021 - Wiley Online Library

In order to improve the accuracy of amino acid identification, a model based on the
convolutional neural network (CNN) and bidirectional gated recurrent network (BiGRU) is …

被引用次数：16 相关文章所有 6 个版本

[PDF] mdpi.com

End-to-end sentence-level multi-view lipreading architecture with spatial attention module integrated multiple CNNs and cascaded local self-attention-CTC

S Jeon, MS Kim - Sensors, 2022 - mdpi.com

Concomitant with the recent advances in deep learning, automatic speech recognition and
visual speech recognition (VSR) have received considerable attention. However, although …

被引用次数：8 相关文章所有 7 个版本

[PDF] mdpi.com

Multi-angle lipreading with angle classification-based feature extraction and its application to audio-visual speech recognition

S Isobe, S Tamura, S Hayamizu, Y Gotoh, M Nose - Future Internet, 2021 - mdpi.com

Recently, automatic speech recognition (ASR) and visual speech recognition (VSR) have
been widely researched owing to the development in deep learning. Most VSR research …

被引用次数：12 相关文章所有 8 个版本

[PDF] researchgate.net

Deep view2view mapping for view-invariant lipreading

A Koumparoulis, G Potamianos - 2018 IEEE Spoken Language …, 2018 - ieeexplore.ieee.org

Recently, visual-only and audio-visual speech recognition have made significant progress
thanks to deep-learning based, trainable visual front-ends (VFEs), with most research …

被引用次数：17 相关文章所有 3 个版本

[PDF] ssrn.com

Improving speech recognition performance using spectral subtraction with artificial neural network

J Umamaheswari, A Akila - International Journal of Advanced …, 2018 - papers.ssrn.com

This study proposes a new technique for speech enhancement using Time-Delay Neural
Network Spectral Subtraction (TDNN-SS) in the presence of background noises. Spectral …

被引用次数：5 相关文章

[PDF] iccspa.org

Multi-angle lipreading using angle classification and angle-specific feature integration

S Isobe, S Tamura, S Hayamizu… - … Processing, and their …, 2021 - ieeexplore.ieee.org

Recently, visual speech recognition (VSR), or namely lipreading, has been widely
researched due to development of Deep Learning (DL). The most lipreading researches …

被引用次数：4 相关文章所有 4 个版本

[引用][C] Visual Lip-Reading for Arabic Alphabets and Quranic Words using Deep Learning

NF Aljohani - 2023 - King Abdulaziz University