A method of amino acid terahertz spectrum recognition based on the convolutional neural network and bidirectional gated recurrent network model
T Li, Y Xu, J Luo, J He, S Lin - Scientific Programming, 2021 - Wiley Online Library
In order to improve the accuracy of amino acid identification, a model based on the
convolutional neural network (CNN) and bidirectional gated recurrent network (BiGRU) is …
convolutional neural network (CNN) and bidirectional gated recurrent network (BiGRU) is …
End-to-end sentence-level multi-view lipreading architecture with spatial attention module integrated multiple CNNs and cascaded local self-attention-CTC
S Jeon, MS Kim - Sensors, 2022 - mdpi.com
Concomitant with the recent advances in deep learning, automatic speech recognition and
visual speech recognition (VSR) have received considerable attention. However, although …
visual speech recognition (VSR) have received considerable attention. However, although …
Multi-angle lipreading with angle classification-based feature extraction and its application to audio-visual speech recognition
S Isobe, S Tamura, S Hayamizu, Y Gotoh, M Nose - Future Internet, 2021 - mdpi.com
Recently, automatic speech recognition (ASR) and visual speech recognition (VSR) have
been widely researched owing to the development in deep learning. Most VSR research …
been widely researched owing to the development in deep learning. Most VSR research …
Deep view2view mapping for view-invariant lipreading
A Koumparoulis, G Potamianos - 2018 IEEE Spoken Language …, 2018 - ieeexplore.ieee.org
Recently, visual-only and audio-visual speech recognition have made significant progress
thanks to deep-learning based, trainable visual front-ends (VFEs), with most research …
thanks to deep-learning based, trainable visual front-ends (VFEs), with most research …
Improving speech recognition performance using spectral subtraction with artificial neural network
J Umamaheswari, A Akila - International Journal of Advanced …, 2018 - papers.ssrn.com
This study proposes a new technique for speech enhancement using Time-Delay Neural
Network Spectral Subtraction (TDNN-SS) in the presence of background noises. Spectral …
Network Spectral Subtraction (TDNN-SS) in the presence of background noises. Spectral …
Multi-angle lipreading using angle classification and angle-specific feature integration
S Isobe, S Tamura, S Hayamizu… - … Processing, and their …, 2021 - ieeexplore.ieee.org
Recently, visual speech recognition (VSR), or namely lipreading, has been widely
researched due to development of Deep Learning (DL). The most lipreading researches …
researched due to development of Deep Learning (DL). The most lipreading researches …