Visual speech recognition using optical flow and hidden Markov model

N Koppula, KS Rao, SA Nabi, A Balaram - Wireless Personal …, 2023 - Springer

Speech is a unique characteristic of humans that expresses one's emotional viewpoint to
others. Speech emotion recognition (SER) identifies the speaker's emotion from the speech …

被引用次数：16 相关文章所有 5 个版本

Awfc: Preventing label flipping attacks towards federated learning for intelligent iot

Z Lv, H Cao, F Zhang, Y Ren, B Wang… - The Computer …, 2022 - academic.oup.com

Centralized machine learning methods require the aggregation of data collected from
clients. Due to the awareness of data privacy, however, the aggregation of raw data …

被引用次数：11 相关文章所有 3 个版本

[PDF] smu.edu.sg

CATNet: Cross-modal fusion for audio–visual speech recognition

X Wang, J Mi, B Li, Y Zhao, J Meng - Pattern Recognition Letters, 2024 - Elsevier

Automatic speech recognition (ASR) is a typical pattern recognition technology that converts
human speeches into texts. With the aid of advanced deep learning models, the …

被引用次数：5 相关文章所有 4 个版本

Prevention of gan-based privacy inferring attacks towards federated learning

H Cao, Y Zhu, Y Ren, B Wang, M Hu, W Wang… - International Conference …, 2022 - Springer

With the increasing amount of data, data privacy has drawn great concern in machine
learning among the public. Federated Learning, which is a new kind of distributed learning …

被引用次数：4 相关文章所有 2 个版本

[PDF] wiley.com Full View

Control system and speech recognition of exhibition hall digital media based on computer technology

Y Zhao - Mobile Information Systems, 2022 - Wiley Online Library

With environmental noise in the exhibition hall, speakers tend to change their speech
production to preserve intelligible communication. While great evolution has been prepared …

被引用次数：3 相关文章所有 4 个版本

Spatio-temporal Weber Gradient Directional feature for visual and audio-visual phrase recognition systems

S Nandakishor, D Pati - International Journal of Information Technology, 2024 - Springer

Visual phrase recognition needs lip movement related visual features, while audio-visual
phrase recognition requires both acoustic and visual features. In this work, we propose a …

[PDF] sciendo.com

[PDF][PDF] HMM-based phoneme speech recognition system for the control and command of industrial robots

A Naik - Technical Transactions, 2021 - sciendo.com

In recent years, the integration of human-robot interaction with speech recognition has
gained a lot of pace in the manufacturing industries. Conventional methods to control the …

被引用次数：5 相关文章所有 18 个版本

[PDF] spiedigitallibrary.org Full View

Siamese decoupling network for speaker-independent lipreading

L Lu, X Xu, J Fu - Journal of Electronic Imaging, 2022 - spiedigitallibrary.org

Lipreading aims to decode the speech content from a moving mouth. It is a very challenging
task because lip appearance variations and speech contents are coupled together in the …

被引用次数：1 相关文章所有 3 个版本

LIP reading for specially abled persons

N Sukritha, M Mohan - 2021 12th International Conference on …, 2021 - ieeexplore.ieee.org

Lip Reading is also called Visual Speech Recognition, is a technique that utilizes only the
visual scene of speech recognition. The proposed framework deals with three main steps …

被引用次数：1 相关文章

Articulator Muscle Contraction Evaluation in Vowel Phonemes Articulation Based on Facial Images and Electromyography Signal

FL Sadida, MI Mandasari - 2023 8th International …, 2023 - ieeexplore.ieee.org

Utilization of motor signal information of articulatory muscles and facial images separately in
speech recognition shows that these two pieces of information are related to the articulation …