A novel optimized recurrent network-based automatic system for speech emotion identification

N Koppula, KS Rao, SA Nabi, A Balaram - Wireless Personal …, 2023 - Springer
Speech is a unique characteristic of humans that expresses one's emotional viewpoint to
others. Speech emotion recognition (SER) identifies the speaker's emotion from the speech …

Awfc: Preventing label flipping attacks towards federated learning for intelligent iot

Z Lv, H Cao, F Zhang, Y Ren, B Wang… - The Computer …, 2022 - academic.oup.com
Centralized machine learning methods require the aggregation of data collected from
clients. Due to the awareness of data privacy, however, the aggregation of raw data …

CATNet: Cross-modal fusion for audio–visual speech recognition

X Wang, J Mi, B Li, Y Zhao, J Meng - Pattern Recognition Letters, 2024 - Elsevier
Automatic speech recognition (ASR) is a typical pattern recognition technology that converts
human speeches into texts. With the aid of advanced deep learning models, the …

Prevention of gan-based privacy inferring attacks towards federated learning

H Cao, Y Zhu, Y Ren, B Wang, M Hu, W Wang… - International Conference …, 2022 - Springer
With the increasing amount of data, data privacy has drawn great concern in machine
learning among the public. Federated Learning, which is a new kind of distributed learning …

Control system and speech recognition of exhibition hall digital media based on computer technology

Y Zhao - Mobile Information Systems, 2022 - Wiley Online Library
With environmental noise in the exhibition hall, speakers tend to change their speech
production to preserve intelligible communication. While great evolution has been prepared …

Spatio-temporal Weber Gradient Directional feature for visual and audio-visual phrase recognition systems

S Nandakishor, D Pati - International Journal of Information Technology, 2024 - Springer
Visual phrase recognition needs lip movement related visual features, while audio-visual
phrase recognition requires both acoustic and visual features. In this work, we propose a …

[PDF][PDF] HMM-based phoneme speech recognition system for the control and command of industrial robots

A Naik - Technical Transactions, 2021 - sciendo.com
In recent years, the integration of human-robot interaction with speech recognition has
gained a lot of pace in the manufacturing industries. Conventional methods to control the …

Siamese decoupling network for speaker-independent lipreading

L Lu, X Xu, J Fu - Journal of Electronic Imaging, 2022 - spiedigitallibrary.org
Lipreading aims to decode the speech content from a moving mouth. It is a very challenging
task because lip appearance variations and speech contents are coupled together in the …

LIP reading for specially abled persons

N Sukritha, M Mohan - 2021 12th International Conference on …, 2021 - ieeexplore.ieee.org
Lip Reading is also called Visual Speech Recognition, is a technique that utilizes only the
visual scene of speech recognition. The proposed framework deals with three main steps …

Articulator Muscle Contraction Evaluation in Vowel Phonemes Articulation Based on Facial Images and Electromyography Signal

FL Sadida, MI Mandasari - 2023 8th International …, 2023 - ieeexplore.ieee.org
Utilization of motor signal information of articulatory muscles and facial images separately in
speech recognition shows that these two pieces of information are related to the articulation …