A novel optimized recurrent network-based automatic system for speech emotion identification
Speech is a unique characteristic of humans that expresses one's emotional viewpoint to
others. Speech emotion recognition (SER) identifies the speaker's emotion from the speech …
others. Speech emotion recognition (SER) identifies the speaker's emotion from the speech …
Awfc: Preventing label flipping attacks towards federated learning for intelligent iot
Z Lv, H Cao, F Zhang, Y Ren, B Wang… - The Computer …, 2022 - academic.oup.com
Centralized machine learning methods require the aggregation of data collected from
clients. Due to the awareness of data privacy, however, the aggregation of raw data …
clients. Due to the awareness of data privacy, however, the aggregation of raw data …
CATNet: Cross-modal fusion for audio–visual speech recognition
X Wang, J Mi, B Li, Y Zhao, J Meng - Pattern Recognition Letters, 2024 - Elsevier
Automatic speech recognition (ASR) is a typical pattern recognition technology that converts
human speeches into texts. With the aid of advanced deep learning models, the …
human speeches into texts. With the aid of advanced deep learning models, the …
Prevention of gan-based privacy inferring attacks towards federated learning
With the increasing amount of data, data privacy has drawn great concern in machine
learning among the public. Federated Learning, which is a new kind of distributed learning …
learning among the public. Federated Learning, which is a new kind of distributed learning …
Control system and speech recognition of exhibition hall digital media based on computer technology
Y Zhao - Mobile Information Systems, 2022 - Wiley Online Library
With environmental noise in the exhibition hall, speakers tend to change their speech
production to preserve intelligible communication. While great evolution has been prepared …
production to preserve intelligible communication. While great evolution has been prepared …
Spatio-temporal Weber Gradient Directional feature for visual and audio-visual phrase recognition systems
S Nandakishor, D Pati - International Journal of Information Technology, 2024 - Springer
Visual phrase recognition needs lip movement related visual features, while audio-visual
phrase recognition requires both acoustic and visual features. In this work, we propose a …
phrase recognition requires both acoustic and visual features. In this work, we propose a …
[PDF][PDF] HMM-based phoneme speech recognition system for the control and command of industrial robots
A Naik - Technical Transactions, 2021 - sciendo.com
In recent years, the integration of human-robot interaction with speech recognition has
gained a lot of pace in the manufacturing industries. Conventional methods to control the …
gained a lot of pace in the manufacturing industries. Conventional methods to control the …
Siamese decoupling network for speaker-independent lipreading
L Lu, X Xu, J Fu - Journal of Electronic Imaging, 2022 - spiedigitallibrary.org
Lipreading aims to decode the speech content from a moving mouth. It is a very challenging
task because lip appearance variations and speech contents are coupled together in the …
task because lip appearance variations and speech contents are coupled together in the …
LIP reading for specially abled persons
N Sukritha, M Mohan - 2021 12th International Conference on …, 2021 - ieeexplore.ieee.org
Lip Reading is also called Visual Speech Recognition, is a technique that utilizes only the
visual scene of speech recognition. The proposed framework deals with three main steps …
visual scene of speech recognition. The proposed framework deals with three main steps …
Articulator Muscle Contraction Evaluation in Vowel Phonemes Articulation Based on Facial Images and Electromyography Signal
FL Sadida, MI Mandasari - 2023 8th International …, 2023 - ieeexplore.ieee.org
Utilization of motor signal information of articulatory muscles and facial images separately in
speech recognition shows that these two pieces of information are related to the articulation …
speech recognition shows that these two pieces of information are related to the articulation …