Ensemble deep learning in speech signal tasks: a review
Abstract Machine learning methods are extensively used for processing and analysing
speech signals by virtue of their performance gains over multiple domains. Deep learning …
speech signals by virtue of their performance gains over multiple domains. Deep learning …
Speech emotion recognition approaches: A systematic review
A Hashem, M Arif, M Alghamdi - Speech Communication, 2023 - Elsevier
The speech emotion recognition (SER) field has been active since it became a crucial
feature in advanced Human-Computer Interaction (HCI), and wide real-life applications use …
feature in advanced Human-Computer Interaction (HCI), and wide real-life applications use …
Multimodal emotion detection via attention-based fusion of extracted facial and speech features
Methods for detecting emotions that employ many modalities at the same time have been
found to be more accurate and resilient than those that rely on a single sense. This is due to …
found to be more accurate and resilient than those that rely on a single sense. This is due to …
A survey on databases for multimodal emotion recognition and an introduction to the VIRI (visible and InfraRed image) database
Multimodal human–computer interaction (HCI) systems pledge a more human–human-like
interaction between machines and humans. Their prowess in emanating an unambiguous …
interaction between machines and humans. Their prowess in emanating an unambiguous …
Self-labeling with feature transfer for speech emotion recognition
Most speech emotion recognition methods based on frames have obtained good results in
many applications. However, they segment each speech sample into smaller frames that are …
many applications. However, they segment each speech sample into smaller frames that are …
A novel heterogeneous parallel convolution Bi-LSTM for speech emotion recognition
H Zhang, H Huang, H Han - Applied Sciences, 2021 - mdpi.com
Speech emotion recognition is a substantial component of natural language processing
(NLP). It has strict requirements for the effectiveness of feature extraction and that of the …
(NLP). It has strict requirements for the effectiveness of feature extraction and that of the …
A review on speech emotion recognition: a survey, recent advances, challenges, and the influence of noise
SM George, PM Ilyas - Neurocomputing, 2024 - Elsevier
Affective Computing systems can detect the emotional state and mindset of an individual.
Speech Emotion Recognition (SER) is a unimodal affect computing system based on …
Speech Emotion Recognition (SER) is a unimodal affect computing system based on …
Temporal relation inference network for multimodal speech emotion recognition
Speech emotion recognition (SER) is a non-trivial task for humans, while it remains
challenging for automatic SER due to the linguistic complexity and contextual distortion …
challenging for automatic SER due to the linguistic complexity and contextual distortion …
Applying segment-level attention on bi-modal transformer encoder for audio-visual emotion recognition
Emotions can be expressed through multiple complementary modalities. This study selected
speech and facial expressions as modalities by which to recognize emotions. Current …
speech and facial expressions as modalities by which to recognize emotions. Current …
CENN: Capsule-enhanced neural network with innovative metrics for robust speech emotion recognition
H Zhang, H Huang, P Zhao, X Zhu, Z Yu - Knowledge-Based Systems, 2024 - Elsevier
Speech emotion recognition (SER) plays a pivotal role in enhancing Human-computer
interaction (HCI) systems. This paper introduces a groundbreaking Capsule-enhanced …
interaction (HCI) systems. This paper introduces a groundbreaking Capsule-enhanced …