Ensemble deep learning in speech signal tasks: a review

M Tanveer, A Rastogi, V Paliwal, MA Ganaie, AK Malik… - Neurocomputing, 2023 - Elsevier
Abstract Machine learning methods are extensively used for processing and analysing
speech signals by virtue of their performance gains over multiple domains. Deep learning …

Speech emotion recognition approaches: A systematic review

A Hashem, M Arif, M Alghamdi - Speech Communication, 2023 - Elsevier
The speech emotion recognition (SER) field has been active since it became a crucial
feature in advanced Human-Computer Interaction (HCI), and wide real-life applications use …

Multimodal emotion detection via attention-based fusion of extracted facial and speech features

D Mamieva, AB Abdusalomov, A Kutlimuratov… - Sensors, 2023 - mdpi.com
Methods for detecting emotions that employ many modalities at the same time have been
found to be more accurate and resilient than those that rely on a single sense. This is due to …

A survey on databases for multimodal emotion recognition and an introduction to the VIRI (visible and InfraRed image) database

MFH Siddiqui, P Dhakal, X Yang, AY Javaid - Multimodal Technologies …, 2022 - mdpi.com
Multimodal human–computer interaction (HCI) systems pledge a more human–human-like
interaction between machines and humans. Their prowess in emanating an unambiguous …

Self-labeling with feature transfer for speech emotion recognition

G Wen, H Liao, H Li, P Wen, T Zhang, S Gao… - Knowledge-Based …, 2022 - Elsevier
Most speech emotion recognition methods based on frames have obtained good results in
many applications. However, they segment each speech sample into smaller frames that are …

A novel heterogeneous parallel convolution Bi-LSTM for speech emotion recognition

H Zhang, H Huang, H Han - Applied Sciences, 2021 - mdpi.com
Speech emotion recognition is a substantial component of natural language processing
(NLP). It has strict requirements for the effectiveness of feature extraction and that of the …

A review on speech emotion recognition: a survey, recent advances, challenges, and the influence of noise

SM George, PM Ilyas - Neurocomputing, 2024 - Elsevier
Affective Computing systems can detect the emotional state and mindset of an individual.
Speech Emotion Recognition (SER) is a unimodal affect computing system based on …

Temporal relation inference network for multimodal speech emotion recognition

GN Dong, CM Pun, Z Zhang - IEEE Transactions on Circuits …, 2022 - ieeexplore.ieee.org
Speech emotion recognition (SER) is a non-trivial task for humans, while it remains
challenging for automatic SER due to the linguistic complexity and contextual distortion …

Applying segment-level attention on bi-modal transformer encoder for audio-visual emotion recognition

JH Hsu, CH Wu - IEEE Transactions on Affective Computing, 2023 - ieeexplore.ieee.org
Emotions can be expressed through multiple complementary modalities. This study selected
speech and facial expressions as modalities by which to recognize emotions. Current …

CENN: Capsule-enhanced neural network with innovative metrics for robust speech emotion recognition

H Zhang, H Huang, P Zhao, X Zhu, Z Yu - Knowledge-Based Systems, 2024 - Elsevier
Speech emotion recognition (SER) plays a pivotal role in enhancing Human-computer
interaction (HCI) systems. This paper introduces a groundbreaking Capsule-enhanced …