Deep learning techniques for speech emotion recognition, from databases to models

BJ Abbaschian, D Sierra-Sosa, A Elmaghraby - Sensors, 2021 - mdpi.com
The advancements in neural networks and the on-demand need for accurate and near real-
time Speech Emotion Recognition (SER) in human–computer interactions make it …

Automated emotion recognition: Current trends and future perspectives

M Maithri, U Raghavendra, A Gudigar… - Computer methods and …, 2022 - Elsevier
Background Human emotions greatly affect the actions of a person. The automated emotion
recognition has applications in multiple domains such as health care, e-learning …

An ensemble 1D-CNN-LSTM-GRU model with data augmentation for speech emotion recognition

MR Ahmed, S Islam, AKMM Islam… - Expert Systems with …, 2023 - Elsevier
Precise recognition of emotion from speech signals aids in enhancing human–computer
interaction (HCI). The performance of a speech emotion recognition (SER) system depends …

Hybrid LSTM-transformer model for emotion recognition from speech audio files

F Andayani, LB Theng, MT Tsun, C Chua - IEEE Access, 2022 - ieeexplore.ieee.org
Emotion is a vital component in daily human communication and it helps people understand
each other. Emotion recognition plays a crucial role in developing human-computer …

Speech emotion recognition approaches: A systematic review

A Hashem, M Arif, M Alghamdi - Speech Communication, 2023 - Elsevier
The speech emotion recognition (SER) field has been active since it became a crucial
feature in advanced Human-Computer Interaction (HCI), and wide real-life applications use …

Rawboost: A raw data boosting and augmentation method applied to automatic speaker verification anti-spoofing

H Tak, M Kamble, J Patino, M Todisco… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
This paper introduces RawBoost, a data boosting and augmentation method for the design
of more reliable spoofing detection solutions which operate directly upon raw waveform …

Emotion intensity and its control for emotional voice conversion

K Zhou, B Sisman, R Rana… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Emotional voice conversion (EVC) seeks to convert the emotional state of an utterance while
preserving the linguistic content and speaker identity. In EVC, emotions are usually treated …

Speech synthesis with mixed emotions

K Zhou, B Sisman, R Rana… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Emotional speech synthesis aims to synthesize human voices with various emotional effects.
The current studies are mostly focused on imitating an averaged style belonging to a specific …

Speech emotion recognition using convolution neural networks and multi-head convolutional transformer

R Ullah, M Asif, WA Shah, F Anjam, I Ullah… - Sensors, 2023 - mdpi.com
Speech emotion recognition (SER) is a challenging task in human–computer interaction
(HCI) systems. One of the key challenges in speech emotion recognition is to extract the …

[HTML][HTML] Hybrid data augmentation and deep attention-based dilated convolutional-recurrent neural networks for speech emotion recognition

NT Pham, DNM Dang, ND Nguyen, TT Nguyen… - Expert Systems with …, 2023 - Elsevier
Recently, speech emotion recognition (SER) has become an active research area in speech
processing, particularly with the advent of deep learning (DL). Numerous DL-based methods …