Deep learning techniques for speech emotion recognition, from databases to models
The advancements in neural networks and the on-demand need for accurate and near real-
time Speech Emotion Recognition (SER) in human–computer interactions make it …
time Speech Emotion Recognition (SER) in human–computer interactions make it …
Automated emotion recognition: Current trends and future perspectives
Background Human emotions greatly affect the actions of a person. The automated emotion
recognition has applications in multiple domains such as health care, e-learning …
recognition has applications in multiple domains such as health care, e-learning …
An ensemble 1D-CNN-LSTM-GRU model with data augmentation for speech emotion recognition
Precise recognition of emotion from speech signals aids in enhancing human–computer
interaction (HCI). The performance of a speech emotion recognition (SER) system depends …
interaction (HCI). The performance of a speech emotion recognition (SER) system depends …
Hybrid LSTM-transformer model for emotion recognition from speech audio files
Emotion is a vital component in daily human communication and it helps people understand
each other. Emotion recognition plays a crucial role in developing human-computer …
each other. Emotion recognition plays a crucial role in developing human-computer …
Speech emotion recognition approaches: A systematic review
A Hashem, M Arif, M Alghamdi - Speech Communication, 2023 - Elsevier
The speech emotion recognition (SER) field has been active since it became a crucial
feature in advanced Human-Computer Interaction (HCI), and wide real-life applications use …
feature in advanced Human-Computer Interaction (HCI), and wide real-life applications use …
Rawboost: A raw data boosting and augmentation method applied to automatic speaker verification anti-spoofing
This paper introduces RawBoost, a data boosting and augmentation method for the design
of more reliable spoofing detection solutions which operate directly upon raw waveform …
of more reliable spoofing detection solutions which operate directly upon raw waveform …
Emotion intensity and its control for emotional voice conversion
Emotional voice conversion (EVC) seeks to convert the emotional state of an utterance while
preserving the linguistic content and speaker identity. In EVC, emotions are usually treated …
preserving the linguistic content and speaker identity. In EVC, emotions are usually treated …
Speech synthesis with mixed emotions
Emotional speech synthesis aims to synthesize human voices with various emotional effects.
The current studies are mostly focused on imitating an averaged style belonging to a specific …
The current studies are mostly focused on imitating an averaged style belonging to a specific …
Speech emotion recognition using convolution neural networks and multi-head convolutional transformer
Speech emotion recognition (SER) is a challenging task in human–computer interaction
(HCI) systems. One of the key challenges in speech emotion recognition is to extract the …
(HCI) systems. One of the key challenges in speech emotion recognition is to extract the …
[HTML][HTML] Hybrid data augmentation and deep attention-based dilated convolutional-recurrent neural networks for speech emotion recognition
Recently, speech emotion recognition (SER) has become an active research area in speech
processing, particularly with the advent of deep learning (DL). Numerous DL-based methods …
processing, particularly with the advent of deep learning (DL). Numerous DL-based methods …