[HTML][HTML] Deep learning techniques for speech emotion recognition, from databases to models
The advancements in neural networks and the on-demand need for accurate and near real-
time Speech Emotion Recognition (SER) in human–computer interactions make it …
time Speech Emotion Recognition (SER) in human–computer interactions make it …
[HTML][HTML] A review on speech emotion recognition using deep learning and attention mechanism
Emotions are an integral part of human interactions and are significant factors in determining
user satisfaction or customer opinion. speech emotion recognition (SER) modules also play …
user satisfaction or customer opinion. speech emotion recognition (SER) modules also play …
Survey of deep representation learning for speech emotion recognition
Traditionally, speech emotion recognition (SER) research has relied on manually
handcrafted acoustic features using feature engineering. However, the design of …
handcrafted acoustic features using feature engineering. However, the design of …
x-vectors meet emotions: A study on dependencies between emotion and speaker recognition
In this work, we explore the dependencies between speaker recognition and emotion
recognition. We first show that knowledge learned for speaker recognition can be reused for …
recognition. We first show that knowledge learned for speaker recognition can be reused for …
[PDF][PDF] Data Augmentation Using GANs for Speech Emotion Recognition.
In this work, we address the problem of data imbalance for the task of Speech Emotion
Recognition (SER). We investigate conditioned data augmentation using Generative …
Recognition (SER). We investigate conditioned data augmentation using Generative …
Improving speech emotion recognition with adversarial data augmentation network
L Yi, MW Mak - IEEE transactions on neural networks and …, 2020 - ieeexplore.ieee.org
When training data are scarce, it is challenging to train a deep neural network without
causing the overfitting problem. For overcoming this challenge, this article proposes a new …
causing the overfitting problem. For overcoming this challenge, this article proposes a new …
Multi-task semi-supervised adversarial autoencoding for speech emotion recognition
Inspite the emerging importance of Speech Emotion Recognition (SER), the state-of-the-art
accuracy is quite low and needs improvement to make commercial applications of SER …
accuracy is quite low and needs improvement to make commercial applications of SER …
Generative adversarial networks for speech processing: A review
Generative adversarial networks (GANs) have seen remarkable progress in recent years.
They are used as generative models for all kinds of data such as text, images, audio, music …
They are used as generative models for all kinds of data such as text, images, audio, music …
Self supervised adversarial domain adaptation for cross-corpus and cross-language speech emotion recognition
Despite the recent advancement in speech emotion recognition (SER) within a single corpus
setting, the performance of these SER systems degrades significantly for cross-corpus and …
setting, the performance of these SER systems degrades significantly for cross-corpus and …
[HTML][HTML] Data augmentation for audio-visual emotion recognition with an efficient multimodal conditional GAN
Audio-visual emotion recognition is the research of identifying human emotional states by
combining the audio modality and the visual modality simultaneously, which plays an …
combining the audio modality and the visual modality simultaneously, which plays an …