On enhancing speech emotion recognition using generative adversarial networks

BJ Abbaschian, D Sierra-Sosa, A Elmaghraby - Sensors, 2021 - mdpi.com

The advancements in neural networks and the on-demand need for accurate and near real-
time Speech Emotion Recognition (SER) in human–computer interactions make it …

被引用次数：259 相关文章所有 9 个版本

[HTML] mdpi.com

[HTML][HTML] A review on speech emotion recognition using deep learning and attention mechanism

E Lieskovská, M Jakubec, R Jarina, M Chmulík - Electronics, 2021 - mdpi.com

Emotions are an integral part of human interactions and are significant factors in determining
user satisfaction or customer opinion. speech emotion recognition (SER) modules also play …

被引用次数：140 相关文章所有 4 个版本

[PDF] techrxiv.org

Survey of deep representation learning for speech emotion recognition

S Latif, R Rana, S Khalifa, R Jurdak… - IEEE Transactions …, 2021 - ieeexplore.ieee.org

Traditionally, speech emotion recognition (SER) research has relied on manually
handcrafted acoustic features using feature engineering. However, the design of …

被引用次数：98 相关文章所有 10 个版本

[PDF] arxiv.org

x-vectors meet emotions: A study on dependencies between emotion and speaker recognition

R Pappagari, T Wang, J Villalba… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

In this work, we explore the dependencies between speaker recognition and emotion
recognition. We first show that knowledge learned for speaker recognition can be reused for …

被引用次数：140 相关文章所有 5 个版本

[PDF] github.io

[PDF][PDF] Data Augmentation Using GANs for Speech Emotion Recognition.

A Chatziagapi, G Paraskevopoulos, D Sgouropoulos… - Interspeech, 2019 - slp-ntua.github.io

In this work, we address the problem of data imbalance for the task of Speech Emotion
Recognition (SER). We investigate conditioned data augmentation using Generative …

被引用次数：144 相关文章所有 5 个版本

[PDF] polyu.edu.hk

Improving speech emotion recognition with adversarial data augmentation network

L Yi, MW Mak - IEEE transactions on neural networks and …, 2020 - ieeexplore.ieee.org

When training data are scarce, it is challenging to train a deep neural network without
causing the overfitting problem. For overcoming this challenge, this article proposes a new …

被引用次数：84 相关文章所有 5 个版本

[PDF] arxiv.org

Multi-task semi-supervised adversarial autoencoding for speech emotion recognition

S Latif, R Rana, S Khalifa, R Jurdak… - IEEE Transactions …, 2020 - ieeexplore.ieee.org

Inspite the emerging importance of Speech Emotion Recognition (SER), the state-of-the-art
accuracy is quite low and needs improvement to make commercial applications of SER …

被引用次数：117 相关文章所有 10 个版本

Generative adversarial networks for speech processing: A review

A Wali, Z Alamgir, S Karim, A Fawaz, MB Ali… - Computer Speech & …, 2022 - Elsevier

Generative adversarial networks (GANs) have seen remarkable progress in recent years.
They are used as generative models for all kinds of data such as text, images, audio, music …

被引用次数：51 相关文章所有 2 个版本

[PDF] arxiv.org

Self supervised adversarial domain adaptation for cross-corpus and cross-language speech emotion recognition

S Latif, R Rana, S Khalifa, R Jurdak… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Despite the recent advancement in speech emotion recognition (SER) within a single corpus
setting, the performance of these SER systems degrades significantly for cross-corpus and …

被引用次数：40 相关文章所有 8 个版本

[HTML] mdpi.com

[HTML][HTML] Data augmentation for audio-visual emotion recognition with an efficient multimodal conditional GAN

F Ma, Y Li, S Ni, SL Huang, L Zhang - Applied Sciences, 2022 - mdpi.com

Audio-visual emotion recognition is the research of identifying human emotional states by
combining the audio modality and the visual modality simultaneously, which plays an …

被引用次数：44 相关文章所有 5 个版本