A survey of speech emotion recognition in natural environment

MS Fahad, A Ranjan, J Yadav, A Deepak - Digital signal processing, 2021 - Elsevier
While speech emotion recognition (SER) has been an active research field since the last
three decades, the techniques that deal with the natural environment have only emerged in …

Deep representation learning in speech processing: Challenges, recent advances, and future trends

S Latif, R Rana, S Khalifa, R Jurdak, J Qadir… - arXiv preprint arXiv …, 2020 - arxiv.org
Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …

Survey of deep representation learning for speech emotion recognition

S Latif, R Rana, S Khalifa, R Jurdak… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
Traditionally, speech emotion recognition (SER) research has relied on manually
handcrafted acoustic features using feature engineering. However, the design of …

Face recognition based on convolutional neural network

M Coşkun, A Uçar, Ö Yildirim… - … conference on modern …, 2017 - ieeexplore.ieee.org
Face recognition is of great importance to real world applications such as video surveillance,
human machine interaction and security systems. As compared to traditional machine …

A multi-task learning framework for emotion recognition using 2D continuous space

R Xia, Y Liu - IEEE Transactions on affective computing, 2015 - ieeexplore.ieee.org
Dimensional models have been proposed in psychology studies to represent complex
human emotional expressions. Activation and valence are two common dimensions in such …

Using regional saliency for speech emotion recognition

Z Aldeneh, EM Provost - 2017 IEEE international conference on …, 2017 - ieeexplore.ieee.org
In this paper, we show that convolutional neural networks can be directly applied to temporal
low-level acoustic features to identify emotionally salient regions without the need for …

Multi-head attention for speech emotion recognition with auxiliary learning of gender recognition

A Nediyanchath, P Paramasivam… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
The paper presents a Multi-Head Attention deep learning network for Speech Emotion
Recognition (SER) using Log mel-Filter Bank Energies (LFBE) spectral features as the input …

End-to-end speech emotion recognition with gender information

TW Sun - IEEE Access, 2020 - ieeexplore.ieee.org
Many works have focused on speech emotion recognition algorithms. However, most rely on
the proper selection of speech acoustic features. In this paper, we propose a novel emotion …

An open-source speaker gender detection framework for monitoring gender equality

D Doukhan, J Carrive, F Vallet… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org
This paper presents an approach based on acoustic analysis to describe gender equality in
French audiovisual streams, through the estimation of male and female speaking time …

Multi-modal dimensional emotion recognition using recurrent neural networks

S Chen, Q Jin - Proceedings of the 5th International Workshop on …, 2015 - dl.acm.org
Emotion recognition has been an active research area with both wide applications and big
challenges. This paper presents our effort for the Audio/Visual Emotion Challenge …