A survey of speech emotion recognition in natural environment
While speech emotion recognition (SER) has been an active research field since the last
three decades, the techniques that deal with the natural environment have only emerged in …
three decades, the techniques that deal with the natural environment have only emerged in …
Deep representation learning in speech processing: Challenges, recent advances, and future trends
Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …
engineered acoustic features (feature engineering) as a separate distinct problem from the …
Survey of deep representation learning for speech emotion recognition
Traditionally, speech emotion recognition (SER) research has relied on manually
handcrafted acoustic features using feature engineering. However, the design of …
handcrafted acoustic features using feature engineering. However, the design of …
Face recognition based on convolutional neural network
Face recognition is of great importance to real world applications such as video surveillance,
human machine interaction and security systems. As compared to traditional machine …
human machine interaction and security systems. As compared to traditional machine …
A multi-task learning framework for emotion recognition using 2D continuous space
Dimensional models have been proposed in psychology studies to represent complex
human emotional expressions. Activation and valence are two common dimensions in such …
human emotional expressions. Activation and valence are two common dimensions in such …
Using regional saliency for speech emotion recognition
Z Aldeneh, EM Provost - 2017 IEEE international conference on …, 2017 - ieeexplore.ieee.org
In this paper, we show that convolutional neural networks can be directly applied to temporal
low-level acoustic features to identify emotionally salient regions without the need for …
low-level acoustic features to identify emotionally salient regions without the need for …
Multi-head attention for speech emotion recognition with auxiliary learning of gender recognition
A Nediyanchath, P Paramasivam… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
The paper presents a Multi-Head Attention deep learning network for Speech Emotion
Recognition (SER) using Log mel-Filter Bank Energies (LFBE) spectral features as the input …
Recognition (SER) using Log mel-Filter Bank Energies (LFBE) spectral features as the input …
End-to-end speech emotion recognition with gender information
TW Sun - IEEE Access, 2020 - ieeexplore.ieee.org
Many works have focused on speech emotion recognition algorithms. However, most rely on
the proper selection of speech acoustic features. In this paper, we propose a novel emotion …
the proper selection of speech acoustic features. In this paper, we propose a novel emotion …
An open-source speaker gender detection framework for monitoring gender equality
This paper presents an approach based on acoustic analysis to describe gender equality in
French audiovisual streams, through the estimation of male and female speaking time …
French audiovisual streams, through the estimation of male and female speaking time …
Multi-modal dimensional emotion recognition using recurrent neural networks
Emotion recognition has been an active research area with both wide applications and big
challenges. This paper presents our effort for the Audio/Visual Emotion Challenge …
challenges. This paper presents our effort for the Audio/Visual Emotion Challenge …