Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers

MB Akçay, K Oğuz - Speech Communication, 2020 - Elsevier
Speech is the most natural way of expressing ourselves as humans. It is only natural then to
extend this communication medium to computer applications. We define speech emotion …

A comprehensive review of speech emotion recognition systems

TM Wani, TS Gunawan, SAA Qadri, M Kartiwi… - IEEE …, 2021 - ieeexplore.ieee.org
During the last decade, Speech Emotion Recognition (SER) has emerged as an integral
component within Human-computer Interaction (HCI) and other high-end speech processing …

Emotion recognition from speech using wav2vec 2.0 embeddings

L Pepino, P Riera, L Ferrer - arXiv preprint arXiv:2104.03502, 2021 - arxiv.org
Emotion recognition datasets are relatively small, making the use of the more sophisticated
deep learning approaches challenging. In this work, we propose a transfer learning method …

Multimodal speech emotion recognition using audio and text

S Yoon, S Byun, K Jung - 2018 IEEE spoken language …, 2018 - ieeexplore.ieee.org
Speech emotion recognition is a challenging task, and extensive reliance has been placed
on models that use audio features in building well-performing classifiers. In this paper, we …

Evaluating deep learning architectures for speech emotion recognition

HM Fayek, M Lech, L Cavedon - Neural Networks, 2017 - Elsevier
Abstract Speech Emotion Recognition (SER) can be regarded as a static or dynamic
classification problem, which makes SER an excellent test bed for investigating and …

Databases, features and classifiers for speech emotion recognition: a review

M Swain, A Routray, P Kabisatpathy - International Journal of Speech …, 2018 - Springer
Speech is an effective medium to express emotions and attitude through language. Finding
the emotional content from a speech signal and identify the emotions from the speech …

Speech emotion recognition using deep neural network and extreme learning machine

K Han, D Yu, I Tashev - Interspeech 2014, 2014 - microsoft.com
Speech emotion recognition is a challenging problem partly because it is unclear what
features are effective for the task. In this paper we propose to utilize deep neural networks …

[HTML][HTML] Deep learning analysis of mobile physiological, environmental and location sensor data for emotion detection

E Kanjo, EMG Younis, CS Ang - Information Fusion, 2019 - Elsevier
The detection and monitoring of emotions are important in various applications, eg, to
enable naturalistic and personalised human-robot interaction. Emotion detection often …

[PDF][PDF] Speech emotion recognition with multi-task learning.

X Cai, J Yuan, R Zheng, L Huang, K Church - Interspeech, 2021 - academia.edu
Speech emotion recognition (SER) classifies speech into emotion categories such as:
Happy, Angry, Sad and Neutral. Recently, deep learning has been applied to the SER task …

[PDF][PDF] Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning.

Y Li, T Zhao, T Kawahara - Interspeech, 2019 - isca-archive.org
Accurately recognizing emotion from speech is a necessary yet challenging task due to the
variability in speech and emotion. In this paper, we propose a speech emotion recognition …