Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge

B Schuller, A Batliner, S Steidl, D Seppi - Speech communication, 2011 - Elsevier
More than a decade has passed since research on automatic recognition of emotion from
speech has become a new field of research in line with its 'big brothers' speech and speaker …

Emotion based hate speech detection using multimodal learning

A Rana, S Jha - arXiv preprint arXiv:2202.06218, 2022 - arxiv.org
In recent years, monitoring hate speech and offensive language on social media platforms
has become paramount due to its widespread usage among all age groups, races, and …

Dealing with disagreements: Looking beyond the majority vote in subjective annotations

AM Davani, M Díaz, V Prabhakaran - Transactions of the Association …, 2022 - direct.mit.edu
Majority voting and averaging are common approaches used to resolve annotator
disagreements and derive single ground truth labels from multiple annotations. However …

Training deep networks for facial expression recognition with crowd-sourced label distribution

E Barsoum, C Zhang, CC Ferrer, Z Zhang - Proceedings of the 18th ACM …, 2016 - dl.acm.org
Crowd sourcing has become a widely adopted scheme to collect ground truth labels.
However, it is a well-known problem that these labels can be very noisy. In this paper, we …

Building naturalistic emotionally balanced speech corpus by retrieving emotional speech from existing podcast recordings

R Lotfian, C Busso - IEEE Transactions on Affective Computing, 2017 - ieeexplore.ieee.org
The lack of a large, natural emotional database is one of the key barriers to translate results
on speech emotion recognition in controlled conditions into real-life applications. Collecting …

MSP-IMPROV: An acted corpus of dyadic interactions to study emotion perception

C Busso, S Parthasarathy, A Burmania… - IEEE Transactions …, 2016 - ieeexplore.ieee.org
We present the MSP-IMPROV corpus, a multimodal emotional database, where the goal is
to have control over lexical content and emotion while also promoting naturalness in the …

Paralinguistics in speech and language—state-of-the-art and the challenge

B Schuller, S Steidl, A Batliner, F Burkhardt… - Computer Speech & …, 2013 - Elsevier
Paralinguistic analysis is increasingly turning into a mainstream topic in speech and
language processing. This article aims to provide a broad overview of the constantly …

Behavioral signal processing: Deriving human behavioral informatics from speech and language

S Narayanan, PG Georgiou - Proceedings of the IEEE, 2013 - ieeexplore.ieee.org
The expression and experience of human behavior are complex and multimodal and
characterized by individual and contextual heterogeneity and variability. Speech and …

A framework for automatic human emotion classification using emotion profiles

E Mower, MJ Matarić… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
Automatic recognition of emotion is becoming an increasingly important component in the
design process for affect-sensitive human-machine interaction (HMI) systems. Well …

[PDF][PDF] Jointly Predicting Arousal, Valence and Dominance with Multi-Task Learning.

S Parthasarathy, C Busso - Interspeech, 2017 - isca-archive.org
An appealing representation of emotions is the use of emotional attributes such as arousal
(passive versus active), valence (negative versus positive) and dominance (weak versus …