Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge

B Schuller, A Batliner, S Steidl, D Seppi - Speech communication, 2011 - Elsevier
More than a decade has passed since research on automatic recognition of emotion from
speech has become a new field of research in line with its 'big brothers' speech and speaker …

Cross-corpus acoustic emotion recognition: Variances and strategies

B Schuller, B Vlasenko, F Eyben… - IEEE Transactions …, 2010 - ieeexplore.ieee.org
As the recognition of emotion from speech has matured to a degree where it becomes
applicable in real-life settings, it is time for a realistic view on obtainable performances. Most …

Paralinguistics in speech and language—state-of-the-art and the challenge

B Schuller, S Steidl, A Batliner, F Burkhardt… - Computer Speech & …, 2013 - Elsevier
Paralinguistic analysis is increasingly turning into a mainstream topic in speech and
language processing. This article aims to provide a broad overview of the constantly …

Featurehouse: Language-independent, automated software composition

S Apel, C Kastner, C Lengauer - 2009 IEEE 31st International …, 2009 - ieeexplore.ieee.org
Superimposition is a composition technique that has been applied successfully in many
areas of software development. Although superimposition is a general-purpose concept, it …

Anger recognition in speech using acoustic and linguistic cues

T Polzehl, A Schmitt, F Metze, M Wagner - Speech Communication, 2011 - Elsevier
The present study elaborates on the exploitation of both linguistic and acoustic feature
modeling for anger classification. In terms of acoustic modeling we generate statistics from …

Shape-based modeling of the fundamental frequency contour for emotion detection in speech

JP Arias, C Busso, NB Yoma - Computer Speech & Language, 2014 - Elsevier
This paper proposes the use of neutral reference models to detect local emotional
prominence in the fundamental frequency. A novel approach based on functional data …

Temporal Bayesian fusion for affect sensing: Combining video, audio, and lexical modalities

A Savran, H Cao, A Nenkova… - IEEE transactions on …, 2014 - ieeexplore.ieee.org
The affective state of people changes in the course of conversations and these changes are
expressed externally in a variety of channels, including facial expressions, voice, and …

Modeling phonetic pattern variability in favor of the creation of robust emotion classifiers for real-life applications

B Vlasenko, D Prylipko, R Böck… - Computer Speech & …, 2014 - Elsevier
The role of automatic emotion recognition from speech is growing continuously because of
the accepted importance of reacting to the emotional state of the user in human–computer …

Recognizing affect from speech prosody using hierarchical graphical models

R Fernandez, R Picard - Speech Communication, 2011 - Elsevier
In this work we develop and apply a class of hierarchical directed graphical models on the
task of recognizing affective categories from prosody in both acted and natural speech. A …

[图书][B] Towards adaptive spoken dialog systems

A Schmitt, W Minker - 2012 - books.google.com
In Monitoring Adaptive Spoken Dialog Systems, authors Alexander Schmitt and Wolfgang
Minker investigate statistical approaches that allow for recognition of negative dialog …