Speech emotion recognition using deep 1D & 2D CNN LSTM networks
We aimed at learning deep emotion features to recognize speech emotion. Two
convolutional neural network and long short-term memory (CNN LSTM) networks, one 1D …
convolutional neural network and long short-term memory (CNN LSTM) networks, one 1D …
Learning deep features to recognise speech emotion using merged deep CNN
This study aims at learning deep features from different data to recognise speech emotion.
The authors designed a merged convolutional neural network (CNN), which had two …
The authors designed a merged convolutional neural network (CNN), which had two …
Exploiting the potentialities of features for speech emotion recognition
In recent years, studies on speech signals have increasingly paid attention to emotional
information. The most challenging aspect in speech emotion recognition (SER) is choosing …
information. The most challenging aspect in speech emotion recognition (SER) is choosing …
Discriminatively trained recurrent neural networks for continuous dimensional emotion recognition from audio
Continuous dimensional emotion recognition from audio is a sequential regression problem,
where the goal is to maximize correlation between sequences of regression outputs and …
where the goal is to maximize correlation between sequences of regression outputs and …
On-line continuous-time music mood regression with deep recurrent neural networks
This paper proposes a novel machine learning approach for the task of on-line continuous-
time music mood regression, ie, low-latency prediction of the time-varying arousal and …
time music mood regression, ie, low-latency prediction of the time-varying arousal and …
[PDF][PDF] Musical audio synthesis using autoencoding neural nets
With an optimal network topology and tuning of hyperpa-rameters, artificial neural networks
(ANNs) may be trained to learn a mapping from low level audio features to one or more …
(ANNs) may be trained to learn a mapping from low level audio features to one or more …
A systematic evaluation of the bag-of-frames representation for music information retrieval
There has been an increasing attention on learning feature representations from the
complex, high-dimensional audio data applied in various music information retrieval (MIR) …
complex, high-dimensional audio data applied in various music information retrieval (MIR) …
Understanding affective content of music videos through learned representations
In consideration of the ever-growing available multimedia data, annotating multimedia
content automatically with feeling (s) expected to arise in users is a challenging problem. In …
content automatically with feeling (s) expected to arise in users is a challenging problem. In …
Emotion recognition from thermal infrared images using deep Boltzmann machine
Facial expression and emotion recognition from thermal infrared images has attracted more
and more attentions in recent years. However, the features adopted in current work are …
and more attentions in recent years. However, the features adopted in current work are …
Recognizing semi-natural and spontaneous speech emotions using deep neural networks
We needed to find deep emotional features to identify emotions from audio signals.
Identifying emotions in spontaneous speech is a novel and challenging subject of research …
Identifying emotions in spontaneous speech is a novel and challenging subject of research …