Survey of deep learning paradigms for speech processing
KB Bhangale, M Kothandaraman - Wireless Personal Communications, 2022 - Springer
Over the past decades, a particular focus is given to research on machine learning
techniques for speech processing applications. However, in the past few years, research …
techniques for speech processing applications. However, in the past few years, research …
A comprehensive survey on multi-modal conversational emotion recognition with deep learning
Multi-modal conversation emotion recognition (MCER) aims to recognize and track the
speaker's emotional state using text, speech, and visual information in the conversation …
speaker's emotional state using text, speech, and visual information in the conversation …
Curriculum learning: A survey
Training machine learning models in a meaningful order, from the easy samples to the hard
ones, using curriculum learning can provide performance improvements over the standard …
ones, using curriculum learning can provide performance improvements over the standard …
AVEC 2019 workshop and challenge: state-of-mind, detecting depression with AI, and cross-cultural affect recognition
The Audio/Visual Emotion Challenge and Workshop (AVEC 2019)'State-of-Mind, Detecting
Depression with AI, and Cross-cultural Affect Recognition'is the ninth competition event …
Depression with AI, and Cross-cultural Affect Recognition'is the ninth competition event …
x-vectors meet emotions: A study on dependencies between emotion and speaker recognition
In this work, we explore the dependencies between speaker recognition and emotion
recognition. We first show that knowledge learned for speaker recognition can be reused for …
recognition. We first show that knowledge learned for speaker recognition can be reused for …
[PDF][PDF] Self-Attention for Speech Emotion Recognition.
L Tarantino, PN Garner, A Lazaridis - Interspeech, 2019 - publications.idiap.ch
Abstract Speech Emotion Recognition (SER) has been shown to benefit from many of the
recent advances in deep learning, including recurrent based and attention based neural …
recent advances in deep learning, including recurrent based and attention based neural …
Expressive TTS training with frame and style reconstruction loss
We propose a novel training strategy for Tacotron-based text-to-speech (TTS) system that
improves the speech styling at utterance level. One of the key challenges in prosody …
improves the speech styling at utterance level. One of the key challenges in prosody …
Adaptive curriculum learning
Inspired by the human learning principle that learning easier concepts first and then
gradually paying more attention to harder ones, curriculum learning uses the non-uniform …
gradually paying more attention to harder ones, curriculum learning uses the non-uniform …
Multi-classifier interactive learning for ambiguous speech emotion recognition
In recent years, speech emotion recognition technology is of great significance in
widespread applications such as call centers, social robots and health care. Thus, the …
widespread applications such as call centers, social robots and health care. Thus, the …
EmoBed: Strengthening monomodal emotion recognition via training with crossmodal emotion embeddings
Despite remarkable advances in emotion recognition, they are severely restrained from
either the essentially limited property of the employed single modality, or the synchronous …
either the essentially limited property of the employed single modality, or the synchronous …