[HTML][HTML] Survey on bimodal speech emotion recognition from acoustic and linguistic information fusion
Speech emotion recognition (SER) is traditionally performed using merely acoustic
information. Acoustic features, commonly are extracted per frame, are mapped into emotion …
information. Acoustic features, commonly are extracted per frame, are mapped into emotion …
Audio, speech, language, & signal processing for covid-19: A comprehensive overview
G Deshpande, BW Schuller - arXiv preprint arXiv:2011.14445, 2020 - arxiv.org
The Coronavirus (COVID-19) pandemic has been the research focus world-wide in the year
2020. Several efforts, from collection of COVID-19 patients' data to screening them for the …
2020. Several efforts, from collection of COVID-19 patients' data to screening them for the …
End-to-end modeling and transfer learning for audiovisual emotion recognition in-the-wild
As emotions play a central role in human communication, automatic emotion recognition has
attracted increasing attention in the last two decades. While multimodal systems enjoy high …
attracted increasing attention in the last two decades. While multimodal systems enjoy high …
[PDF][PDF] Ensembling End-to-End Deep Models for Computational Paralinguistics Tasks: ComParE 2020 Mask and Breathing Sub-Challenges.
This paper describes deep learning approaches for the Mask and Breathing Sub-
Challenges (SCs), which are addressed by the INTERSPEECH 2020 Computational …
Challenges (SCs), which are addressed by the INTERSPEECH 2020 Computational …
A multimodal approach for mania level prediction in bipolar disorder
Bipolar disorder is a mental health disorder that causes mood swings that range from
depression to mania. Clinical diagnosis of bipolar disorder is based on patient interviews …
depression to mania. Clinical diagnosis of bipolar disorder is based on patient interviews …
A computational look at oral history archives
F Pessanha, AA Salah - ACM Journal on Computing and Cultural …, 2021 - dl.acm.org
Computational technologies have revolutionized the archival sciences field, prompting new
approaches to process the extensive data in these collections. Automatic speech recognition …
approaches to process the extensive data in these collections. Automatic speech recognition …
[PDF][PDF] Annotation Confidence vs. Training Sample Size: Trade-Off Solution for Partially-Continuous Categorical Emotion Recognition.
Commonly adapted design of emotional corpora includes multiple annotations for the same
instance from several annotators. Most of the previous studies assume the ground truth to be …
instance from several annotators. Most of the previous studies assume the ground truth to be …
[PDF][PDF] Mind the gap: On the value of silence representations to lexical-based speech emotion recognition.
Speech timing and non-speech regions (here referred to as “silence”), often play a critical
role in the perception of spoken language. Silence represents an important paralinguistic …
role in the perception of spoken language. Silence represents an important paralinguistic …
[PDF][PDF] A Bimodal Approach for Speech Emotion Recognition using Audio and Text.
O Verkholyak, A Dvoynikova, A Karpov - J. Internet Serv. Inf. Secur., 2021 - jisis.org
This paper presents a novel bimodal speech emotion recognition system based on analysis
of acoustic and linguistic information. We propose a novel decision-level fusion strategy that …
of acoustic and linguistic information. We propose a novel decision-level fusion strategy that …
A persian asr-based ser: modification of sharif emotional speech database and investigation of persian text corpora
A Yazdani, Y Shekofteh - arXiv preprint arXiv:2211.09956, 2022 - arxiv.org
Speech Emotion Recognition (SER) is one of the essential perceptual methods of humans in
understanding the situation and how to interact with others, therefore, in recent years, it has …
understanding the situation and how to interact with others, therefore, in recent years, it has …