[HTML][HTML] Survey on bimodal speech emotion recognition from acoustic and linguistic information fusion

BT Atmaja, A Sasou, M Akagi - Speech Communication, 2022 - Elsevier
Speech emotion recognition (SER) is traditionally performed using merely acoustic
information. Acoustic features, commonly are extracted per frame, are mapped into emotion …

Audio, speech, language, & signal processing for covid-19: A comprehensive overview

G Deshpande, BW Schuller - arXiv preprint arXiv:2011.14445, 2020 - arxiv.org
The Coronavirus (COVID-19) pandemic has been the research focus world-wide in the year
2020. Several efforts, from collection of COVID-19 patients' data to screening them for the …

End-to-end modeling and transfer learning for audiovisual emotion recognition in-the-wild

D Dresvyanskiy, E Ryumina, H Kaya… - Multimodal …, 2022 - mdpi.com
As emotions play a central role in human communication, automatic emotion recognition has
attracted increasing attention in the last two decades. While multimodal systems enjoy high …

[PDF][PDF] Ensembling End-to-End Deep Models for Computational Paralinguistics Tasks: ComParE 2020 Mask and Breathing Sub-Challenges.

M Markitantov, D Dresvyanskiy, D Mamontov… - …, 2020 - isca-archive.org
This paper describes deep learning approaches for the Mask and Breathing Sub-
Challenges (SCs), which are addressed by the INTERSPEECH 2020 Computational …

A multimodal approach for mania level prediction in bipolar disorder

P Baki, H Kaya, E Çiftçi, H Güleç… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Bipolar disorder is a mental health disorder that causes mood swings that range from
depression to mania. Clinical diagnosis of bipolar disorder is based on patient interviews …

A computational look at oral history archives

F Pessanha, AA Salah - ACM Journal on Computing and Cultural …, 2021 - dl.acm.org
Computational technologies have revolutionized the archival sciences field, prompting new
approaches to process the extensive data in these collections. Automatic speech recognition …

[PDF][PDF] Annotation Confidence vs. Training Sample Size: Trade-Off Solution for Partially-Continuous Categorical Emotion Recognition.

E Ryumina, O Verkholyak, A Karpov - Interspeech, 2021 - isca-archive.org
Commonly adapted design of emotional corpora includes multiple annotations for the same
instance from several annotators. Most of the previous studies assume the ground truth to be …

[PDF][PDF] Mind the gap: On the value of silence representations to lexical-based speech emotion recognition.

M Perez, M Jaiswal, M Niu, C Gorrostieta… - …, 2022 - researchgate.net
Speech timing and non-speech regions (here referred to as “silence”), often play a critical
role in the perception of spoken language. Silence represents an important paralinguistic …

[PDF][PDF] A Bimodal Approach for Speech Emotion Recognition using Audio and Text.

O Verkholyak, A Dvoynikova, A Karpov - J. Internet Serv. Inf. Secur., 2021 - jisis.org
This paper presents a novel bimodal speech emotion recognition system based on analysis
of acoustic and linguistic information. We propose a novel decision-level fusion strategy that …

A persian asr-based ser: modification of sharif emotional speech database and investigation of persian text corpora

A Yazdani, Y Shekofteh - arXiv preprint arXiv:2211.09956, 2022 - arxiv.org
Speech Emotion Recognition (SER) is one of the essential perceptual methods of humans in
understanding the situation and how to interact with others, therefore, in recent years, it has …