Is Everything Fine, Grandma? Acoustic and Linguistic Modeling for Robust Elderly Speech Emotion...

[HTML][HTML] Survey on bimodal speech emotion recognition from acoustic and linguistic information fusion

BT Atmaja, A Sasou, M Akagi - Speech Communication, 2022 - Elsevier

Speech emotion recognition (SER) is traditionally performed using merely acoustic
information. Acoustic features, commonly are extracted per frame, are mapped into emotion …

被引用次数：69 相关文章所有 7 个版本

[PDF] arxiv.org

Audio, speech, language, & signal processing for covid-19: A comprehensive overview

G Deshpande, BW Schuller - arXiv preprint arXiv:2011.14445, 2020 - arxiv.org

The Coronavirus (COVID-19) pandemic has been the research focus world-wide in the year
2020. Several efforts, from collection of COVID-19 patients' data to screening them for the …

被引用次数：28 相关文章所有 4 个版本

[PDF] mdpi.com

End-to-end modeling and transfer learning for audiovisual emotion recognition in-the-wild

D Dresvyanskiy, E Ryumina, H Kaya… - Multimodal …, 2022 - mdpi.com

As emotions play a central role in human communication, automatic emotion recognition has
attracted increasing attention in the last two decades. While multimodal systems enjoy high …

被引用次数：26 相关文章所有 10 个版本

[PDF] isca-archive.org

[PDF][PDF] Ensembling End-to-End Deep Models for Computational Paralinguistics Tasks: ComParE 2020 Mask and Breathing Sub-Challenges.

M Markitantov, D Dresvyanskiy, D Mamontov… - …, 2020 - isca-archive.org

This paper describes deep learning approaches for the Mask and Breathing Sub-
Challenges (SCs), which are addressed by the INTERSPEECH 2020 Computational …

被引用次数：42 相关文章所有 7 个版本

A multimodal approach for mania level prediction in bipolar disorder

P Baki, H Kaya, E Çiftçi, H Güleç… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Bipolar disorder is a mental health disorder that causes mood swings that range from
depression to mania. Clinical diagnosis of bipolar disorder is based on patient interviews …

被引用次数：16 相关文章所有 3 个版本

A computational look at oral history archives

F Pessanha, AA Salah - ACM Journal on Computing and Cultural …, 2021 - dl.acm.org

Computational technologies have revolutionized the archival sciences field, prompting new
approaches to process the extensive data in these collections. Automatic speech recognition …

被引用次数：18 相关文章

[PDF] isca-archive.org

[PDF][PDF] Annotation Confidence vs. Training Sample Size: Trade-Off Solution for Partially-Continuous Categorical Emotion Recognition.

E Ryumina, O Verkholyak, A Karpov - Interspeech, 2021 - isca-archive.org

Commonly adapted design of emotional corpora includes multiple annotations for the same
instance from several annotators. Most of the previous studies assume the ground truth to be …

被引用次数：14 相关文章所有 5 个版本

[PDF] researchgate.net

[PDF][PDF] Mind the gap: On the value of silence representations to lexical-based speech emotion recognition.

M Perez, M Jaiswal, M Niu, C Gorrostieta… - …, 2022 - researchgate.net

Speech timing and non-speech regions (here referred to as “silence”), often play a critical
role in the perception of spoken language. Silence represents an important paralinguistic …

被引用次数：6 相关文章所有 5 个版本

[PDF] jisis.org

[PDF][PDF] A Bimodal Approach for Speech Emotion Recognition using Audio and Text.

O Verkholyak, A Dvoynikova, A Karpov - J. Internet Serv. Inf. Secur., 2021 - jisis.org

This paper presents a novel bimodal speech emotion recognition system based on analysis
of acoustic and linguistic information. We propose a novel decision-level fusion strategy that …

被引用次数：12 相关文章所有 4 个版本

[PDF] arxiv.org

A persian asr-based ser: modification of sharif emotional speech database and investigation of persian text corpora

A Yazdani, Y Shekofteh - arXiv preprint arXiv:2211.09956, 2022 - arxiv.org

Speech Emotion Recognition (SER) is one of the essential perceptual methods of humans in
understanding the situation and how to interact with others, therefore, in recent years, it has …

被引用次数：2 相关文章所有 2 个版本