Evaluating self-supervised speech representations for speech emotion recognition

S Madanian, T Chen, O Adeleye, JM Templeton… - Intelligent systems with …, 2023 - Elsevier

Speech emotion recognition (SER) as a Machine Learning (ML) problem continues to
garner a significant amount of research interest, especially in the affective computing …

被引用次数：24 相关文章所有 4 个版本

[PDF] arxiv.org

EMO-SUPERB: An in-depth look at speech emotion recognition

H Wu, HC Chou, KW Chang, L Goncalves, J Du… - arXiv preprint arXiv …, 2024 - arxiv.org

Speech emotion recognition (SER) is a pivotal technology for human-computer interaction
systems. However, 80.77% of SER papers yield results that cannot be reproduced. We …

被引用次数：8 相关文章所有 2 个版本

Automatic Speech Emotion Recognition: a Systematic Literature Review

HH Mustafa, NR Darwish, HA Hefny - International Journal of Speech …, 2024 - Springer

Abstract Automatic Speech Emotion Recognition (ASER) has recently garnered attention
across various fields including artificial intelligence, pattern recognition, and human …

Transforming the embeddings: A lightweight technique for speech emotion recognition tasks

OC Phukan, AB Buduru, R Sharma - arXiv preprint arXiv:2305.18640, 2023 - arxiv.org

Speech emotion recognition (SER) is a field that has drawn a lot of attention due to its
applications in diverse fields. A current trend in methods used for SER is to leverage …

被引用次数：8 相关文章所有 4 个版本

[PDF] arxiv.org

A comparative study of pre-trained speech and audio embeddings for speech emotion recognition

OC Phukan, AB Buduru, R Sharma - arXiv preprint arXiv:2304.11472, 2023 - arxiv.org

Pre-trained models (PTMs) have shown great promise in the speech and audio domain.
Embeddings leveraged from these models serve as inputs for learning algorithms with …

被引用次数：7 相关文章

[PDF] researchgate.net

[PDF][PDF] Open-Emotion: A Reproducible EMOSUPERB for Speech Emotion Recognition Systems

H Wu, HC Chou, KW Chang, L Goncalves… - 2024 IEEE Spoken …, 2024 - researchgate.net

Speech emotion recognition (SER) is an essential technology for human-computer
interaction systems. However, the previous study reveals that 80.77% of SER papers yield …

被引用次数：3 相关文章

[PDF] arxiv.org

Learning Arousal-Valence Representation from Categorical Emotion Labels of Speech

E Zhou, Y Zhang, Z Duan - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org

Dimensional representations of speech emotions such as the arousal-valence (AV)
representation provide a continuous and fine-grained description and control than their …

被引用次数：2 相关文章所有 4 个版本

Comparing hysteresis comparator and RMS threshold methods for automatic single cough segmentations

BT Atmaja, Zanjabila, Suyanto, A Sasou - International Journal of …, 2024 - Springer

Research on diagnosing diseases based on voice signals is rapidly increasing, including
cough-related diseases. When training the cough sound signals into deep learning models …

被引用次数：1 相关文章所有 2 个版本

Multilingual, Cross-lingual, and Monolingual Speech Emotion Recognition on EmoFilm Dataset

BT Atmaja, A Sasou - 2023 Asia Pacific Signal and Information …, 2023 - ieeexplore.ieee.org

Research on speech emotion recognition has been actively conducted; most are in
monolingual settings. Considering that emotion expressed in speech is universal, it is …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition

YC Lin, H Wu, HC Chou, CC Lee, H Lee - arXiv preprint arXiv:2406.05065, 2024 - arxiv.org

The rapid growth of Speech Emotion Recognition (SER) has diverse global applications,
from improving human-computer interactions to aiding mental health diagnostics. However …

被引用次数：2 相关文章所有 2 个版本