Speech emotion recognition via multi-level attention network

K Liu, D Wang, D Wu, Y Liu… - IEEE Signal Processing …, 2022 - ieeexplore.ieee.org
Aiming to improve the performance of human speech emotion recognition (SER), the
existing work has made great progress based on the popular mel-scale frequency cepstral …

Speech emotion recognition based on discriminative features extraction

K Liu, J Hu, Y Liu, J Feng - 2022 IEEE International conference …, 2022 - ieeexplore.ieee.org
In intelligent human-computer interaction systems, speech emotion recognition (SER) is a
fundamental task for understanding user intention. One vital challenge for emotion inferring …

[HTML][HTML] MFGCN: Multimodal fusion graph convolutional network for speech emotion recognition

X Qi, Y Wen, P Zhang, H Huang - Neurocomputing, 2025 - Elsevier
Speech emotion recognition (SER) is challenging owing to the complexity of emotional
representation. Hence, this article focuses on multimodal speech emotion recognition that …

CCTG-NET: Contextualized Convolutional Transformer-GRU Network for speech emotion recognition

M Tellai, Q Mao - International Journal of Speech Technology, 2023 - Springer
Speech is a crucial aspect of human-to-human interactions and plays a fundamental role in
the advancement of human–computer interaction (HCI) systems. Developing an accurate …

Speech emotion recognition based on low-level auto-extracted time-frequency features

K Liu, J Hu, J Feng - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
Deep-learning based methods that aim to extract effective high-level features have steadily
improved the performance on the speech emotion recognition. However, low-level features …

Emohrnet: High-Resolution Neural Network Based Speech Emotion Recognition

A Muppidi, M Radfar - ICASSP 2024-2024 IEEE International …, 2024 - ieeexplore.ieee.org
Speech emotion recognition (SER) is pivotal for enhancing human-machine interactions.
This paper introduces" EmoHRNet", a novel adaptation of High-Resolution Networks …

[PDF][PDF] FTA-net: A Frequency and Time Attention Network for Speech Depression Detection

Q Li, D Wang, Y Ren, Y Gao, Y Li - Conference of the International …, 2023 - isca-archive.org
Depression is one of the most common mental diseases nowadays, which seriously affects
the health of individuals. Some researchers have shown an association between the level of …

Accurate time-frequency estimation in sαs noise via memory-dependent derivative

P Huang, J Xiao, W Sun… - Advances in Mechanical …, 2023 - journals.sagepub.com
This letter presents a time-frequency estimation approach based on memory-dependent
derivative to obtain accurate spectrograph interpolation information. The memory correlation …

Applying Image Classification Model to Spectrograms for Speech Emotion Recognition

Y Zhou, L Yang, J Mao - 2023 7th Asian Conference on …, 2023 - ieeexplore.ieee.org
Speech Emotion Recognition (SER) aims to enable machines to comprehend the subjective
emotions of individuals by analyzing their vocal expressions. The challenge of SER lies in …