Speech emotion recognition via multi-level attention network
K Liu, D Wang, D Wu, Y Liu… - IEEE Signal Processing …, 2022 - ieeexplore.ieee.org
Aiming to improve the performance of human speech emotion recognition (SER), the
existing work has made great progress based on the popular mel-scale frequency cepstral …
existing work has made great progress based on the popular mel-scale frequency cepstral …
Speech emotion recognition based on discriminative features extraction
K Liu, J Hu, Y Liu, J Feng - 2022 IEEE International conference …, 2022 - ieeexplore.ieee.org
In intelligent human-computer interaction systems, speech emotion recognition (SER) is a
fundamental task for understanding user intention. One vital challenge for emotion inferring …
fundamental task for understanding user intention. One vital challenge for emotion inferring …
[HTML][HTML] MFGCN: Multimodal fusion graph convolutional network for speech emotion recognition
X Qi, Y Wen, P Zhang, H Huang - Neurocomputing, 2025 - Elsevier
Speech emotion recognition (SER) is challenging owing to the complexity of emotional
representation. Hence, this article focuses on multimodal speech emotion recognition that …
representation. Hence, this article focuses on multimodal speech emotion recognition that …
CCTG-NET: Contextualized Convolutional Transformer-GRU Network for speech emotion recognition
Speech is a crucial aspect of human-to-human interactions and plays a fundamental role in
the advancement of human–computer interaction (HCI) systems. Developing an accurate …
the advancement of human–computer interaction (HCI) systems. Developing an accurate …
Speech emotion recognition based on low-level auto-extracted time-frequency features
K Liu, J Hu, J Feng - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
Deep-learning based methods that aim to extract effective high-level features have steadily
improved the performance on the speech emotion recognition. However, low-level features …
improved the performance on the speech emotion recognition. However, low-level features …
Emohrnet: High-Resolution Neural Network Based Speech Emotion Recognition
A Muppidi, M Radfar - ICASSP 2024-2024 IEEE International …, 2024 - ieeexplore.ieee.org
Speech emotion recognition (SER) is pivotal for enhancing human-machine interactions.
This paper introduces" EmoHRNet", a novel adaptation of High-Resolution Networks …
This paper introduces" EmoHRNet", a novel adaptation of High-Resolution Networks …
[PDF][PDF] FTA-net: A Frequency and Time Attention Network for Speech Depression Detection
Depression is one of the most common mental diseases nowadays, which seriously affects
the health of individuals. Some researchers have shown an association between the level of …
the health of individuals. Some researchers have shown an association between the level of …
Accurate time-frequency estimation in sαs noise via memory-dependent derivative
P Huang, J Xiao, W Sun… - Advances in Mechanical …, 2023 - journals.sagepub.com
This letter presents a time-frequency estimation approach based on memory-dependent
derivative to obtain accurate spectrograph interpolation information. The memory correlation …
derivative to obtain accurate spectrograph interpolation information. The memory correlation …
Applying Image Classification Model to Spectrograms for Speech Emotion Recognition
Y Zhou, L Yang, J Mao - 2023 7th Asian Conference on …, 2023 - ieeexplore.ieee.org
Speech Emotion Recognition (SER) aims to enable machines to comprehend the subjective
emotions of individuals by analyzing their vocal expressions. The challenge of SER lies in …
emotions of individuals by analyzing their vocal expressions. The challenge of SER lies in …