Temporal modeling matters: A novel temporal emotional modeling approach for speech emotion recognition

J Ye, XC Wen, Y Wei, Y Xu, K Liu… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Speech emotion recognition (SER) plays a vital role in improving the interactions between
humans and machines by inferring human emotion and affective states from speech signals …

Multiscale-multichannel feature extraction and classification through one-dimensional convolutional neural network for Speech emotion recognition

M Liu, ANJ Raj, V Rajangam, K Ma, Z Zhuang… - Speech …, 2024 - Elsevier
Speech emotion recognition (SER) is a crucial field of research in artificial intelligence and
human–computer interaction. Extracting effective speech features for emotion recognition is …

Metts: Multilingual emotional text-to-speech by cross-speaker and cross-lingual emotion transfer

X Zhu, Y Lei, T Li, Y Zhang, H Zhou… - … /ACM Transactions on …, 2024 - ieeexplore.ieee.org
Previous multilingual text-to-speech (TTS) approaches have considered leveraging
monolingual speaker data to enable cross-lingual speech synthesis. However, such data …

CENN: Capsule-enhanced neural network with innovative metrics for robust speech emotion recognition

H Zhang, H Huang, P Zhao, X Zhu, Z Yu - Knowledge-Based Systems, 2024 - Elsevier
Speech emotion recognition (SER) plays a pivotal role in enhancing Human-computer
interaction (HCI) systems. This paper introduces a groundbreaking Capsule-enhanced …

Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Recognition

J Ye, Y Wei, XC Wen, C Ma, Z Huang, K Liu… - Proceedings of the 31st …, 2023 - dl.acm.org
Cross-corpus speech emotion recognition (SER) seeks to generalize the ability of inferring
speech emotion from a well-labeled corpus to an unlabeled one, which is a rather …

[HTML][HTML] Combining wav2vec 2.0 fine-tuning and ConLearnNet for speech emotion recognition

C Sun, Y Zhou, X Huang, J Yang, X Hou - Electronics, 2024 - mdpi.com
Speech emotion recognition poses challenges due to the varied expression of emotions
through intonation and speech rate. In order to reduce the loss of emotional information …

A Triplet Multimodel Transfer Learning Network for Speech Disorder Screening of Parkinson's Disease

A Zhao, N Wang, X Niu, M Chen… - International Journal of …, 2024 - Wiley Online Library
Deterioration in the quality of a person's voice and speech is an early sign of Parkinson's
disease (PD). Although a number of computer‐based methods have been invested to use …

Multimodal and Multitask Learning with Additive Angular Penalty Focus Loss for Speech Emotion Recognition

G Wen, S Ye, H Li, P Wen… - International Journal of …, 2023 - Wiley Online Library
Speech emotion recognition has lots of applications such as human‐computer interaction
and health management. The current methods are challenged with the problems of fuzzy …

What if we intervene?: Higher-order cross-lagged causal model with interventional approach under observational design

C Castro, K Michell, W Kristjanpoller… - Neural Computing and …, 2024 - Springer
Experimental design allows us to more accurately determine the causal relationship
between variables correlated over time as compared to observational design based on …

Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition

D Li, Y Zhang, Y Wang, K Funakoshi… - Journal of Natural …, 2024 - jstage.jst.go.jp
Speech emotion recognition (SER) has garnered increasing attention due to its wide range
of applications in various fields, including human-machine interaction, virtual assistants, and …