Deep learning-based multimodal emotion recognition from audio, visual, and text modalities: A systematic review of recent advancements and future prospects

S Zhang, Y Yang, C Chen, X Zhang, Q Leng… - Expert Systems with …, 2024 - Elsevier
Emotion recognition has recently attracted extensive interest due to its significant
applications to human–computer interaction. The expression of human emotion depends on …

Artificial intelligence of things for smarter healthcare: A survey of advancements, challenges, and opportunities

S Baker, W Xiang - IEEE Communications Surveys & Tutorials, 2023 - ieeexplore.ieee.org
Healthcare systems are under increasing strain due to a myriad of factors, from a steadily
ageing global population to the current COVID-19 pandemic. In a world where we have …

Conversational emotion recognition studies based on graph convolutional neural networks and a dependent syntactic analysis

Y Shou, T Meng, W Ai, S Yang, K Li - Neurocomputing, 2022 - Elsevier
Abstract Multimodal Emotion Recognition for Conversation (ERC) is a challenging multi-
class classification task that requires recognizing multiple speakers' emotions in text, audio …

Transformers in speech processing: A survey

S Latif, A Zaidi, H Cuayahuitl, F Shamshad… - arXiv preprint arXiv …, 2023 - arxiv.org
The remarkable success of transformers in the field of natural language processing has
sparked the interest of the speech-processing community, leading to an exploration of their …

Revisiting disentanglement and fusion on modality and context in conversational multimodal emotion recognition

B Li, H Fei, L Liao, Y Zhao, C Teng, TS Chua… - Proceedings of the 31st …, 2023 - dl.acm.org
It has been a hot research topic to enable machines to understand human emotions in
multimodal contexts under dialogue scenarios, which is tasked with multimodal emotion …

Mer 2023: Multi-label learning, modality robustness, and semi-supervised learning

Z Lian, H Sun, L Sun, K Chen, M Xu, K Wang… - Proceedings of the 31st …, 2023 - dl.acm.org
The first Multimodal Emotion Recognition Challenge (MER 2023) 1 was successfully held at
ACM Multimedia. The challenge focuses on system robustness and consists of three distinct …

Transformer-based multimodal feature enhancement networks for multimodal depression detection integrating video, audio and remote photoplethysmograph signals

H Fan, X Zhang, Y Xu, J Fang, S Zhang, X Zhao, J Yu - Information Fusion, 2024 - Elsevier
Depression stands as one of the most widespread psychological disorders and has
garnered increasing attention. Currently, how to effectively achieve automatic multimodal …

Robust emotion recognition in context debiasing

D Yang, K Yang, M Li, S Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Context-aware emotion recognition (CAER) has recently boosted the practical applications
of affective computing techniques in unconstrained environments. Mainstream CAER …

Robust multimodal emotion recognition from conversation with transformer-based crossmodality fusion

B Xie, M Sidulova, CH Park - Sensors, 2021 - mdpi.com
Decades of scientific research have been conducted on developing and evaluating methods
for automated emotion recognition. With exponentially growing technology, there is a wide …

DA-Net: Dual-attention network for multivariate time series classification

R Chen, X Yan, S Wang, G Xiao - Information Sciences, 2022 - Elsevier
Multivariate time series classification is one of the increasingly important issues in machine
learning. Existing methods focus on establishing the global long-range dependencies or …