End-to-end modeling and transfer learning for audiovisual emotion recognition in-the-wild

D Ryumin, D Ivanko, E Ryumina - Sensors, 2023 - mdpi.com

Audio-visual speech recognition (AVSR) is one of the most promising solutions for reliable
speech recognition, particularly when audio is corrupted by noise. Additional visual …

被引用次数：47 相关文章所有 9 个版本

In search of a robust facial expressions recognition model: A large-scale visual cross-corpus study

E Ryumina, D Dresvyanskiy, A Karpov - Neurocomputing, 2022 - Elsevier

Many researchers have been seeking robust emotion recognition system for already last two
decades. It would advance computer systems to a new level of interaction, providing much …

被引用次数：45 相关文章所有 4 个版本

[PDF] mdpi.com

Multimodal emotion detection via attention-based fusion of extracted facial and speech features

D Mamieva, AB Abdusalomov, A Kutlimuratov… - Sensors, 2023 - mdpi.com

Methods for detecting emotions that employ many modalities at the same time have been
found to be more accurate and resilient than those that rely on a single sense. This is due to …

被引用次数：26 相关文章所有 8 个版本

[PDF] mdpi.com

Advances in Facial Expression Recognition: A Survey of Methods, Benchmarks, Models, and Datasets

T Kopalidis, V Solachidis, N Vretos, P Daras - Information, 2024 - mdpi.com

Recent technological developments have enabled computers to identify and categorize
facial expressions to determine a person's emotional state in an image or a video. This …

被引用次数：1 相关文章所有 2 个版本

Facial Emotion Recognition in-the-Wild Using Deep Neural Networks: A Comprehensive Review

H Boughanem, H Ghazouani, W Barhoumi - SN Computer Science, 2023 - Springer

Facial expressions are a crucial aspect of human communication that provide information
about emotions, intentions, interactions, and social relationships. They are a universal signal …

被引用次数：4 相关文章所有 2 个版本

[PDF] thecvf.com

Multi-modal Arousal and Valence Estimation under Noisy Conditions

D Dresvyanskiy, M Markitantov, J Yu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Automatic emotion recognition has gained significant attention over the past two decades
due to the central role that emotions play in human communication. While multi-modal …

被引用次数：1 相关文章

[PDF] mdpi.com

Comparing approaches for explaining DNN-based facial expression classifications

K ter Burg, H Kaya - Algorithms, 2022 - mdpi.com

Classifying facial expressions is a vital part of developing systems capable of aptly
interacting with users. In this field, the use of deep-learning models has become the …

被引用次数：10 相关文章所有 5 个版本

[PDF] mdpi.com

Multi-corpus learning for audio–visual emotions and sentiment recognition

E Ryumina, M Markitantov, A Karpov - Mathematics, 2023 - mdpi.com

Recognition of emotions and sentiment (affective states) from human audio–visual
information is widely used in healthcare, education, entertainment, and other fields; …

被引用次数：3 相关文章所有 6 个版本

[PDF] arxiv.org

SUN Team's Contribution to ABAW 2024 Competition: Audio-visual Valence-Arousal Estimation and Expression Recognition

D Dresvyanskiy, M Markitantov, J Yu, P Li… - arXiv preprint arXiv …, 2024 - arxiv.org

As emotions play a central role in human communication, automatic emotion recognition has
attracted increasing attention in the last two decades. While multimodal systems enjoy high …

被引用次数：4 相关文章所有 2 个版本

[PDF] isca-archive.org

[PDF][PDF] Biometric Russian Audio-Visual Extended MASKS (BRAVE-MASKS) Corpus: Multimodal Mask Type Recognition Task.

M Markitantov, E Ryumina, D Ryumin, A Karpov - INTERSPEECH, 2022 - isca-archive.org

In this paper, we present a new multimodal corpus called Biometric Russian Audio-Visual
Extended MASKS (BRAVEMASKS), which is designed to analyze voice and facial …

被引用次数：10 相关文章所有 4 个版本