IEMOCAP: Interactive emotional dyadic motion capture database

Y Wang, W Song, W Tao, A Liotta, D Yang, X Li, S Gao… - Information …, 2022 - Elsevier

Affective computing conjoins the research topics of emotion recognition and sentiment
analysis, and can be realized with unimodal or multimodal data, consisting primarily of …

被引用次数：277 相关文章所有 5 个版本

[HTML] sciencedirect.com

[HTML][HTML] Emotion recognition and artificial intelligence: A systematic review (2014–2023) and research recommendations

SK Khare, V Blanes-Vidal, ES Nadimi, UR Acharya - Information Fusion, 2023 - Elsevier

Emotion recognition is the ability to precisely infer human emotions from numerous sources
and modalities using questionnaires, physical signals, and physiological signals. Recently …

被引用次数：68 相关文章所有 7 个版本

[PDF] arxiv.org

Dawn of the transformer era in speech emotion recognition: closing the valence gap

J Wagner, A Triantafyllopoulos… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Recent advances in transformer-based architectures have shown promise in several
machine learning tasks. In the audio domain, such architectures have been successfully …

被引用次数：208 相关文章所有 8 个版本

[PDF] thecvf.com

Mvimgnet: A large-scale dataset of multi-view images

X Yu, M Xu, Y Zhang, H Liu, C Ye… - Proceedings of the …, 2023 - openaccess.thecvf.com

Being data-driven is one of the most iconic properties of deep learning algorithms. The birth
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …

被引用次数：81 相关文章所有 5 个版本

[PDF] arxiv.org

Superb: Speech processing universal performance benchmark

S Yang, PH Chi, YS Chuang, CIJ Lai… - arXiv preprint arXiv …, 2021 - arxiv.org

Self-supervised learning (SSL) has proven vital for advancing research in natural language
processing (NLP) and computer vision (CV). The paradigm pretrains a shared model on …

被引用次数：743 相关文章所有 11 个版本

An introduction to deep learning in natural language processing: Models, techniques, and tools

I Lauriola, A Lavelli, F Aiolli - Neurocomputing, 2022 - Elsevier

Abstract Natural Language Processing (NLP) is a branch of artificial intelligence that
involves the design and implementation of systems and algorithms able to interact through …

被引用次数：381 相关文章所有 4 个版本

[PDF] arxiv.org

Beats: Audio pre-training with acoustic tokenizers

S Chen, Y Wu, C Wang, S Liu, D Tompkins… - arXiv preprint arXiv …, 2022 - arxiv.org

The massive growth of self-supervised learning (SSL) has been witnessed in language,
vision, speech, and audio domains over the past few years. While discrete label prediction is …

被引用次数：140 相关文章所有 8 个版本

[PDF] aaai.org

Ssast: Self-supervised audio spectrogram transformer

Y Gong, CI Lai, YA Chung, J Glass - … of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org

Recently, neural networks based purely on self-attention, such as the Vision Transformer
(ViT), have been shown to outperform deep learning models constructed with convolutional …

被引用次数：241 相关文章所有 11 个版本

[PDF] arxiv.org

Emotion recognition from speech using wav2vec 2.0 embeddings

L Pepino, P Riera, L Ferrer - arXiv preprint arXiv:2104.03502, 2021 - arxiv.org

Emotion recognition datasets are relatively small, making the use of the more sophisticated
deep learning approaches challenging. In this work, we propose a transfer learning method …

被引用次数：325 相关文章所有 8 个版本

A comprehensive survey on feature selection in the various fields of machine learning

P Dhal, C Azad - Applied Intelligence, 2022 - Springer

Abstract In Machine Learning (ML), Feature Selection (FS) plays a crucial part in reducing
data's dimensionality and enhancing any proposed framework's performance. However, in …

被引用次数：242 相关文章所有 3 个版本