A systematic review on affective computing: Emotion models, databases, and recent advances
Affective computing conjoins the research topics of emotion recognition and sentiment
analysis, and can be realized with unimodal or multimodal data, consisting primarily of …
analysis, and can be realized with unimodal or multimodal data, consisting primarily of …
[HTML][HTML] Emotion recognition and artificial intelligence: A systematic review (2014–2023) and research recommendations
Emotion recognition is the ability to precisely infer human emotions from numerous sources
and modalities using questionnaires, physical signals, and physiological signals. Recently …
and modalities using questionnaires, physical signals, and physiological signals. Recently …
Eva-clip: Improved training techniques for clip at scale
Contrastive language-image pre-training, CLIP for short, has gained increasing attention for
its potential in various scenarios. In this paper, we propose EVA-CLIP, a series of models …
its potential in various scenarios. In this paper, we propose EVA-CLIP, a series of models …
Vision-language models for vision tasks: A survey
Most visual recognition studies rely heavily on crowd-labelled data in deep neural networks
(DNNs) training, and they usually train a DNN for each single visual recognition task …
(DNNs) training, and they usually train a DNN for each single visual recognition task …
Eva-02: A visual representation for neon genesis
We launch EVA-02, a next-generation Transformer-based visual representation pre-trained
to reconstruct strong and robust language-aligned vision features via masked image …
to reconstruct strong and robust language-aligned vision features via masked image …
Slip: Self-supervision meets language-image pre-training
Recent work has shown that self-supervised pre-training leads to improvements over
supervised learning on challenging visual recognition tasks. CLIP, an exciting new …
supervised learning on challenging visual recognition tasks. CLIP, an exciting new …
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks
The exponential growth of large language models (LLMs) has opened up numerous
possibilities for multi-modal AGI systems. However the progress in vision and vision …
possibilities for multi-modal AGI systems. However the progress in vision and vision …
Learning transferable visual models from natural language supervision
State-of-the-art computer vision systems are trained to predict a fixed set of predetermined
object categories. This restricted form of supervision limits their generality and usability since …
object categories. This restricted form of supervision limits their generality and usability since …
[HTML][HTML] Federated learning for secure IoMT-applications in smart healthcare systems: A comprehensive review
Recent developments in the Internet of Things (IoT) and various communication
technologies have reshaped numerous application areas. Nowadays, IoT is assimilated into …
technologies have reshaped numerous application areas. Nowadays, IoT is assimilated into …
Learn from all: Erasing attention consistency for noisy label facial expression recognition
Abstract Noisy label Facial Expression Recognition (FER) is more challenging than
traditional noisy label classification tasks due to the inter-class similarity and the annotation …
traditional noisy label classification tasks due to the inter-class similarity and the annotation …