Prompting visual-language models for dynamic facial expression recognition
This paper presents a novel visual-language model called DFER-CLIP, which is based on
the CLIP model and designed for in-the-wild Dynamic Facial Expression Recognition …
the CLIP model and designed for in-the-wild Dynamic Facial Expression Recognition …
Improving fairness using vision-language driven image augmentation
Fairness is crucial when training a deep-learning discriminative model, especially in the
facial domain. Models tend to correlate specific characteristics (such as age and skin color) …
facial domain. Models tend to correlate specific characteristics (such as age and skin color) …
MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance
Generating human portraits is a hot topic in the image generation area, eg mask-to-face
generation and text-to-face generation. However, these unimodal generation methods lack …
generation and text-to-face generation. However, these unimodal generation methods lack …
Are CLIP features all you need for Universal Synthetic Image Origin Attribution?
The steady improvement of Diffusion Models for visual synthesis has given rise to many new
and interesting use cases of synthetic images but also has raised concerns about their …
and interesting use cases of synthetic images but also has raised concerns about their …
A Hitchhikers Guide to Fine-Grained Face Forgery Detection Using Common Sense Reasoning
Explainability in artificial intelligence is crucial for restoring trust, particularly in areas like
face forgery detection, where viewers often struggle to distinguish between real and …
face forgery detection, where viewers often struggle to distinguish between real and …
Understanding the Limitations of Diffusion Concept Algebra Through Food
Image generation techniques, particularly latent diffusion models, have exploded in
popularity in recent years. Many techniques have been developed to manipulate and clarify …
popularity in recent years. Many techniques have been developed to manipulate and clarify …
Exploring CLIP for Real World, Text-based Image Retrieval
M Sultan, L Jacobs, A Stylianou… - 2023 IEEE Applied …, 2023 - ieeexplore.ieee.org
We consider the ability of CLIP features to support text-driven image retrieval. Traditional
image-based queries sometimes misalign with user intentions due to their focus on …
image-based queries sometimes misalign with user intentions due to their focus on …