Boosting few-shot action recognition with graph-guided hybrid matching
Class prototype construction and matching are core aspects of few-shot action recognition.
Previous methods mainly focus on designing spatiotemporal relation modeling modules or …
Previous methods mainly focus on designing spatiotemporal relation modeling modules or …
Soc: Semantic-assisted object cluster for referring video object segmentation
This paper studies referring video object segmentation (RVOS) by boosting video-level
visual-linguistic alignment. Recent approaches model the RVOS task as a sequence …
visual-linguistic alignment. Recent approaches model the RVOS task as a sequence …
A Comprehensive Review of Few-shot Action Recognition
Few-shot action recognition aims to address the high cost and impracticality of manually
labeling complex and variable video data in action recognition. It requires accurately …
labeling complex and variable video data in action recognition. It requires accurately …
Consistency Prototype Module and Motion Compensation for few-shot action recognition (CLIP-CPM2C)
F Guo, YK Wang, H Qi, L Zhu, J Sun - Neurocomputing, 2025 - Elsevier
Recently, few-shot action recognition has progressed significantly, as it has learned the
feature discriminability and designed suitable comparison methods. Still, there are the …
feature discriminability and designed suitable comparison methods. Still, there are the …
Semantic-aware Video Representation for Few-shot Action Recognition
Recent work on action recognition leverages 3D features and textual information to achieve
state-of-the-art performance. However, most of the current few-shot action recognition …
state-of-the-art performance. However, most of the current few-shot action recognition …
Trajectory-aligned Space-time Tokens for Few-shot Action Recognition
P Kumar, N Padmanabhan, L Luo… - … on Computer Vision, 2025 - Springer
We propose a simple yet effective approach for few-shot action recognition, emphasizing the
disentanglement of motion and appearance representations. By harnessing recent progress …
disentanglement of motion and appearance representations. By harnessing recent progress …
Multi-view distillation based on multi-modal fusion for few-shot action recognition (CLIP-MDMF)
F Guo, YK Wang, H Qi, W Jin, L Zhu, J Sun - Knowledge-Based Systems, 2024 - Elsevier
In recent years, the field of few-shot action recognition (FSAR) has garnered significant
attention. Although many methods primarily rely on mono-modal data, there is a growing …
attention. Although many methods primarily rely on mono-modal data, there is a growing …
Fully Aligned Network for Referring Image Segmentation
Y Liu, R Xu, Y Tang - arXiv preprint arXiv:2409.19569, 2024 - arxiv.org
This paper focuses on the Referring Image Segmentation (RIS) task, which aims to segment
objects from an image based on a given language description. The critical problem of RIS is …
objects from an image based on a given language description. The critical problem of RIS is …
They Look Like Each Other: Case-based Reasoning for Explainable Depression Detection on Twitter using Large Language Models
Depression is a common mental health issue that requires prompt diagnosis and treatment.
Despite the promise of social media data for depression detection, the opacity of employed …
Despite the promise of social media data for depression detection, the opacity of employed …
Few-Shot Relation Extraction with Hybrid Visual Evidence
J Gong, H Eldardiry - arXiv preprint arXiv:2403.00724, 2024 - arxiv.org
The goal of few-shot relation extraction is to predict relations between name entities in a
sentence when only a few labeled instances are available for training. Existing few-shot …
sentence when only a few labeled instances are available for training. Existing few-shot …