MORE'24 Multimedia Object Re-ID: Advancements, Challenges, and Opportunities
Object re-identification (or object re-id) has gained significant attention in recent years,
fueled by the increasing demand for advanced video analysis and safety systems. In object …
fueled by the increasing demand for advanced video analysis and safety systems. In object …
Pipa: Pixel-and patch-wise self-supervised learning for domain adaptative semantic segmentation
Unsupervised Domain Adaptation (UDA) aims to enhance the generalization of the learned
model to other domains. The domain-invariant knowledge is transferred from the model …
model to other domains. The domain-invariant knowledge is transferred from the model …
An Overview of Text-based Person Search: Recent Advances and Future Directions
K Niu, Y Liu, Y Long, Y Huang, L Wang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Due to the practical significance in smart video surveillance systems, Text-Based Person
Search (TBPS) has been one of the research hotspots recently, which refers to searching for …
Search (TBPS) has been one of the research hotspots recently, which refers to searching for …
Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID
W Tan, C Ding, J Jiang, F Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Text-to-image person re-identification (ReID) retrieves pedestrian images according to
textual descriptions. Manually annotating textual descriptions is time-consuming restricting …
textual descriptions. Manually annotating textual descriptions is time-consuming restricting …
Adaptive uncertainty-based learning for text-based person retrieval
Text-based person retrieval aims at retrieving a specific pedestrian image from a gallery
based on textual descriptions. The primary challenge is how to overcome the inherent …
based on textual descriptions. The primary challenge is how to overcome the inherent …
Distilling CLIP with Dual Guidance for Learning Discriminative Human Body Shape Representation
Abstract Person Re-Identification (ReID) holds critical importance in computer vision with
pivotal applications in public safety and crime prevention. Traditional ReID methods reliant …
pivotal applications in public safety and crime prevention. Traditional ReID methods reliant …
All in One Framework for Multimodal Re-identification in the Wild
Abstract In Re-identification (ReID) recent advancements yield noteworthy progress in both
unimodal and cross-modal retrieval tasks. However the challenge persists in developing a …
unimodal and cross-modal retrieval tasks. However the challenge persists in developing a …
Deep multimodal learning for information retrieval
Information retrieval (IR) is a fundamental technique that aims to acquire information from a
collection of documents, web pages, or other sources. While traditional text-based IR has …
collection of documents, web pages, or other sources. While traditional text-based IR has …
Jointly harnessing prior structures and temporal consistency for sign language video generation
Sign language provides a way for differently-abled individuals to express their feelings and
emotions. However, learning sign language can be challenging and time consuming. An …
emotions. However, learning sign language can be challenging and time consuming. An …
Sequencepar: Understanding pedestrian attributes via a sequence generation paradigm
Current pedestrian attribute recognition (PAR) algorithms are developed based on multi-
label or multi-task learning frameworks, which aim to discriminate the attributes using …
label or multi-task learning frameworks, which aim to discriminate the attributes using …