MORE'24 Multimedia Object Re-ID: Advancements, Challenges, and Opportunities

Z Zheng, Y Wang, X Qian, Z Zhong, Z Wang… - Proceedings of the 2024 …, 2024 - dl.acm.org
Object re-identification (or object re-id) has gained significant attention in recent years,
fueled by the increasing demand for advanced video analysis and safety systems. In object …

Pipa: Pixel-and patch-wise self-supervised learning for domain adaptative semantic segmentation

M Chen, Z Zheng, Y Yang, TS Chua - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Unsupervised Domain Adaptation (UDA) aims to enhance the generalization of the learned
model to other domains. The domain-invariant knowledge is transferred from the model …

An Overview of Text-based Person Search: Recent Advances and Future Directions

K Niu, Y Liu, Y Long, Y Huang, L Wang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Due to the practical significance in smart video surveillance systems, Text-Based Person
Search (TBPS) has been one of the research hotspots recently, which refers to searching for …

Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID

W Tan, C Ding, J Jiang, F Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Text-to-image person re-identification (ReID) retrieves pedestrian images according to
textual descriptions. Manually annotating textual descriptions is time-consuming restricting …

Adaptive uncertainty-based learning for text-based person retrieval

S Li, C He, X Xu, F Shen, Y Yang… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Text-based person retrieval aims at retrieving a specific pedestrian image from a gallery
based on textual descriptions. The primary challenge is how to overcome the inherent …

Distilling CLIP with Dual Guidance for Learning Discriminative Human Body Shape Representation

F Liu, M Kim, Z Ren, X Liu - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Abstract Person Re-Identification (ReID) holds critical importance in computer vision with
pivotal applications in public safety and crime prevention. Traditional ReID methods reliant …

All in One Framework for Multimodal Re-identification in the Wild

H Li, M Ye, M Zhang, B Du - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
Abstract In Re-identification (ReID) recent advancements yield noteworthy progress in both
unimodal and cross-modal retrieval tasks. However the challenge persists in developing a …

Deep multimodal learning for information retrieval

W Ji, Y Wei, Z Zheng, H Fei, T Chua - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Information retrieval (IR) is a fundamental technique that aims to acquire information from a
collection of documents, web pages, or other sources. While traditional text-based IR has …

Jointly harnessing prior structures and temporal consistency for sign language video generation

Y Suo, Z Zheng, X Wang, B Zhang, Y Yang - ACM Transactions on …, 2024 - dl.acm.org
Sign language provides a way for differently-abled individuals to express their feelings and
emotions. However, learning sign language can be challenging and time consuming. An …

Sequencepar: Understanding pedestrian attributes via a sequence generation paradigm

J Jin, X Wang, C Li, L Huang, J Tang - arXiv preprint arXiv:2312.01640, 2023 - arxiv.org
Current pedestrian attribute recognition (PAR) algorithms are developed based on multi-
label or multi-task learning frameworks, which aim to discriminate the attributes using …