Towards unified text-based person retrieval: A large-scale multi-attribute and language search...

Z Zheng, Y Wang, X Qian, Z Zhong, Z Wang… - Proceedings of the 2024 …, 2024 - dl.acm.org

Object re-identification (or object re-id) has gained significant attention in recent years,
fueled by the increasing demand for advanced video analysis and safety systems. In object …

被引用次数：5 相关文章

[PDF] acm.org

Pipa: Pixel-and patch-wise self-supervised learning for domain adaptative semantic segmentation

M Chen, Z Zheng, Y Yang, TS Chua - Proceedings of the 31st ACM …, 2023 - dl.acm.org

Unsupervised Domain Adaptation (UDA) aims to enhance the generalization of the learned
model to other domains. The domain-invariant knowledge is transferred from the model …

被引用次数：41 相关文章所有 5 个版本

An Overview of Text-based Person Search: Recent Advances and Future Directions

K Niu, Y Liu, Y Long, Y Huang, L Wang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Due to the practical significance in smart video surveillance systems, Text-Based Person
Search (TBPS) has been one of the research hotspots recently, which refers to searching for …

被引用次数：2 相关文章

[PDF] thecvf.com

Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID

W Tan, C Ding, J Jiang, F Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Text-to-image person re-identification (ReID) retrieves pedestrian images according to
textual descriptions. Manually annotating textual descriptions is time-consuming restricting …

被引用次数：3 相关文章所有 3 个版本

[PDF] aaai.org

Adaptive uncertainty-based learning for text-based person retrieval

S Li, C He, X Xu, F Shen, Y Yang… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

Text-based person retrieval aims at retrieving a specific pedestrian image from a gallery
based on textual descriptions. The primary challenge is how to overcome the inherent …

被引用次数：6 相关文章

[PDF] thecvf.com

Distilling CLIP with Dual Guidance for Learning Discriminative Human Body Shape Representation

F Liu, M Kim, Z Ren, X Liu - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Abstract Person Re-Identification (ReID) holds critical importance in computer vision with
pivotal applications in public safety and crime prevention. Traditional ReID methods reliant …

被引用次数：2 相关文章所有 3 个版本

[PDF] thecvf.com

All in One Framework for Multimodal Re-identification in the Wild

H Li, M Ye, M Zhang, B Du - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com

Abstract In Re-identification (ReID) recent advancements yield noteworthy progress in both
unimodal and cross-modal retrieval tasks. However the challenge persists in developing a …

被引用次数：1 相关文章所有 3 个版本

[PDF] researchgate.net

Deep multimodal learning for information retrieval

W Ji, Y Wei, Z Zheng, H Fei, T Chua - Proceedings of the 31st ACM …, 2023 - dl.acm.org

Information retrieval (IR) is a fundamental technique that aims to acquire information from a
collection of documents, web pages, or other sources. While traditional text-based IR has …

被引用次数：6 相关文章所有 3 个版本

[PDF] arxiv.org

Jointly harnessing prior structures and temporal consistency for sign language video generation

Y Suo, Z Zheng, X Wang, B Zhang, Y Yang - ACM Transactions on …, 2024 - dl.acm.org

Sign language provides a way for differently-abled individuals to express their feelings and
emotions. However, learning sign language can be challenging and time consuming. An …

被引用次数：11 相关文章所有 3 个版本

[PDF] arxiv.org

Sequencepar: Understanding pedestrian attributes via a sequence generation paradigm

J Jin, X Wang, C Li, L Huang, J Tang - arXiv preprint arXiv:2312.01640, 2023 - arxiv.org

Current pedestrian attribute recognition (PAR) algorithms are developed based on multi-
label or multi-task learning frameworks, which aim to discriminate the attributes using …

被引用次数：5 相关文章所有 2 个版本