Cross-modal retrieval: a systematic review of methods and future directions

T Wang, F Li, L Zhu, J Li, Z Zhang, HT Shen - arXiv preprint arXiv …, 2023 - arxiv.org
With the exponential surge in diverse multi-modal data, traditional uni-modal retrieval
methods struggle to meet the needs of users seeking access to data across various …

Multi-modal hashing for efficient multimedia retrieval: A survey

L Zhu, C Zheng, W Guan, J Li, Y Yang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the explosive growth of multimedia contents, multimedia retrieval is facing
unprecedented challenges on both storage cost and retrieval speed. Hashing technique can …

Structured multi-modal feature embedding and alignment for image-sentence retrieval

X Ge, F Chen, JM Jose, Z Ji, Z Wu, X Liu - Proceedings of the 29th ACM …, 2021 - dl.acm.org
The current state-of-the-art image-sentence retrieval methods implicitly align the visual-
textual fragments, like regions in images and words in sentences, and adopt attention …

Deep adaptively-enhanced hashing with discriminative similarity guidance for unsupervised cross-modal retrieval

Y Shi, Y Zhao, X Liu, F Zheng, W Ou… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Cross-modal hashing that leverages hash functions to project high-dimensional data from
different modalities into the compact common hamming space, has shown immeasurable …

Teacher-student learning: Efficient hierarchical message aggregation hashing for cross-modal retrieval

W Tan, L Zhu, J Li, H Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Inspired by the powerful representation capability of deep neural networks, deep cross-
modal hashing methods have recently drawn much attention and various deep cross-modal …

Adaptive partial multi-view hashing for efficient social image retrieval

C Zheng, L Zhu, Z Cheng, J Li… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Social networks allow users to actively upload images and descriptive tags, which has led to
an explosive growth in the number of social images. Multi-view hashing is an efficient …

Supervised hierarchical deep hashing for cross-modal retrieval

YW Zhan, X Luo, Y Wang, XS Xu - Proceedings of the 28th ACM …, 2020 - dl.acm.org
Cross-modal hashing has attracted much attention in the large-scale multimedia search
area. In many real applications, labels of samples have hierarchical structure which also …

Deep relation embedding for cross-modal retrieval

Y Zhang, W Zhou, M Wang, Q Tian… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Cross-modal retrieval aims to identify relevant data across different modalities. In this work,
we are dedicated to cross-modal retrieval between images and text sentences, which is …

Partial multi-modal hashing via neighbor-aware completion learning

W Tan, L Zhu, J Li, Z Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Multi-modal hashing technology can support large-scale multimedia retrieval well, because
of its fast query speed and low storage consumption. Although many multi-modal hashing …

Graph convolutional incomplete multi-modal hashing

X Shen, Y Chen, S Pan, W Liu, Y Zheng - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Multi-modal hashing (MMH) encodes multi-modal data into latent hash code, and has been
widely applied for efficient large-scale multi-modal retrieval. In practice it is common that …