Comparative analysis on cross-modal information retrieval: A review

P Kaur, HS Pannu, AK Malhi - Computer Science Review, 2021 - Elsevier
Human beings experience life through a spectrum of modes such as vision, taste, hearing,
smell, and touch. These multiple modes are integrated for information processing in our …

Simple unsupervised graph representation learning

Y Mo, L Peng, J Xu, X Shi, X Zhu - … of the AAAI conference on artificial …, 2022 - ojs.aaai.org
In this paper, we propose a simple unsupervised graph representation learning method to
conduct effective and efficient contrastive learning. Specifically, the proposed multiplet loss …

Deep fuzzy hashing network for efficient image retrieval

H Lu, M Zhang, X Xu, Y Li… - IEEE transactions on fuzzy …, 2020 - ieeexplore.ieee.org
Hashing methods for efficient image retrieval aim at learning hash functions that map similar
images to semantically correlated binary codes in the Hamming space with similarity well …

Cross-modal attention with semantic consistence for image–text matching

X Xu, T Wang, Y Yang, L Zuo, F Shen… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
The task of image-text matching refers to measuring the visual-semantic similarity between
an image and a sentence. Recently, the fine-grained matching methods that explore the …

Survey on deep multi-modal data analytics: Collaboration, rivalry, and fusion

Y Wang - ACM Transactions on Multimedia Computing …, 2021 - dl.acm.org
With the development of web technology, multi-modal or multi-view data has surged as a
major stream for big data, where each modal/view encodes individual property of data …

Exploiting subspace relation in semantic labels for cross-modal hashing

HT Shen, L Liu, Y Yang, X Xu, Z Huang… - … on Knowledge and …, 2020 - ieeexplore.ieee.org
Hashing methods have been extensively applied to efficient multimedia data indexing and
retrieval on account of the explosion of multimedia data. Cross-modal hashing usually …

Deep incomplete multi-view clustering via mining cluster complementarity

J Xu, C Li, Y Ren, L Peng, Y Mo, X Shi… - Proceedings of the AAAI …, 2022 - ojs.aaai.org
Incomplete multi-view clustering (IMVC) is an important unsupervised approach to group the
multi-view data containing missing data in some views. Previous IMVC methods suffer from …

Deep incomplete multi-view clustering with cross-view partial sample and prototype alignment

J Jin, S Wang, Z Dong, X Liu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
The success of existing multi-view clustering relies on the assumption of sample integrity
across multiple views. However, in real-world scenarios, samples of multi-view are partially …

Cross-modal retrieval: a systematic review of methods and future directions

L Zhu, T Wang, F Li, J Li, Z Zhang, HT Shen - arXiv preprint arXiv …, 2023 - arxiv.org
With the exponential surge in diverse multi-modal data, traditional uni-modal retrieval
methods struggle to meet the needs of users demanding access to data from various …

SCCGAN: style and characters inpainting based on CGAN

R Liu, X Wang, H Lu, Z Wu, Q Fan, S Li… - Mobile networks and …, 2021 - Springer
With the development of deep learning technology, many deep learning methods have been
applied to font recognition and generation. However, few studies focus on font inpainting …