Ternary adversarial networks with self-supervision for zero-shot cross-modal retrieval

P Kaur, HS Pannu, AK Malhi - Computer Science Review, 2021 - Elsevier

Human beings experience life through a spectrum of modes such as vision, taste, hearing,
smell, and touch. These multiple modes are integrated for information processing in our …

被引用次数：79 相关文章所有 3 个版本

[PDF] aaai.org

Simple unsupervised graph representation learning

Y Mo, L Peng, J Xu, X Shi, X Zhu - … of the AAAI conference on artificial …, 2022 - ojs.aaai.org

In this paper, we propose a simple unsupervised graph representation learning method to
conduct effective and efficient contrastive learning. Specifically, the proposed multiplet loss …

被引用次数：106 相关文章所有 5 个版本

Deep fuzzy hashing network for efficient image retrieval

H Lu, M Zhang, X Xu, Y Li… - IEEE transactions on fuzzy …, 2020 - ieeexplore.ieee.org

Hashing methods for efficient image retrieval aim at learning hash functions that map similar
images to semantically correlated binary codes in the Hamming space with similarity well …

被引用次数：298 相关文章所有 2 个版本

Cross-modal attention with semantic consistence for image–text matching

X Xu, T Wang, Y Yang, L Zuo, F Shen… - IEEE transactions on …, 2020 - ieeexplore.ieee.org

The task of image-text matching refers to measuring the visual-semantic similarity between
an image and a sentence. Recently, the fine-grained matching methods that explore the …

被引用次数：217 相关文章所有 3 个版本

[PDF] arxiv.org

Survey on deep multi-modal data analytics: Collaboration, rivalry, and fusion

Y Wang - ACM Transactions on Multimedia Computing …, 2021 - dl.acm.org

With the development of web technology, multi-modal or multi-view data has surged as a
major stream for big data, where each modal/view encodes individual property of data …

被引用次数：190 相关文章所有 3 个版本

Exploiting subspace relation in semantic labels for cross-modal hashing

HT Shen, L Liu, Y Yang, X Xu, Z Huang… - … on Knowledge and …, 2020 - ieeexplore.ieee.org

Hashing methods have been extensively applied to efficient multimedia data indexing and
retrieval on account of the explosion of multimedia data. Cross-modal hashing usually …

被引用次数：185 相关文章所有 3 个版本

[PDF] aaai.org

Deep incomplete multi-view clustering via mining cluster complementarity

J Xu, C Li, Y Ren, L Peng, Y Mo, X Shi… - Proceedings of the AAAI …, 2022 - ojs.aaai.org

Incomplete multi-view clustering (IMVC) is an important unsupervised approach to group the
multi-view data containing missing data in some views. Previous IMVC methods suffer from …

被引用次数：56 相关文章所有 4 个版本

[PDF] thecvf.com

Deep incomplete multi-view clustering with cross-view partial sample and prototype alignment

J Jin, S Wang, Z Dong, X Liu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

The success of existing multi-view clustering relies on the assumption of sample integrity
across multiple views. However, in real-world scenarios, samples of multi-view are partially …

被引用次数：26 相关文章所有 5 个版本

[PDF] arxiv.org

Cross-modal retrieval: a systematic review of methods and future directions

L Zhu, T Wang, F Li, J Li, Z Zhang, HT Shen - arXiv preprint arXiv …, 2023 - arxiv.org

With the exponential surge in diverse multi-modal data, traditional uni-modal retrieval
methods struggle to meet the needs of users demanding access to data from various …

被引用次数：6 相关文章所有 3 个版本

SCCGAN: style and characters inpainting based on CGAN

R Liu, X Wang, H Lu, Z Wu, Q Fan, S Li… - Mobile networks and …, 2021 - Springer

With the development of deep learning technology, many deep learning methods have been
applied to font recognition and generation. However, few studies focus on font inpainting …

被引用次数：89 相关文章所有 3 个版本