TIAR: Text-Image-Audio Retrieval with weighted multimodal re-ranking

P Chi, Y Feng, M Zhou, X Xiong, Y Wang, B Qiang - Applied Intelligence, 2023 - Springer
Cross-modal retrieval has developed remarkably recently and received extensive attention
as an essential method for multimodal interaction study. However, most existing models are …

Multi-scale motivated neural network for image-text matching

X Qin, L Li, G Pang - Multimedia Tools and Applications, 2024 - Springer
Existing mainstream image-text matching methods usually measure the relevance of image-
text pairs by capturing and aggregating the affinities between textual words and visual …

A novel individual-relational consistency for bad semi-supervised generative adversarial networks (IRC-BSGAN) in image classification and synthesis

MS Iraji, J Tanha, MA Balafar, MR Feizi-Derakhshi - Applied Intelligence, 2024 - Springer
Semi-supervised learning leverages both labeled and unlabeled images for model training,
addressing the scarcity of labeled data. However, challenges persist, including the …

Semi-supervised Text-based Person Search

D Gao, Y Bai, M Cao, H Dou, M Ye, M Zhang - arXiv preprint arXiv …, 2024 - arxiv.org
Text-based person search (TBPS) aims to retrieve images of a specific person from a large
image gallery based on a natural language description. Existing methods rely on massive …

Kernel-based sparse representation learning with global and local low-rank label constraint

L Teng, F Tang, Z Zheng, P Kang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Due to the large-scale and multiscale natures of social media data, sparse representation
(SR) learning methods are widely followed. However, there are three problems associated …

Dual discriminant adversarial cross-modal retrieval

P He, M Wang, D Tu, Z Wang - Applied Intelligence, 2023 - Springer
In order to improve the accuracy of cross-modal retrieval tasks and achieve flexible retrieval
between different modalities, we propose a Dual Discriminant Adversarial cross-modal …

Unsupervised Deep Relative Neighbor Relationship Preserving Cross-Modal Hashing

X Yang, Z Wang, N Wu, G Li, C Feng, P Liu - Mathematics, 2022 - mdpi.com
The image-text cross-modal retrieval task, which aims to retrieve the relevant image from text
and vice versa, is now attracting widespread attention. To quickly respond to the large-scale …

Semantic preserving asymmetric discrete hashing for cross-modal retrieval

F Yang, Q Zhang, X Ding, F Ma, J Cao, D Tong - Applied Intelligence, 2023 - Springer
In recent years, hashing technologies have garnered substantial attention and achieved
notable results due to their low storage costs and excellent retrieval efficiency. However, the …

[PDF][PDF] 面向视角非对齐数据的多视角聚类方法.

李骜, 冯聪, 牛宇童, 徐士彪, 张英涛… - Journal on …, 2022 - researchgate.net
如何在视角对齐关系错位时有效进行非对齐多视角学习是一类新的挑战性问题. 针对这一问题,
提出面向视角非对齐数据的多视角学习方法. 一方面, 为了捕获多视角异构特征的跨视角相似度 …