An overview of cross-media retrieval: Concepts, methodologies, benchmarks, and challenges

Y Peng, X Huang, Y Zhao - … on circuits and systems for video …, 2017 - ieeexplore.ieee.org
Multimedia retrieval plays an indispensable role in big data utilization. Past efforts mainly
focused on single-media retrieval. However, the requirements of users are highly flexible …

Challenges and opportunities: from big data to knowledge in AI 2.0

Y Zhuang, F Wu, C Chen, Y Pan - Frontiers of Information Technology & …, 2017 - Springer
In this paper, we review recent emerging theoretical and technological advances of artificial
intelligence (AI) in the big data settings. We conclude that integrating data-driven machine …

Cnn-rnn: A unified framework for multi-label image classification

J Wang, Y Yang, J Mao, Z Huang… - Proceedings of the …, 2016 - openaccess.thecvf.com
While deep convolutional neural networks (CNNs) have shown a great success in single-
label image classification, it is important to note that most real world images contain multiple …

Learning discriminative binary codes for large-scale cross-modal retrieval

X Xu, F Shen, Y Yang, HT Shen… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Hashing based methods have attracted considerable attention for efficient cross-modal
retrieval on large-scale multimedia data. The core problem of cross-modal hashing is how to …

Exploiting subspace relation in semantic labels for cross-modal hashing

HT Shen, L Liu, Y Yang, X Xu, Z Huang… - … on Knowledge and …, 2020 - ieeexplore.ieee.org
Hashing methods have been extensively applied to efficient multimedia data indexing and
retrieval on account of the explosion of multimedia data. Cross-modal hashing usually …

Large-scale cross-modality search via collective matrix factorization hashing

G Ding, Y Guo, J Zhou, Y Gao - IEEE Transactions on Image …, 2016 - ieeexplore.ieee.org
By transforming data into binary representation, ie, Hashing, we can perform high-speed
search with low storage cost, and thus, Hashing has collected increasing research interest in …

Modality-specific cross-modal similarity measurement with recurrent attention network

Y Peng, J Qi, Y Yuan - IEEE Transactions on Image Processing, 2018 - ieeexplore.ieee.org
Nowadays, cross-modal retrieval plays an important role to flexibly find useful information
across different modalities of data. Effectively measuring the similarity between different …

Topic-oriented image captioning based on order-embedding

N Yu, X Hu, B Song, J Yang… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
We present an image captioning framework that generates captions under a given topic. The
topic candidates are extracted from the caption corpus. A given image's topics are then …

Simple to complex cross-modal learning to rank

M Luo, X Chang, Z Li, L Nie, AG Hauptmann… - Computer Vision and …, 2017 - Elsevier
The heterogeneity-gap between different modalities brings a significant challenge to
multimedia information retrieval. Some studies formalize the cross-modal retrieval tasks as a …

A benchmark dataset and learning high-level semantic embeddings of multimedia for cross-media retrieval

SU Rehman, S Tu, Y Huang, OU Rehman - IEEE Access, 2018 - ieeexplore.ieee.org
The selection of semantic concepts for modal construction and data collection remains an
open research issue. It is highly demanding to choose good multimedia concepts with small …