Multimodal co-learning: Challenges, applications with datasets, recent advances and future directions
Multimodal deep learning systems that employ multiple modalities like text, image, audio,
video, etc., are showing better performance than individual modalities (ie, unimodal) …
video, etc., are showing better performance than individual modalities (ie, unimodal) …
A systematic literature review on multimodal machine learning: Applications, challenges, gaps and future directions
Multimodal machine learning (MML) is a tempting multidisciplinary research area where
heterogeneous data from multiple modalities and machine learning (ML) are combined to …
heterogeneous data from multiple modalities and machine learning (ML) are combined to …
Hierarchical consensus hashing for cross-modal retrieval
Cross-modal hashing (CMH) has gained much attention due to its effectiveness and
efficiency in facilitating efficient retrieval between different modalities. Whereas, most …
efficiency in facilitating efficient retrieval between different modalities. Whereas, most …
Proactive privacy-preserving learning for cross-modal retrieval
Deep cross-modal retrieval techniques have recently achieved remarkable performance,
which also poses severe threats to data privacy potentially. Nowadays, enormous user …
which also poses severe threats to data privacy potentially. Nowadays, enormous user …
Less is better: Exponential loss for cross-modal matching
Deep metric learning has become a key component of cross-modal retrieval. By learning to
pull the features of matched instances closer while pushing the features of mismatched …
pull the features of matched instances closer while pushing the features of mismatched …
Geometric Correspondence-Based Multimodal Learning for Ophthalmic Image Analysis
Color fundus photography (CFP) and Optical coherence tomography (OCT) images are two
of the most widely used modalities in the clinical diagnosis and management of retinal …
of the most widely used modalities in the clinical diagnosis and management of retinal …
Cross-Modality Knowledge Calibration Network for Video Corpus Moment Retrieval
Video corpus moment retrieval has become a hot topic recently, which aims to localize a
consequent video moments highly relevant to the given query language description from …
consequent video moments highly relevant to the given query language description from …
EDMH: Efficient discrete matrix factorization hashing for multi-modal similarity retrieval
Hashing has been an emerging topic and has recently attracted widespread attention in
multi-modal similarity search applications. However, most existing approaches rely on …
multi-modal similarity search applications. However, most existing approaches rely on …
CrowdTransfer: Enabling Crowd Knowledge Transfer in AIoT Community
Artificial Intelligence of Things (AIoT) is an emerging frontier based on the deep fusion of
Internet of Things (IoT) and Artificial Intelligence (AI) technologies. The fundamental goal of …
Internet of Things (IoT) and Artificial Intelligence (AI) technologies. The fundamental goal of …
Deep supervised dual cycle adversarial network for cross-modal retrieval
Cross-modal retrieval tasks, which are more natural and challenging than traditional
retrieval tasks, have attracted increasing interest from researchers in recent years. Although …
retrieval tasks, have attracted increasing interest from researchers in recent years. Although …