Hierarchical matching and reasoning for multi-query image retrieval
As a promising field, Multi-Query Image Retrieval (MQIR) aims at searching for the
semantically relevant image given multiple region-specific text queries. Existing works …
semantically relevant image given multiple region-specific text queries. Existing works …
Multi-modal long document classification based on Hierarchical Prompt and Multi-modal Transformer
In the realm of long document classification (LDC), previous research has predominantly
focused on modeling unimodal texts, overlooking the potential of multi-modal documents …
focused on modeling unimodal texts, overlooking the potential of multi-modal documents …
Multi-level Symmetric Semantic Alignment Network for image-text matching
W Wang, X Di, M Liu, F Gao - Neurocomputing, 2024 - Elsevier
Image-text matching has attracted much attention as one of the visual-linguistic tasks. Most
of the existing methods tend to concentrate on single-level semantic similarity by global …
of the existing methods tend to concentrate on single-level semantic similarity by global …
A unified multiple inducible co-attentions and edge guidance network for co-saliency detection
The learning-based methods have improved the performances of co-salient object detection
(CoSOD). Mining the intra-image saliency individuals and exploring the inter-image co …
(CoSOD). Mining the intra-image saliency individuals and exploring the inter-image co …
A Unified Video Semantics Extraction and Noise Object Suppression Network for Video Saliency Detection
Video salient object detection (VSOD) aims to segment the most attractive objects from a
video sequence. Exploring video semantics and suppressing noise objects are two …
video sequence. Exploring video semantics and suppressing noise objects are two …
A Cross-Modal View to Utilize Label Semantics for Enhancing Student Network in Multi-label Classification
Abstract Knowledge transfer has become a promising approach for improving the
performance and efficiency of relatively lightweight networks. Previous research has focused …
performance and efficiency of relatively lightweight networks. Previous research has focused …
Deep Learning Models for Fast Retrieval and Extraction of French Speech Vocabulary Applications
M Xu - Computational Intelligence and Neuroscience, 2022 - Wiley Online Library
Due to the large French vocabulary, how quickly retrieve and accurately identify the required
vocabulary is still a big challenge in French learning. In view of the above problems, we …
vocabulary is still a big challenge in French learning. In view of the above problems, we …
A Review on the Progress of Text-based Image Retrieval Technology
R Ma, X Cui, W Li, L Lu - Advances in Engineering …, 2024 - madison-proceedings.com