Hierarchical matching and reasoning for multi-query image retrieval

Z Ji, Z Li, Y Zhang, H Wang, Y Pang, X Li - Neural Networks, 2024 - Elsevier
As a promising field, Multi-Query Image Retrieval (MQIR) aims at searching for the
semantically relevant image given multiple region-specific text queries. Existing works …

Multi-modal long document classification based on Hierarchical Prompt and Multi-modal Transformer

T Liu, Y Hu, J Gao, J Wang, Y Sun, B Yin - Neural Networks, 2024 - Elsevier
In the realm of long document classification (LDC), previous research has predominantly
focused on modeling unimodal texts, overlooking the potential of multi-modal documents …

Multi-level Symmetric Semantic Alignment Network for image-text matching

W Wang, X Di, M Liu, F Gao - Neurocomputing, 2024 - Elsevier
Image-text matching has attracted much attention as one of the visual-linguistic tasks. Most
of the existing methods tend to concentrate on single-level semantic similarity by global …

A unified multiple inducible co-attentions and edge guidance network for co-saliency detection

Z Tan, X Gu - International Conference on Artificial Neural Networks, 2022 - Springer
The learning-based methods have improved the performances of co-salient object detection
(CoSOD). Mining the intra-image saliency individuals and exploring the inter-image co …

A Unified Video Semantics Extraction and Noise Object Suppression Network for Video Saliency Detection

Z Tan, X Gu - International Conference on Artificial Neural Networks, 2023 - Springer
Video salient object detection (VSOD) aims to segment the most attractive objects from a
video sequence. Exploring video semantics and suppressing noise objects are two …

A Cross-Modal View to Utilize Label Semantics for Enhancing Student Network in Multi-label Classification

Y Qin, H Liu, X Gu - International Conference on Artificial Neural Networks, 2023 - Springer
Abstract Knowledge transfer has become a promising approach for improving the
performance and efficiency of relatively lightweight networks. Previous research has focused …

Deep Learning Models for Fast Retrieval and Extraction of French Speech Vocabulary Applications

M Xu - Computational Intelligence and Neuroscience, 2022 - Wiley Online Library
Due to the large French vocabulary, how quickly retrieve and accurately identify the required
vocabulary is still a big challenge in French learning. In view of the above problems, we …

A Review on the Progress of Text-based Image Retrieval Technology

R Ma, X Cui, W Li, L Lu - Advances in Engineering …, 2024 - madison-proceedings.com