Cosmo: Content-style modulation for image retrieval with text feedback
We tackle the task of image retrieval with text feedback, where a reference image and
modifier text are combined to identify the desired target image. We focus on designing an …
modifier text are combined to identify the desired target image. We focus on designing an …
Multi-modal transformer with global-local alignment for composed query image retrieval
In this paper, we study the composed query image retrieval, which aims at retrieving the
target image similar to the composed query, ie, a reference image and the desired …
target image similar to the composed query, ie, a reference image and the desired …
Amc: Adaptive multi-expert collaborative network for text-guided image retrieval
Text-guided image retrieval integrates reference image and text feedback as a multimodal
query to search the image corresponding to user intention. Recent approaches employ multi …
query to search the image corresponding to user intention. Recent approaches employ multi …
Heterogeneous feature fusion and cross-modal alignment for composed image retrieval
G Zhang, S Wei, H Pang, Y Zhao - Proceedings of the 29th ACM …, 2021 - dl.acm.org
Composed image retrieval aims at performing image retrieval task by giving a reference
image and a complementary text piece. Since composing both image and text information …
image and a complementary text piece. Since composing both image and text information …
A light touch approach to teaching transformers multi-view geometry
Y Bhalgat, JF Henriques… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Transformers are powerful visual learners, in large part due to their conspicuous lack of
manually-specified priors. This flexibility can be problematic in tasks that involve multiple …
manually-specified priors. This flexibility can be problematic in tasks that involve multiple …
Comprehensive relationship reasoning for composed query based image retrieval
Composed Query Based Image Retrieval (CQBIR) aims at searching images relevant to a
composed query, ie, a reference image together with a modifier text. Compared with …
composed query, ie, a reference image together with a modifier text. Compared with …
Geometry sensitive cross-modal reasoning for composed query based image retrieval
Composed Query Based Image Retrieval (CQBIR) aims at retrieving images relevant to a
composed query containing a reference image with a requested modification expressed via …
composed query containing a reference image with a requested modification expressed via …
Veg-DenseCap: Dense captioning model for vegetable leaf disease images
W Sun, C Wang, J Gu, X Sun, J Li, F Liang - Agronomy, 2023 - mdpi.com
The plant disease recognition model based on deep learning has shown good performance
potential. However, high complexity and nonlinearity lead to the low transparency and poor …
potential. However, high complexity and nonlinearity lead to the low transparency and poor …
[PDF][PDF] Trace: Transform aggregate and compose visiolinguistic representations for image search with text feedback
The ability to efficiently search for images over an indexed database is the cornerstone for
several user experiences. Incorporating user feedback, through multi-modal inputs provide …
several user experiences. Incorporating user feedback, through multi-modal inputs provide …
Filtering deep convolutional features for image retrieval
BJ Zhang, GH Liu, JK Hu - International Journal of Pattern …, 2022 - World Scientific
In image retrieval, highlighting target object and reducing the influence of background noise
remains challenging. To address this problem, we propose a novel weighting method that …
remains challenging. To address this problem, we propose a novel weighting method that …