Cosmo: Content-style modulation for image retrieval with text feedback

S Lee, D Kim, B Han - … of the IEEE/CVF Conference on …, 2021 - openaccess.thecvf.com
We tackle the task of image retrieval with text feedback, where a reference image and
modifier text are combined to identify the desired target image. We focus on designing an …

Multi-modal transformer with global-local alignment for composed query image retrieval

Y Xu, Y Bin, J Wei, Y Yang, G Wang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
In this paper, we study the composed query image retrieval, which aims at retrieving the
target image similar to the composed query, ie, a reference image and the desired …

Amc: Adaptive multi-expert collaborative network for text-guided image retrieval

H Zhu, Y Wei, Y Zhao, C Zhang, S Huang - ACM Transactions on …, 2023 - dl.acm.org
Text-guided image retrieval integrates reference image and text feedback as a multimodal
query to search the image corresponding to user intention. Recent approaches employ multi …

Heterogeneous feature fusion and cross-modal alignment for composed image retrieval

G Zhang, S Wei, H Pang, Y Zhao - Proceedings of the 29th ACM …, 2021 - dl.acm.org
Composed image retrieval aims at performing image retrieval task by giving a reference
image and a complementary text piece. Since composing both image and text information …

A light touch approach to teaching transformers multi-view geometry

Y Bhalgat, JF Henriques… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Transformers are powerful visual learners, in large part due to their conspicuous lack of
manually-specified priors. This flexibility can be problematic in tasks that involve multiple …

Comprehensive relationship reasoning for composed query based image retrieval

F Zhang, M Yan, J Zhang, C Xu - Proceedings of the 30th ACM …, 2022 - dl.acm.org
Composed Query Based Image Retrieval (CQBIR) aims at searching images relevant to a
composed query, ie, a reference image together with a modifier text. Compared with …

Geometry sensitive cross-modal reasoning for composed query based image retrieval

F Zhang, M Xu, C Xu - IEEE Transactions on Image Processing, 2021 - ieeexplore.ieee.org
Composed Query Based Image Retrieval (CQBIR) aims at retrieving images relevant to a
composed query containing a reference image with a requested modification expressed via …

Veg-DenseCap: Dense captioning model for vegetable leaf disease images

W Sun, C Wang, J Gu, X Sun, J Li, F Liang - Agronomy, 2023 - mdpi.com
The plant disease recognition model based on deep learning has shown good performance
potential. However, high complexity and nonlinearity lead to the low transparency and poor …

[PDF][PDF] Trace: Transform aggregate and compose visiolinguistic representations for image search with text feedback

S Jandial, A Chopra, P Badjatiya… - arXiv preprint arXiv …, 2020 - researchgate.net
The ability to efficiently search for images over an indexed database is the cornerstone for
several user experiences. Incorporating user feedback, through multi-modal inputs provide …

Filtering deep convolutional features for image retrieval

BJ Zhang, GH Liu, JK Hu - International Journal of Pattern …, 2022 - World Scientific
In image retrieval, highlighting target object and reducing the influence of background noise
remains challenging. To address this problem, we propose a novel weighting method that …