Negative sample is negative in its own way: Tailoring negative sentences for image-text retrieval

Constructing phrase-level semantic labels to form multi-grained supervision for image-text retrieval

Z Fan, Z Wei, Z Li, S Wang, H Shan, X Huang… - Proceedings of the 2022 …, 2022 - dl.acm.org

Existing research for image text retrieval mainly relies on sentence-level supervision to
distinguish matched and mismatched sentences for a query image. However, semantic …

被引用次数：11 相关文章所有 6 个版本

[PDF] thecvf.com

Beyond Coarse-Grained Matching in Video-Text Retrieval

A Chen, H Doughty, X Li… - Proceedings of the …, 2024 - openaccess.thecvf.com

Text-to-video retrieval has seen significant advancements, yet the ability of models to
discern subtle differences in captions still requires verification. In this paper, we introduce a …

[HTML][HTML] The impact of hard and easy negative training data on vulnerability prediction performance

F Al Debeyan, L Madeyski, T Hall, D Bowes - Journal of Systems and …, 2024 - Elsevier

Vulnerability prediction models have been shown to perform poorly in the real world. We
examine how the composition of negative training data influences vulnerability prediction …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

A unified continuous learning framework for multi-modal knowledge discovery and pre-training

Z Fan, Z Wei, J Chen, S Wang, Z Li, J Xu… - arXiv preprint arXiv …, 2022 - arxiv.org

Multi-modal pre-training and knowledge discovery are two important research topics in multi-
modal machine learning. Nevertheless, none of existing works make attempts to link …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval

H Liu, Y Song, X Wang, Z Xiangru, Z Li… - arXiv preprint arXiv …, 2024 - arxiv.org

With the explosive growth of multi-modal information on the Internet, unimodal search
cannot satisfy the requirement of Internet applications. Text-image retrieval research is …

AsCL: An Asymmetry-sensitive Contrastive Learning Method for Image-Text Retrieval with Cross-Modal Fusion

Z Gong, C Mai, Y Huang - arXiv preprint arXiv:2405.10029, 2024 - arxiv.org

The image-text retrieval task aims to retrieve relevant information from a given image or text.
The main challenge is to unify multimodal representation and distinguish fine-grained …

[PDF][PDF] The Journal of Systems & Software

F Al Debeyan, L Madeyski, T Hall, D Bowes - madeyski.e-informatyka.pl

Vulnerability prediction models have been shown to perform poorly in the real world. We
examine how the composition of negative training data influences vulnerability prediction …