SIFT meets CNN: A decade survey of instance retrieval

L Zheng, Y Yang, Q Tian - IEEE transactions on pattern …, 2017 - ieeexplore.ieee.org
In the early days, content-based image retrieval (CBIR) was studied with global features.
Since 2003, image retrieval based on local descriptors (de facto SIFT) has been extensively …

Near-duplicate video retrieval: Current research and future trends

J Liu, Z Huang, H Cai, HT Shen, CW Ngo… - ACM Computing Surveys …, 2013 - dl.acm.org
The exponential growth of online videos, along with increasing user involvement in video-
related activities, has been observed as a constant phenomenon during the last decade …

Unsupervised deep video hashing via balanced code for large-scale video retrieval

G Wu, J Han, Y Guo, L Liu, G Ding… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
This paper proposes a deep hashing framework, namely, unsupervised deep video hashing
(UDVH), for large-scale video similarity search with the aim to learn compact yet effective …

Effective multiple feature hashing for large-scale near-duplicate video retrieval

J Song, Y Yang, Z Huang, HT Shen… - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
Near-duplicate video retrieval (NDVR) has recently attracted much research attention due to
the exponential growth of online videos. It has many applications, such as copyright …

It takes two to tango: Combining visual and textual information for detecting duplicate video-based bug reports

N Cooper, C Bernal-Cárdenas… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
When a bug manifests in a user-facing application, it is likely to be exposed through the
graphical user interface (GUI). Given the importance of visual information to the process of …

Visil: Fine-grained spatio-temporal video similarity learning

G Kordopatis-Zilos, S Papadopoulos… - Proceedings of the …, 2019 - openaccess.thecvf.com
In this paper we introduce ViSiL, a Video Similarity Learning architecture that considers fine-
grained Spatio-Temporal relations between pairs of videos--such relations are typically lost …

Transvcl: Attention-enhanced video copy localization network with flexible supervision

S He, Y He, M Lu, C Jiang, X Yang, F Qian… - Proceedings of the …, 2023 - ojs.aaai.org
Video copy localization aims to precisely localize all the copied segments within a pair of
untrimmed videos in video retrieval applications. Previous methods typically start from frame …

Deep video hashing

VE Liong, J Lu, YP Tan, J Zhou - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
In this work, we propose a deep video hashing (DVH) method for scalable video search.
Unlike most existing video hashing methods that first extract features for each single frame …

Reconstructing an image from its local descriptors

P Weinzaepfel, H Jégou, P Pérez - CVPR 2011, 2011 - ieeexplore.ieee.org
This paper shows that an image can be approximately reconstructed based on the output of
a blackbox local description software such as those classically used for image indexing. Our …

Flip-invariant SIFT for copy and object detection

WL Zhao, CW Ngo - IEEE Transactions on Image Processing, 2012 - ieeexplore.ieee.org
Scale-invariant feature transform (SIFT) feature has been widely accepted as an effective
local keypoint descriptor for its invariance to rotation, scale, and lighting changes in images …