A survey on locality sensitive hashing algorithms and their applications

O Jafari, P Maurya, P Nagarkar, KM Islam… - arXiv preprint arXiv …, 2021 - arxiv.org
Finding nearest neighbors in high-dimensional spaces is a fundamental operation in many
diverse application domains. Locality Sensitive Hashing (LSH) is one of the most popular …

Deep learning for approximate nearest neighbour search: A survey and future directions

M Li, YG Wang, P Zhang, H Wang, L Fan… - … on Knowledge and …, 2022 - ieeexplore.ieee.org
Approximate nearest neighbour search (ANNS) in high-dimensional space is an essential
and fundamental operation in many applications from many domains such as multimedia …

Learning space partitions for nearest neighbor search

Y Dong, P Indyk, I Razenshteyn, T Wagner - arXiv preprint arXiv …, 2019 - arxiv.org
Space partitions of $\mathbb {R}^ d $ underlie a vast and important class of fast nearest
neighbor search (NNS) algorithms. Inspired by recent theoretical work on NNS for general …

G-miner: an efficient task-oriented graph mining system

H Chen, M Liu, Y Zhao, X Yan, D Yan… - Proceedings of the …, 2018 - dl.acm.org
Graph mining is one of the most important areas in data mining. However, scalable solutions
for graph mining are still lacking as existing studies focus on sequential algorithms. While …

Attention-based saliency hashing for ophthalmic image retrieval

J Fang, Y Xu, X Zhang, Y Hu… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
Deep hashing methods have been proved to be effective for the large-scale medical image
search assisting reference-based diagnosis for clinicians. However, when the salient region …

Signrff: Sign random fourier features

X Li, P Li - Advances in Neural Information Processing …, 2022 - proceedings.neurips.cc
The industry practice has been moving to embedding based retrieval (EBR). For example, in
many applications, the embedding vectors are trained by some form of two-tower models …

Large-scale network embedding in apache spark

W Lin - Proceedings of the 27th ACM SIGKDD Conference on …, 2021 - dl.acm.org
Network embedding has been widely used in social recommendation and network analysis,
such as recommendation systems and anomaly detection with graphs. However, most of …

Accelerating LSH-based distributed search with in-network computation

P Zhang, H Pan, Z Li, P He, Z Zhang… - IEEE INFOCOM 2021 …, 2021 - ieeexplore.ieee.org
Locality Sensitive Hashing (LSH) is widely adopted to index similar data in high-dimensional
space for approximate nearest neighbor search. With the rapid increase of datasets, recent …

A Survey on Efficient Processing of Similarity Queries over Neural Embeddings

Y Wang - arXiv preprint arXiv:2204.07922, 2022 - arxiv.org
Similarity query is the family of queries based on some similarity metrics. Unlike the
traditional database queries which are mostly based on value equality, similarity queries aim …

A general and efficient querying method for learning to hash

J Li, X Yan, J Zhang, A Xu, J Cheng, J Liu… - Proceedings of the …, 2018 - dl.acm.org
As an effective solution to the approximate nearest neighbors (ANN) search problem,
learning to hash (L2H) is able to learn similarity-preserving hash functions tailored for a …