ANN-Benchmarks: A benchmarking tool for approximate nearest neighbor algorithms

M Aumüller, E Bernhardsson, A Faithfull - Information Systems, 2020 - Elsevier
This paper describes ANN-Benchmarks, a tool for evaluating the performance of in-memory
approximate nearest neighbor algorithms. It provides a standard interface for measuring the …

Song: Approximate nearest neighbor search on gpu

W Zhao, S Tan, P Li - 2020 IEEE 36th International Conference …, 2020 - ieeexplore.ieee.org
Approximate nearest neighbor (ANN) searching is a fundamental problem in computer
science with numerous applications in (eg,) machine learning and data mining. Recent …

Mongoose: A learnable lsh framework for efficient neural network training

B Chen, Z Liu, B Peng, Z Xu, JL Li, T Dao… - International …, 2020 - openreview.net
Recent advances by practitioners in the deep learning community have breathed new life
into Locality Sensitive Hashing (LSH), using it to reduce memory and time bottlenecks in …

Locality sensitive teaching

Z Xu, B Chen, C Li, W Liu, L Song… - Advances in …, 2021 - proceedings.neurips.cc
The emergence of the Internet-of-Things (IoT) sheds light on applying the machine teaching
(MT) algorithms for online personalized education on home devices. This direction becomes …

Locality-sensitive hashing scheme based on longest circular co-substring

Y Lei, Q Huang, M Kankanhalli, AKH Tung - Proceedings of the 2020 …, 2020 - dl.acm.org
Locality-Sensitive Hashing (LSH) is one of the most popular methods for c-Approximate
Nearest Neighbor Search (c-ANNS) in high-dimensional spaces. In this paper, we propose a …

Dessert: An efficient algorithm for vector set search with vector set queries

J Engels, B Coleman, V Lakshman… - Advances in Neural …, 2024 - proceedings.neurips.cc
We study the problem of $\text {\emph {vector set search}} $ with $\text {\emph {vector set
queries}} $. This task is analogous to traditional near-neighbor search, with the exception …

Unique entity estimation with application to the Syrian conflict

B Chen, A Shrivastava, RC Steorts - The Annals of Applied Statistics, 2018 - JSTOR
Entity resolution identifies and removes duplicate entities in large, noisy databases and has
grown in both usage and new developments as a result of increased data availability …

Practical near neighbor search via group testing

J Engels, B Coleman… - Advances in Neural …, 2021 - proceedings.neurips.cc
We present a new algorithm for the approximate near neighbor problem that combines
classical ideas from group testing with locality-sensitive hashing (LSH). We reduce the near …

Online multimedia retrieval on CPU–GPU platforms with adaptive work partition

R Souza, A Fernandes, TSFX Teixeira… - Journal of Parallel and …, 2021 - Elsevier
Nearest neighbors search is a core operation found in several online multimedia services.
These services have to handle very large databases, while, at the same time, they must …

Fast approximation of similarity graphs with kernel density estimation

P Macgregor, H Sun - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Constructing a similarity graph from a set $ X $ of data points in $\mathbb {R}^ d $ is the first
step of many modern clustering algorithms. However, typical constructions of a similarity …