High-dimensional similarity query processing for data science
Similarity query (aka nearest neighbor query) processing has been an active research topic
for several decades. It is an essential procedure in a wide range of applications (eg …
for several decades. It is an essential procedure in a wide range of applications (eg …
Space and time efficient kernel density estimation in high dimensions
Abstract Recently, Charikar and Siminelakis (2017) presented a framework for kernel
density estimation in provably sublinear query time, for kernels that possess a certain …
density estimation in provably sublinear query time, for kernels that possess a certain …
{VBASE}: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity
Approximate similarity queries on high-dimensional vector indices have become the
cornerstone for many critical online services. An increasing need for more sophisticated …
cornerstone for many critical online services. An increasing need for more sophisticated …
Rehashing kernel evaluation in high dimensions
Kernel methods are effective but do not scale well to large scale data, especially in high
dimensions where the geometric data structures used to accelerate kernel evaluation suffer …
dimensions where the geometric data structures used to accelerate kernel evaluation suffer …
Kernel density estimation through density constrained near neighbor search
In this paper we revisit the kernel density estimation problem: given a kernel K (x, y) and a
dataset of n points in high dimensional Euclidean space, prepare a data structure that can …
dataset of n points in high dimensional Euclidean space, prepare a data structure that can …
Monotonic cardinality estimation of similarity selection: A deep learning approach
In this paper, we investigate the possibilities of utilizing deep learning for cardinality
estimation of similarity selection. Answering this problem accurately and efficiently is …
estimation of similarity selection. Answering this problem accurately and efficiently is …
Fast rotation kernel density estimation over data streams
Kernel density estimation method is a powerful tool and is widely used in many important
real-world applications such as anomaly detection and statistical learning. Unfortunately …
real-world applications such as anomaly detection and statistical learning. Unfortunately …
[PDF][PDF] Similarity query processing for high-dimensional data
Similarity query processing has been an active research topic for several decades. It is an
essential procedure in a wide range of applications. Recently, embedding and auto …
essential procedure in a wide range of applications. Recently, embedding and auto …
Consistent and flexible selectivity estimation for high-dimensional data
Selectivity estimation aims at estimating the number of database objects that satisfy a
selection criterion. Answering this problem accurately and efficiently is essential to many …
selection criterion. Answering this problem accurately and efficiently is essential to many …
Cardinality estimation of activity trajectory similarity queries using deep learning
Cardinality estimation, which involves estimating the result size of queries, is a critical aspect
of query processing and optimization. Deep Neural Networks (DNNs) are data hungry, and …
of query processing and optimization. Deep Neural Networks (DNNs) are data hungry, and …