Compression for quadratic similarity queries

A Ingber, T Courtade… - IEEE transactions on …, 2015 - ieeexplore.ieee.org
The problem of performing similarity queries on compressed data is considered. We focus
on the quadratic similarity measure, and study the fundamental tradeoff between …

Genomic data compression and processing: theory, models, algorithms, and experiments

I Ochoa-Alvarez - 2016 - search.proquest.com
Recently, there has been growing interest in genome sequencing, driven by advancements
in the sequencing technology. Although early sequencing technologies required several …

Compression schemes for similarity queries

I Ochoa, A Ingber, T Weissman - 2014 Data Compression …, 2014 - ieeexplore.ieee.org
We consider compression of sequences in a database so that similarity queries can be
performed efficiently in the compressed domain. The fundamental limits for this problem …

Compression for quadratic similarity queries: Finite blocklength and practical schemes

F Steiner, S Dempfle, A Ingber… - IEEE transactions on …, 2016 - ieeexplore.ieee.org
We study the problem of compression for the purpose of similarity identification, where
similarity is measured by the mean square Euclidean distance between vectors. While the …

Transform-based compression for quadratic similarity queries

H Wu, M Flierl - 2017 51st Asilomar Conference on Signals …, 2017 - ieeexplore.ieee.org
This paper considers the problem of compression for similarity queries [1] and discusses
transform-based compression schemes. Here, the focus is on the tradeoff between the rate …

Compression for similarity identification: Fundamental limits

A Ingber, T Weissman - 2014 IEEE International Symposium on …, 2014 - ieeexplore.ieee.org
We study the problem of compressing a source for the goal of answering similarity queries
from the compressed data. Unlike classical compression, here there is no requirement that …

Computing Similarity Queries for Correlated Gaussian Sources

H Wu, Q Wang, M Flierl - arXiv preprint arXiv:2001.07941, 2020 - arxiv.org
Among many current data processing systems, the objectives are often not the reproduction
of data, but to compute some answers based on the data resulting from queries. The …

Prediction-based similarity identification for autoregressive processes

H Wu, Q Wang, M Flierl - 2018 IEEE Global Conference on …, 2018 - ieeexplore.ieee.org
The task of similarity identification is to identify items in a database which are similar to a
given query item for a given metric. The identification rate of a compression scheme …

Identification Rates for Block-correlated Gaussian Sources

H Wu, Q Wang, M Flierl - 2018 52nd Asilomar Conference on …, 2018 - ieeexplore.ieee.org
Among many current data processing systems, the objectives are often not the reproduction
of data, but to compute some answers based on the data responding to some queries. The …

Compression-based Data Identification and Representation Learning

H Wu - 2020 - diva-portal.org
Large-scale data generation, acquisition, and processing are happening at every moment in
our society. This thesis explores the opportunities for applying lossy compression methods …