Rosetta: A robust space-time optimized range filter for key-value stores

S Luo, S Chatterjee, R Ketsetsidis, N Dayan… - Proceedings of the …, 2020 - dl.acm.org
We introduce Rosetta, a probabilistic range filter designed specifically for LSM-tree based
key-value stores. The core intuition is that we can sacrifice filter probe time because it is not …

Return of the lernaean hydra: Experimental evaluation of data series approximate similarity search

K Echihabi, K Zoumpatianos, T Palpanas… - arXiv preprint arXiv …, 2020 - arxiv.org
Data series are a special type of multidimensional data present in numerous domains,
where similarity search is a key operation that has been extensively studied in the data …

Chucky: A succinct cuckoo filter for lsm-tree

N Dayan, M Twitto - Proceedings of the 2021 International Conference …, 2021 - dl.acm.org
Modern key-value stores typically rely on an LSM-tree in storage (SSD) to handle writes and
Bloom filters in memory (DRAM) to optimize reads. With ongoing advances in SSD …

The log-structured merge-bush & the wacky continuum

N Dayan, S Idreos - Proceedings of the 2019 International Conference …, 2019 - dl.acm.org
Data-intensive key-value stores based on the Log-Structured Merge-Tree are used in
numerous modern applications ranging from social media and data science to cloud …

Matrix profile goes MAD: variable-length motif and discord discovery in data series

M Linardi, Y Zhu, T Palpanas, E Keogh - Data Mining and Knowledge …, 2020 - Springer
In the last 15 years, data series motif and discord discovery have emerged as two useful and
well-used primitives for data series mining, with applications to many domains, including …

Spooky: granulating LSM-tree compactions correctly

N Dayan, T Weiss, S Dashevsky, M Pan… - Proceedings of the …, 2022 - dl.acm.org
Modern storage engines and key-value stores have come to rely on the log-structured
merge-tree (LSM-tree) as their core data structure. LSM-tree operates by gradually merge …

Report on the first and second interdisciplinary time series analysis workshop (ITISA)

T Palpanas, V Beckmann - ACM SIGMOD Record, 2019 - dl.acm.org
The analysis of time-series data associated with modernday industrial operations and
scientific experiments is now pushing both computational power and resources to their …

Fast adaptive similarity search through variance-aware quantization

J Paparrizos, I Edian, C Liu, AJ Elmore… - 2022 IEEE 38th …, 2022 - ieeexplore.ieee.org
With the explosive growth of high-dimensional data, approximate methods emerge as
promising solutions for nearest neighbor search. Among alternatives, quantization methods …

The LSM design space and its read optimizations

S Sarkar, N Dayan… - 2023 IEEE 39th …, 2023 - ieeexplore.ieee.org
Log-structured merge (LSM) trees have emerged as one of the most commonly used storage-
based data structures in modern data systems as they offer high throughput for writes and …

Dissecting, designing, and optimizing LSM-based data stores

S Sarkar, M Athanassoulis - … of the 2022 International Conference on …, 2022 - dl.acm.org
Log-structured merge (LSM) trees have emerged as one of the most commonly used disk-
based data structures in modern data systems. LSM-trees employ out-of-place ingestion to …