Survey and taxonomy of lossless graph compression and space-efficient graph representations

M Besta, T Hoefler - arXiv preprint arXiv:1806.01799, 2018 - arxiv.org
Various graphs such as web or social networks may contain up to trillions of edges.
Compressing such datasets can accelerate graph processing by reducing the amount of I/O …

[PDF][PDF] Encoding, fast and slow:{Low-Latency} video processing using thousands of tiny threads

S Fouladi, RS Wahby, B Shacklett… - … USENIX Symposium on …, 2017 - usenix.org
Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads
Page 1 This paper is included in the Proceedings of the 14th USENIX Symposium on Networked …

POCLib: A high-performance framework for enabling near orthogonal processing on compression

F Zhang, J Zhai, X Shen, O Mutlu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Parallel technology boosts data processing in recent years, and parallel direct data
processing on hierarchically compressed documents exhibits great promise. The high …

Jiffy: Elastic far-memory for stateful serverless analytics

A Khandelwal, Y Tang, R Agarwal, A Akella… - Proceedings of the …, 2022 - dl.acm.org
Stateful serverless analytics can be enabled using a remote memory system for inter-task
communication, and for storing and exchanging intermediate data. However, existing …

{EC-Cache}:{Load-Balanced},{Low-Latency} Cluster Caching with Online Erasure Coding

KV Rashmi, M Chowdhury, J Kosaian, I Stoica… - … USENIX Symposium on …, 2016 - usenix.org
Data-intensive clusters and object stores are increasingly relying on in-memory object
caching to meet the I/O performance demands. These systems routinely face the challenges …

Surf: Practical range query filtering with fast succinct tries

H Zhang, H Lim, V Leis, DG Andersen… - Proceedings of the …, 2018 - dl.acm.org
We present the Succinct Range Filter (SuRF), a fast and compact data structure for
approximate membership tests. Unlike traditional Bloom filters, SuRF supports both single …

CompressDB: Enabling efficient compressed data direct processing for various databases

F Zhang, W Wan, C Zhang, J Zhai, Y Chai… - Proceedings of the 2022 …, 2022 - dl.acm.org
In modern data management systems, directly performing operations on compressed data
has been proven to be a big success facing big data problems. These systems have …

Pancake: Frequency smoothing for encrypted data stores

P Grubbs, A Khandelwal, MS Lacharité… - 29th USENIX Security …, 2020 - usenix.org
We present PANCAKE, the first system to protect key-value stores from access pattern
leakage attacks with small constant factor bandwidth overhead. PANCAKE uses a new …

TADOC: Text analytics directly on compression

F Zhang, J Zhai, X Shen, D Wang, Z Chen, O Mutlu… - The VLDB Journal, 2021 - Springer
This article provides a comprehensive description of text analytics directly on compression
(TADOC), which enables direct document analytics on compressed textual data. The article …

A deep dive into common open formats for analytical dbmss

C Liu, A Pavlenko, M Interlandi, B Haynes - Proceedings of the VLDB …, 2023 - dl.acm.org
This paper evaluates the suitability of Apache Arrow, Parquet, and ORC as formats for
subsumption in an analytical DBMS. We systematically identify and explore the high-level …