Survey and taxonomy of lossless graph compression and space-efficient graph representations
Various graphs such as web or social networks may contain up to trillions of edges.
Compressing such datasets can accelerate graph processing by reducing the amount of I/O …
Compressing such datasets can accelerate graph processing by reducing the amount of I/O …
[PDF][PDF] Encoding, fast and slow:{Low-Latency} video processing using thousands of tiny threads
Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads
Page 1 This paper is included in the Proceedings of the 14th USENIX Symposium on Networked …
Page 1 This paper is included in the Proceedings of the 14th USENIX Symposium on Networked …
POCLib: A high-performance framework for enabling near orthogonal processing on compression
Parallel technology boosts data processing in recent years, and parallel direct data
processing on hierarchically compressed documents exhibits great promise. The high …
processing on hierarchically compressed documents exhibits great promise. The high …
Jiffy: Elastic far-memory for stateful serverless analytics
Stateful serverless analytics can be enabled using a remote memory system for inter-task
communication, and for storing and exchanging intermediate data. However, existing …
communication, and for storing and exchanging intermediate data. However, existing …
{EC-Cache}:{Load-Balanced},{Low-Latency} Cluster Caching with Online Erasure Coding
Data-intensive clusters and object stores are increasingly relying on in-memory object
caching to meet the I/O performance demands. These systems routinely face the challenges …
caching to meet the I/O performance demands. These systems routinely face the challenges …
Surf: Practical range query filtering with fast succinct tries
We present the Succinct Range Filter (SuRF), a fast and compact data structure for
approximate membership tests. Unlike traditional Bloom filters, SuRF supports both single …
approximate membership tests. Unlike traditional Bloom filters, SuRF supports both single …
CompressDB: Enabling efficient compressed data direct processing for various databases
In modern data management systems, directly performing operations on compressed data
has been proven to be a big success facing big data problems. These systems have …
has been proven to be a big success facing big data problems. These systems have …
Pancake: Frequency smoothing for encrypted data stores
We present PANCAKE, the first system to protect key-value stores from access pattern
leakage attacks with small constant factor bandwidth overhead. PANCAKE uses a new …
leakage attacks with small constant factor bandwidth overhead. PANCAKE uses a new …
TADOC: Text analytics directly on compression
This article provides a comprehensive description of text analytics directly on compression
(TADOC), which enables direct document analytics on compressed textual data. The article …
(TADOC), which enables direct document analytics on compressed textual data. The article …
A deep dive into common open formats for analytical dbmss
This paper evaluates the suitability of Apache Arrow, Parquet, and ORC as formats for
subsumption in an analytical DBMS. We systematically identify and explore the high-level …
subsumption in an analytical DBMS. We systematically identify and explore the high-level …