Evaluating Persistent Memory Range Indexes: Part Two [Extended Version]

Y He, D Lu, K Huang, T Wang - arXiv preprint arXiv:2201.13047, 2022 - arxiv.org
Scalable persistent memory (PM) has opened up new opportunities for building indexes that
operate and persist data directly on the memory bus, potentially enabling instant recovery …

The past, present and future of indexing on persistent memory

K Huang, Y He, T Wang - Proceedings of the VLDB Endowment, 2022 - dl.acm.org
Persistent memory (PM) based indexing techniques have been proposed to build fast yet
persistent indexes that sit on the memory bus. Over the past decade, numerous techniques …

Optimizing Data Pipelines for Machine Learning in Feature Stores

R Liu, K Park, F Psallidas, X Zhu, J Mo, R Sen… - Proceedings of the …, 2023 - dl.acm.org
Data pipelines (ie, converting raw data to features) are critical for machine learning (ML)
models, yet their development and management is time-consuming. Feature stores have …

MTM: Rethinking Memory Profiling and Migration for Multi-Tiered Large Memory

J Ren, D Xu, J Ryu, K Shin, D Kim, D Li - Proceedings of the Nineteenth …, 2024 - dl.acm.org
Multi-terabyte large memory systems are often characterized by more than two memory tiers
with different latency and bandwidth. Multi-tiered large memory systems call for rethinking of …

PhaST: Hierarchical Concurrent Log-Free Skip List for Persistent Memory

Z Li, B Jiao, S He, W Yu - IEEE Transactions on Parallel and …, 2022 - ieeexplore.ieee.org
Skip list (skiplist) is a competitive index structure that offers superior concurrency and
excellent performance but with high memory overhead and low access locality. Emerging …

A system for time series feature extraction in federated learning

S Wang, J Li, M Lu, Z Zheng, Y Chen, B He - Proceedings of the 31st …, 2022 - dl.acm.org
Federated learning (FL), which enables collaborative learning without revealing raw data, is
an emerging topic in privacy-preserving machine learning. Based on our experiences in …

Krypton: Real-Time Serving and Analytical SQL Engine at ByteDance

J Chen, R Shi, H Chen, L Zhang, R Li, W Ding… - Proceedings of the …, 2023 - dl.acm.org
In recent years, at ByteDance, we have started seeing more and more business scenarios
that require performing real-time data serving besides complex Ad Hoc analysis over large …

Febench: A benchmark for real-time relational data feature extraction

X Zhou, C Chen, K Li, B He, M Lu, Q Liu… - Proceedings of the …, 2023 - dl.acm.org
As the use of online AI inference services rapidly expands in various applications (eg, fraud
detection in banking, product recommendation in e-commerce), real-time feature extraction …

Write-optimized and consistent skiplists for non-volatile memory

R Xiao, D Feng, Y Hu, F Wang, X Wei, X Zou… - IEEE Access, 2021 - ieeexplore.ieee.org
Skiplist as an in-memory index performs pretty well on rapid insertions because there are no
rotations or reallocations for rebalancing. The emerging non-volatile memory (NVM) …

Specpmt: Speculative logging for resolving crash consistency overhead of persistent memory

C Ye, Y Xu, X Shen, Y Sha, X Liao, H Jin… - Proceedings of the 28th …, 2023 - dl.acm.org
Crash consistency overhead is a long-standing barrier to the adoption of byte-addressable
persistent memory in practice. Despite continuous progress, persistent transactions for crash …