Partial failure resilient memory management system for (cxl-based) distributed shared memory

M Zhang, T Ma, J Hua, Z Liu, K Chen, N Ding… - Proceedings of the 29th …, 2023 - dl.acm.org
The efficiency of distributed shared memory (DSM) has been greatly improved by recent
hardware technologies. But, the difficulty of distributed memory management can still be a …

A cloud-scale characterization of remote procedure calls

K Seemakhupt, BE Stephens, S Khan, S Liu… - Proceedings of the 29th …, 2023 - dl.acm.org
The global scale and challenging requirements of modern cloud applications have led to the
development of complex, widely distributed, service-oriented applications. One enabler of …

Chitu: accelerating serverless workflows with asynchronous state replication pipelines

Z Lei, X Shi, C Lv, X Yu, X Zhao - … of the 2023 ACM symposium on cloud …, 2023 - dl.acm.org
Serverless workflows are characterized as multi-stage computing, while downstream
functions require accessing intermediate states or the output of upstream functions for …

Cohabitation of intelligence and systems: Towards self-reference in digital anatomies

A Morichetta, A Lackinger… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
The modern computing scenario of the Computing Continuum exhibits large and complex
applications with heterogeneous requirements running on distributed infrastructure. Still …

DmRPC: Disaggregated Memory-aware Datacenter RPC for Data-intensive Applications

J Zhang, X Chen, Y Zhang… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
Modern datacenter applications are increasingly being built using a microservices
architecture. These microservices communicate with each other using datacenter RPCs …

Programming Fully Disaggregated Systems

C Anneser, L Vogel, F Gruber, M Bandle… - Proceedings of the 19th …, 2023 - dl.acm.org
With full resource disaggregation on the horizon, it is unclear what the most suitable
programming model is that enables dataflow developers to fully harvest the potential that …

Telepathic Datacenters: Fast RPCs using Shared CXL Memory

S Mahar, E Hajyjasini, S Lee, Z Zhang, M Shen… - arXiv preprint arXiv …, 2024 - arxiv.org
Datacenter applications often rely on remote procedure calls (RPCs) for fast, efficient, and
secure communication. However, RPCs are slow, inefficient, and hard to use as they require …

Exoshuffle: An extensible shuffle architecture

FS Luan, S Wang, S Yagati, S Kim, K Lien… - Proceedings of the …, 2023 - dl.acm.org
Shuffle is one of the most expensive communication primitives in distributed data processing
and is difficult to scale. Prior work addresses the scalability challenges of shuffle by building …

Don't Let RPCs Constrain Your API

D Bittman, R Soulé, EL Miller, V Shrivastav… - Proceedings of the 20th …, 2021 - dl.acm.org
As data becomes increasingly distributed, traditional RPC and data serialization limits
performance, result in rigidity, and hamper expressivity. We believe that technology trends …

Expedited Data Transfers for Serverless Clouds

D Ustiugov, S Jesalpura, MB Alper, M Baczun… - arXiv preprint arXiv …, 2023 - arxiv.org
Serverless computing has emerged as a popular cloud deployment paradigm. In serverless,
the developers implement their application as a set of chained functions that form a workflow …