An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems

Y Gan, Y Zhang, D Cheng, A Shetty, P Rathi… - Proceedings of the …, 2019 - dl.acm.org
Cloud services have recently started undergoing a major shift from monolithic applications,
to graphs of hundreds or thousands of loosely-coupled microservices. Microservices …

Benchmarking big data systems: A review

R Han, LK John, J Zhan - IEEE Transactions on Services …, 2017 - ieeexplore.ieee.org
With the fast development of big data systems in recent years, a variety of open-source
benchmarks have been built to evaluate and compare the workloads on these systems, and …

Shenango: Achieving high {CPU} efficiency for latency-sensitive datacenter workloads

A Ousterhout, J Fried, J Behrens, A Belay… - … USENIX Symposium on …, 2019 - usenix.org
Datacenter applications demand microsecond-scale tail latencies and high request rates
from operating systems, and most applications handle loads that have high variance over …

Parties: Qos-aware resource partitioning for multiple interactive services

S Chen, C Delimitrou, JF Martínez - Proceedings of the Twenty-Fourth …, 2019 - dl.acm.org
Multi-tenancy in modern datacenters is currently limited to a single latency-critical,
interactive service, running alongside one or more low-priority, best-effort jobs. This limits …

Serving {DNNs} like clockwork: Performance predictability from the bottom up

A Gujarati, R Karimi, S Alzayat, W Hao… - … USENIX Symposium on …, 2020 - usenix.org
Machine learning inference is becoming a core building block for interactive web
applications. As a result, the underlying model serving systems on which these applications …

The demikernel datapath os architecture for microsecond-scale datacenter systems

I Zhang, A Raybuck, P Patel, K Olynyk… - Proceedings of the …, 2021 - dl.acm.org
Datacenter systems and I/O devices now run at single-digit microsecond latencies, requiring
ns-scale operating systems. Traditional kernel-based operating systems impose an …

Zygos: Achieving low tail latency for microsecond-scale networked tasks

G Prekas, M Kogias, E Bugnion - Proceedings of the 26th Symposium on …, 2017 - dl.acm.org
This paper focuses on the efficient scheduling on multicore systems of very fine-grain
networked tasks, which are the typical building block of online data-intensive applications …

Arrakis: The operating system is the control plane

S Peter, J Li, I Zhang, DRK Ports, D Woos… - ACM Transactions on …, 2015 - dl.acm.org
Recent device hardware trends enable a new approach to the design of network server
operating systems. In a traditional operating system, the kernel mediates access to device …

Shinjuku: Preemptive Scheduling for {μsecond-scale} Tail Latency

K Kaffes, T Chong, JT Humphries, A Belay… - … USENIX Symposium on …, 2019 - usenix.org
The recently proposed dataplanes for microsecond scale applications, such as IX and
ZygOS, use non-preemptive policies to schedule requests to cores. For the many real-world …

Just say {NO} to paxos overhead: Replacing consensus with network ordering

J Li, E Michael, NK Sharma, A Szekeres… - … USENIX Symposium on …, 2016 - usenix.org
Distributed applications use replication, implemented by protocols like Paxos, to ensure data
availability and transparently mask server failures. This paper presents a new approach to …