An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems
Cloud services have recently started undergoing a major shift from monolithic applications,
to graphs of hundreds or thousands of loosely-coupled microservices. Microservices …
to graphs of hundreds or thousands of loosely-coupled microservices. Microservices …
Benchmarking big data systems: A review
With the fast development of big data systems in recent years, a variety of open-source
benchmarks have been built to evaluate and compare the workloads on these systems, and …
benchmarks have been built to evaluate and compare the workloads on these systems, and …
Shenango: Achieving high {CPU} efficiency for latency-sensitive datacenter workloads
Datacenter applications demand microsecond-scale tail latencies and high request rates
from operating systems, and most applications handle loads that have high variance over …
from operating systems, and most applications handle loads that have high variance over …
Parties: Qos-aware resource partitioning for multiple interactive services
Multi-tenancy in modern datacenters is currently limited to a single latency-critical,
interactive service, running alongside one or more low-priority, best-effort jobs. This limits …
interactive service, running alongside one or more low-priority, best-effort jobs. This limits …
Serving {DNNs} like clockwork: Performance predictability from the bottom up
Machine learning inference is becoming a core building block for interactive web
applications. As a result, the underlying model serving systems on which these applications …
applications. As a result, the underlying model serving systems on which these applications …
The demikernel datapath os architecture for microsecond-scale datacenter systems
Datacenter systems and I/O devices now run at single-digit microsecond latencies, requiring
ns-scale operating systems. Traditional kernel-based operating systems impose an …
ns-scale operating systems. Traditional kernel-based operating systems impose an …
Zygos: Achieving low tail latency for microsecond-scale networked tasks
This paper focuses on the efficient scheduling on multicore systems of very fine-grain
networked tasks, which are the typical building block of online data-intensive applications …
networked tasks, which are the typical building block of online data-intensive applications …
Arrakis: The operating system is the control plane
Recent device hardware trends enable a new approach to the design of network server
operating systems. In a traditional operating system, the kernel mediates access to device …
operating systems. In a traditional operating system, the kernel mediates access to device …
Shinjuku: Preemptive Scheduling for {μsecond-scale} Tail Latency
The recently proposed dataplanes for microsecond scale applications, such as IX and
ZygOS, use non-preemptive policies to schedule requests to cores. For the many real-world …
ZygOS, use non-preemptive policies to schedule requests to cores. For the many real-world …
Just say {NO} to paxos overhead: Replacing consensus with network ordering
Distributed applications use replication, implemented by protocols like Paxos, to ensure data
availability and transparently mask server failures. This paper presents a new approach to …
availability and transparently mask server failures. This paper presents a new approach to …