Tales of the tail: Hardware, os, and application-level sources of tail latency

An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems

Y Gan, Y Zhang, D Cheng, A Shetty, P Rathi… - Proceedings of the …, 2019 - dl.acm.org

Cloud services have recently started undergoing a major shift from monolithic applications,
to graphs of hundreds or thousands of loosely-coupled microservices. Microservices …

被引用次数：698 相关文章所有 9 个版本

[PDF] researchgate.net

Benchmarking big data systems: A review

R Han, LK John, J Zhan - IEEE Transactions on Services …, 2017 - ieeexplore.ieee.org

With the fast development of big data systems in recent years, a variety of open-source
benchmarks have been built to evaluate and compare the workloads on these systems, and …

被引用次数：95 相关文章所有 6 个版本

[PDF] usenix.org

Shenango: Achieving high {CPU} efficiency for latency-sensitive datacenter workloads

A Ousterhout, J Fried, J Behrens, A Belay… - … USENIX Symposium on …, 2019 - usenix.org

Datacenter applications demand microsecond-scale tail latencies and high request rates
from operating systems, and most applications handle loads that have high variance over …

被引用次数：354 相关文章所有 14 个版本

[PDF] acm.org

Parties: Qos-aware resource partitioning for multiple interactive services

S Chen, C Delimitrou, JF Martínez - Proceedings of the Twenty-Fourth …, 2019 - dl.acm.org

Multi-tenancy in modern datacenters is currently limited to a single latency-critical,
interactive service, running alongside one or more low-priority, best-effort jobs. This limits …

被引用次数：329 相关文章所有 8 个版本

[PDF] usenix.org

Serving {DNNs} like clockwork: Performance predictability from the bottom up

A Gujarati, R Karimi, S Alzayat, W Hao… - … USENIX Symposium on …, 2020 - usenix.org

Machine learning inference is becoming a core building block for interactive web
applications. As a result, the underlying model serving systems on which these applications …

被引用次数：257 相关文章所有 16 个版本

[PDF] irenezhang.net

The demikernel datapath os architecture for microsecond-scale datacenter systems

I Zhang, A Raybuck, P Patel, K Olynyk… - Proceedings of the …, 2021 - dl.acm.org

Datacenter systems and I/O devices now run at single-digit microsecond latencies, requiring
ns-scale operating systems. Traditional kernel-based operating systems impose an …

被引用次数：111 相关文章所有 4 个版本

[PDF] acm.org

Zygos: Achieving low tail latency for microsecond-scale networked tasks

G Prekas, M Kogias, E Bugnion - Proceedings of the 26th Symposium on …, 2017 - dl.acm.org

This paper focuses on the efficient scheduling on multicore systems of very fine-grain
networked tasks, which are the typical building block of online data-intensive applications …

被引用次数：276 相关文章所有 9 个版本

[PDF] acm.org

Arrakis: The operating system is the control plane

S Peter, J Li, I Zhang, DRK Ports, D Woos… - ACM Transactions on …, 2015 - dl.acm.org

Recent device hardware trends enable a new approach to the design of network server
operating systems. In a traditional operating system, the kernel mediates access to device …

被引用次数：578 相关文章所有 64 个版本

[PDF] usenix.org

Shinjuku: Preemptive Scheduling for {μsecond-scale} Tail Latency

K Kaffes, T Chong, JT Humphries, A Belay… - … USENIX Symposium on …, 2019 - usenix.org

The recently proposed dataplanes for microsecond scale applications, such as IX and
ZygOS, use non-preemptive policies to schedule requests to cores. For the many real-world …

被引用次数：217 相关文章所有 12 个版本

[PDF] usenix.org

Just say {NO} to paxos overhead: Replacing consensus with network ordering

J Li, E Michael, NK Sharma, A Szekeres… - … USENIX Symposium on …, 2016 - usenix.org

Distributed applications use replication, implemented by protocols like Paxos, to ensure data
availability and transparently mask server failures. This paper presents a new approach to …

被引用次数：263 相关文章所有 25 个版本