Autopilot: workload autoscaling at google

K Rzadca, P Findeisen, J Swiderski, P Zych… - Proceedings of the …, 2020 - dl.acm.org
In many public and private Cloud systems, users need to specify a limit for the amount of
resources (CPU cores and RAM) to provision for their workloads. A job that exceeds its limits …

Pocket: Elastic ephemeral storage for serverless analytics

A Klimovic, Y Wang, P Stuedi, A Trivedi… - … USENIX Symposium on …, 2018 - usenix.org
Serverless computing is becoming increasingly popular, enabling users to quickly launch
thousands of shortlived tasks in the cloud with high elasticity and fine-grain billing. These …

Protean:{VM} allocation service at scale

O Hadary, L Marshall, I Menache, A Pan… - … USENIX Symposium on …, 2020 - usenix.org
We describe the design and implementation of Protean--the Microsoft Azure service
responsible for allocating Virtual Machines (VMs) to millions of servers around the globe. A …

Heracles: Improving resource efficiency at scale

D Lo, L Cheng, R Govindaraju… - Proceedings of the …, 2015 - dl.acm.org
User-facing, latency-sensitive services, such as websearch, underutilize their computing
resources during daily periods of low traffic. Reusing those resources for other tasks is rarely …

A survey of profit optimization techniques for cloud providers

P Cong, G Xu, T Wei, K Li - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
As the demand for computing resources grows, cloud computing becomes more and more
popular as a pay-as-you-go model, in which the computing resources and services are …

Mercury: Hybrid centralized and distributed scheduling in large shared clusters

K Karanasos, S Rao, C Curino, C Douglas… - 2015 USENIX Annual …, 2015 - usenix.org
Datacenter-scale computing for analytics workloads is increasingly common. High
operational costs force heterogeneous applications to share cluster resources for achieving …

Providing {SLOs} for {Resource-Harvesting}{VMs} in cloud platforms

P Ambati, Í Goiri, F Frujeri, A Gun, K Wang… - … USENIX Symposium on …, 2020 - usenix.org
Cloud providers rent the resources they do not allocate as evictable virtual machines (VMs),
like spot instances. In this paper, we first characterize the unallocated resources in Microsoft …

The power of prediction: microservice auto scaling via workload learning

S Luo, H Xu, K Ye, G Xu, L Zhang, G Yang… - Proceedings of the 13th …, 2022 - dl.acm.org
When deploying microservices in production clusters, it is critical to automatically scale
containers to improve cluster utilization and ensure service level agreements (SLA) …

On the diversity of cluster workloads and its impact on research results

G Amvrosiadis, JW Park, GR Ganger… - 2018 USENIX Annual …, 2018 - usenix.org
Six years ago, Google released an invaluable set of scheduler logs which has already been
used in more than 450 publications. We find that the scarcity of other data sources, however …

Memory-harvesting vms in cloud platforms

A Fuerst, S Novaković, Í Goiri, GI Chaudhry… - Proceedings of the 27th …, 2022 - dl.acm.org
loud platforms monetize their spare capacity by renting “Spot” virtual machines (VMs) that
can be evicted in favor of higher-priority VMs. Recent work has shown that resource …