Autopilot: workload autoscaling at google
K Rzadca, P Findeisen, J Swiderski, P Zych… - Proceedings of the …, 2020 - dl.acm.org
In many public and private Cloud systems, users need to specify a limit for the amount of
resources (CPU cores and RAM) to provision for their workloads. A job that exceeds its limits …
resources (CPU cores and RAM) to provision for their workloads. A job that exceeds its limits …
Pocket: Elastic ephemeral storage for serverless analytics
Serverless computing is becoming increasingly popular, enabling users to quickly launch
thousands of shortlived tasks in the cloud with high elasticity and fine-grain billing. These …
thousands of shortlived tasks in the cloud with high elasticity and fine-grain billing. These …
Protean:{VM} allocation service at scale
We describe the design and implementation of Protean--the Microsoft Azure service
responsible for allocating Virtual Machines (VMs) to millions of servers around the globe. A …
responsible for allocating Virtual Machines (VMs) to millions of servers around the globe. A …
Heracles: Improving resource efficiency at scale
User-facing, latency-sensitive services, such as websearch, underutilize their computing
resources during daily periods of low traffic. Reusing those resources for other tasks is rarely …
resources during daily periods of low traffic. Reusing those resources for other tasks is rarely …
A survey of profit optimization techniques for cloud providers
As the demand for computing resources grows, cloud computing becomes more and more
popular as a pay-as-you-go model, in which the computing resources and services are …
popular as a pay-as-you-go model, in which the computing resources and services are …
Mercury: Hybrid centralized and distributed scheduling in large shared clusters
Datacenter-scale computing for analytics workloads is increasingly common. High
operational costs force heterogeneous applications to share cluster resources for achieving …
operational costs force heterogeneous applications to share cluster resources for achieving …
Providing {SLOs} for {Resource-Harvesting}{VMs} in cloud platforms
Cloud providers rent the resources they do not allocate as evictable virtual machines (VMs),
like spot instances. In this paper, we first characterize the unallocated resources in Microsoft …
like spot instances. In this paper, we first characterize the unallocated resources in Microsoft …
The power of prediction: microservice auto scaling via workload learning
When deploying microservices in production clusters, it is critical to automatically scale
containers to improve cluster utilization and ensure service level agreements (SLA) …
containers to improve cluster utilization and ensure service level agreements (SLA) …
On the diversity of cluster workloads and its impact on research results
Six years ago, Google released an invaluable set of scheduler logs which has already been
used in more than 450 publications. We find that the scarcity of other data sources, however …
used in more than 450 publications. We find that the scarcity of other data sources, however …
Memory-harvesting vms in cloud platforms
loud platforms monetize their spare capacity by renting “Spot” virtual machines (VMs) that
can be evicted in favor of higher-priority VMs. Recent work has shown that resource …
can be evicted in favor of higher-priority VMs. Recent work has shown that resource …