Auto-scaling web applications in clouds: A taxonomy and survey

C Qu, RN Calheiros, R Buyya - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
Web application providers have been migrating their applications to cloud data centers,
attracted by the emerging cloud computing paradigm. One of the appealing features of the …

{MArk}: Exploiting cloud services for {Cost-Effective},{SLO-Aware} machine learning inference serving

C Zhang, M Yu, W Wang, F Yan - 2019 USENIX Annual Technical …, 2019 - usenix.org
The advances of Machine Learning (ML) have sparked a growing demand of ML-as-a-
Service: developers train ML models and publish them in the cloud as online services to …

Cloud brokerage: A systematic survey

A Elhabbash, F Samreen, J Hadley… - ACM Computing Surveys …, 2019 - dl.acm.org
Background—The proliferation of cloud services has opened a space for cloud brokerage
services. Brokers intermediate between cloud customers and providers to assist the …

Deadline-constrained cost optimization approaches for workflow scheduling in clouds

Q Wu, F Ishikawa, Q Zhu, Y Xia… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Nowadays it is becoming more and more attractive to execute workflow applications in the
cloud because it enables workflow applications to use computing resources on demand …

Bamboo: Making preemptible instances resilient for affordable training of large {DNNs}

J Thorpe, P Zhao, J Eyolfson, Y Qiao, Z Jia… - … USENIX Symposium on …, 2023 - usenix.org
DNN models across many domains continue to grow in size, resulting in high resource
requirements for effective training, and unpalatable (and often unaffordable) costs for …

Cocktail: A multidimensional optimization for model serving in cloud

JR Gunasekaran, CS Mishra, P Thinakaran… - … USENIX Symposium on …, 2022 - usenix.org
With a growing demand for adopting ML models for a variety of application services, it is vital
that the frameworks serving these models are capable of delivering highly accurate …

{InfiniCache}: exploiting ephemeral serverless functions to build a {cost-effective} memory cache

A Wang, J Zhang, X Ma, A Anwar, L Rupprecht… - … USENIX conference on …, 2020 - usenix.org
Internet-scale web applications are becoming increasingly storage-intensive and rely
heavily on in-memory object caching to attain required I/O performance. We argue that the …

Fifer: Tackling resource underutilization in the serverless era

JR Gunasekaran, P Thinakaran… - Proceedings of the 21st …, 2020 - dl.acm.org
Datacenters are witnessing a rapid surge in the adoption of serverless functions for
microservices-based applications. A vast majority of these microservices typically span less …

MOELS: Multiobjective evolutionary list scheduling for cloud workflows

Q Wu, MC Zhou, Q Zhu, Y Xia… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Cloud computing has nowadays become a dominant technology to reduce the computation
cost by elastically providing resources to users on a pay-per-use basis. More and more …

Enabling sustainable clouds: The case for virtualizing the energy system

N Bashir, T Guo, M Hajiesmaili, D Irwin… - Proceedings of the …, 2021 - dl.acm.org
Cloud platforms' growing energy demand and carbon emissions are raising concern about
their environmental sustainability. The current approach to enabling sustainable clouds …