Serverless computing: a survey of opportunities, challenges, and applications
The emerging serverless computing paradigm has attracted attention from both academia
and industry. This paradigm brings benefits such as less operational complexity, a pay-as …
and industry. This paradigm brings benefits such as less operational complexity, a pay-as …
Wisefuse: Workload characterization and dag transformation for serverless workflows
We characterize production workloads of serverless DAGs at a major cloud provider. Our
analysis highlights two major factors that limit performance:(a) lack of efficient …
analysis highlights two major factors that limit performance:(a) lack of efficient …
The power of prediction: microservice auto scaling via workload learning
When deploying microservices in production clusters, it is critical to automatically scale
containers to improve cluster utilization and ensure service level agreements (SLA) …
containers to improve cluster utilization and ensure service level agreements (SLA) …
Vabus: Edge-cloud real-time video analytics via background understanding and subtraction
Edge-cloud collaborative video analytics is transforming the way data is being handled,
processed, and transmitted from the ever-growing number of surveillance cameras around …
processed, and transmitted from the ever-growing number of surveillance cameras around …
Optimizing video analytics with declarative model relationships
The availability of vast video collections and the accuracy of ML models has generated
significant interest in video analytics systems. Since naively processing all frames using …
significant interest in video analytics systems. Since naively processing all frames using …
Stepconf: Slo-aware dynamic resource configuration for serverless function workflows
Function-as-a-Service (FaaS) offers a fine-grained resource provision model, enabling
developers to build highly elastic cloud applications. User requests are handled by a series …
developers to build highly elastic cloud applications. User requests are handled by a series …
Honeycomb: Secure and Efficient {GPU} Executions via Static Validation
Graphics Processing Units (GPUs) unlock emerging use cases like large language models
and autonomous driving. They process a large amount of sensitive data, where security is of …
and autonomous driving. They process a large amount of sensitive data, where security is of …
Scrooge: A cost-effective deep learning inference system
Advances in deep learning (DL) have prompted the development of cloud-hosted DL-based
media applications that process video and audio streams in real-time. Such applications …
media applications that process video and audio streams in real-time. Such applications …
Clover: Toward sustainable ai with carbon-aware machine learning inference service
This paper presents a solution to the challenge of mitigating carbon emissions from hosting
large-scale machine learning (ML) inference services. ML inference is critical to modern …
large-scale machine learning (ML) inference services. ML inference is critical to modern …
FaST-GShare: Enabling efficient spatio-temporal GPU sharing in serverless computing for deep learning inference
Serverless computing (FaaS) has been extensively utilized for deep learning (DL) inference
due to the ease of deployment and pay-per-use benefits. However, existing FaaS platforms …
due to the ease of deployment and pay-per-use benefits. However, existing FaaS platforms …