A survey on the scheduling mechanisms in serverless computing: a taxonomy, challenges, and trends

M Ghorbian, M Ghobaei-Arani, L Esmaeili - Cluster Computing, 2024 - Springer
In recent years, serverless computing has received significant attention due to its innovative
approach to cloud computing. In this novel approach, a new payment model is presented …

How does it function? characterizing long-term trends in production serverless workloads

A Joosen, A Hassan, M Asenov, R Singh… - Proceedings of the …, 2023 - dl.acm.org
This paper releases and analyzes two new Huawei cloud serverless traces. The traces span
a period of over 7 months with over 1.4 trillion function invocations combined. The first trace …

Owl: Performance-aware scheduling for resource-efficient function-as-a-service cloud

H Tian, S Li, A Wang, W Wang, T Wu… - Proceedings of the 13th …, 2022 - dl.acm.org
This work documents our experience of improving the scheduler in Alibaba Function
Compute, a public FaaS platform. It commences with our observation that memory and CPU …

Function as a Function

T Kuchler, M Giardino, T Roscoe… - Proceedings of the 2023 …, 2023 - dl.acm.org
Function as a Service (FaaS) and the associated serverless computing paradigm alleviates
users from resource management and allows cloud platforms to optimize system …

Golgi: Performance-aware, resource-efficient function scheduling for serverless computing

S Li, W Wang, J Yang, G Chen, D Lu - … of the 2023 ACM Symposium on …, 2023 - dl.acm.org
This paper introduces Golgi, a novel scheduling system designed for serverless functions,
with the goal of minimizing resource provisioning costs while meeting the function latency …

Yuanrong: A production general-purpose serverless system for distributed applications in the cloud

Q Chen, J Qian, Y Che, Z Lin, J Wang, J Zhou… - Proceedings of the …, 2024 - dl.acm.org
We design, implement, and evaluate YuanRong, the first production general-purpose
serverless platform with a unified programming interface, multi-language runtime, and a …

Ditto: Efficient serverless analytics with elastic parallelism

C Jin, Z Zhang, X Xiang, S Zou, G Huang… - Proceedings of the ACM …, 2023 - dl.acm.org
Serverless computing provides fine-grained resource elasticity for data analytics---a job can
flexibly scale its resources for each stage, instead of sticking to a fixed pool of resources …

SimLess: simulate serverless workflows and their twins and siblings in federated FaaS

S Ristov, M Hautz, C Hollaus, R Prodan - Proceedings of the 13th …, 2022 - dl.acm.org
Many researchers migrate scientific serverless workflows or function choreographies (FCs)
on Function-as-a-Service (FaaS) to benefit from its high scalability and elasticity …

{ServerlessLLM}:{Low-Latency} Serverless Inference for Large Language Models

Y Fu, L Xue, Y Huang, AO Brabete, D Ustiugov… - … USENIX Symposium on …, 2024 - usenix.org
This paper presents ServerlessLLM, a distributed system designed to support low-latency
serverless inference for Large Language Models (LLMs). By harnessing the substantial near …

Faascinating resilience for serverless function choreographies in federated clouds

S Ristov, D Kimovski, T Fahringer - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Cloud applications often benefit from deployment on serverless technology Function-as-a-
Service (FaaS), which may instantly spawn numerous functions and charges users for the …