Scalable load balancing in networked systems: A survey of recent advances

MV der Boor, SC Borst, JSH Van Leeuwaarden… - SIAM Review, 2022 - SIAM
In this survey we provide an overview of recent advances on scalable load balancing
schemes which provide favorable delay performance and yet require minimal …

Sharp waiting-time bounds for multiserver jobs

Y Hong, W Wang - Proceedings of the Twenty-Third International …, 2022 - dl.acm.org
Multiserver jobs, which are jobs that occupy multiple servers simultaneously during service,
are prevalent in today's computing clusters. But little is known about the delay performance …

Mean-field analysis for load balancing on spatial graphs

D Rutten, D Mukherjee - Abstract Proceedings of the 2023 ACM …, 2023 - dl.acm.org
A pivotal methodological tool behind the analysis of large-scale load balancing systems is
mean-field analysis. The high-level idea is to represent the system state by aggregate …

Steady‐state analysis of load balancing with Coxian‐2 distributed service times

X Liu, K Gong, L Ying - Naval Research Logistics (NRL), 2022 - Wiley Online Library
This paper studies load balancing for many‐server (N servers) systems. Each server has a
buffer of size b− 1, and can have at most one job in service and b− 1 jobs in the buffer. The …

Non-stationary queues with batch arrivals

A Daw, B Fralix, J Pender - arXiv preprint arXiv:2008.00625, 2020 - arxiv.org
Motivated by applications that involve setting proper staffing levels for multi-server queueing
systems with batch arrivals, we present a thorough study of the queue-length process $\{Q …

Large-system insensitivity of zero-waiting load balancing algorithms

X Liu, K Gong, L Ying - ACM SIGMETRICS Performance Evaluation …, 2022 - dl.acm.org
This paper studies the sensitivity (or insensitivity) of a class of load balancing algorithms that
achieve asymptotic zero-waiting in the sub-Halfin-Whitt regime, named LB-zero. Most …

[PDF][PDF] The finite-skip method for multiserver analysis

I Grosof, M Harchol-Balter… - arXiv preprint arXiv …, 2021 - academia.edu
In this paper we introduce the work-conserving finite-skip class of models. This class
includes many important models, such as the heterogeneous M/G/k, the limited processor …

Queueing analysis of imbalance between multiple server pools with an application to 3-phase EV charging

A Ferragut, F Paganini - ACM SIGMETRICS Performance Evaluation …, 2024 - dl.acm.org
We consider systems where multiple servers operate in parallel, with a particular feature:
servers are classified into d classes, and we wish to keep approximate balance between the …

Asymptotically Optimal Scheduling of Multiple Parallelizable Job Classes

B Berg, B Moseley, W Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Many modern computing workloads are composed of parallelizable jobs. A single
parallelizable job can be completed more quickly if it is run on additional servers, however …

Achieving zero asymptotic queueing delay for parallel jobs

W Weng, W Wang - ACM SIGMETRICS Performance Evaluation Review, 2021 - dl.acm.org
Zero queueing delay is highly desirable in large-scale computing systems. Existing work has
shown that it can be asymptotically achieved by using the celebrated Power-of-d-choices …