VASIM: Vertical Autoscaling Simulator Toolkit

A Pavlenko, K Saur, Y Zhu, B Kroth… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
In recent years, autoscaling has garnered significant attention in cloud computing,
emphasizing cost efficiency, performance optimization, and availability for dynamic …

A Tale of Two Scales: Reconciling Horizontal and Vertical Scaling for Inference Serving Systems

K Razavi, M Salmani, M Mühlhäuser… - arXiv preprint arXiv …, 2024 - arxiv.org
Inference serving is of great importance in deploying machine learning models in real-world
applications, ensuring efficient processing and quick responses to inference requests …