Performance interference of virtual machines: A survey
The rapid development of cloud computing with virtualization technology has benefited both
academia and industry. For any cloud data center at scale, one of the primary challenges is …
academia and industry. For any cloud data center at scale, one of the primary challenges is …
{INFaaS}: Automated model-less inference serving
Despite existing work in machine learning inference serving, ease-of-use and cost efficiency
remain challenges at large scales. Developers must manually search through thousands of …
remain challenges at large scales. Developers must manually search through thousands of …
Twig: Multi-agent task management for colocated latency-critical cloud services
Many of the important services running on data centres are latency-critical, time-varying, and
demand strict user satisfaction. Stringent tail-latency targets for colocated services and …
demand strict user satisfaction. Stringent tail-latency targets for colocated services and …
Warehouse-scale video acceleration: co-design and deployment in the wild
P Ranganathan, D Stodolsky, J Calow… - Proceedings of the 26th …, 2021 - dl.acm.org
Video sharing (eg, YouTube, Vimeo, Facebook, TikTok) accounts for the majority of internet
traffic, and video processing is also foundational to several other key workloads (video …
traffic, and video processing is also foundational to several other key workloads (video …
Interference-aware scheduling for inference serving
Machine learning inference applications have proliferated through diverse domains such as
healthcare, security, and analytics. Recent work has proposed inference serving systems for …
healthcare, security, and analytics. Recent work has proposed inference serving systems for …
CuttleSys: Data-driven resource management for interactive services on reconfigurable multicores
N Kulkarni, G Gonzalez-Pumariega… - 2020 53rd Annual …, 2020 - ieeexplore.ieee.org
Multi-tenancy for latency-critical applications leads to resource interference and
unpredictable performance. Core reconfiguration opens up more opportunities for …
unpredictable performance. Core reconfiguration opens up more opportunities for …
INFaaS: A model-less and managed inference serving system
Despite existing work in machine learning inference serving, ease-of-use and cost efficiency
remain challenges at large scales. Developers must manually search through thousands of …
remain challenges at large scales. Developers must manually search through thousands of …
Workload consolidation in alibaba clusters: the good, the bad, and the ugly
Web companies typically run latency-critical long-running services and resource-intensive,
throughput-hungry batch jobs in a shared cluster for improved utilization and reduced cost …
throughput-hungry batch jobs in a shared cluster for improved utilization and reduced cost …
Adaptive performance modeling of data-intensive workloads for resource provisioning in virtualized environment
The processing of data-intensive workloads is a challenging and time-consuming task that
often requires massive infrastructure to ensure fast data analysis. The cloud platform is the …
often requires massive infrastructure to ensure fast data analysis. The cloud platform is the …
RESTRAIN: A dynamic and cost-efficient resource management scheme for addressing performance interference in NFV-based systems
Abstract Network Functions Virtualization (NFV) replaces the conventional middleboxes by
their software counterparts known as Virtual Network Functions (VNFs) which run on general …
their software counterparts known as Virtual Network Functions (VNFs) which run on general …