Sustainable ai: Environmental implications, challenges and opportunities

CJ Wu, R Raghavendra, U Gupta… - Proceedings of …, 2022 - proceedings.mlsys.org
This paper explores the environmental impact of the super-linear growth trends for AI from a
holistic perspective, spanning Data, Algorithms, and System Hardware. We characterize the …

{FIRM}: An intelligent fine-grained resource management framework for {SLO-Oriented} microservices

H Qiu, SS Banerjee, S Jha, ZT Kalbarczyk… - 14th USENIX symposium …, 2020 - usenix.org
User-facing latency-sensitive web services include numerous distributed,
intercommunicating microservices that promise to simplify software development and …

Sage: practical and scalable ML-driven performance debugging in microservices

Y Gan, M Liang, S Dev, D Lo, C Delimitrou - Proceedings of the 26th …, 2021 - dl.acm.org
Cloud applications are increasingly shifting from large monolithic services to complex
graphs of loosely-coupled microservices. Despite the advantages of modularity and …

ThunderGP: HLS-based graph processing framework on FPGAs

X Chen, H Tan, Y Chen, B He, WF Wong… - The 2021 ACM/SIGDA …, 2021 - dl.acm.org
FPGA has been an emerging computing infrastructure in datacenters benefiting from
features of fine-grained parallelism, energy efficiency, and reconfigurability. Meanwhile …

Understanding data storage and ingestion for large-scale deep recommendation model training: Industrial product

M Zhao, N Agarwal, A Basant, B Gedik, S Pan… - Proceedings of the 49th …, 2022 - dl.acm.org
Datacenter-scale AI training clusters consisting of thousands of domain-specific accelerators
(DSA) are used to train increasingly-complex deep learning models. These clusters rely on a …

{AWARE}: Automate workload autoscaling with reinforcement learning in production cloud systems

H Qiu, W Mao, C Wang, H Franke, A Youssef… - 2023 USENIX Annual …, 2023 - usenix.org
Workload autoscaling is widely used in public and private cloud systems to maintain stable
service performance and save resources. However, it remains challenging to set the optimal …

Cornflakes: Zero-copy serialization for microsecond-scale networking

D Raghavan, S Ravi, G Yuan, P Thaker… - Proceedings of the 29th …, 2023 - dl.acm.org
Data serialization is critical for many datacenter applications, but the memory copies
required to move application data into packets are costly. Recent zero-copy APIs expose …

A hardware accelerator for protocol buffers

S Karandikar, C Leary, C Kennelly, J Zhao… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
Serialization frameworks are a fundamental component of scale-out systems, but introduce
significant compute overheads. However, they are amenable to acceleration with …

A cloud-scale characterization of remote procedure calls

K Seemakhupt, BE Stephens, S Khan, S Liu… - Proceedings of the 29th …, 2023 - dl.acm.org
The global scale and challenging requirements of modern cloud applications have led to the
development of complex, widely distributed, service-oriented applications. One enabler of …

Deepscaling: microservices autoscaling for stable cpu utilization in large scale cloud systems

Z Wang, S Zhu, J Li, W Jiang… - Proceedings of the 13th …, 2022 - dl.acm.org
Cloud service providers conservatively provision excessive resources to ensure service
level objectives (SLOs) are met. They often set lower CPU utilization targets to ensure …