Deep learning workload scheduling in gpu datacenters: A survey

Z Ye, W Gao, Q Hu, P Sun, X Wang, Y Luo… - ACM Computing …, 2024 - dl.acm.org
Deep learning (DL) has demonstrated its remarkable success in a wide variety of fields. The
development of a DL model is a time-consuming and resource-intensive procedure. Hence …

End-to-end optimization of machine learning prediction queries

K Park, K Saur, D Banda, R Sen, M Interlandi… - Proceedings of the …, 2022 - dl.acm.org
Prediction queries are widely used across industries to perform advanced analytics and
draw insights from data. They include a data processing part (eg, for joining, filtering …

Extending relational query processing with ML inference

K Karanasos, M Interlandi, D Xin, F Psallidas… - arXiv preprint arXiv …, 2019 - arxiv.org
The broadening adoption of machine learning in the enterprise is increasing the pressure for
strict governance and cost-effective performance, in particular for the common and …

A tensor compiler for unified machine learning prediction serving

S Nakandala, K Saur, GI Yu, K Karanasos… - … USENIX Symposium on …, 2020 - usenix.org
Machine Learning (ML) adoption in the enterprise requires simpler and more efficient
software infrastructure—the bespoke solutions typical in large web companies are simply …

Sommelier: Curating DNN models for the masses

P Guo, B Hu, W Hu - Proceedings of the 2022 International Conference …, 2022 - dl.acm.org
Deep learning model repositories are indispensable in machine learning ecosystems today
to facilitate model reuse. However, existing model repositories provide a bare-bone interface …

RALF: Accuracy-Aware Scheduling for Feature Store Maintenance

S Wooders, X Mo, A Narang, K Lin, I Stoica… - Proceedings of the …, 2023 - dl.acm.org
Feature stores (also sometimes referred to as embedding stores) are becoming ubiquitous
in model serving systems: downstream applications query these stores for auxiliary inputs at …

Amalur: Data integration meets machine learning

Z Li, W Sun, D Zhan, Y Kang, L Chen… - … on Knowledge and …, 2024 - ieeexplore.ieee.org
Machine learning (ML) training data is often scattered across disparate collections of
datasets, called data silos. This fragmentation poses a major challenge for data-intensive …

Metadata representations for queryable repositories of machine learning models

Z Li, H Kant, R Hai, A Katsifodimos, M Brambilla… - IEEE …, 2023 - ieeexplore.ieee.org
Machine learning (ML) practitioners and organizations are building model repositories of pre-
trained models, referred to as model zoos. These model zoos contain metadata describing …

Deep learning: Systems and responsibility

A Wasay, S Chatterjee, S Idreos - Proceedings of the 2021 International …, 2021 - dl.acm.org
Deep learning enables numerous applications across diverse areas. Data systems
researchers are also increasingly experimenting with deep learning to enhance data …

Accelerating deep learning inference via learned caches

A Balasubramanian, A Kumar, Y Liu, H Cao… - arXiv preprint arXiv …, 2021 - arxiv.org
Deep Neural Networks (DNNs) are witnessing increased adoption in multiple domains
owing to their high accuracy in solving real-world problems. However, this high accuracy …