End-to-end optimization of machine learning prediction queries

K Park, K Saur, D Banda, R Sen, M Interlandi… - Proceedings of the …, 2022 - dl.acm.org
Prediction queries are widely used across industries to perform advanced analytics and
draw insights from data. They include a data processing part (eg, for joining, filtering …

Query processing on tensor computation runtimes

D He, S Nakandala, D Banda, R Sen, K Saur… - arXiv preprint arXiv …, 2022 - arxiv.org
The huge demand for computation in artificial intelligence (AI) is driving unparalleled
investments in hardware and software systems for AI. This leads to an explosion in the …

Distributed deep learning on data systems: a comparative analysis of approaches

Y Zhang, F Mcquillan, N Jayaram, N Kak… - Proceedings of the …, 2021 - par.nsf.gov
Deep learning (DL) is growing in popularity for many data analytics applications, including
among enterprises. Large business-critical datasets in such settings typically reside in …

Autoscheduling for sparse tensor algebra with an asymptotic cost model

W Ahrens, F Kjolstad, S Amarasinghe - Proceedings of the 43rd ACM …, 2022 - dl.acm.org
While loop reordering and fusion can make big impacts on the constant-factor performance
of dense tensor programs, the effects on sparse tensor programs are asymptotic, often …

Optimizing tensor programs on flexible storage

M Schleich, A Shaikhha, D Suciu - … of the ACM on Management of Data, 2023 - dl.acm.org
Tensor programs often need to process large tensors (vectors, matrices, or higher order
tensors) that require a specialized storage format for their memory layout. Several such …

Auto-differentiation of relational computations for very large scale machine learning

Y Tang, Z Ding, D Jankov, B Yuan… - International …, 2023 - proceedings.mlr.press
The relational data model was designed to facilitate large-scale data management and
analytics. We consider the problem of how to differentiate computations expressed …

Bagua: scaling up distributed learning with system relaxations

S Gan, X Lian, R Wang, J Chang, C Liu, H Shi… - arXiv preprint arXiv …, 2021 - arxiv.org
Recent years have witnessed a growing list of systems for distributed data-parallel training.
Existing systems largely fit into two paradigms, ie, parameter server and MPI-style collective …

Indexed Streams: A Formal Intermediate Representation for Fused Contraction Programs

S Kovach, P Kolichala, T Gu, F Kjolstad - Proceedings of the ACM on …, 2023 - dl.acm.org
We introduce indexed streams, a formal operational model and intermediate representation
that describes the fused execution of a contraction language that encompasses both sparse …

nsDB: Architecting the Next Generation Database by Integrating Neural and Symbolic Systems

Y Yuan, B Tang, T Zhou, Z Zhang, J Qin - Proceedings of the VLDB …, 2024 - dl.acm.org
In this paper, we propose nsDB, a novel neuro-symbolic database system that integrates
neural and symbolic system architectures natively to address the weaknesses of each …

A Comparison of End-to-End Decision Forest Inference Pipelines

H Guan, S Masood, M Dwarampudi, V Gunda… - Proceedings of the …, 2023 - dl.acm.org
Decision forest, including RandomForest, XGBoost, and LightGBM, dominates the machine
learning tasks over tabular data. Recently, several frameworks were developed for decision …