End-to-end optimization of machine learning prediction queries
Prediction queries are widely used across industries to perform advanced analytics and
draw insights from data. They include a data processing part (eg, for joining, filtering …
draw insights from data. They include a data processing part (eg, for joining, filtering …
Query processing on tensor computation runtimes
The huge demand for computation in artificial intelligence (AI) is driving unparalleled
investments in hardware and software systems for AI. This leads to an explosion in the …
investments in hardware and software systems for AI. This leads to an explosion in the …
Distributed deep learning on data systems: a comparative analysis of approaches
Deep learning (DL) is growing in popularity for many data analytics applications, including
among enterprises. Large business-critical datasets in such settings typically reside in …
among enterprises. Large business-critical datasets in such settings typically reside in …
Autoscheduling for sparse tensor algebra with an asymptotic cost model
While loop reordering and fusion can make big impacts on the constant-factor performance
of dense tensor programs, the effects on sparse tensor programs are asymptotic, often …
of dense tensor programs, the effects on sparse tensor programs are asymptotic, often …
Optimizing tensor programs on flexible storage
Tensor programs often need to process large tensors (vectors, matrices, or higher order
tensors) that require a specialized storage format for their memory layout. Several such …
tensors) that require a specialized storage format for their memory layout. Several such …
Auto-differentiation of relational computations for very large scale machine learning
The relational data model was designed to facilitate large-scale data management and
analytics. We consider the problem of how to differentiate computations expressed …
analytics. We consider the problem of how to differentiate computations expressed …
Bagua: scaling up distributed learning with system relaxations
Recent years have witnessed a growing list of systems for distributed data-parallel training.
Existing systems largely fit into two paradigms, ie, parameter server and MPI-style collective …
Existing systems largely fit into two paradigms, ie, parameter server and MPI-style collective …
Indexed Streams: A Formal Intermediate Representation for Fused Contraction Programs
S Kovach, P Kolichala, T Gu, F Kjolstad - Proceedings of the ACM on …, 2023 - dl.acm.org
We introduce indexed streams, a formal operational model and intermediate representation
that describes the fused execution of a contraction language that encompasses both sparse …
that describes the fused execution of a contraction language that encompasses both sparse …
nsDB: Architecting the Next Generation Database by Integrating Neural and Symbolic Systems
In this paper, we propose nsDB, a novel neuro-symbolic database system that integrates
neural and symbolic system architectures natively to address the weaknesses of each …
neural and symbolic system architectures natively to address the weaknesses of each …
A Comparison of End-to-End Decision Forest Inference Pipelines
Decision forest, including RandomForest, XGBoost, and LightGBM, dominates the machine
learning tasks over tabular data. Recently, several frameworks were developed for decision …
learning tasks over tabular data. Recently, several frameworks were developed for decision …