Towards communication-efficient vertical federated learning training via cache-enabled local updates
Vertical federated learning (VFL) is an emerging paradigm that allows different parties (eg,
organizations or enterprises) to collaboratively build machine learning models with privacy …
organizations or enterprises) to collaboratively build machine learning models with privacy …
Auto-differentiation of relational computations for very large scale machine learning
The relational data model was designed to facilitate large-scale data management and
analytics. We consider the problem of how to differentiate computations expressed …
analytics. We consider the problem of how to differentiate computations expressed …
Database native model selection: Harnessing deep neural networks in database systems
The growing demand for advanced analytics beyond statistical aggregation calls for
database systems that support effective model selection of deep neural networks (DNNs) …
database systems that support effective model selection of deep neural networks (DNNs) …
Powering in-database dynamic model slicing for structured data analytics
Relational database management systems (RDBMS) are widely used for the storage and
retrieval of structured data. To derive insights beyond statistical aggregation, we typically …
retrieval of structured data. To derive insights beyond statistical aggregation, we typically …
Stochastic gradient descent without full data shuffle: with applications to in-database machine learning and deep learning systems
Modern machine learning (ML) systems commonly use stochastic gradient descent (SGD) to
train ML models. However, SGD relies on random data order to converge, which usually …
train ML models. However, SGD relies on random data order to converge, which usually …
A Selective Preprocessing Offloading Framework for Reducing Data Traffic in DL Training
Deep learning (DL) training is data-intensive and often bottlenecked by fetching data from
remote storage. Recognizing that many samples' sizes diminish during data preprocessing …
remote storage. Recognizing that many samples' sizes diminish during data preprocessing …
Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training
We explore the training dynamics of neural networks in a structured non-IID setting where
documents are presented cyclically in a fixed, repeated sequence. Typically, networks suffer …
documents are presented cyclically in a fixed, repeated sequence. Typically, networks suffer …
In-database query optimization on SQL with ML predicates
Extended SQL with machine learning (ML) predicates, commonly referred to as SQL+ ML,
integrates ML abilities into traditional SQL processing in databases. When processing SQL+ …
integrates ML abilities into traditional SQL processing in databases. When processing SQL+ …
NeurDB: On the Design and Implementation of an AI-powered Autonomous Database
Databases are increasingly embracing AI to provide autonomous system optimization and
intelligent in-database analytics, aiming to relieve end-user burdens across various industry …
intelligent in-database analytics, aiming to relieve end-user burdens across various industry …
moduli: A Disaggregated Data Management Architecture for Data-Intensive Workflows
P Ceravolo, T Catarci, M Console… - ACM SIGWEB …, 2024 - dl.acm.org
As companies store, process, and analyse bigger and bigger volumes of highly
heterogeneous data, novel research and technological challenges are emerging. Traditional …
heterogeneous data, novel research and technological challenges are emerging. Traditional …