[PDF][PDF] Various properties of various ultrafilters, various graph width parameters, and various connectivity systems

T Fujita - arXiv preprint arXiv, 2024 - researchgate.net
This paper investigates ultrafilters in the context of connectivity systems, defined as pairs (X,
f) where X is a finite set and f is a symmetric submodular function. Ultrafilters, essential in …

Adopting worst-case optimal joins in relational database systems

M Freitag, M Bandle, T Schmidt, A Kemper… - Proceedings of the …, 2020 - dl.acm.org
Worst-case optimal join algorithms are attractive from a theoretical point of view, as they offer
asymptotically better runtime than binary joins on certain types of queries. In particular, they …

Joinboost: Grow trees over normalized data using only SQL

Z Huang, R Sen, J Liu, E Wu - arXiv preprint arXiv:2307.00422, 2023 - arxiv.org
Although dominant for tabular data, ML libraries that train tree models over normalized
databases (eg, LightGBM, XGBoost) require the data to be denormalized as a single table …

Saibot: A Differentially Private Data Search Platform

Z Huang, J Liu, D Alabi, RC Fernandez… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent data search platforms use ML task-based utility measures rather than metadata-
based keywords, to search large dataset corpora. Requesters submit a training dataset and …

The fine-grained complexity of boolean conjunctive queries and sum-product problems

AZ Fan, P Koutris, H Zhao - arXiv preprint arXiv:2304.14557, 2023 - arxiv.org
We study the fine-grained complexity of evaluating Boolean Conjunctive Queries and their
generalization to sum-of-product problems over an arbitrary semiring. For these problems …

The relational data borg is learning

D Olteanu - arXiv preprint arXiv:2008.07864, 2020 - arxiv.org
This paper overviews an approach that addresses machine learning over relational data as
a database problem. This is justified by two observations. First, the input to the learning task …

Lightweight Materialization for Fast Dashboards Over Joins

Z Huang, E Wu - Proceedings of the ACM on Management of Data, 2023 - dl.acm.org
Dashboards are vital in modern business intelligence tools, providing non-technical users
with an interface to access comprehensive business data. With the rise of cloud technology …

Tight fine-grained bounds for direct access on join queries

K Bringmann, N Carmeli, S Mengel - … of the 41st ACM SIGMOD-SIGACT …, 2022 - dl.acm.org
We consider the task of lexicographic direct access to query answers. That is, we want to
simulate an array containing the answers of a join query sorted in a lexicographic order …

Improved approximation algorithms for relational clustering

A Esmailpour, S Sintos - Proceedings of the ACM on Management of …, 2024 - dl.acm.org
Clustering plays a crucial role in computer science, facilitating data analysis and problem-
solving across numerous fields. By partitioning large datasets into meaningful groups …

Optimal join algorithms meet top-k

N Tziavelis, W Gatterbauer, M Riedewald - Proceedings of the 2020 …, 2020 - dl.acm.org
Top-k queries have been studied intensively in the database community and they are an
important means to reduce query cost when only the" best" or" most interesting" results are …