Check out the big brain on BRAD: simplifying cloud data processing with learned automated data meshes

T Kraska, T Li, S Madden, M Markakis, A Ngom… - Proceedings of the …, 2023 - dl.acm.org
The last decade of database research has led to the prevalence of specialized systems for
different workloads. Consequently, organizations often rely on a combination of specialized …

Towards instance-optimized data systems

T Kraska - Proceedings of the VLDB Endowment, 2021 - par.nsf.gov
In recent years, we have seen increased interest in applying machine learning to system
problems. For example, there has been work on applying machine learning to improve …

Proteus: Autonomous adaptive storage for mixed workloads

M Abebe, H Lazu, K Daudjee - … of the 2022 International Conference on …, 2022 - dl.acm.org
Enterprises use distributed database systems to meet the demands of mixed or hybrid
transaction/analytical processing (HTAP) workloads that contain both transactional (OLTP) …

Sagedb: An instance-optimized data analytics system

J Ding, R Marcus, A Kipf, V Nathan… - Proceedings of the …, 2022 - par.nsf.gov
Modern data systems are typically both complex and general-purpose. They are complex
because of the numerous internal knobs and parameters that users need to manually tune in …

High-throughput vector similarity search in knowledge graphs

J Mohoney, A Pacaci, SR Chowdhury… - Proceedings of the …, 2023 - dl.acm.org
There is an increasing adoption of machine learning for encoding data into vectors to serve
online recommendation and search use cases. As a result, recent data management …

Enhancing In-Memory Spatial Indexing with Learned Search

V Pandey, A Van Renen, ET Zacharatou, A Kipf… - arXiv preprint arXiv …, 2023 - arxiv.org
Spatial data is ubiquitous. Massive amounts of data are generated every day from a plethora
of sources such as billions of GPS-enabled devices (eg, cell phones, cars, and sensors) …

The “AI+ R”-tree: An Instance-optimized R-tree

CMR Haider, J Wang, WG Aref - 2022 23rd IEEE …, 2022 - ieeexplore.ieee.org
The emerging class of instance-optimized systems has shown potential to achieve high
performance by specializing to a specific data and query workloads. Particularly, Machine …

Machine learning for data management: A system view

G Li, X Zhou - 2022 IEEE 38th International Conference on …, 2022 - ieeexplore.ieee.org
Machine learning techniques have been proposed to optimize data management in recent
years. Compared with traditional empirical data management, learning-based methods …

PLATON: Top-down R-tree Packing with Learned Partition Policy

J Yang, G Cong - Proceedings of the ACM on Management of Data, 2023 - dl.acm.org
The exponential growth of spatial data poses new challenges to the performance of spatial
databases. Spatial indexes like R-tree greatly accelerate the query performance and can be …

Anser: Adaptive Information Sharing Framework of AnalyticDB

L Lin, Y Li, B Wu, H Mai, R Lou, J Tan, F Li - Proceedings of the VLDB …, 2023 - dl.acm.org
The surge in data analytics has fostered burgeoning demand for AnalyticDB on Alibaba
Cloud, which has well served thousands of customers from various business sectors. The …