Cloud-native database systems at Alibaba: Opportunities and challenges

F Li - Proceedings of the VLDB Endowment, 2019 - dl.acm.org
Cloud-native databases become increasingly important for the era of cloud computing, due
to the needs for elasticity and on-demand usage by various applications. These challenges …

Survey of vector database management systems

JJ Pan, J Wang, G Li - The VLDB Journal, 2024 - Springer
There are now over 20 commercial vector database management systems (VDBMSs), all
produced within the past five years. But embedding-based retrieval has been studied for …

A structured review of data management technology for interactive visualization and analysis

L Battle, C Scheidegger - IEEE transactions on visualization …, 2020 - ieeexplore.ieee.org
In the last two decades, interactive visualization and analysis have become a central tool in
data-driven decision making. Concurrently to the contributions in data visualization …

AnalyticDB-V: a hybrid analytical engine towards query fusion for structured and unstructured data

C Wei, B Wu, S Wang, R Lou, C Zhan, F Li… - Proceedings of the VLDB …, 2020 - dl.acm.org
With the explosive growth of unstructured data (such as images, videos, and audios),
unstructured data analytics is widespread in a rich vein of real-world applications. Many …

Data management for machine learning: A survey

C Chai, J Wang, Y Luo, Z Niu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Machine learning (ML) has widespread applications and has revolutionized many
industries, but suffers from several challenges. First, sufficient high-quality training data is …

Selective data acquisition in the wild for model charging

C Chai, J Liu, N Tang, G Li, Y Luo - Proceedings of the VLDB …, 2022 - dl.acm.org
The lack of sufficient labeled data is a key bottleneck for practitioners in many real-world
supervised machine learning (ML) tasks. In this paper, we study a new problem, namely …

The case for distributed shared-memory databases with rdma-enabled memory disaggregation

R Wang, J Wang, S Idreos, MT Özsu… - arXiv preprint arXiv …, 2022 - arxiv.org
Memory disaggregation (MD) allows for scalable and elastic data center design by
separating compute (CPU) from memory. With MD, compute and memory are no longer …

Cloud-Native Databases: A Survey

H Dong, C Zhang, G Li, H Zhang - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Cloud databases have been widely accepted and deployed due to their unique advantages,
such as high elasticity, high availability, and low cost. Many new techniques, such as …

Disaggregated database systems

J Wang, Q Zhang - Companion of the 2023 International Conference on …, 2023 - dl.acm.org
Disaggregated database systems achieve unprecedented excellence in elasticity and
resource utilization at the cloud scale and have gained great momentum from both industry …

Rw-tree: A learned workload-aware framework for r-tree construction

H Dong, C Chai, Y Luo, J Liu, J Feng… - 2022 IEEE 38th …, 2022 - ieeexplore.ieee.org
R-tree is a popular index which supports efficient queries on multi-dimensional data. The
performance of R-tree mostly depends on how the tree structure is built if new data instances …