Milvus: A purpose-built vector data management system

J Wang, X Yi, R Guo, H Jin, P Xu, S Li, X Wang… - Proceedings of the …, 2021 - dl.acm.org
Recently, there has been a pressing need to manage high-dimensional vector data in data
science and AI applications. This trend is fueled by the proliferation of unstructured data and …

Challenges in KNN classification

S Zhang - IEEE Transactions on Knowledge and Data …, 2021 - ieeexplore.ieee.org
The KNN algorithm is one of the most popular data mining algorithms. It has been widely
and successfully applied to data analysis applications across a variety of research topics in …

Towards efficient index construction and approximate nearest neighbor search in high-dimensional spaces

X Zhao, Y Tian, K Huang, B Zheng, X Zhou - Proceedings of the VLDB …, 2023 - dl.acm.org
The approximate nearest neighbor (ANN) search in high-dimensional spaces is a
fundamental but computationally very expensive problem. Many methods have been …

Manu: a cloud native vector database management system

R Guo, X Luan, L Xiang, X Yan, X Yi, J Luo… - arXiv preprint arXiv …, 2022 - arxiv.org
With the development of learning-based embedding models, embedding vectors are widely
used for analyzing and searching unstructured data. As vector collections exceed billion …

Approximate Nearest Neighbor Search in High Dimensional Vector Databases: Current Research and Future Directions.

Y Tian, Z Yue, R Zhang, X Zhao, B Zheng… - IEEE Data Eng …, 2023 - sites.computer.org
Approximate nearest neighbor search is an important research topic with a wide range of
applications. In this study, we first introduce the problem and review major research results …

High-dimensional approximate nearest neighbor search: with reliable and efficient distance comparison operations

J Gao, C Long - Proceedings of the ACM on Management of Data, 2023 - dl.acm.org
Approximate K nearest neighbor (AKNN) search in the high-dimensional Euclidean vector
space is a fundamental and challenging problem. We observe that in high-dimensional …

DB-LSH 2.0: Locality-sensitive hashing with query-based dynamic bucketing

Y Tian, X Zhao, X Zhou - IEEE Transactions on Knowledge and …, 2023 - ieeexplore.ieee.org
Locality-sensitive hashing (LSH) is a promising family of methods for the high-dimensional
approximate nearest neighbor (ANN) search problem due to its sub-linear query time and …

HVS: hierarchical graph structure based on voronoi diagrams for solving approximate nearest neighbor search

K Lu, M Kudo, C Xiao, Y Ishikawa - Proceedings of the VLDB Endowment, 2021 - dl.acm.org
Approximate nearest neighbor search (ANNS) is a fundamental problem that has a wide
range of applications in information retrieval and data mining. Among state-of-the-art in …

Starling: An I/O-Efficient Disk-Resident Graph Index Framework for High-Dimensional Vector Similarity Search on Data Segment

M Wang, W Xu, X Yi, S Wu, Z Peng, X Ke… - Proceedings of the …, 2024 - dl.acm.org
High-dimensional vector similarity search (HVSS) is gaining prominence as a powerful tool
for various data science and AI applications. As vector data scales up, in-memory indexes …

Must: An effective and scalable framework for multimodal search of target modality

M Wang, X Ke, X Xu, L Chen, Y Gao… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
We investigate the problem of multimodal search of target modality, where the task involves
enhancing a query in a specific target modality by integrating information from auxiliary …