Searching in one billion vectors: re-rank with source coding- 学术资源搜索

Searching in one billion vectors: re-rank with source coding

H Jégou, R Tavenard, M Douze… - 2011 IEEE International …, 2011 - ieeexplore.ieee.org

2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011•ieeexplore.ieee.org

Recent indexing techniques inspired by source coding have been shown successful to index billions of high-dimensional vectors in memory. In this paper, we propose an approach that re-ranks the neighbor hypotheses obtained by these compressed-domain indexing methods. In contrast to the usual post-verification scheme, which performs exact distance calculation on the short-list of hypotheses, the estimated distances are refined based on short quantization codes, to avoid reading the full vectors from disk. We have released a new public dataset of one billion 128 dimensional vectors and proposed an experimental setup to evaluate high dimensional indexing algorithms on a realistic scale. Experiments show that our method accurately and efficiently re-ranks the neighbor hypotheses using little memory compared to the full vectors representation.

ieeexplore.ieee.org

展开收起

被引用次数：336 相关文章所有 22 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果