作者
Yunpeng Cai, Yijun Sun
发表日期
2011/8/1
期刊
Nucleic acids research
卷号
39
期号
14
页码范围
e95-e95
出版商
Oxford University Press
简介
Taxonomy-independent analysis plays an essential role in microbial community analysis. Hierarchical clustering is one of the most widely employed approaches to finding operational taxonomic units, the basis for many downstream analyses. Most existing algorithms have quadratic space and computational complexities, and thus can be used only for small or medium-scale problems. We propose a new online learning-based algorithm that simultaneously addresses the space and computational issues of prior work. The basic idea is to partition a sequence space into a set of subspaces using a partition tree constructed using a pseudometric, then recursively refine a clustering structure in these subspaces. The technique relies on new methods for fast closest-pair searching and efficient dynamic insertion and deletion of tree nodes. To avoid exhaustive computation of pairwise distances between clusters, we …
引用总数
201120122013201420152016201720182019202020212022202320242172016271921912614891