作者
Pankaj K Agarwal, Graham Cormode, Zengfeng Huang, Jeff M Phillips, Zhewei Wei, Ke Yi
发表日期
2013/12/4
期刊
ACM Transactions on Database Systems (TODS)
卷号
38
期号
4
页码范围
1-28
出版商
ACM
简介
We study the mergeability of data summaries. Informally speaking, mergeability requires that, given two summaries on two datasets, there is a way to merge the two summaries into a single summary on the two datasets combined together, while preserving the error and size guarantees. This property means that the summaries can be merged in a way akin to other algebraic operators such as sum and max, which is especially useful for computing summaries on massive distributed data. Several data summaries are trivially mergeable by construction, most notably all the sketches that are linear functions of the datasets. But some other fundamental ones, like those for heavy hitters and quantiles, are not (known to be) mergeable. In this article, we demonstrate that these summaries are indeed mergeable or can be made mergeable after appropriate modifications. Specifically, we show that for ε-approximate heavy …
引用总数
2012201320142015201620172018201920202021202220232024119810101621212219317
学术搜索中的文章
PK Agarwal, G Cormode, Z Huang, JM Phillips, Z Wei… - ACM Transactions on Database Systems (TODS), 2013