Data streams: Algorithms and applications

S Muthukrishnan - Foundations and Trends® in Theoretical …, 2005 - nowpublishers.com
In the data stream scenario, input arrives very rapidly and there is limited memory to store
the input. Algorithms have to work with one or few passes over the data, space less than …

Methods for mining frequent items in data streams: an overview

H Liu, Y Lin, J Han - Knowledge and information systems, 2011 - Springer
In many real-world applications, information such as web click data, stock ticker data, sensor
network data, phone call records, and traffic monitoring data appear in the form of data …

Quantitative data cleaning for large databases

JM Hellerstein - 2013 - biblioteca.unisced.edu.mz
Data collection has become a ubiquitous function of large organizations {not only for record
keeping, but to support a variety of data analysis tasks that are critical to the organizational …

Constant time updates in hierarchical heavy hitters

R Ben Basat, G Einziger, R Friedman… - Proceedings of the …, 2017 - dl.acm.org
Monitoring tasks, such as anomaly and DDoS detection, require identifying frequent flow
aggregates based on common IP prefixes. These are known as hierarchical heavy hitters …

Space-optimal heavy hitters with strong error bounds

R Berinde, P Indyk, G Cormode… - ACM Transactions on …, 2010 - dl.acm.org
The problem of finding heavy hitters and approximating the frequencies of items is at the
heart of many problems in data stream analysis. It has been observed that several proposed …

Finding hierarchical heavy hitters in streaming data

G Cormode, F Korn, S Muthukrishnan… - ACM Transactions on …, 2008 - dl.acm.org
Data items that arrive online as streams typically have attributes which take values from one
or more hierarchies (time and geographic location, source and destination IP addresses …

Memento: Making sliding windows efficient for heavy hitters

RB Basat, G Einziger, I Keslassy, A Orda… - Proceedings of the 14th …, 2018 - dl.acm.org
Cloud operators require real-time identification of Heavy Hitters (HH) and Hierarchical
Heavy Hitters (HHH) for applications such as load balancing, traffic engineering, and attack …

Hierarchical heavy hitters with the space saving algorithm

M Mitzenmacher, T Steinke, J Thaler - 2012 Proceedings of the Fourteenth …, 2012 - SIAM
Abstract The Hierarchical Heavy Hitters problem extends the notion of frequent items to data
arranged in a hierarchy. This problem has applications to network traffic monitoring …

A high-performance algorithm for identifying frequent items in data streams

D Anderson, P Bevan, K Lang, E Liberty… - Proceedings of the …, 2017 - dl.acm.org
Estimating frequencies of items over data streams is a common building block in streaming
data measurement and analysis. Misra and Gries introduced their seminal algorithm for the …

An Optimal Algorithm for ℓ1-Heavy Hitters in Insertion Streams and Related Problems

A Bhattacharyya, P Dey, DP Woodruff - ACM Transactions on Algorithms …, 2018 - dl.acm.org
We give the first optimal bounds for returning the ℓ1-heavy hitters in a data stream of
insertions, together with their approximate frequencies, closing a long line of work on this …