Data streams: Algorithms and applications
S Muthukrishnan - Foundations and Trends® in Theoretical …, 2005 - nowpublishers.com
In the data stream scenario, input arrives very rapidly and there is limited memory to store
the input. Algorithms have to work with one or few passes over the data, space less than …
the input. Algorithms have to work with one or few passes over the data, space less than …
Methods for mining frequent items in data streams: an overview
In many real-world applications, information such as web click data, stock ticker data, sensor
network data, phone call records, and traffic monitoring data appear in the form of data …
network data, phone call records, and traffic monitoring data appear in the form of data …
Quantitative data cleaning for large databases
JM Hellerstein - 2013 - biblioteca.unisced.edu.mz
Data collection has become a ubiquitous function of large organizations {not only for record
keeping, but to support a variety of data analysis tasks that are critical to the organizational …
keeping, but to support a variety of data analysis tasks that are critical to the organizational …
Constant time updates in hierarchical heavy hitters
Monitoring tasks, such as anomaly and DDoS detection, require identifying frequent flow
aggregates based on common IP prefixes. These are known as hierarchical heavy hitters …
aggregates based on common IP prefixes. These are known as hierarchical heavy hitters …
Space-optimal heavy hitters with strong error bounds
The problem of finding heavy hitters and approximating the frequencies of items is at the
heart of many problems in data stream analysis. It has been observed that several proposed …
heart of many problems in data stream analysis. It has been observed that several proposed …
Finding hierarchical heavy hitters in streaming data
G Cormode, F Korn, S Muthukrishnan… - ACM Transactions on …, 2008 - dl.acm.org
Data items that arrive online as streams typically have attributes which take values from one
or more hierarchies (time and geographic location, source and destination IP addresses …
or more hierarchies (time and geographic location, source and destination IP addresses …
Memento: Making sliding windows efficient for heavy hitters
Cloud operators require real-time identification of Heavy Hitters (HH) and Hierarchical
Heavy Hitters (HHH) for applications such as load balancing, traffic engineering, and attack …
Heavy Hitters (HHH) for applications such as load balancing, traffic engineering, and attack …
Hierarchical heavy hitters with the space saving algorithm
Abstract The Hierarchical Heavy Hitters problem extends the notion of frequent items to data
arranged in a hierarchy. This problem has applications to network traffic monitoring …
arranged in a hierarchy. This problem has applications to network traffic monitoring …
A high-performance algorithm for identifying frequent items in data streams
D Anderson, P Bevan, K Lang, E Liberty… - Proceedings of the …, 2017 - dl.acm.org
Estimating frequencies of items over data streams is a common building block in streaming
data measurement and analysis. Misra and Gries introduced their seminal algorithm for the …
data measurement and analysis. Misra and Gries introduced their seminal algorithm for the …
An Optimal Algorithm for ℓ1-Heavy Hitters in Insertion Streams and Related Problems
We give the first optimal bounds for returning the ℓ1-heavy hitters in a data stream of
insertions, together with their approximate frequencies, closing a long line of work on this …
insertions, together with their approximate frequencies, closing a long line of work on this …