Processing flows of information: From data stream to complex event processing

G Cugola, A Margara - ACM Computing Surveys (CSUR), 2012 - dl.acm.org
A large number of distributed applications requires continuous and timely processing of
information as it flows from the periphery to the center of the system. Examples include …

The big data system, components, tools, and technologies: a survey

TR Rao, P Mitra, R Bhatt, A Goswami - Knowledge and Information …, 2019 - Springer
The traditional databases are not capable of handling unstructured data and high volumes
of real-time datasets. Diverse datasets are unstructured lead to big data, and it is laborious …

Videoedge: Processing camera streams using hierarchical clusters

CC Hung, G Ananthanarayanan… - 2018 IEEE/ACM …, 2018 - ieeexplore.ieee.org
Organizations deploy a hierarchy of clusters-cameras, private clusters, public clouds-for
analyzing live video feeds from their cameras. Video analytics queries have many …

Live video analytics at scale with approximation and {Delay-Tolerance}

H Zhang, G Ananthanarayanan, P Bodik… - … USENIX Symposium on …, 2017 - usenix.org
Video cameras are pervasively deployed for security and smart city scenarios, with millions
of them in large cities worldwide. Achieving the potential of these cameras requires …

Sonata: Query-driven streaming network telemetry

A Gupta, R Harrison, M Canini, N Feamster… - Proceedings of the …, 2018 - dl.acm.org
Managing and securing networks requires collecting and analyzing network traffic data in
real time. Existing telemetry systems do not allow operators to express the range of queries …

Millwheel: Fault-tolerant stream processing at internet scale

T Akidau, A Balikov, K Bekiroğlu, S Chernyak… - Proceedings of the …, 2013 - dl.acm.org
MillWheel is a framework for building low-latency data-processing applications that is widely
used at Google. Users specify a directed computation graph and application code for …

Data streams: Algorithms and applications

S Muthukrishnan - Foundations and Trends® in Theoretical …, 2005 - nowpublishers.com
In the data stream scenario, input arrives very rapidly and there is limited memory to store
the input. Algorithms have to work with one or few passes over the data, space less than …

[PDF][PDF] MapReduce online.

T Condie, N Conway, P Alvaro, JM Hellerstein… - Nsdi, 2010 - usenix.org
MapReduce is a popular framework for data-intensive distributed computing of batch jobs.
To simplify fault tolerance, many implementations of MapReduce materialize the entire …

Aurora: a new model and architecture for data stream management

DJ Abadi, D Carney, U Cetintemel, M Cherniack… - the VLDB Journal, 2003 - Springer
This paper describes the basic processing model and architecture of Aurora, a new system
to manage data streams for monitoring applications. Monitoring applications differ …

Integrating scale out and fault tolerance in stream processing using operator state management

R Castro Fernandez, M Migliavacca… - Proceedings of the …, 2013 - dl.acm.org
As users of" big data" applications expect fresh results, we witness a new breed of stream
processing systems (SPS) that are designed to scale to large numbers of cloud-hosted …