Predicting and preventing inconsistencies in deployed distributed systems

M Yabandeh, N Knežević, D Kostić… - ACM Transactions on …, 2010 - dl.acm.org
We propose a new approach for developing and deploying distributed systems, in which
nodes predict distributed consequences of their actions and use this information to detect …

Enhanced monitoring-as-a-service for effective cloud management

S Meng, L Liu - IEEE Transactions on Computers, 2012 - ieeexplore.ieee.org
This paper introduces the concept of monitoring-as-a-service (MaaS), its main components,
and a suite of key functional requirements of MaaS in cloud. We argue that MaaS should …

Themis: Fairness in federated stream processing under overload

E Kalyvianaki, M Fiscato, T Salonidis… - Proceedings of the 2016 …, 2016 - dl.acm.org
Federated stream processing systems, which utilise nodes from multiple independent
domains, can be found increasingly in multi-provider cloud deployments, internet-of-things …

Deployment strategies for distributed complex event processing

G Cugola, A Margara - Computing, 2013 - Springer
Several complex event processing (CEP) middleware solutions have been proposed in the
past. They act by processing primitive events generated by sources, extracting new …

Do you know your IQ? A research agenda for information quality in systems

K Keeton, P Mehra, J Wilkes - ACM SIGMETRICS Performance …, 2010 - dl.acm.org
Information quality (IQ) is a measure of how fit information is for a purpose. Sometimes
called Quality of Information (QoI) by analogy with Quality of Service (QoS), it quantifies …

Toward high-performance distributed stream processing via approximate fault tolerance

Q Huang, PPC Lee - Proceedings of the VLDB Endowment, 2016 - dl.acm.org
Fault tolerance is critical for distributed stream processing systems, yet achieving error-free
fault tolerance often incurs substantial performance overhead. We present AF-Stream, a …

Reliable state monitoring in cloud datacenters

S Meng, AK Iyengar, IM Rouvellou, L Liu… - 2012 IEEE Fifth …, 2012 - ieeexplore.ieee.org
State monitoring is widely used for detecting critical events and abnormalities of distributed
systems. As the scale of such systems grows and the degree of workload consolidation …

Performance troubleshooting in data centers: an annotated bibliography?

C Wang, SP Kavulya, J Tan, L Hu, M Kutare… - ACM SIGOPS …, 2013 - dl.acm.org
In the emerging cloud computing era, enterprise data centers host a plethora of web
services and applications, including those for e-Commerce, distributed multimedia, and …

State monitoring in cloud datacenters

S Meng, L Liu, T Wang - IEEE transactions on Knowledge and …, 2011 - ieeexplore.ieee.org
Monitoring global states of a distributed cloud application is a critical functionality for cloud
datacenter management. State monitoring requires meeting two demanding objectives: high …

LiMoSense: live monitoring in dynamic sensor networks

I Eyal, I Keidar, R Rom - Distributed computing, 2014 - Springer
We present LiMoSense, a fault-tolerant live monitoring algorithm for dynamic sensor
networks. This is the first asynchronous robust average aggregation algorithm that performs …