Predicting and preventing inconsistencies in deployed distributed systems
M Yabandeh, N Knežević, D Kostić… - ACM Transactions on …, 2010 - dl.acm.org
We propose a new approach for developing and deploying distributed systems, in which
nodes predict distributed consequences of their actions and use this information to detect …
nodes predict distributed consequences of their actions and use this information to detect …
Enhanced monitoring-as-a-service for effective cloud management
S Meng, L Liu - IEEE Transactions on Computers, 2012 - ieeexplore.ieee.org
This paper introduces the concept of monitoring-as-a-service (MaaS), its main components,
and a suite of key functional requirements of MaaS in cloud. We argue that MaaS should …
and a suite of key functional requirements of MaaS in cloud. We argue that MaaS should …
Themis: Fairness in federated stream processing under overload
E Kalyvianaki, M Fiscato, T Salonidis… - Proceedings of the 2016 …, 2016 - dl.acm.org
Federated stream processing systems, which utilise nodes from multiple independent
domains, can be found increasingly in multi-provider cloud deployments, internet-of-things …
domains, can be found increasingly in multi-provider cloud deployments, internet-of-things …
Deployment strategies for distributed complex event processing
Several complex event processing (CEP) middleware solutions have been proposed in the
past. They act by processing primitive events generated by sources, extracting new …
past. They act by processing primitive events generated by sources, extracting new …
Do you know your IQ? A research agenda for information quality in systems
Information quality (IQ) is a measure of how fit information is for a purpose. Sometimes
called Quality of Information (QoI) by analogy with Quality of Service (QoS), it quantifies …
called Quality of Information (QoI) by analogy with Quality of Service (QoS), it quantifies …
Toward high-performance distributed stream processing via approximate fault tolerance
Fault tolerance is critical for distributed stream processing systems, yet achieving error-free
fault tolerance often incurs substantial performance overhead. We present AF-Stream, a …
fault tolerance often incurs substantial performance overhead. We present AF-Stream, a …
Reliable state monitoring in cloud datacenters
State monitoring is widely used for detecting critical events and abnormalities of distributed
systems. As the scale of such systems grows and the degree of workload consolidation …
systems. As the scale of such systems grows and the degree of workload consolidation …
Performance troubleshooting in data centers: an annotated bibliography?
In the emerging cloud computing era, enterprise data centers host a plethora of web
services and applications, including those for e-Commerce, distributed multimedia, and …
services and applications, including those for e-Commerce, distributed multimedia, and …
State monitoring in cloud datacenters
Monitoring global states of a distributed cloud application is a critical functionality for cloud
datacenter management. State monitoring requires meeting two demanding objectives: high …
datacenter management. State monitoring requires meeting two demanding objectives: high …
LiMoSense: live monitoring in dynamic sensor networks
We present LiMoSense, a fault-tolerant live monitoring algorithm for dynamic sensor
networks. This is the first asynchronous robust average aggregation algorithm that performs …
networks. This is the first asynchronous robust average aggregation algorithm that performs …