Graph-based root cause analysis for service-oriented and microservice architectures

Á Brandón, M Solé, A Huélamo, D Solans… - Journal of Systems and …, 2020 - Elsevier
Abstract Service-oriented architectures and microservices define two ways of designing
software with the aim of dividing an application into loosely-coupled services that …

Sieve: Actionable insights from monitored metrics in distributed systems

J Thalheim, A Rodrigues, IE Akkus, P Bhatotia… - Proceedings of the 18th …, 2017 - dl.acm.org
Major cloud computing operators provide powerful monitoring tools to understand the
current (and prior) state of the distributed systems deployed in their infrastructure. While …

Microservices monitoring with event logs and black box execution tracing

M Cinque, R Della Corte… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
Monitoring is a core practice in any software system. Trends in microservices systems
exacerbate the role of monitoring and pose novel challenges to data sources being used for …

Advancing monitoring in microservices systems

M Cinque, R Della Corte… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
Monitoring is a core reliability engineering practice to gain insights into production systems.
New trends in microservices exacerbate the role of monitoring. This paper discusses key …

Computer identification and quantification of fissured tongue diagnosis

HK Zhang, YY Hu, LJ Wang… - … on bioinformatics and …, 2018 - ieeexplore.ieee.org
This paper presents a new computer identification method for the diagnosis of the fissured
tongue, which can not only detect the existence of fissures, but also quantify the severity …

Autotune: Improving end-to-end performance and resource efficiency for microservice applications

MA Chang, A Panda, H Wang, Y Tsai… - arXiv preprint arXiv …, 2021 - arxiv.org
Most large web-scale applications are now built by composing collections (from a few up to
100s or 1000s) of microservices. Operators need to decide how many resources are …

BALANCE: Bayesian Linear Attribution for Root Cause Localization

C Chen, H Yu, Z Lei, J Li, S Ren, T Zhang… - Proceedings of the …, 2023 - dl.acm.org
Root Cause Analysis (RCA) plays an indispensable role in distributed data system
maintenance and operations, as it bridges the gap between fault detection and system …

An exploratory study on zeroconf monitoring of microservices systems

M Cinque, R Della Corte, R Iorio… - 2018 14th European …, 2018 - ieeexplore.ieee.org
This paper presents an explorative study on microservices monitoring. The study paves the
way for MetroFunnel, our novel application-transparent and zeroconf monitoring tool, which …

CCA: An ML Pipeline for Cloud Anomaly Troubleshooting

L Georgieva, I Giurgiu, S Monney, H Pozidis… - Proceedings of the …, 2022 - ojs.aaai.org
Abstract Cloud Causality Analyzer (CCA) is an ML-based analytical pipeline to automate the
tedious process of Root Cause Analysis (RCA) of Cloud IT events. The 3-stage pipeline is …

[PDF][PDF] Root cause analysis for large-scale cloud-native applications

B Zurkowski - 2022 - doktoraty.iet.agh.edu.pl
This chapter introduces the research problem considered in this dissertation. Followed by
presenting the problem motivation, the dissertation thesis is formulated. Then, the research …