Performance anomaly detection and bottleneck identification
O Ibidunmoye, F Hernández-Rodriguez… - ACM Computing Surveys …, 2015 - dl.acm.org
In order to meet stringent performance requirements, system administrators must effectively
detect undesirable performance behaviours, identify potential root causes, and take …
detect undesirable performance behaviours, identify potential root causes, and take …
Sage: practical and scalable ML-driven performance debugging in microservices
Cloud applications are increasingly shifting from large monolithic services to complex
graphs of loosely-coupled microservices. Despite the advantages of modularity and …
graphs of loosely-coupled microservices. Despite the advantages of modularity and …
A systematic mapping study in AIOps
IT systems of today are becoming larger and more complex, rendering their human
supervision more difficult. Artificial Intelligence for IT Operations (AIOps) has been proposed …
supervision more difficult. Artificial Intelligence for IT Operations (AIOps) has been proposed …
A survey of aiops methods for failure management
Modern society is increasingly moving toward complex and distributed computing systems.
The increase in scale and complexity of these systems challenges O&M teams that perform …
The increase in scale and complexity of these systems challenges O&M teams that perform …
Studying the effectiveness of application performance management (apm) tools for detecting performance regressions for web applications: an experience report
Performance regressions, such as a higher CPU utilization than in the previous version of an
application, are caused by software application updates that negatively affect the …
application, are caused by software application updates that negatively affect the …
Automated dynamic firmware analysis at scale: a case study on embedded web interfaces
Embedded devices are becoming more widespread, interconnected, and web-enabled than
ever. However, recent studies showed that embedded devices are far from being secure …
ever. However, recent studies showed that embedded devices are far from being secure …
Understanding and detecting real-world performance bugs
Developers frequently use inefficient code sequences that could be fixed by simple patches.
These inefficient code sequences can cause significant performance degradation and …
These inefficient code sequences can cause significant performance degradation and …
Microscope: Pinpoint performance issues with causal graphs in micro-service environments
Driven by the emerging business models (eg, digital sales) and IT technologies (eg, DevOps
and Cloud computing), the architecture of software is shifting from monolithic to microservice …
and Cloud computing), the architecture of software is shifting from monolithic to microservice …
Learning to log: Helping developers make informed logging decisions
Logging is a common programming practice of practical importance to collect system
runtime information for postmortem analysis. Strategic logging placement is desired to cover …
runtime information for postmortem analysis. Strategic logging placement is desired to cover …
Structured comparative analysis of systems logs to diagnose performance problems
Diagnosis and correction of performance issues in modern, large-scale distributed systems
can be a daunting task, since a single developer is unlikely to be familiar with the entire …
can be a daunting task, since a single developer is unlikely to be familiar with the entire …