Foundation models for time series analysis: A tutorial and survey

Y Liang, H Wen, Y Nie, Y Jiang, M Jin, D Song… - Proceedings of the 30th …, 2024 - dl.acm.org
Time series analysis stands as a focal point within the data mining community, serving as a
cornerstone for extracting valuable insights crucial to a myriad of real-world applications …

Ai for it operations (aiops) on cloud platforms: Reviews, opportunities and challenges

Q Cheng, D Sahoo, A Saha, W Yang, C Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
Artificial Intelligence for IT operations (AIOps) aims to combine the power of AI with the big
data generated by IT Operations processes, particularly in cloud infrastructures, to provide …

TFAD: A decomposition time series anomaly detection architecture with time-frequency analysis

C Zhang, T Zhou, Q Wen, L Sun - Proceedings of the 31st ACM …, 2022 - dl.acm.org
Time series anomaly detection is a challenging problem due to the complex temporal
dependencies and the limited label data. Although some algorithms including both …

Adgym: Design choices for deep anomaly detection

M Jiang, C Hou, A Zheng, S Han… - Advances in …, 2024 - proceedings.neurips.cc
Deep learning (DL) techniques have recently found success in anomaly detection (AD)
across various fields such as finance, medical services, and cloud computing. However …

Robust time series analysis and applications: An industrial perspective

Q Wen, L Yang, T Zhou, L Sun - Proceedings of the 28th ACM SIGKDD …, 2022 - dl.acm.org
Time series analysis is ubiquitous and important in various areas, such as Artificial
Intelligence for IT Operations (AIOps) in cloud computing, AI-powered Business Intelligence …

Rcagent: Cloud root cause analysis by autonomous agents with tool-augmented large language models

Z Wang, Z Liu, Y Zhang, A Zhong, J Wang… - Proceedings of the 33rd …, 2024 - dl.acm.org
Large language model (LLM) applications in cloud root cause analysis (RCA) have been
actively explored recently. However, current methods are still reliant on manual workflow …

[PDF][PDF] Empowering practical root cause analysis by large language models for cloud incidents

Y Chen, H Xie, M Ma, Y Kang, X Gao… - arXiv preprint arXiv …, 2023 - yinfangchen.github.io
Ensuring the reliability and availability of cloud services necessitates efficient root cause
analysis (RCA) for cloud incidents. Traditional RCA methods, which rely on manual …

Nezha: Interpretable fine-grained root causes analysis for microservices on multi-modal observability data

G Yu, P Chen, Y Li, H Chen, X Li, Z Zheng - Proceedings of the 31st …, 2023 - dl.acm.org
Root cause analysis (RCA) in large-scale microservice systems is a critical and challenging
task. To understand and localize root causes of unexpected faults, modern observability …

Robust failure diagnosis of microservice system through multimodal data

S Zhang, P Jin, Z Lin, Y Sun, B Zhang… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Automatic failure diagnosis is crucial for large microservice systems. Currently, most failure
diagnosis methods rely solely on single-modal data (ie, using either metrics, logs, or traces) …

KGroot: A knowledge graph-enhanced method for root cause analysis

T Wang, G Qi, T Wu - Expert Systems with Applications, 2024 - Elsevier
Fault localization in online microservices is a challenging task due to the vast amount of
monitoring data, diversity of types and events, and complex interdependencies among …