Autonomic Service Operation for Cloud Applications: Safe Actuation and Risk Management

J Tomás, A Bento, J Soares, L Ribeiro… - … Computing-EDCC 2021 …, 2021 - Springer
Dependable Computing-EDCC 2021 Workshops: DREAMS, DSOGRI, SERENE 2021, Munich …, 2021Springer
Cloud-native applications consist of highly specialized and decoupled services that can be
deployed, scaled and managed independently. Maintaining such applications available is a
complex task for operators, because software defects and other kinds of faults can be
challenging to diagnose and repair to quickly resume operations. Autonomic service
operation is therefore a promising approach. However, there are risks associated to
guaranteeing safe autonomic actuation, which must be managed. This paper discusses the …
Abstract
Cloud-native applications consist of highly specialized and decoupled services that can be deployed, scaled and managed independently. Maintaining such applications available is a complex task for operators, because software defects and other kinds of faults can be challenging to diagnose and repair to quickly resume operations. Autonomic service operation is therefore a promising approach. However, there are risks associated to guaranteeing safe autonomic actuation, which must be managed. This paper discusses the challenges identified in the context of the development of a platform for autonomic service operation and describe the software architecture of the platform. Results show mean times to detect, diagnose and repair failures in the order of tens of seconds.
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果