作者
Mbarka Soualhia, Foutse Khomh, Sofiene Tahar
发表日期
2018/2/13
期刊
IEEE Transactions on Cloud Computing
卷号
8
期号
2
页码范围
553-569
出版商
IEEE
简介
Hadoop has become a popular framework for processing data-intensive applications in cloud environments. A core constituent of Hadoop is the scheduler, which is responsible for scheduling and monitoring the jobs and tasks, and rescheduling them in case of failures. Although fault-tolerance mechanisms have been proposed for Hadoop, the performance of Hadoop can be significantly impacted by unforeseen events in the cloud environment. In this paper, we introduce a dynamic and failure-aware framework that can be integrated within Hadoop scheduler and adjust the scheduling decisions based on collected information about the cloud environment. Our framework relies on predictions made by machine learning algorithms and scheduling policies generated by a Markovian Decision Process (MDP), to adjust its scheduling decisions on the fly. Instead of the fixed heartbeat-based failure detection commonly …
引用总数
2019202020212022202320242489155
学术搜索中的文章
M Soualhia, F Khomh, S Tahar - IEEE Transactions on Cloud Computing, 2018