[HTML][HTML] Fault tolerance in big data storage and processing systems: A review on challenges and solutions

M Saadoon, SHA Hamid, H Sofian… - Ain Shams Engineering …, 2022 - Elsevier
Big data systems are sufficiently stable to store and process a massive volume of rapidly
changing data. However, big data systems are composed of large-scale hardware resources …

Effectiveness review of the machine learning algorithms for scheduling in cloud environment

GU Srikanth, R Geetha - Archives of Computational Methods in …, 2023 - Springer
Cloud computing becomes rapid growing technology using virtualization and Service
Oriented Architecture (SOA), developed as one of the sought-after computing models. It is a …

Reinforcement learning-based application autoscaling in the cloud: A survey

Y Garí, DA Monge, E Pacini, C Mateos… - … Applications of Artificial …, 2021 - Elsevier
Reinforcement Learning (RL) has demonstrated a great potential for automatically solving
decision-making problems in complex, uncertain environments. RL proposes a …

Failure-resilient DAG task scheduling in edge computing

L Cai, X Wei, C Xing, X Zou, G Zhang, X Wang - Computer Networks, 2021 - Elsevier
Through placing computation, storage, and communications facilities near the data source,
Edge Computing (EC) is anticipated to extend the intelligence from the central cloud to the …

Deep reinforcement learning for fault-tolerant workflow scheduling in cloud environment

T Dong, F Xue, H Tang, C Xiao - Applied Intelligence, 2023 - Springer
Cloud computing is widely used in various fields, which can provide sufficient computing
resources to address users' demands (workflows) quickly and effectively. However, resource …

Replication-Based Query Management for Resource Allocation Using Hadoop and MapReduce over Big Data

A Kumar, N Varshney, S Bhatiya… - Big Data Mining and …, 2023 - ieeexplore.ieee.org
We live in an age where everything around us is being created. Data generation rates are so
scary, creating pressure to implement costly and straightforward data storage and recovery …

Proactive failure-aware task scheduling framework for cloud computing

Y Alahmad, T Daradkeh, A Agarwal - IEEE Access, 2021 - ieeexplore.ieee.org
Cloud computing is a widely adopted platform for executing tasks of different application
types that belong to the end users. In the cloud, application task is prone to failure for several …

A comparison of reinforcement learning frameworks for software testing tasks

PS Nouwou Mindom, A Nikanjam, F Khomh - Empirical Software …, 2023 - Springer
Software testing activities scrutinize the artifacts and the behavior of a software product to
find possible defects and ensure that the product meets its expected requirements. Although …

An effective scheduling strategy based on hypergraph partition in geographically distributed datacenters

C Li, Y Zhang, Z Hao, Y Luo - Computer Networks, 2020 - Elsevier
As the complexity of workflow applications increase, the scheduling and execution of
workflow incur more waste of resources. In order to achieve load balancing and reduce the …

A Q-learning approach for the autoscaling of scientific workflows in the cloud

Y Garí, DA Monge, C Mateos - Future Generation Computer Systems, 2022 - Elsevier
Autoscaling strategies aim to exploit the elasticity, resource heterogeneity and varied prices
options of a Cloud infrastructure to improve efficiency in the execution of resource-hungry …