A survey of data-intensive scientific workflow management
Nowadays, more and more computer-based scientific experiments need to handle massive
amounts of data. Their data processing consists of multiple computational steps and …
amounts of data. Their data processing consists of multiple computational steps and …
Toward data lakes as central building blocks for data management and analysis
P Wieder, H Nolte - Frontiers in big Data, 2022 - frontiersin.org
Data lakes are a fundamental building block for many industrial data analysis solutions and
becoming increasingly popular in research. Often associated with big data use cases, data …
becoming increasingly popular in research. Often associated with big data use cases, data …
A provenance-based adaptive scheduling heuristic for parallel scientific workflows in clouds
In the last years, scientific workflows have emerged as a fundamental abstraction for
structuring and executing scientific experiments in computational environments. Scientific …
structuring and executing scientific experiments in computational environments. Scientific …
Comparing futuregrid, amazon ec2, and open science grid for scientific workflows
Scientists have many computing infrastructures available to conduct their research,
including grids and public or private clouds. This article explores the use of these cyber …
including grids and public or private clouds. This article explores the use of these cyber …
Performance evaluation of parallel strategies in public clouds: A study with phylogenomic workflows
Data analysis is an exploratory process that demands high performance computing (HPC).
SciPhylomics, for example, is a data-intensive workflow that aims at producing …
SciPhylomics, for example, is a data-intensive workflow that aims at producing …
Beeflow: A workflow management system for in situ processing across hpc and cloud systems
In this paper, we propose BeeFlow-an in situ analysis enabled workflow management
system across multiple platforms using Docker containers. BeeFlow can support both …
system across multiple platforms using Docker containers. BeeFlow can support both …
Cloud autoscaling simulation based on queueing network model
For the development of a predictive autoscaler for private clouds, an evaluation method was
needed. A survey of available tools was made, but none were found suitable. The …
needed. A survey of available tools was made, but none were found suitable. The …
[PDF][PDF] Big data workflows: A reference architecture and the DATAVIEW system
The big data era is here, a natural result of the digital revolution of the last few decades. The
emergence of big data in virtually all areas of life raises a fundamental question-how can we …
emergence of big data in virtually all areas of life raises a fundamental question-how can we …
A method for trust quantification in cloud computing environments
X Li, J He, B Zhao, J Fang, Y Zhang… - International Journal of …, 2016 - journals.sagepub.com
Cloud computing and Internet of Things (IoT) are emerging technologies that have
experienced rapid development in recent years. While cloud computing presents a new …
experienced rapid development in recent years. While cloud computing presents a new …
A reinforcement learning scheduling strategy for parallel cloud-based workflows
A Nascimento, V Olimpio, V Silva… - 2019 IEEE …, 2019 - ieeexplore.ieee.org
Scientific experiments can be modeled as Workflows. Such Workflows are usually
computing-and data-intensive, demanding the use of High-Performance Computing …
computing-and data-intensive, demanding the use of High-Performance Computing …