Provenance and scientific workflows: challenges and opportunities

SB Davidson, J Freire - Proceedings of the 2008 ACM SIGMOD …, 2008 - dl.acm.org
Provenance in the context of workflows, both for the data they derive and for their
specification, is an essential component to allow for result reproducibility, sharing, and …

Iron behaving badly: inappropriate iron chelation as a major contributor to the aetiology of vascular and other progressive inflammatory and degenerative diseases

DB Kell - BMC medical genomics, 2009 - Springer
Background The production of peroxide and superoxide is an inevitable consequence of
aerobic metabolism, and while these particular'reactive oxygen species'(ROSs) can exhibit a …

A survey on provenance: What for? What form? What from?

M Herschel, R Diestelkämper, H Ben Lahmar - The VLDB Journal, 2017 - Springer
Provenance refers to any information describing the production process of an end product,
which can be anything from a piece of digital data to a physical object. While this survey …

[HTML][HTML] Nipype: a flexible, lightweight and extensible neuroimaging data processing framework in python

K Gorgolewski, CD Burns, C Madison… - Frontiers in …, 2011 - frontiersin.org
Current neuroimaging software offer users an incredible opportunity to analyze their data in
different ways, with different underlying assumptions. Several sophisticated software …

[图书][B] Principles of data integration

AH Doan, A Halevy, Z Ives - 2012 - books.google.com
Principles of Data Integration is the first comprehensive textbook of data integration,
covering theoretical principles and implementation issues as well as current challenges …

The future of scientific workflows

E Deelman, T Peterka, I Altintas… - … Journal of High …, 2018 - journals.sagepub.com
Today's computational, experimental, and observational sciences rely on computations that
involve many related tasks. The success of a scientific mission often hinges on the computer …

[PDF][PDF] DryadLINQ: A system for general-purpose distributed data-parallel computing using a high-level language

YYMID Fetterly, M Budiu, Ú Erlingsson, PKGJ Currey - Proc. LSDS-IR, 2009 - usenix.org
DryadLINQ is a system and a set of language extensions that enable a new programming
model for large scale distributed computing. It generalizes previous execution environments …

Taverna: a tool for building and running workflows of services

D Hull, K Wolstencroft, R Stevens, C Goble… - Nucleic acids …, 2006 - academic.oup.com
Taverna is an application that eases the use and integration of the growing number of
molecular biology tools and databases available on the web, especially web services. It …

Finding related tables in data lakes for interactive data science

Y Zhang, ZG Ives - Proceedings of the 2020 ACM SIGMOD International …, 2020 - dl.acm.org
Many modern data science applications build on data lakes, schema-agnostic repositories
of data files and data products that offer limited organization and management capabilities …

Dynamic QoS management and optimization in service-based systems

R Calinescu, L Grunske, M Kwiatkowska… - IEEE Transactions …, 2010 - ieeexplore.ieee.org
Service-based systems that are dynamically composed at runtime to provide complex,
adaptive functionality are currently one of the main development paradigms in software …