Big data resource management & networks: Taxonomy, survey, and future directions

FM Awaysheh, M Alazab, S Garg… - … Surveys & Tutorials, 2021 - ieeexplore.ieee.org
Big Data (BD) platforms have a long tradition of leveraging trends and technologies from the
broader computer network and communication community. For several years, dedicated …

A review on hadoop—HDFS infrastructure extensions

AK Karun, K Chitharanjan - 2013 IEEE conference on …, 2013 - ieeexplore.ieee.org
Apache's Hadoop 1 as of now is pretty good but there are scopes of extensions and
enhancements. A large number of improvements are proposed to Hadoop which is an open …

The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update

E Afgan, D Baker, B Batut, M Van Den Beek… - Nucleic acids …, 2018 - academic.oup.com
Abstract Galaxy (homepage: https://galaxyproject. org, main public server: https://usegalaxy.
org) is a web-based scientific analysis platform used by tens of thousands of scientists …

{SkyPilot}: An intercloud broker for sky computing

Z Yang, Z Wu, M Luo, WL Chiang, R Bhardwaj… - … USENIX Symposium on …, 2023 - usenix.org
To comply with the increasing number of government regulations about data placement and
processing, and to protect themselves against major cloud outages, many users want the …

Protean:{VM} allocation service at scale

O Hadary, L Marshall, I Menache, A Pan… - … USENIX Symposium on …, 2020 - usenix.org
We describe the design and implementation of Protean--the Microsoft Azure service
responsible for allocating Virtual Machines (VMs) to millions of servers around the globe. A …

Pegasus, a workflow management system for science automation

E Deelman, K Vahi, G Juve, M Rynge… - Future Generation …, 2015 - Elsevier
Modern science often requires the execution of large-scale, multi-stage simulation and data
analysis pipelines to enable the study of complex systems. The amount of computation and …

Apache hadoop yarn: Yet another resource negotiator

VK Vavilapalli, AC Murthy, C Douglas… - Proceedings of the 4th …, 2013 - dl.acm.org
The initial design of Apache Hadoop [1] was tightly focused on running massive,
MapReduce jobs to process a web crawl. For increasingly diverse companies, Hadoop has …

FireWorks: a dynamic workflow system designed for high‐throughput applications

A Jain, SP Ong, W Chen, B Medasani… - Concurrency and …, 2015 - Wiley Online Library
This paper introduces FireWorks, a workflow software for running high‐throughput
calculation workflows at supercomputing centers. FireWorks has been used to complete …

PyCBC Inference: A Python-based parameter estimation toolkit for compact binary coalescence signals

CM Biwer, CD Capano, S De, M Cabero… - Publications of the …, 2019 - iopscience.iop.org
We introduce new modules in the open-source PyCBC gravitational-wave astronomy toolkit
that implement Bayesian inference for compact-object binary mergers. We review the …

Digital cameras with designs inspired by the arthropod eye

YM Song, Y Xie, V Malyarchuk, J Xiao, I Jung, KJ Choi… - Nature, 2013 - nature.com
In arthropods, evolution has created a remarkably sophisticated class of imaging systems,
with a wide-angle field of view, low aberrations, high acuity to motion and an infinite depth of …