Implementing a genomic data management system using iRODS in the Wellcome Trust Sanger Institute

GT Chiang, P Clapham, G Qi, K Sale, G Coates - BMC bioinformatics, 2011 - Springer
Background Increasingly large amounts of DNA sequencing data are being generated
within the Wellcome Trust Sanger Institute (WTSI). The traditional file system struggles to …

Survey and comparison for Open and closed sources in cloud computing

NK Salih, T Zang - arXiv preprint arXiv:1207.5480, 2012 - arxiv.org
Cloud computing is a new technology widely studied in recent years. Now there are many
cloud platforms both in industry and in academic circle. How to understand and use these …

HydroCloud: A cloud-based system for hydrologic data integration and analysis

MP McGuire, MC Roberge, J Lian - 2014 Fifth International …, 2014 - ieeexplore.ieee.org
The analysis of rainfall and runoff to characterize watershed response to storm events is a
critical area of hydrologic research. A wealth of data exists to perform this analysis, but it is …

Sql or nosql? which is the best choice for storing big spatio-temporal climate data?

J Lian, S Miao, M McGuire, Z Tang - … : ER 2018 Workshops Emp-ER, MoBiD …, 2018 - Springer
Management of big spatio-temporal data such as the results from large scale global climate
models has long been a challenge because of the sheer vastness of the dataset. Although …

An e‐Science data infrastructure for simulations within Grid computing environment: methods, approaches and practice

X Yang, MT Dove, RP Bruin… - Concurrency and …, 2013 - Wiley Online Library
Grid‐based simulation usually involves large quantities of data at each stage of the
simulation process. These data include simulation input and output files, intermediate results …

Methodological approach to data-centric cloudification of scientific iterative workflows

S Caíno-Lores, A Lapin, P Kropf, J Carretero - International Conference on …, 2016 - Springer
The computational complexity and the constantly increasing amount of input data for
scientific computing models is threatening their scalability. In addition, this is leading …

The design of a collaborative social network for watershed science

MP McGuire, MC Roberge - … Conference, GRMSE 2014, Ypsilanti, MI, USA …, 2015 - Springer
There is a strong and persistent demand amongst scientists, citizen scientists and the
general public for hydrologic data such as NEXRAD imagery and stream gauge time-series …

Channeling the water data deluge: a system for flexible integration and analysis of hydrologic data

MP McGuire, MC Roberge, J Lian - International Journal of Digital …, 2016 - Taylor & Francis
The hydrologic cycle and understanding the relationship between rainfall and runoff is an
important component of earth system science, sustainable development, and natural …

[PDF][PDF] (2020). Applying big data paradigms to a large scale scientific workflow: Lessons learned and future directions. Future Generation Computer Systems, 110, pp …

S Caíno-Lores, A Lapin, J Carretero, P Kropf - 2020 - e-archivo.uc3m.es
The increasing amounts of data related to the execution of scientific workflows has raised
awareness of their shift towards parallel data-intensive problems. In this paper, we deliver …

[HTML][HTML] On the convergence of big data analytics and high-performance computing: a novel approach for runtime interoperability

SC Lores - 2019 - documat.unirioja.es
The information technology ecosystem is currently in transition to a new generation of
applications requiring intensive data acquisition, processing and storage. As a result of this …