Acceleration of BAM I/O on distributed file systems

S Ito, S Miyano, K Ono - 2023 IEEE International Conference on …, 2023 - ieeexplore.ieee.org
Rapid advances in high-throughput sequencers have made it possible to obtain large
amounts of whole genome data quickly and inexpensively. As the amount of data increases …

[PDF][PDF] ParaMEDIC: Parallel metadata environment for distributed I/O and computing

P Balaji, W Feng, J Archuleta, H Lin… - IEEE/ACM International …, 2007 - Citeseer
BLAST is a widely used software toolkit for genomic sequence search. mpiBLAST is a freely
available, opensource parallelization of BLAST that uses database segmentation to allow …

Filesystem aware scalable i/o framework for data-intensive parallel applications

R Xu, M Araya-Polo, B Chapman - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
The growing speed gap between CPU and memory makes I/O the main bottleneck of many
industrial applications. Some applications need to perform I/O operations for very large …

HDF5 Cache VOL: Efficient and scalable parallel I/O through caching data on node-local storage

H Zheng, V Vishwanath, Q Koziol… - 2022 22nd IEEE …, 2022 - ieeexplore.ieee.org
Modern-era high performance computing (HPC) systems are providing multiple levels of
memory and storage layers to bridge the performance gap between fast memory and slow …

Scalable in-memory computing

A Uta, A Sandu, S Costache… - 2015 15th IEEE/ACM …, 2015 - ieeexplore.ieee.org
Data-intensive scientific workflows are composed of many tasks that exhibit data
precedence constraints leading to communication schemes expressed by means of …

Opiom: Off-processor I/O with myrinet

P Geoffray - Future Generation Computer Systems, 2002 - Elsevier
As processors become more powerful and clusters larger, users will exploit this increased
power to progressively run larger and larger problems. Today's datasets in biology, physics …

Opiom: off-processor io with myrinet

P Geoffray - … First IEEE/ACM International Symposium on …, 2001 - ieeexplore.ieee.org
As processors become more powerful and clusters larger, users will exploit this increased
power to progressively run larger and larger problems. Today's datasets in biology, physics …

Parallel file system analysis through application I/O tracing

SA Wright, SD Hammond, SJ Pennycook… - The Computer …, 2013 - academic.oup.com
Abstract Input/Output (I/O) operations can represent a significant proportion of the run-time of
parallel scientific computing applications. Although there have been several advances in file …

An approach for parallel reading in multiple sequence alignment

SH Ko, V Gancheva - 2020 International Conference …, 2020 - ieeexplore.ieee.org
We propose an approach for faster file reading of multiple sequence alignment input through
the use of MPI-I/O over a subset of MPI cores. The idea is to allow a subset of MPI cores that …

Optimization of i/o intensive genome assemblies on the cori supercomputer with burst buffer

J Pritchett, B Andreopoulos - Proceedings of the 7th ACM International …, 2016 - dl.acm.org
Since the development of next generation sequencing technologies, genome assembly has
become one of the most computational and I/O intensive analyses done on the genomic …