Gpu-enabled asynchronous multi-level checkpoint caching and prefetching

A Maurya, MM Rafique, T Tonellot, HJ AlSalem… - Proceedings of the …, 2023 - dl.acm.org
Checkpointing is an I/O intensive operation increasingly used by High-Performance
Computing (HPC) applications to revisit previous intermediate datasets at scale. Unlike the …

Towards efficient I/O scheduling for collaborative multi-level checkpointing

A Maurya, B Nicolae, MM Rafique… - … , and Simulation of …, 2021 - ieeexplore.ieee.org
Efficient checkpointing of distributed data structures periodically at key moments during
runtime is a recurring fundamental pattern in a large number of uses cases: fault tolerance …

Towards Efficient Cache Allocation for High-Frequency Checkpointing

A Maurya, B Nicolae, MM Rafique… - 2022 IEEE 29th …, 2022 - ieeexplore.ieee.org
While many HPC applications are known to have long runtimes, this is not always because
of single large runs: in many cases, this is due to ensembles composed of many short runs …

Maximizing I/O bandwidth for reverse time migration on heterogeneous large-scale systems

T Alturkestani, H Ltaief, D Keyes - … , Warsaw, Poland, August 24–28, 2020 …, 2020 - Springer
Abstract Reverse Time Migration (RTM) is an important scientific application for oil and gas
exploration. The 3D RTM simulation generates terabytes of intermediate data that does not …

Towards Efficient I/O Pipelines using Accumulated Compression

A Maurya, B Nicolae, MM Rafique… - 2023 IEEE 30th …, 2023 - ieeexplore.ieee.org
High-Performance Computing (HPC) workloads generate large volumes of data at high-
frequency during their execution, which needs to be captured concurrently at scale. These …

Reverse Time Migration with Lossy and Lossless Wavefield Compression

CHS Barbosa, ALGA Coutinho - 2023 IEEE 35th International …, 2023 - ieeexplore.ieee.org
Seismic imaging techniques like Reverse Time Migration (RTM) are time-consuming and
data-intensive activities in the field of geophysical exploration. The computational cost …

Evaluating Asynchronous Parallel I/O on HPC Systems

J Ravi, S Byna, Q Koziol, H Tang… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Parallel I/O is an effective method to optimize data movement between memory and storage
for many scientific applications. Poor performance of traditional disk-based file systems has …

An Active File Mode Transition Mechanism Based on Directory Activation Ratio in File Synchronization Service

M Lim - Applied Sciences, 2023 - mdpi.com
In this paper, we propose an active file mode change mechanism in which the file
synchronization system of cloud storage automatically changes files in a directory of a client …

Toward high performance asynchronous RTM with temporal blocking and buffered I/O

L Qu, H Ltaief, D Keyes - Fifth EAGE Workshop on High Performance …, 2021 - earthdoc.org
During the forward and backward modeling in Reverse Time Migration (RTM), stencil
computations constitute one of the main computationally intensive components. Their classic …

[图书][B] Toward Scalable Middleware for Shared HPC Resources

JJ Ravi - 2023 - search.proquest.com
Exascale HPC systems often utilize accelerators like GPUs to accelerate entire or parts of
applications. Often, large-scale applications struggle to use the available resources fully …