Argobots: A lightweight low-level threading and tasking framework

S Seo, A Amer, P Balaji, C Bordage… - … on Parallel and …, 2017 - ieeexplore.ieee.org
In the past few decades, a number of user-level threading and tasking models have been
proposed in the literature to address the shortcomings of OS-level threads, primarily with …

Unified fault-tolerance framework for hybrid task-parallel message-passing applications

O Subasi, T Martsinkevich… - … Journal of High …, 2018 - journals.sagepub.com
We present a unified fault-tolerance framework for task-parallel message-passing
applications to mitigate transient errors. First, we propose a fault-tolerant message-logging …

Nanocheckpoints: A task-based asynchronous dataflow framework for efficient and scalable checkpoint/restart

O Subasi, J Arias, O Unsal, J Labarta… - 2015 23rd Euromicro …, 2015 - ieeexplore.ieee.org
In this paper, we present NanoCheckpoints which is a lightweight software-based
checkpoint/restart scheme for task-parallel HPC applications. We leverage OmpSs, a task …

RADAR: Runtime-assisted dead region management for last-level caches

M Manivannan, V Papaefstathiou… - … Symposium on High …, 2016 - ieeexplore.ieee.org
Last-level caches (LLCs) bridge the processor/memory speed gap and reduce energy
consumed per access. Unfortunately, LLCs are poorly utilized because of the relatively large …

Vits: video tagging system from massive web multimedia collections

D Fernández, D Varas, J Espadaler… - Proceedings of the …, 2017 - openaccess.thecvf.com
The popularization of multimedia content on the Web has arised the need to automatically
understand, index and retrieve it. In this paper we present ViTS, an automatic Video Tagging …

Dash: a benchmark suite for hybrid dataflow and shared memory programming models: with comparative evaluation of three hybrid dataflow models

V Gajinov, S Stipić, I Erić, OS Unsal… - Proceedings of the 11th …, 2014 - dl.acm.org
The current trend in development of parallel programming models is to combine different
well established models into a single programming model in order to support efficient …

Fault-tolerant protocol for hybrid task-parallel message-passing applications

T Martsinkevich, O Subasi, O Unsal… - 2015 IEEE …, 2015 - ieeexplore.ieee.org
We present a fault-tolerant protocol for task-parallel message-passing applications to
mitigate transient errors. The protocol requires the restart only of the task that experienced …

Dashmm: Dynamic adaptive system for hierarchical multipole methods

J DeBuhr, B Zhang, A Tsueda… - Communications in …, 2016 - cambridge.org
We present DASHMM, a general library implementing multipole methods (including both
Barnes-Hut and the Fast Multipole Method). DASHMM relies on dynamic adaptive runtime …

Asynchronous task scheduling of the fast multipole method using various runtime systems

B Zhang - 2014 Fourth Workshop on Data-Flow Execution …, 2014 - ieeexplore.ieee.org
In this paper, we explore data-driven execution of the adaptive fast multipole method by
asynchronously scheduling available computational tasks using Cilk, C++ 11 standard …

Marriage between coordinated and uncoordinated checkpointing for the exascale era

O Subasi, F Zyulkyarov, O Unsal… - 2015 IEEE 17th …, 2015 - ieeexplore.ieee.org
The state-of-the-art checkpointing techniques are projected to be prohibitively expensive in
the Exascale era. These techniques are most often holistic in nature which prevents them to …