Towards exascale co-design in a runtime system

T Sterling, M Anderson, PK Bohan, M Brodowicz… - … Software Challenges for …, 2015 - Springer
Achieving the performance potential of an Exascale machine depends on realizing both
operational efficiency and scalability in high performance computing applications. This …

Accelerating the 3-D FFT using a heterogeneous FPGA architecture

M Anderson, M Brodowicz, M Swany… - Euro-Par 2017: Parallel …, 2018 - Springer
Future Exascale architectures will likely make extensive use of computing accelerators such
as Field Programmable Gate Arrays (FPGAs) given that these accelerators are very power …

Particle-in-cell simulation using asynchronous tasking

N Guidotti, P Ceyrat, J Barreto, J Monteiro… - Euro-Par 2021: Parallel …, 2021 - Springer
Recently, task-based programming models have emerged as a prominent alternative among
shared-memory parallel programming paradigms. Inherently asynchronous, these models …

Hardware-based scheduling and synchronization for light-weight, fine-grained multi-threading.

S Haddad, J Cook - … Journal of New Computer Architectures and Their …, 2020 - go.gale.com
Fine-grained, light-weight multi-threading enhances the performance and scalability of many
applications due to its ability to hide memory and network latency and enhance load …

[图书][B] The utilization of hardware-based thread scheduling and synchronization to increase the performance of graph-based applications

SH Haddad - 2015 - search.proquest.com
Graph-based applications have been repeatedly shown to demonstrate poor scaling
performance on contemporary architectures due to their characteristics that exacerbate …

Towards Exascale Co-design in a Runtime System

A Kulkarni, B Zhang - … , EASC 2014, Stockholm, Sweden, April 2 …, 2015 - books.google.com
Achieving the performance potential of an Exascale machine depends on realizing both
operational efficiency and scalability in high performance computing applications. This …