The survey on ARM processors for HPC
The ongoing effort to reach the exascale computing barrier has led to a myriad of research
and publications in the topic of alternative energy-efficient architectures, such as ARM, for …
and publications in the topic of alternative energy-efficient architectures, such as ARM, for …
Effective exploration of thread throttling and thread/page mapping on numa systems
J Schwarzrock, HMGA Rocha… - 2020 IEEE 22nd …, 2020 - ieeexplore.ieee.org
NUMA systems have become commonly used in HPC. However, to fully take advantage of
these systems, the right thread-to-core allocation and page placement are essential. On top …
these systems, the right thread-to-core allocation and page placement are essential. On top …
Dynamic concurrency throttling on numa systems and data migration impacts
Many parallel applications do not scale as the number of threads increases, which means
that using the maximum number of threads will not always deliver the best outcome in …
that using the maximum number of threads will not always deliver the best outcome in …
PAMPAR: A new parallel benchmark for performance and energy consumption evaluation
A Marques Garcia, C Schepke… - … : Practice and Experience, 2020 - Wiley Online Library
This paper presents PAMPAR, a new benchmark to evaluate the performance and energy
consumption of different Parallel Programming Interfaces (PPIs). The benchmark is …
consumption of different Parallel Programming Interfaces (PPIs). The benchmark is …
How programming languages and paradigms affect performance and energy in multithreaded applications
GG Magalhaes, AL Sartor, AF Lorenzon… - 2016 VI Brazilian …, 2016 - ieeexplore.ieee.org
Considering that multithreaded applications may be implemented using several
programming languages and paradigms, in this work we show how they influence …
programming languages and paradigms, in this work we show how they influence …
On the influence of data migration in dynamic thread management of parallel applications
Many parallel applications do not scale as the number of threads increases, which means
that using the maximum number of threads will not always deliver the best outcome in …
that using the maximum number of threads will not always deliver the best outcome in …
Transparent aging-aware thread throttling
TS Medeiros, L Pereira, FD Rossi… - 2019 31st …, 2019 - ieeexplore.ieee.org
To satisfy the rising performance demands of modern applications, the number of cores in a
single chip package has been increasing. However, the power dissipated and temperature …
single chip package has been increasing. However, the power dissipated and temperature …
A new parallel benchmark for performance evaluation and energy consumption
This paper presents a new benchmark to evaluate performance and energy consumption of
different Parallel Programming Interfaces (PPIs). The benchmark is composed of 11 …
different Parallel Programming Interfaces (PPIs). The benchmark is composed of 11 …
Automatic tuning tlp and dvfs for edp with a non-intrusive genetic algorithm framework
CC De Oliveira, AF Lorenzon… - 2018 VIII Brazilian …, 2018 - ieeexplore.ieee.org
New applications have been pushing multithreaded processing to another level of
performance and energy requirements. However, many aspects prevent linear …
performance and energy requirements. However, many aspects prevent linear …
Searching for the Ideal Number of Threads on Asymmetric Multiprocessors
MK Moori, HMGA Rocha, AF Lorenzon… - 2023 XIII Brazilian …, 2023 - ieeexplore.ieee.org
Asymmetric multicore processors (AMP) combine high-performance cores with more energy-
efficient ones, capitalizing on the diverse performance demands of modern devices (eg …
efficient ones, capitalizing on the diverse performance demands of modern devices (eg …