The survey on ARM processors for HPC

D Yokoyama, B Schulze, F Borges… - The Journal of …, 2019 - Springer
The ongoing effort to reach the exascale computing barrier has led to a myriad of research
and publications in the topic of alternative energy-efficient architectures, such as ARM, for …

Effective exploration of thread throttling and thread/page mapping on numa systems

J Schwarzrock, HMGA Rocha… - 2020 IEEE 22nd …, 2020 - ieeexplore.ieee.org
NUMA systems have become commonly used in HPC. However, to fully take advantage of
these systems, the right thread-to-core allocation and page placement are essential. On top …

Dynamic concurrency throttling on numa systems and data migration impacts

J Schwarzrock, MG Jordan, G Korol, CC Oliveira… - Design Automation for …, 2021 - Springer
Many parallel applications do not scale as the number of threads increases, which means
that using the maximum number of threads will not always deliver the best outcome in …

PAMPAR: A new parallel benchmark for performance and energy consumption evaluation

A Marques Garcia, C Schepke… - … : Practice and Experience, 2020 - Wiley Online Library
This paper presents PAMPAR, a new benchmark to evaluate the performance and energy
consumption of different Parallel Programming Interfaces (PPIs). The benchmark is …

How programming languages and paradigms affect performance and energy in multithreaded applications

GG Magalhaes, AL Sartor, AF Lorenzon… - 2016 VI Brazilian …, 2016 - ieeexplore.ieee.org
Considering that multithreaded applications may be implemented using several
programming languages and paradigms, in this work we show how they influence …

On the influence of data migration in dynamic thread management of parallel applications

J Schwarzrock, MG Jordan, G Korol… - 2019 IX Brazilian …, 2019 - ieeexplore.ieee.org
Many parallel applications do not scale as the number of threads increases, which means
that using the maximum number of threads will not always deliver the best outcome in …

Transparent aging-aware thread throttling

TS Medeiros, L Pereira, FD Rossi… - 2019 31st …, 2019 - ieeexplore.ieee.org
To satisfy the rising performance demands of modern applications, the number of cores in a
single chip package has been increasing. However, the power dissipated and temperature …

A new parallel benchmark for performance evaluation and energy consumption

AM Garcia, C Schepke, AG Girardi… - … Conference on Vector and …, 2018 - Springer
This paper presents a new benchmark to evaluate performance and energy consumption of
different Parallel Programming Interfaces (PPIs). The benchmark is composed of 11 …

Automatic tuning tlp and dvfs for edp with a non-intrusive genetic algorithm framework

CC De Oliveira, AF Lorenzon… - 2018 VIII Brazilian …, 2018 - ieeexplore.ieee.org
New applications have been pushing multithreaded processing to another level of
performance and energy requirements. However, many aspects prevent linear …

Searching for the Ideal Number of Threads on Asymmetric Multiprocessors

MK Moori, HMGA Rocha, AF Lorenzon… - 2023 XIII Brazilian …, 2023 - ieeexplore.ieee.org
Asymmetric multicore processors (AMP) combine high-performance cores with more energy-
efficient ones, capitalizing on the diverse performance demands of modern devices (eg …