Aurora: Seamless optimization of openmp applications

AF Lorenzon, CC De Oliveira… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Efficiently exploiting thread-level parallelism has been challenging for software developers.
As many parallel applications do not scale with the number of cores, the task of rightly …

A runtime and non-intrusive approach to optimize edp by tuning threads and cpu frequency for openmp applications

J Schwarzrock, CC de Oliveira, M Ritt… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Efficiently exploiting thread-level parallelism has been challenging. Many parallel
applications are not sufficiently balanced or CPU-bound to take advantage of the increasing …

Improving the efficiency of graph algorithm executions on high‐performance computing

MK Moori, HMG de A. Rocha… - Concurrency and …, 2023 - Wiley Online Library
The growing need for extracting information from large graphs has been pushing the
development of parallel graph algorithms. However, the highly irregular structure of the real …

Adapt burstable containers to variable CPU resources

H Huang, Y Zhao, J Rao, S Wu, H Jin… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
In the age of the cloud-native, container technology, referred as OS-level virtualization, is
increasingly adopted to deploy cloud applications. Compared with virtual machines …

Etcg: Energy-aware cpu thread throttling for cpu-gpu collaborative environments

T Knorst, MG Jordan, AF Lorenzen… - 2021 34th SBC …, 2021 - ieeexplore.ieee.org
High-Performance computing systems have been constantly adopting CPU-GPU
architectures as a collaborative environment to accelerate applications by partitioning …

Dynamic concurrency throttling on numa systems and data migration impacts

J Schwarzrock, MG Jordan, G Korol, CC Oliveira… - Design Automation for …, 2021 - Springer
Many parallel applications do not scale as the number of threads increases, which means
that using the maximum number of threads will not always deliver the best outcome in …

Synergically rebalancing parallel execution via dct and turbo boosting

SM Marques, TS Medeiros, FD Rossi… - 2021 58th ACM/IEEE …, 2021 - ieeexplore.ieee.org
The increasing use of cloud and HPC systems put more pressure on the efficient utilization
of hardware resources to keep costs low. Many dynamic concurrency throttling (DCT) …

Etcf–energy-aware cpu thread throttling and workload balancing framework for cpu-fpga collaborative environments

T Knorst, MG Jordan, AF Lorenzon… - 2021 XI Brazilian …, 2021 - ieeexplore.ieee.org
Warehouses and Cloud Servers have been adopting collaborative CPU-FPGA architectures
as an alternative to enable extra acceleration for data-parallel applications by distributing …

Online thread auto-tuning for performance improvement and resource saving

G Luan, P Pang, Q Chen, S Xue… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Multi-threading is a common way for programs to benefit from the multi/many-core design.
However, the performance of some parallel programs does not increase/even decrease as …

Low learning-cost offline strategies for EDP optimization of parallel applications

GP Berned, FD Rossi, MC Luizelli, SX de Souza… - Journal of Systems …, 2021 - Elsevier
Many parallel applications do not scale with the number of threads. Several online and
offline strategies have been proposed in order to optimize this number. While the former …