Aurora: Seamless optimization of openmp applications
AF Lorenzon, CC De Oliveira… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Efficiently exploiting thread-level parallelism has been challenging for software developers.
As many parallel applications do not scale with the number of cores, the task of rightly …
As many parallel applications do not scale with the number of cores, the task of rightly …
A runtime and non-intrusive approach to optimize edp by tuning threads and cpu frequency for openmp applications
J Schwarzrock, CC de Oliveira, M Ritt… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Efficiently exploiting thread-level parallelism has been challenging. Many parallel
applications are not sufficiently balanced or CPU-bound to take advantage of the increasing …
applications are not sufficiently balanced or CPU-bound to take advantage of the increasing …
Improving the efficiency of graph algorithm executions on high‐performance computing
MK Moori, HMG de A. Rocha… - Concurrency and …, 2023 - Wiley Online Library
The growing need for extracting information from large graphs has been pushing the
development of parallel graph algorithms. However, the highly irregular structure of the real …
development of parallel graph algorithms. However, the highly irregular structure of the real …
Adapt burstable containers to variable CPU resources
In the age of the cloud-native, container technology, referred as OS-level virtualization, is
increasingly adopted to deploy cloud applications. Compared with virtual machines …
increasingly adopted to deploy cloud applications. Compared with virtual machines …
Etcg: Energy-aware cpu thread throttling for cpu-gpu collaborative environments
High-Performance computing systems have been constantly adopting CPU-GPU
architectures as a collaborative environment to accelerate applications by partitioning …
architectures as a collaborative environment to accelerate applications by partitioning …
Dynamic concurrency throttling on numa systems and data migration impacts
Many parallel applications do not scale as the number of threads increases, which means
that using the maximum number of threads will not always deliver the best outcome in …
that using the maximum number of threads will not always deliver the best outcome in …
Synergically rebalancing parallel execution via dct and turbo boosting
SM Marques, TS Medeiros, FD Rossi… - 2021 58th ACM/IEEE …, 2021 - ieeexplore.ieee.org
The increasing use of cloud and HPC systems put more pressure on the efficient utilization
of hardware resources to keep costs low. Many dynamic concurrency throttling (DCT) …
of hardware resources to keep costs low. Many dynamic concurrency throttling (DCT) …
Etcf–energy-aware cpu thread throttling and workload balancing framework for cpu-fpga collaborative environments
Warehouses and Cloud Servers have been adopting collaborative CPU-FPGA architectures
as an alternative to enable extra acceleration for data-parallel applications by distributing …
as an alternative to enable extra acceleration for data-parallel applications by distributing …
Online thread auto-tuning for performance improvement and resource saving
G Luan, P Pang, Q Chen, S Xue… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Multi-threading is a common way for programs to benefit from the multi/many-core design.
However, the performance of some parallel programs does not increase/even decrease as …
However, the performance of some parallel programs does not increase/even decrease as …
Low learning-cost offline strategies for EDP optimization of parallel applications
Many parallel applications do not scale with the number of threads. Several online and
offline strategies have been proposed in order to optimize this number. While the former …
offline strategies have been proposed in order to optimize this number. While the former …