Run-to-run variability on Xeon Phi based Cray XC systems

S Chunduri, K Harms, S Parker, V Morozov… - Proceedings of the …, 2017 - dl.acm.org
The increasing complexity of HPC systems has introduced new sources of variability, which
can contribute to significant differences in run-to-run performance of applications. With …

A novel data-partitioning algorithm for performance optimization of data-parallel applications on heterogeneous HPC platforms

H Khaleghzadeh, RR Manumachu… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Modern HPC platforms have become highly heterogeneous owing to tight integration of
multicore CPUs and accelerators (such as Graphics Processing Units, Intel Xeon Phis, or …

Vapro: Performance variance detection and diagnosis for production-run parallel applications

L Zheng, J Zhai, X Tang, H Wang, T Yu, Y Jin… - Proceedings of the 27th …, 2022 - dl.acm.org
Performance variance is a serious problem for parallel applications, which can cause
performance degradation and make applications' behavior hard to understand. Therefore …

Parallel data partitioning algorithms for optimization of data-parallel applications on modern extreme-scale multicore platforms for performance and energy

RR Manumachu, A Lastovetsky - IEEE Access, 2018 - ieeexplore.ieee.org
Data partitioning algorithms aiming to minimize the execution time and the energy of
computations in self-adaptable data-parallel applications on modern extreme-scale …

Memory access optimization of molecular dynamics simulation software crystal-md on sunway taihulight

J Li, J Lin, P Du, K Zhang, J Wu - Tsinghua Science and …, 2020 - ieeexplore.ieee.org
The radiation damage effect of key structural materials is one of the main research subjects
of the numerical reactor. From the perspective of experimental safety and feasibility …

Detecting performance variance for parallel applications without source code

J Zhai, L Zheng, F Zhang, X Tang… - … on Parallel and …, 2022 - ieeexplore.ieee.org
For parallel applications, performance variance is a critical issue that can degrade
performance and make applications' behavior difficult to explain. Therefore, users and …

Characterizing security monitor and embedded system performance across distinct risc-v ip-cores

JC Tullos - 2021 - scholar.afit.edu
Embedded systems have seen a rapid integration into all forms of industry as they continue
to shrink in size and cost. The increased demand has highlighted a need for secure systems …

并行程序中同步瓶颈的检测和优化方法.

张杨, 李柳旭 - Journal of National University of Defense …, 2022 - search.ebscohost.com
针对并发程序中锁的不当使用可能导致性能瓶颈的问题, 提出检测和优化并发程序中同步瓶颈的
方法IdeSync. IdeSync 使用静态分析方法获取同步方法和同步块, 构建静态同步依赖图 …

Computer comparisons in the presence of performance variation

S Irving, B Li, S Chen, L Peng, W Zhang… - Frontiers of Computer …, 2020 - Springer
Performance variability, stemming from non-deterministic hardware and software behaviors
or deterministic behaviors such as measurement bias, is a well-known phenomenon of …

[PDF][PDF] Novel Data-Partitioning Algorithms for Performance and Energy Optimization of Data-Parallel Applications on Modern Heterogeneous HPC Platforms

H Khaleghzadeh - 2019 - researchgate.net
Heterogeneity has turned into one of the most profound and challenging characteristics of
today's HPC environments. Modern HPC platforms have become highly heterogeneous …