Identifying HPC codes via performance logs and machine learning
O DeMasi, T Samak, DH Bailey - Proceedings of the first workshop on …, 2013 - dl.acm.org
We aim here to leverage supervised learning to enable large-scale analysis of performance
logs, in order to accurately classify code runs and understand the importance of different …
logs, in order to accurately classify code runs and understand the importance of different …
Visualizing distributed memory computations with hive plots
A hive plot is a network layout algorithm that uses a parallel coordinate plot in which axes
are radially arranged and node position is based on structural properties of that node [8]. We …
are radially arranged and node position is based on structural properties of that node [8]. We …
ASCR Cybersecurity for Scientific Computing Integrity
S Piesert - 2015 - escholarship.org
The Department of Energy (DOE) has the responsibility to address the energy,
environmental, and nuclear security challenges that face our nation. Much of DOE's …
environmental, and nuclear security challenges that face our nation. Much of DOE's …
Characterizing loop-level communication patterns in shared memory
Communication patterns extracted from parallel programs can provide a valuable source of
information for parallel pattern detection, application auto-tuning, and runtime workload …
information for parallel pattern detection, application auto-tuning, and runtime workload …
Fingerprinting anomalous computation with RNN for GPU-accelerated HPC machines
This paper presents a workload classification framework that discriminates illicit computation
from authorized workloads on GPU-accelerated HPC systems. As such heterogeneous …
from authorized workloads on GPU-accelerated HPC systems. As such heterogeneous …
Multiclass classification of distributed memory parallel computations
High Performance Computing (HPC) is a field concerned with solving large-scale problems
in science and engineering. However, the computational infrastructure of HPC systems can …
in science and engineering. However, the computational infrastructure of HPC systems can …
Towards Real-Time Classification of HPC Workloads via Out-of-band Telemetry
S Presser - 2022 IEEE International Conference on Cluster …, 2022 - ieeexplore.ieee.org
Detecting illicit workloads on High Performance Computing (HPC) systems is an important
task. Such workloads might indicate a user account is being misused, for example to run …
task. Such workloads might indicate a user account is being misused, for example to run …
Unveiling thread communication bottlenecks using hardware-independent metrics
A critical factor for developing robust shared-memory applications is the efficient use of the
cache and the communication between threads. Inappropriate data structures, algorithm …
cache and the communication between threads. Inappropriate data structures, algorithm …
Detecting anomalous computation with rnns on gpu-accelerated hpc machines
This paper presents a workload classification framework that accurately discriminates illicit
computation from authorized workloads on GPU-accelerated HPC systems at runtime. As …
computation from authorized workloads on GPU-accelerated HPC systems at runtime. As …
Catch me if you can: Using power analysis to identify HPC activity
B Copos, S Peisert - arXiv preprint arXiv:2005.03135, 2020 - arxiv.org
Monitoring users on large computing platforms such as high performance computing (HPC)
and cloud computing systems is non-trivial. Utilities such as process viewers provide limited …
and cloud computing systems is non-trivial. Utilities such as process viewers provide limited …