受强制性开放获取政策约束的文章 - Christian Engelmann了解详情
可在其他位置公开访问的文章:47 篇
Failures in large scale systems: Long-term measurement, analysis, and implications
S Gupta, T Patel, C Engelmann, D Tiwari
Proceedings of the International Conference for High Performance Computing …, 2017
强制性开放获取政策: US Department of Energy
Machine learning models for GPU error prediction in a large scale HPC system
B Nie, J Xue, S Gupta, T Patel, C Engelmann, E Smirni, D Tiwari
2018 48th Annual IEEE/IFIP International Conference on Dependable Systems …, 2018
强制性开放获取政策: US National Science Foundation, US Department of Energy
Characterizing temperature, power, and soft-error behaviors in data center systems: Insights, challenges, and opportunities
B Nie, J Xue, S Gupta, C Engelmann, E Smirni, D Tiwari
2017 IEEE 25th International Symposium on Modeling, Analysis, and Simulation …, 2017
强制性开放获取政策: US National Science Foundation, US Department of Energy
Reducing waste in extreme scale systems through introspective analysis
L Bautista-Gomez, A Gainaru, S Perarnau, D Tiwari, S Gupta, ...
2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2016
强制性开放获取政策: US Department of Energy
GPU lifetimes on Titan supercomputer: Survival analysis and reliability
G Ostrouchov, D Maxwell, RA Ashraf, C Engelmann, M Shankar, ...
SC20: International Conference for High Performance Computing, Networking …, 2020
强制性开放获取政策: US Department of Energy
Big data meets HPC log analytics: Scalable approach to understanding systems at extreme scale
BH Park, S Hukerikar, R Adamson, C Engelmann
2017 IEEE International Conference on Cluster Computing (CLUSTER), 758-765, 2017
强制性开放获取政策: US Department of Energy
Shrink or Substitute: Handling Process Failures in HPC Systems using In-situ Recovery
RA Ashraf, S Hukerikar, C Engelmann
Parallel, Distributed and Network-based Processing (PDP), 2018 26th …, 2018
强制性开放获取政策: US Department of Energy
Power-capping aware checkpointing: On the interplay among power-capping, temperature, reliability, performance, and energy
K Tang, D Tiwari, S Gupta, P Huang, Q Lu, C Engelmann, X He
2016 46th Annual IEEE/IFIP International Conference on Dependable Systems …, 2016
强制性开放获取政策: US National Science Foundation, US Department of Energy
Scalable and fault tolerant failure detection and consensus
A Katti, G Di Fatta, T Naughton, C Engelmann
Proceedings of the 22nd European MPI Users' Group Meeting, 1-9, 2015
强制性开放获取政策: US Department of Energy
A big data analytics framework for HPC log data: Three case studies using the Titan supercomputer log
BH Park, Y Hui, S Boehm, RA Ashraf, C Layton, C Engelmann
2018 IEEE International Conference on Cluster Computing (CLUSTER), 571-579, 2018
强制性开放获取政策: US Department of Energy
Resilience Design Patterns: A Structured Approach to Resilience at Extreme Scale (Version 1.0)
S Hukerikar, C Engelmann
doi: 10.2172/1338552, 2016
强制性开放获取政策: US Department of Energy
Epidemic failure detection and consensus for extreme parallelism
A Katti, G Di Fatta, T Naughton, C Engelmann
The International Journal of High Performance Computing Applications 32 (5 …, 2018
强制性开放获取政策: US Department of Energy
Understanding and Analyzing Interconnect Errors and Network Congestion on a Large Scale HPC System
M Kumar, S Gupta, T Patel, M Wilder, W Shi, S Fu, C Engelmann, D Tiwari
2018 48th Annual IEEE/IFIP International Conference on Dependable Systems …, 2018
强制性开放获取政策: US National Science Foundation, US Department of Energy
Resiliency in numerical algorithm design for extreme scale simulations
E Agullo, M Altenbernd, H Anzt, L Bautista-Gomez, T Benacchio, ...
The International Journal of High Performance Computing Applications 36 (2 …, 2022
强制性开放获取政策: US Department of Energy
Mini-ckpts: Surviving OS failures in persistent memory
D Fiala, F Mueller, K Ferreira, C Engelmann
Proceedings of the 2016 International Conference on Supercomputing, 7, 2016
强制性开放获取政策: US National Science Foundation, US Department of Energy
The INTERSECT Open Federated Architecture for the Laboratory of the Future
C Engelmann, O Kuchar, S Boehm, MJ Brim, T Naughton, S Somnath, ...
Smoky Mountains Computational Sciences and Engineering Conference, 173-190, 2022
强制性开放获取政策: US Department of Energy
Pattern-based modeling of multiresilience solutions for high-performance computing
RA Ashraf, S Hukerikar, C Engelmann
Proceedings of the 2018 ACM/SPEC International Conference on Performance …, 2018
强制性开放获取政策: US Department of Energy
Analyzing the impact of system reliability events on applications in the Titan supercomputer
RA Ashraf, C Engelmann
2018 IEEE/ACM 8th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS …, 2018
强制性开放获取政策: US Department of Energy
A pattern language for high-performance computing resilience
S Hukerikar, C Engelmann
Proceedings of the 22nd European Conference on Pattern Languages of Programs …, 2017
强制性开放获取政策: US Department of Energy
A comprehensive informative metric for analyzing HPC system status using the LogSCAN platform
Y Hui, BH Park, C Engelmann
2018 IEEE/ACM 8th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS …, 2018
强制性开放获取政策: US Department of Energy
出版信息和资助信息由计算机程序自动确定