Anatomy of microarchitecture-level reliability assessment: Throughput and accuracy

A Chatzidimitriou, D Gizopoulos - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
2016 IEEE International Symposium on Performance Analysis of …, 2016ieeexplore.ieee.org
The increasing density and complexity of modern microprocessors, which is driven by
manufacturing technologies scaling, significantly affect their reliability. Reliability evaluation
during the early design stages is a challenging process for microprocessor designers.
Statistical fault-injection on microarchitecture simulators is commonly used, among other
techniques, since it can deliver early and accurate reliability estimations for many important
microprocessor hardware structures. However, full-system microarchitectural simulators …
The increasing density and complexity of modern microprocessors, which is driven by manufacturing technologies scaling, significantly affect their reliability. Reliability evaluation during the early design stages is a challenging process for microprocessor designers. Statistical fault-injection on microarchitecture simulators is commonly used, among other techniques, since it can deliver early and accurate reliability estimations for many important microprocessor hardware structures. However, full-system microarchitectural simulators have a relatively small simulation throughput. Thus, the number of injection experiments that can be performed during a fault injection campaign can be limited and therefore lead to smaller statistical significance of the reliability assessment. Aiming to boost the throughput of microarchitecture-level fault injection, we present, in this paper, a multi-faceted microarchitecture-level toolset for reliability assessment of modern microprocessors. The framework is built around the Gem5 simulator and provides several modes of operation which employ acceleration features for all stages of a fault-injection based reliability assessment campaign. The tool throughput and the accuracy of the delivered reliability assessments can be traded off and allow architects to make informed decisions about the most suitable error protection mechanisms of any given microarchitecture and workload by studying the reports delivered by the toolset. We provide experimental results of the different modes of the toolset for both the x86 and ARM out-of-order models of Gem5. Our experimental results show that up to 8x acceleration of the fault injection campaigns can be achieved with less than 0.5 percentile points of accuracy loss.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果