FIMSIM: A fault injection infrastructure for microarchitectural simulators

G Yalcin, OS Unsal, A Cristal… - 2011 IEEE 29th …, 2011 - ieeexplore.ieee.org
Fault injection is a widely used approach for experiment-based dependability evaluation.
Injecting faults to microarchitectural simulators is particularly appealing for researchers …

FaulTM: error detection and recovery using hardware transactional memory

G Yalcin, O Unsal, A Cristal - … & Test in Europe Conference & …, 2013 - ieeexplore.ieee.org
Reliability is an essential concern for processor designers due to increasing transient and
permanent fault rates. Executing instruction streams redundantly in chip multi processors …

Combining error detection and transactional memory for energy-efficient computing below safe operation margins

G Yalcin, A Sobe, D Harmanci… - … , and Network-Based …, 2014 - ieeexplore.ieee.org
The power envelope has become a major issue for the design of computer systems. One
way of reducing energy consumption is to downscale the voltage of microprocessors …

Revisiting symptom-based fault tolerant techniques against soft errors

H So, M Didehban, Y Ko, R Jeyapaul, J Kim, Y Kim… - Electronics, 2021 - mdpi.com
Aggressive technology scaling and near-threshold computing have made soft error reliability
one of the leading design considerations in modern embedded microprocessors. Although …

Applying Hardware Transactional Memory for {Concurrency-Bug} Failure Recovery in Production Runs

Y Chen, S Wang, S Lu, K Sankaralingam - 2018 USENIX Annual …, 2018 - usenix.org
Concurrency bugs widely exist and severely threaten system availability. Techniques that
help recover from concurrency-bug failures during production runs are highly desired. This …

Transactional encoding for tolerating transient hardware errors

JT Wamhoff, M Schwalbe, R Faqeh, C Fetzer… - Stabilization, Safety, and …, 2013 - Springer
The decreasing feature size of integrated circuits leads to less reliable hardware with higher
likelihood for errors. Without adding additional failure detection and masking mechanisms …

[PDF][PDF] Leveraging transactional memory for energyefficient computing below safe operation margins

A Cristal, P Felber, C Fetzer, D Harmanci… - Proceedings of the 8th …, 2013 - academia.edu
The power envelope has become a major issue for the design of computer systems. One
way of reducing energy consumption is to downscale the voltage of microprocessors …

Detection of Silent Data Corruption in fault-tolerant distributed systems on board spacecraft

M Fayyaz, T Vladimirova - 2014 NASA/ESA Conference on …, 2014 - ieeexplore.ieee.org
In this paper a novel distributed architecture for system level Fault Detection, Isolation and
Recovery (FDIR) aimed at spacecraft applications is presented. The architecture …

Task Oriented Fault-Tolerant Distributed Computing for Use on Board Spacecraft

M Fayyaz - 2016 - figshare.le.ac.uk
Current and future space missions demand highly reliable, High Performance Embedded
Computing (HPEC). The review of the literature has shown that no single solution could …

Applying transactional memory for concurrency-bug failure recovery in production runs

Y Chen, S Wang, S Lu… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Concurrency bugs widely exist and severely threaten system availability. Techniques that
help recover from concurrency-bug failures during production runs are highly desired. This …