Reliable on-chip systems in the nano-era: Lessons learnt and future trends

J Henkel, L Bauer, N Dutt, P Gupta, S Nassif… - Proceedings of the 50th …, 2013 - dl.acm.org
Reliability concerns due to technology scaling have been a major focus of researchers and
designers for several technology nodes. Therefore, many new techniques for enhancing and …

Characterizing application memory error vulnerability to optimize datacenter cost via heterogeneous-reliability memory

Y Luo, S Govindan, B Sharma… - 2014 44th Annual …, 2014 - ieeexplore.ieee.org
Memory devices represent a key component of datacenter total cost of ownership (TCO),
and techniques used to reduce errors that occur on these devices increase this cost. Existing …

A survey on simulation-based fault injection tools for complex systems

M Kooli, G Di Natale - … Conference on Design & Technology of …, 2014 - ieeexplore.ieee.org
Dependability is a key decision factor in today's global business environment. A powerful
method that permits to evaluate the dependability of a system is the fault injection. The …

Test sequences generation from lustre descriptions: Gatel

B Marre, A Arnould - Proceedings ASE 2000. Fifteenth IEEE …, 2000 - ieeexplore.ieee.org
We describe a test sequence generation method from LUSTRE descriptions and its
companion tool, GATEL. The LUSTRE language is declarative and describes synchronous …

Applying lightweight soft error mitigation techniques to embedded mixed precision deep neural networks

G Abich, J Gava, R Garibotti, R Reis… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Deep neural networks (DNNs) are being incorporated in resource-constrained IoT devices,
which typically rely on reduced memory footprint and low-performance processors. While …

A source-to-source compiler for generating dependable software

M Rebaudengo, MS Reorda, M Violante… - … Workshop on Source …, 2001 - ieeexplore.ieee.org
Over the last years, an increasing number of safety-critical tasks have been demanded for
computer systems. In particular, safety-critical computer-based applications are hitting …

[HTML][HTML] SOFIA: An automated framework for early soft error assessment, identification, and mitigation

J Gava, V Bandeira, F Rosa, R Garibotti, R Reis… - Journal of Systems …, 2022 - Elsevier
The occurrence of radiation-induced soft errors in electronic computing systems can either
affect non-essential system functionalities or violate safety–critical conditions, which might …

Generative software-based memory error detection and correction for operating system data structures

C Borchert, H Schirmeier… - 2013 43rd Annual IEEE …, 2013 - ieeexplore.ieee.org
Recent studies indicate that the number of system failures caused by main memory errors is
much higher than expected. In contrast to the commonly used hardware-based …

A watchdog processor to detect data and control flow errors

A Benso, S Di Carlo, G Di Natale… - 9th IEEE On-Line …, 2003 - ieeexplore.ieee.org
A watchdog processor for the MOTOROLA M68040 microprocessor is presented. Its main
task is to protect from transient faults caused by SEUs the transmission of data between the …

[PDF][PDF] Hardware error detection using AN-codes

U Schiffel - 2011 - academia.edu
Due to the continuously decreasing feature sizes and the increasing complexity of integrated
circuits, commercial off-the-shelf (COTS) hardware is becoming less and less reliable …