A survey on multithreading alternatives for soft error fault tolerance

I Oz, S Arslan - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
Smaller transistor sizes and reduction in voltage levels in modern microprocessors induce
higher soft error rates. This trend makes reliability a primary design constraint for computer …

Survey on Redundancy Based-Fault tolerance methods for Processors and Hardware accelerators-Trends in Quantum Computing, Heterogeneous Systems and …

S Venkatesha, R Parthasarathi - ACM Computing Surveys, 2024 - dl.acm.org
Rapid progress in the CMOS technology for the past 25 years has increased the
vulnerability of processors towards faults. Subsequently, focus of computer architects shifted …

Architectures for online error detection and recovery in multicore processors

D Gizopoulos, M Psarakis, SV Adve… - … , Automation & Test …, 2011 - ieeexplore.ieee.org
The huge investment in the design and production of multicore processors may be put at risk
because the emerging highly miniaturized but unreliable fabrication technologies will …

Design and architectures for dependable embedded systems

J Henkel, L Bauer, J Becker, O Bringmann… - Proceedings of the …, 2011 - dl.acm.org
The paper presents an overview of a major research project on dependable embedded
systems that has started in Fall 2010 and is running for a projected duration of six years. Aim …

Adaptive, efficient, parallel execution of parallel programs

S Sridharan, G Gupta, GS Sohi - Proceedings of the 35th ACM SIGPLAN …, 2014 - dl.acm.org
Future multicore processors will be heterogeneous, be increasingly less reliable, and
operate in dynamically changing operating conditions. Such environments will result in a …

LESS-MICS: A low energy standby-sparing scheme for mixed-criticality systems

S Safari, S Hessabi, G Ershadi - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Multicore platforms are becoming the dominant trend in mixed-criticality systems (MCSs).
Multicores provide great opportunities to realize task-level redundancy for reliability …

Thermal-aware standby-sparing technique on heterogeneous real-time embedded systems

M Ansari, S Safari, S Yari-Karin… - … on Emerging Topics …, 2021 - ieeexplore.ieee.org
Low power consumption, real-time computing, and high reliability are three key
requirements/design objectives of real-time embedded systems. The standby-sparing …

DRVS: Power-efficient reliability management through dynamic redundancy and voltage scaling under variations

M Salehi, MK Tavana, S Rehman… - 2015 IEEE/ACM …, 2015 - ieeexplore.ieee.org
Many-core processors facilitate coarse-grained reliability by exploiting available cores for
redundant multithreading. However, ensuring high reliability with reduced power …

Exploiting temporal data diversity for detecting safety-critical faults in AV compute systems

S Jha, S Cui, T Tsai, SKS Hari… - 2022 52nd Annual …, 2022 - ieeexplore.ieee.org
Silent data corruption caused by random hardware faults in autonomous vehicle (AV)
computational elements is a significant threat to vehicle safety. Previous research has …

Software-only based diverse redundancy for asil-d automotive applications on embedded hpc platforms

S Alcaide, L Kosmidis, C Hernandez… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
High-Performance Computing (HPC) platforms become a must in automotive systems to
enable autonomous driving. However, automotive platforms must avoid Common Cause …