Two-state checkpointing for energy-efficient fault tolerance in hard real-time systems

M Salehi, MK Tavana, S Rehman… - … Transactions on Very …, 2016 - ieeexplore.ieee.org
Checkpointing with rollback recovery is a well-established technique to tolerate transient
faults. However, it incurs significant time and energy overheads, which go wasted in fault …

Peak power management to meet thermal design power in fault-tolerant embedded systems

M Ansari, S Safari, A Yeganeh-Khaksar… - … on Parallel and …, 2018 - ieeexplore.ieee.org
Multicore platforms provide a great opportunity for implementation of fault-tolerance
techniques to achieve high reliability in real-time embedded systems. Passive redundancy is …

Thermal-aware standby-sparing technique on heterogeneous real-time embedded systems

M Ansari, S Safari, S Yari-Karin… - … on Emerging Topics …, 2021 - ieeexplore.ieee.org
Low power consumption, real-time computing, and high reliability are three key
requirements/design objectives of real-time embedded systems. The standby-sparing …

DRVS: Power-efficient reliability management through dynamic redundancy and voltage scaling under variations

M Salehi, MK Tavana, S Rehman… - 2015 IEEE/ACM …, 2015 - ieeexplore.ieee.org
Many-core processors facilitate coarse-grained reliability by exploiting available cores for
redundant multithreading. However, ensuring high reliability with reduced power …

Two-phase low-energy N-modular redundancy for hard real-time multi-core systems

M Salehi, A Ejlali, BM Al-Hashimi - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
This paper proposes an N-modular redundancy (NMR) technique with low energy-overhead
for hard real-time multi-core systems. NMR is well-suited for multi-core platforms as they …

Peak-power-aware energy management for periodic real-time applications

M Ansari, A Yeganeh-Khaksar… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Two main objectives in designing real-time embedded systems are high reliability and low
power consumption. Hardware replication (eg, standby-sparing) can provide high reliability …

Exploiting primary/backup mechanism for energy efficiency in dependable real-time systems

Y Guo, D Zhu, H Aydin, JJ Han, LT Yang - Journal of Systems Architecture, 2017 - Elsevier
Primary/Backup has been well studied as an effective fault-tolerance technique. In this
paper, with the objectives of tolerating a single permanent fault and maintaining system …

Energy-aware mixed partitioning scheduling in standby-sparing systems

Y Zhang - Computer Standards & Interfaces, 2019 - Elsevier
Previous standby-sparing techniques assume that all tasks don't access to shared
resources. In addition, primary tasks and backup tasks are allocated to the primary …

Energy-aware primary/backup scheduling of periodic real-time tasks on heterogeneous multicore systems

A Roy, H Aydin, D Zhu - Sustainable Computing: Informatics and Systems, 2021 - Elsevier
For real-time embedded systems, energy management and fault tolerance are both critical.
However these two objectives are often at odds, because extra resources needed to tolerate …

Peak-power-aware primary-backup technique for efficient fault-tolerance in multicore embedded systems

M Ansari, M Salehi, S Safari, A Ejlali, M Shafique - IEEE Access, 2020 - ieeexplore.ieee.org
Multicore platforms offer great potential for task-level redundancy to achieve a degree of
fault-tolerance/reliability in embedded systems by exploiting the idle cores. However, due to …