Research Repository

RASA: Reliability-Aware Scheduling Approach for FPGA-Based Resilient Embedded Systems in Extreme Environments

Saha, Sangeet and Zhai, Xiaojun and Ehsan, Shoaib and Majeed, Shakaiba and McDonald-Maier, Klaus (2021) 'RASA: Reliability-Aware Scheduling Approach for FPGA-Based Resilient Embedded Systems in Extreme Environments.' IEEE Transactions on Systems, Man, and Cybernetics: Systems. ISSN 2168-2216

[img]
Preview
Text
09430894.pdf - Published Version
Available under License Creative Commons Attribution.

Download (1MB) | Preview

Abstract

Field-programmable gate arrays (FPGAs) offer the flexibility of general-purpose processors along with the performance efficiency of dedicated hardware that essentially renders it as a platform of choice for modern-day robotic systems for achieving real-time performance. Such robotic systems when deployed in harsh environments often get plagued by faults due to extreme conditions. Consequently, the real-time applications running on FPGA become susceptible to errors which call for a reliability-aware task scheduling approach, the focus of this article. We attempt to address this challenge using a hybrid offline-online approach. Given a set of periodic real-time tasks that require to be executed, the offline component generates a feasible preemptive schedule with specific preemption points. At runtime, these preemption events are utilized for fault detection. Upon detecting any faulty execution at such distinct points, the reliability-aware scheduling approach, RASA, orchestrates the recovery mechanism to remediate the scenario without jeopardizing the predefined schedule. Effectiveness of the proposed strategy has been verified through simulation-based experiments and we observed that the RASA is able to achieve 72% of task acceptance rate even under 70% of system workloads with high fault occurrence rates.

Item Type: Article
Uncontrolled Keywords: Extreme environments (EEs), field-programmable gate array (FPGA), partial reconfiguration, real-time scheduling, reliability, resilient systems, single-event upsets (SEUs)
Divisions: Faculty of Science and Health > Computer Science and Electronic Engineering, School of
Depositing User: Elements
Date Deposited: 18 May 2021 12:38
Last Modified: 18 May 2021 12:38
URI: http://repository.essex.ac.uk/id/eprint/30370

Actions (login required)

View Item View Item