Skip to main navigation Skip to search Skip to main content

Systematic Reliability Evaluation of FPGA Implemented CNN Accelerators

  • Zhen Gao*
  • , Shihui Gao
  • , Yi Yao
  • , Qiang Liu
  • , Shulin Zeng
  • , Guangjun Ge
  • , Yu Wang
  • , Anees Ullah
  • , Pedro Reviriego
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

14 Scopus citations

Abstract

Convolutional neural networks (CNN) have become essential for many scientific and industrial applications, such as image classification and pattern detection. Among the devices that can implement neural networks, SRAM based FPGAs are a popular option due to their excellent parallel computing capability and good flexibility. However, SRAM-FPGAs are susceptible to radiation effects, which limits its application on safety critical applications. In this paper, the reliability of an accelerator based on the advanced Instruction-Set Architecture is evaluated based on hardware fault injection experiments. Each main module of the accelerator is evaluated separately, and the impact of parallelism and model features on the accelerator reliability is also examined. The experimental results reveal some important conclusions in terms of general hardware reliability and also of the particular model reliability. First, over 99% of SEUs on the computation modules will cause accuracy loss, and the reliability improves for higher parallelism. Second, a large portion of SEUs on the data mover and the instruction scheduler will cause system corruptions due to abnormal interactions with the ARM or other modules. Third, nonlinear activation and pooling layers are effective in reducing the effect of SEUs on computation modules, so models that use these layers tend to be more robust. The results provide a deep understanding of the impact of errors on CNNs implemented on ISA based FPGA accelerators (e.g., the Xilinx DPU).

Original languageEnglish
Pages (from-to)116-126
Number of pages11
JournalIEEE Transactions on Device and Materials Reliability
Volume23
Issue number1
DOIs
StatePublished - 1 Mar 2023

Bibliographical note

Publisher Copyright:
© 2001-2011 IEEE.

Keywords

  • Convolutional neural networks (CNNs)
  • fault injection
  • FPGA accelerator
  • reliability
  • single bit upsets (SEUs)

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Safety, Risk, Reliability and Quality
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Systematic Reliability Evaluation of FPGA Implemented CNN Accelerators'. Together they form a unique fingerprint.

Cite this