Fault Injection Experiments with the CLAMR Hydrodynamics Mini-App

Hits: 1210
Year:
2014
Type of Publication:
Article
Authors:
  • Atkinson, B.
  • DeBardeleben, N.
  • Guan, Qiang
  • Robey, R.
  • Jones, W. M.
Journal:
Software Reliability Engineering Workshops (ISSREW), 2014 IEEE International Symposium
Pages:
6-9
BibTex:
Abstract:
In this paper, we present a resilience analysis of the impact of soft errors on CLAMR, a hydrodynamics mini-app for high performance computing (HPC). We utilize F-SEFI, a fine grainedfault injection tool, to inject faults into the kernel routines of CLAMR. We demonstrate visually the impact of these faults as they are either benign (have no impact on the results), cause silent data corruption (SDC), or cause the application to crash due to instabilities. We quantify the probability that an injected fault will cause CLAMR to transition to one of the above three states using F-SEFI. Finally, we explore the relationship between the application's fault characteristics and when the fault is injected in simulation time. Overall, we find that 17% and 24% of the faults propagate into SDC and crashes respectively.
Back

© 2017 New Mexico Consortium