TY - JOUR
T1 - Instruction-level impact analysis of low-level faults in a modern microprocessor controller
AU - Maniatakos, Michail
AU - Karimi, Naghmeh
AU - Tirumurti, Chandra
AU - Jas, Abhijit
AU - Makris, Yiorgos
N1 - Funding Information:
This work is supported by a generous gift from Intel Corp. The first two authors contributed equally to this work. The second author performed this research while being a visiting student at Yale University. Preliminary versions of parts of the results reported herein were presented at the 2008 International Test Conference [17] and the 2009 VLSI Test Symposium [18]. The authors would like to thank Professor Sanjay Patel and Nicholas Wang from the University of Illinois at Urbana-Champaign for sharing the IVM microprocessor model and for providing technical assistance in its installation and usage.
PY - 2011
Y1 - 2011
N2 - We investigate the correlation between low-level faults in the control logic of a modern microprocessor and their instruction-level impact on the execution of typical workload. Such information can prove immensely useful in accurately assessing and prioritizing faults with regards to their criticality, as well as commensurately allocating resources to enhance online testability and error/fault resilience through concurrent error detection/correction methods. To this end, we developed an extensive fault simulation infrastructure which allows injection of stuck-at faults and transient errors of arbitrary starting time and duration, as well as cost-effective simulation and classification of their repercussions into various instruction-level error types. As a test vehicle for our study, we employ a superscalar, dynamically-scheduled, out-of-order, Alpha-like microprocessor, on which we execute SPEC2000 integer benchmarks. Extensive fault injection campaigns in control modules of this microprocessor facilitate valuable observations regarding the distribution of low-level faults into the instruction-level error types that they cause. Experimentation with both Register Transfer (RT-) and Gate-Level faults, as well as with both stuck-at faults and transient errors, confirms the validity and corroborates the utility of these observations.
AB - We investigate the correlation between low-level faults in the control logic of a modern microprocessor and their instruction-level impact on the execution of typical workload. Such information can prove immensely useful in accurately assessing and prioritizing faults with regards to their criticality, as well as commensurately allocating resources to enhance online testability and error/fault resilience through concurrent error detection/correction methods. To this end, we developed an extensive fault simulation infrastructure which allows injection of stuck-at faults and transient errors of arbitrary starting time and duration, as well as cost-effective simulation and classification of their repercussions into various instruction-level error types. As a test vehicle for our study, we employ a superscalar, dynamically-scheduled, out-of-order, Alpha-like microprocessor, on which we execute SPEC2000 integer benchmarks. Extensive fault injection campaigns in control modules of this microprocessor facilitate valuable observations regarding the distribution of low-level faults into the instruction-level error types that they cause. Experimentation with both Register Transfer (RT-) and Gate-Level faults, as well as with both stuck-at faults and transient errors, confirms the validity and corroborates the utility of these observations.
KW - Fault simulation
KW - concurrent error detection
KW - instruction-level error
KW - microprocessor controller
UR - http://www.scopus.com/inward/record.url?scp=79961077712&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79961077712&partnerID=8YFLogxK
U2 - 10.1109/TC.2010.60
DO - 10.1109/TC.2010.60
M3 - Article
AN - SCOPUS:79961077712
SN - 0018-9340
VL - 60
SP - 1260
EP - 1273
JO - IEEE Transactions on Computers
JF - IEEE Transactions on Computers
IS - 9
M1 - 5432157
ER -