Reliability Measurement of Fault-Tolerant Onboard Memory System under Fault Clustering
Advances in spaceborne vehicular technology have made possible the long-life duration of the mission in harsh cosmic environments. Reliability and data integrity are commonly emphasized requirements of spaceborne solid-state mass storage systems, because faults due to the harsh cosmic environments - such as extreme radiation - can be experienced throughout the mission. Acceptable dependability for these instruments have been achieved by using redundancy and repair. Reconfiguration (repair) of memory arrays using spare memory lines is the most common technique for reliability enhancement of memories with faults. Faulty cells in memory arrays are known to show spatial locality. This physical phenomenon is referred to as fault clustering. This paper initially investigates a quadrat-based fault model for memory arrays under clustered faults to establish a sound foundation of measurement. Then, long-life dependability of a fault-tolerant spaceborn memory system with hierarchical active redundancy, which consists of space columns in each memory module and redundant memory modules, is measured in terms of reliability (i.e., the conditional probability that the system performs correctly throughout the mission) and mean-time-to-failure (MTTF, i.e., the expected time that a system will operate before it fails).
M. Choi et al., "Reliability Measurement of Fault-Tolerant Onboard Memory System under Fault Clustering," Proceedings of the 19th IEEE Instrumentation and Measurement Technology Conference (2002, Anchorage, AK), vol. 2, pp. 1161-1166, Institute of Electrical and Electronics Engineers (IEEE), May 2002.
The definitive version is available at http://dx.doi.org/10.1109/IMTC.2002.1007121
19th IEEE Instrumentation and Measurement Technology Conference: IMTC (2002: May 21-23, Anchorage, AK)
Electrical and Computer Engineering
Keywords and Phrases
Computer Simulation; Fault Tolerant Computer Systems; Mathematical Models; Probability; Radiation; Reliability; Fault Clustering; Fault Tolerant Onboard Memory System; Mean Time to Failure; Reliability Measurement; Spaceborne Vehicular Technology; Data Storage Equipment; Clustered Faults; Hierarchical Active Redundancy; Memory Reconfiguration (Repair); Onboard Memory Systems; Quadrat-Based Fault Model
International Standard Book Number (ISBN)
International Standard Serial Number (ISSN)
Article - Conference proceedings
© 2002 Institute of Electrical and Electronics Engineers (IEEE), All rights reserved.