Reliability Modeling of Declustered-Parity RAID Considering Uncorrectable Bit Errors

Xuefeng WU  Jie LI  Hisao KAMEDA  

IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences   Vol.E80-A   No.8   pp.1508-1515
Publication Date: 1997/08/25
Online ISSN: 
Print ISSN: 0916-8508
Type of Manuscript: PAPER
Category: Reliability and Fault Analysis
uncorrectable bit errors,  reliability,  analytic model,  disk arrays,  RAID,  

Full Text: PDF>>
Buy this Article

UNcorrectable Bit Errors (UNBEs) are important in considering the reliability of Redundant Array of Inexpensive Disks (RAID). They, however, have been ignored or have not been studied in detail in existing reliability analysis of RAID. In this paper, we present an analytic model to study the reliability of declustered-parity RAID by considering UNBEs. By using the analytic model, the optimistic and the pessimistic estimates of the probability that data loss occurs due to an UNBE during the data reconstruction after a disk failed (we call this DB data loss) are obtained. Then, the optimistic and the pessimistic estimates of the Mean Time To Data Loss (MTTDL) that take into account both DB data loss and the data loss caused by double independent disk failures (we call this DD data loss) are obtained. Furthermore, how the MTTDL depends on the number of units in a parity stripe, rebuild time of a failed disk and write fraction of data access are studied by numerical analysis.