On Reducing Rollback Propagation Effect of Optimistic Message Logging for Group-Based Distributed Systems

Jinho AHN  

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E96-D   No.11   pp.2473-2477
Publication Date: 2013/11/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.E96.D.2473
Print ISSN: 0916-8532
Type of Manuscript: LETTER
Category: Dependable Computing
Keyword: 
distributed computing,  scalability,  fault-tolerance,  group communication,  message logging and recovery,  

Full Text: PDF(2.1MB)>>
Buy this Article




Summary: 
This paper presents a new scalable method to considerably reduce the rollback propagation effect of the conventional optimistic message logging by utilizing positive features of reliable FIFO group communication links. To satisfy this goal, the proposed method forces group members to replicate different receive sequence numbers (RSNs), which they assigned for each identical message to their group respectively, into their volatile memories. As the degree of redundancy of RSNs increases, the possibility of local recovery for each crashed process may significantly be higher. Experimental results show that our method can outperform the previous one in terms of the rollback distance of non-faulty processes with a little normal time overhead.