For Full-Text PDF, please login, if you are a member of IEICE,|
or go to Pay Per View on menu list, if you are a nonmember of IEICE.
Efficient Techniques for Adaptive Independent Checkpointing in Distributed Systems
IEICE TRANSACTIONS on Information and Systems
Publication Date: 2000/08/25
Print ISSN: 0916-8532
Type of Manuscript: PAPER
Category: Fault Tolerance
distributed systems, fault tolerance, checkpointing, failure recovery,
Full Text: PDF>>
This work presents two novel algorithms to prevent rollback propagation for independent checkpointing: an efficient adaptive independent checkpointing algorithm and an optimized adaptive independent checkpointing algorithm. The last opportunity strategy that yields a better performance than the conservation strategy is also employed to prevent useless checkpoints for both causal rewinding paths and non-causal rewinding paths. The two methods proposed herein are domino effect-free and require only a limited amount of control information. They also take less unnecessary adaptive checkpoints than other algorithms. Furthermore, experimental results indicate that the checkpoint overhead of our techniques is lower than that of the coordinated checkpointing and domino effect-free algorithms for service-providing applications.