An Asynchronous Checkpointing Algorithm Using Virtual Checkpointing On Distributed Systems 


Vol. 6,  No. 5, pp. 1203-1211, May  1999
10.3745/KIPSTE.1999.6.5.1203


PDF
  Abstract

Checkpointing is the one of fault-tolerant techniques to restore faults and to restart job fast. Checkpointing algorithms in distributed systems have been studied for many years. These algorithms can be classified into synchronous Checkpointing algorithms and asynchronous Checkpointing algorithms. In this paper, we propose an independent Checkpointing algorithm that has a minimum Checkpointing counts equal to periodic Checkpointing algorithm, and relatively short rollback distance at faulty situation. Checkpointing count is directly related to task completion time in a fault-free situation and short rollback distance is directly related to task completion time in a faulty situation. The proposed algorithm is compared with the previously proposed asynchronous Checkpointing algorithms using simulation. In the simulation, the proposed Checkpointing algorithm produces better results than other algorithms in terms of task completion time in fault-free as well as faulty situations.

  Statistics


  Cite this article

[IEEE Style]

K. D. Hyung, P. C. Soon, K. Jong, "An Asynchronous Checkpointing Algorithm Using Virtual Checkpointing On Distributed Systems," The Transactions of the Korea Information Processing Society (1994 ~ 2000), vol. 6, no. 5, pp. 1203-1211, 1999. DOI: 10.3745/KIPSTE.1999.6.5.1203.

[ACM Style]

Kim Do Hyung, Park Chang Soon, and Kim Jong. 1999. An Asynchronous Checkpointing Algorithm Using Virtual Checkpointing On Distributed Systems. The Transactions of the Korea Information Processing Society (1994 ~ 2000), 6, 5, (1999), 1203-1211. DOI: 10.3745/KIPSTE.1999.6.5.1203.