Beta
32714

Non-blocking minimum processes coordinated checkpointing for hierarchical computational grid

Article

Last updated: 04 Jan 2025

Subjects

-

Tags

-

Abstract

Fault tolerance is an important property in grid computing as the dependability of individual grid resources may not be able to be guaranteed. Common fault tolerance techniques in distributed systems are normally achieved with checkpoint recovery, message logging with checkpointing, or through task replication on alternative resources in cases of a system outage. In this paper, we present a mailbox-based non-blocking minimum processes coordinated checkpoint protocol for hierarchical grid. In our grid model, processes on different processors communicate indirectly by sending messages over the network through mailbox-based technique at a shared node. The mailbox of each process can be exploited as an events logger since it logs the messages sent to the process in strict FIFO order. The main advantages of our approach are achieving more parallelism and suiting the highly dynamic environment where processes frequently migrate from one
node to anotherز

DOI

10.21608/iceeng.2012.32714

Keywords

Coordinated checkpointing, Fault tolerant, non-blocking, message logging

Authors

First Name

Gamal

Last Name

El-Sayed

MiddleName

A.

Affiliation

Electrical Engineering Department, Assiut University, Egypt.

Email

-

City

-

Orcid

-

First Name

Aref

Last Name

Abdullah

MiddleName

M.

Affiliation

Electrical Engineering Department, Assiut University, Egypt.

Email

-

City

-

Orcid

-

Volume

8

Article Issue

8th International Conference on Electrical Engineering ICEENG 2012

Related Issue

5272

Issue Date

2012-05-01

Receive Date

2019-05-22

Publish Date

2012-05-01

Page Start

1

Page End

13

Print ISSN

2636-4433

Online ISSN

2636-4441

Link

https://iceeng.journals.ekb.eg/article_32714.html

Detail API

https://iceeng.journals.ekb.eg/service?article_code=32714

Order

90

Type

Original Article

Type Code

833

Publication Type

Journal

Publication Title

The International Conference on Electrical Engineering

Publication Link

https://iceeng.journals.ekb.eg/

MainTitle

Non-blocking minimum processes coordinated checkpointing for hierarchical computational grid

Details

Type

Article

Created At

22 Jan 2023