32716

Fault-tolerant scalable hierarchical scheduling in grid computing

Article

Last updated: 04 Jan 2025

Subjects

-

Tags

-

Abstract

Computational grids have the potential for solving large-scale scientific applications using heterogeneous, distributed and possibly non-dedicated resources. Grid environment is dynamic in nature, hence scalable and fault-tolerant scheduling is a
much needed to schedule parallel applications with inter-process communication. In this paper, we propose a hierarchical and fault-tolerant scheduling approach, in which the application's processes communicate indirectly by sending messages over the network through mailbox-based communication technique at a shared node. In grid, process often migrates from one node to another, so this technique ensures the reliable delivery of messages; prevents messages sent to the migrating process form losing. A nonevolutionary mapping heuristic based on Max-Min approach is also proposed for
mapping such applications on grid resources. Finally, MPICH-V1 protocol is integrated into our scheduling framework that exploits the mailbox-based technique instead of channel memories. The simulation experimental results demonstrate that, the proposed approach as a whole effectively schedules the grid applications in scalable and fault tolerant way thereby ensures the application to be executed within its deadline making the grid environment trust worthy.

DOI

10.21608/iceeng.2012.32716

Keywords

Grid scheduling, fault-tolerance, MPICH-V1, rescheduling, checkpointing

Authors

First Name

Gamal

Last Name

El-Sayed

MiddleName

A.

Affiliation

Electrical Engineering Department, Assiut University, Egypt.

Email

-

City

-

Orcid

-

First Name

Aref

Last Name

Abdullah

MiddleName

M.

Affiliation

Electrical Engineering Department, Assiut University, Egypt.

Email

-

City

-

Orcid

-

Volume

8

Article Issue

8th International Conference on Electrical Engineering ICEENG 2012

Related Issue

5272

Issue Date

2012-05-01

Receive Date

2019-05-22

Publish Date

2012-05-01

Page Start

1

Page End

27

Print ISSN

2636-4433

Online ISSN

2636-4441

Link

https://iceeng.journals.ekb.eg/article_32716.html

Detail API

https://iceeng.journals.ekb.eg/service?article_code=32716

Order

91

Type

Original Article

Type Code

833

Publication Type

Journal

Publication Title

The International Conference on Electrical Engineering

Publication Link

https://iceeng.journals.ekb.eg/

MainTitle

Fault-tolerant scalable hierarchical scheduling in grid computing

Details

Type

Article

Created At

22 Jan 2023