Beta
407141

A Distributed Feature Selection Approach over Hadoop for Accurate Classification based on Grasshopper Algorithm and Rough Sets

Article

Last updated: 01 Feb 2025

Subjects

-

Tags

AI & Expert Systems

Abstract

In the specialized field of data analysis, precise feature selection has become paramount, especially given the extensive and in- tricate datasets available. Many of these datasets house a plethora of features, of which a substantial number may be redundant, leading to potential inaccuracies and increased computational demands. Although the Rough Set (RS) and Multigranular Rough Set (MGRS) models have demonstrated efficacy in feature selection, their computational complexities can be limiting. To address this, we introduce an innovative solution, integrating the MGRS with the Grasshopper Optimization Algorithm (GOA)-a meta- heuristic technique derived from grasshopper foraging behaviors. To manage large-scale data, we employ the Hadoop framework for streamlined distributed processing. By distributing the enhanced GOA tasks within Hadoop, we aspire to efficiently process large-scale datasets. The proposed algorithm's efficacy is assessed using dedicated datasets, benchmarked via classifiers such as Random Forest and K-Nearest Neighbor. Preliminary results highlight the superior performance of our approach compared to prevalent metaheuristic strategies, with the MGRS model enhancing performance notably when employed as an objective function.

DOI

10.21608/djis.2025.351506.1006

Keywords

Multigranular Rough Set (MGRS), Grasshopper Optimization Algorithm (GOA), Hadoop framework, Large-scale datasets, Feature Selection

Authors

First Name

Ahmed

Last Name

Hamed

MiddleName

-

Affiliation

Department of Computer Science, Faculty of Computers and Information, Damanhour University, Damanhour, 22511, Egypt

Email

ahmed_hamed@cis.dmu.edu.eg

City

-

Orcid

-

Volume

1

Article Issue

1

Related Issue

52014

Issue Date

2025-01-01

Receive Date

2025-01-08

Publish Date

2025-01-26

Print ISSN

3062-5017

Link

https://djis.journals.ekb.eg/article_407141.html

Detail API

http://journals.ekb.eg?_action=service&article_code=407141

Order

407,141

Type

Original Article

Type Code

3,325

Publication Type

Journal

Publication Title

Damanhour Journal of Intelligent Systems and Informatics

Publication Link

https://djis.journals.ekb.eg/

MainTitle

A Distributed Feature Selection Approach over Hadoop for Accurate Classification based on Grasshopper Algorithm and Rough Sets

Details

Type

Article

Created At

01 Feb 2025