411113

SpamML: An Efficient Framework for Detecting Spam Emails Using Machine Learning

Article

Last updated: 15 Feb 2025

Subjects

-

Tags

-

Abstract

Spam detection or anti-spam techniques are methods to identify and filter out unwanted, unsolicited, or malicious emails, commonly known as spam. These techniques aim to enhance email security, reduce the risk of phishing attacks, and improve the overall user experience. The prediction of spam emails falls under the broader email filtering or classification category. Specifically, it is a part of the field of machine learning and data mining, where techniques are employed to automatically categorize emails into different classes, such as "spam" or "non-spam" (ham). This process involves using various algorithms and features to analyze emails' content, structure, and metadata to determine whether they will likely be spam or legitimate messages. Our objective is to use Machine Learning to predict and identify simplistically whether the Email is Spam Or Not. It was concluded and considered that the two datasets we can use have many Machine Learning algorithms. The proposed algorithms were tested: k-nearest Neighbor, Gradient Boosting, Random Forest, Naïve Bayes, Decision Tree, and Logistic Regression. After rigorous testing, the only algorithm, Gradiant boosting, stayed dominant in most of the testing, achieving accuracies of 98.5%; also, the other dataset with the best algorithm was Gradiant boosting, which scored the highest accuracy in all the testing, which was 98.6%. As shown in this paper, Machine Learning algorithms, such as supervised or unsupervised models, are trained on datasets containing examples of both spam and legitimate emails. These models then use the learned patterns to classify incoming emails. Can adapt to new spam patterns, effectively handling complex relationships in data.

DOI

10.21608/jocc.2025.411113

Keywords

Spam Email Prediction, Machine Learning, classification, Naïve Bayes, Gradient boosting, Linear Regression, K-nearest neighbor

Authors

First Name

Diaa

Last Name

AbdElminaam

MiddleName

s

Affiliation

Department of Data Science , Faculty of Computer Science , Misr International University , Cairo , Egypt

Email

diaa.salama@miuegypt.edu.eg

City

-

Orcid

0000-0002-1544-9906

First Name

Maged

Last Name

Farouk

MiddleName

-

Affiliation

Department of Business Information Systems, Faculty of Business, Alamein International University, Alamein, Egypt

Email

melsayed@aiu.edu.eg

City

Alamein

Orcid

-

First Name

Nashwa

Last Name

Shaker

MiddleName

-

Affiliation

Department of Business Information Systems, Faculty of Business, Alamein International University, Alamein, Egypt

Email

nragab@aiu.edu.eg

City

Alamein

Orcid

-

First Name

Omnia

Last Name

Elrashidy

MiddleName

-

Affiliation

Department of Business Information Systems, Faculty of Business, Alamein International University, Alamein, Egypt

Email

oelrashidy@aiu.edu.eg

City

Alamein

Orcid

-

First Name

Reda

Last Name

Elazab

MiddleName

-

Affiliation

Department of Business Information Systems, Faculty of Business, Alamein International University, Alamein, Egypt

Email

relazab@aiu.edu.eg

City

Alamein

Orcid

-

Volume

4

Article Issue

1

Related Issue

53728

Issue Date

2025-02-01

Receive Date

2024-01-08

Publish Date

2025-02-01

Page Start

43

Page End

54

Online ISSN

2636-3577

Link

https://jocc.journals.ekb.eg/article_411113.html

Detail API

http://journals.ekb.eg?_action=service&article_code=411113

Order

4

Type

Original Article

Type Code

731

Publication Type

Journal

Publication Title

Journal of Computing and Communication

Publication Link

https://jocc.journals.ekb.eg/

MainTitle

SpamML: An Efficient Framework for Detecting Spam Emails Using Machine Learning

Details

Type

Article

Created At

15 Feb 2025