Beta
22749

Towards Ontology-Based web text Document Classification

Article

Last updated: 24 Dec 2024

Subjects

-

Tags

-

Abstract

The data on the web is generally stored in structured, semi-structured and un- structured formats; from the survey the most of the information of an organization is stored in unstructured textual form .so, the task of categorizing this huge number of unstructured web text documents has become one of the most important tasks when dealing with web. Categorization, Classification, of web text documents aims in assigning one or more class labels, Categories, to the un-labeled ones; the assignment process depends mainly on the contents of the document itself with the help of using one or more of machine learning techniques. Different learning algorithms have been applied on the content of text documents for the classification process. In this paper experiments uses a subset of Reuters-21578 dataset to highlight the leakage and limitations of traditional techniques for feature generation and dimensionality reduction, showing the results of classification accuracy, and F-measure when applying different classification algorithms.

DOI

10.21608/asat.2017.22749

Keywords

Feature Extraction, natural language processing, web text documents classification, Vector Space Mode, KNN, Principle component analysis, dimensionality reduction, term frequency inverse document frequency

Authors

First Name

Mohamed

Last Name

Elhadad

MiddleName

K.

Affiliation

Egyptian Armed Forces, Egypt.

Email

-

City

-

Orcid

-

First Name

Khaled

Last Name

Badran

MiddleName

M.

Affiliation

Egyptian Armed Forces, Egypt.

Email

-

City

-

Orcid

-

First Name

Gouda

Last Name

Salama

MiddleName

I.

Affiliation

Egyptian Armed Forces, Egypt.

Email

-

City

-

Orcid

-

Volume

17

Article Issue

AEROSPACE SCIENCES & AVIATION TECHNOLOGY, ASAT - 17 – April 11 - 13, 2017

Related Issue

4266

Issue Date

2017-04-01

Receive Date

2018-12-24

Publish Date

2017-04-01

Page Start

1

Page End

8

Print ISSN

2090-0678

Online ISSN

2636-364X

Link

https://asat.journals.ekb.eg/article_22749.html

Detail API

https://asat.journals.ekb.eg/service?article_code=22749

Order

61

Type

Original Article

Type Code

737

Publication Type

Journal

Publication Title

International Conference on Aerospace Sciences and Aviation Technology

Publication Link

https://asat.journals.ekb.eg/

MainTitle

Towards Ontology-Based web text Document Classification

Details

Type

Article

Created At

22 Jan 2023