Beta
158213

Enhancing Arabic Text Mining Using Linguistic Factors

Article

Last updated: 27 Dec 2024

Subjects

-

Tags

-

Abstract

The World Wide Web overwhelms people with immense amount of widely
distributed, interconnected, rich and dynamic hypertext information. Text mining concerns
extracting knowledge from unstructured textual data. The most important task to achieve
this mission is finding the rules that relate specific words and phrases. This research
presents how Arabic morphology and Arabic synonymous, as linguistic factors, can be
used to extract the required knowledge from Arabic texts.
The contribution in this research is based on the design and implementation of a
system combining morphology, synonyms, indexing and databases for Text Mining and
Information Retrieval with different modes regarding morphology and synonyms.
The used approach is based on preprocessing the Arabic text to convert it into
semi-structured database. A suitable indexing method and an appropriate searching
mechanism are used to extract the required information. The proposed model is tested and
it showed a promising success. Shortage in Arabic Computational linguistics tools such as
Arabic lexicon tagged with semantic features appeared.

DOI

10.21608/asc.2011.158213

Keywords

Data mining, Arabic Text Mining, Arabic Natural Language Processing, Information Retrieval, information extraction, Database, Indexing

Volume

5

Article Issue

1

Related Issue

23271

Issue Date

2011-06-01

Receive Date

2021-03-21

Publish Date

2011-06-01

Page Start

49

Page End

62

Print ISSN

1687-8515

Online ISSN

2682-3578

Link

https://asc.journals.ekb.eg/article_158213.html

Detail API

https://asc.journals.ekb.eg/service?article_code=158213

Order

4

Type

Original Article

Type Code

1,549

Publication Type

Journal

Publication Title

Journal of the ACS Advances in Computer Science

Publication Link

https://asc.journals.ekb.eg/

MainTitle

Enhancing Arabic Text Mining Using Linguistic Factors

Details

Type

Article

Created At

23 Jan 2023