Beta
293686

ARABIC CORPUS of LIBRARY and INFORMATION SCIENCE: DESIGN and CONSTRUCTION

Article

Last updated: 04 Jan 2025

Subjects

-

Tags

-

Abstract

This paper addresses the principal considerations in creating the Arabic Corpus of Library and Information Science, a specialized Arabic corpus on the academic genre. This discusses ten phases of creation: the rationale of the Arabic Corpus of Library and Information Science, types of texts, resources of texts, legal approval, data collection, refining texts, revising texts, saving texts, coding texts, and finally, the size of the Arabic Corpus of Library and Information Science (357,485 tokens). Collecting texts of the articles was the longest and most challenging phase of building the corpus. Especially when we encounter files in PDFs or images that are difficult to read 100% correctly by various software. This challenge has been overcome by considering several factors that have been clarified at this stage. The Arabic Corpus of Library and Information Science can play a significant role in addressing the salient features of the academic genre, including keywords identification, lexico-grammatical patterns, themes, topics, and index terms used in the genre of Library and Information Science. Furthermore, the steps of creating the Arabic Corpus of Library and Information Science can guide in building other corpora for any genre or language.

DOI

10.21608/ejle.2023.183529.1040

Keywords

Arabic Corpora, Arabic Natural Language Processing, Information Retrieval Systems, Indexing Arabic Texts, Arabic Information Extraction

Authors

First Name

Ayman

Last Name

Eddakrouri

MiddleName

-

Affiliation

Effat University, Jeddah, Kingdom of Saudi Arabia

Email

a.eldakroury@aucegypt.edu

City

Jeddah, Kingdom of Saudi Arabia

Orcid

0000-0002-6077-349X

Volume

10

Article Issue

1

Related Issue

40912

Issue Date

2023-04-01

Receive Date

2022-12-26

Publish Date

2023-04-01

Page Start

1

Page End

9

Print ISSN

2356-8208

Online ISSN

2356-8216

Link

https://ejle.journals.ekb.eg/article_293686.html

Detail API

https://ejle.journals.ekb.eg/service?article_code=293686

Order

1

Type

Original Article

Type Code

1,039

Publication Type

Journal

Publication Title

The Egyptian Journal of Language Engineering

Publication Link

https://ejle.journals.ekb.eg/

MainTitle

ARABIC CORPUS of LIBRARY and INFORMATION SCIENCE: DESIGN and CONSTRUCTION

Details

Type

Article

Created At

24 Dec 2024