423606

Automatic Summarization Techniques for Arabic Text;

Article

Last updated: 27 Apr 2025

Subjects

-

Tags

-

Abstract

The fast growth of data has transformed text processing, making it challenging to extract key information efficiently. Text summarization techniques address this by reducing lengthy documents while retaining essential content. Automatic text summarization can be broadly categorized into two main types, extractive and abstractive summarization. In extractive summarization, the final summaries are constructed by selecting and extracting content directly from the source text. On the other hand, abstractive summarization takes a different approach. It aims to understand the source text and convey its core ideas in a more concise form using linguistic techniques. Arabic is spoken by over 300 million people and serves as the official language in 22 countries. There is a growing demand for effective Arabic summarization systems to facilitate efficient information processing and retrieval in the Arab-speaking world. Transformers revolutionize NLP by using self-attention to capture long-range dependencies and process input sequences at the same time, improving efficiency. In abstractive summarization,Transformers play an important role because they produce clear, logical summaries that go beyond simply selecting important passages and rewriting the text in a way that is human-like. In this paper, we present a comprehensive investigation of Arabic summarization datasets and techniques introduced to date, with a focus on fine-tuning and using pre-trained transformer models for Arabic summarization, such as AraT5 and AraBERT. We compare their performance using the ROUGE metric on the Wikilingua multi-sentence dataset and find that AraT5 outperforms AraBERT, showing its effectiveness in abstractive summarization tasks.

DOI

10.21608/ijicis.2025.375275.1389

Keywords

natural language processing, Text Summarization, abstractive Arabic summarization, Deep learning, and transformers

Authors

First Name

Karim

Last Name

Morsi Abd El-Salam

MiddleName

Mohamed

Affiliation

Faculty of Computer and Information Science Ain-Shams

Email

karim.mohamed.fcis@cis.asu.edu.eg

City

Cairo

Orcid

-

First Name

Fatma

Last Name

Naguib

MiddleName

-

Affiliation

Faculty of Computers and Information Sciences, Ain Shams University

Email

fatma_mohamed@cis.asu.edu.eg

City

-

Orcid

-

First Name

Wedad

Last Name

Hussein

MiddleName

-

Affiliation

Faculty of computer and information sciences, Ain Shams University

Email

wedad.hussein@cis.asu.edu.eg

City

-

Orcid

-

First Name

Rasha

Last Name

Ismail

MiddleName

-

Affiliation

Vice Dean for Postgraduate Studies & Research, Faculty of Computer and Information Sciences, Ain Shams University

Email

rashaismail@cis.asu.edu.eg

City

-

Orcid

0000-0003-3581-8112

Volume

25

Article Issue

1

Related Issue

55209

Issue Date

2025-03-01

Receive Date

2025-04-13

Publish Date

2025-03-31

Page Start

74

Page End

88

Print ISSN

1687-109X

Online ISSN

2535-1710

Link

https://ijicis.journals.ekb.eg/article_423606.html

Detail API

http://journals.ekb.eg?_action=service&article_code=423606

Order

423,606

Type

Original Article

Type Code

494

Publication Type

Journal

Publication Title

International Journal of Intelligent Computing and Information Sciences

Publication Link

https://ijicis.journals.ekb.eg/

MainTitle

Automatic Summarization Techniques for Arabic Text;

Details

Type

Article

Created At

27 Apr 2025