Beta
312008

A Hybrid Approach for Automatic Morphological Diacritization of Arabic Text

Article

Last updated: 05 Jan 2025

Subjects

-

Tags

-

Abstract

 Arabic Modern texts are commonly written without diacritization, which is a critical task for other Arabic
processing tasks as word sense disambiguation, automatic speech recognition, and text to speech, where word meaning
or pronunciation is decided based on the diacritic signs assigned to each letter. This paper presents a novel approach for automatic Arabic text diacritization using deep encode-decode recurrent neural networks that is followed by several text correction techniques, to improve the overall system output accuracy. Experimental results of the proposed system on Wikinews test set show superior performance and are competitive with those of the-state-of-the-art diacritization methods. Namely, our method achieves morphological diacritization Word Error Rate (WER) 3.85% and Diacritic Error Rate (DER) 1.12%
 

DOI

10.21608/mjcis.2018.312008

Keywords

Arabic Natural Language Processing, Automatic Morphological Diacritization, deep encode-decode recurrent neural networks

Authors

First Name

Hatem

Last Name

M Noaman

MiddleName

-

Affiliation

Computer Science Department, Mansoura University, Egypt

Email

-

City

-

Orcid

-

First Name

Shahenda

Last Name

S. Sarhan

MiddleName

-

Affiliation

Computer Science Department, Mansoura University, Egypt

Email

-

City

-

Orcid

-

First Name

M. A. A.

Last Name

Rashwan

MiddleName

-

Affiliation

Electronics and Communications Department, Cairo University, Egypt

Email

-

City

-

Orcid

-

Volume

14

Article Issue

2

Related Issue

42820

Issue Date

2018-12-01

Receive Date

2023-08-10

Publish Date

2018-12-01

Page Start

39

Page End

46

Print ISSN

2090-1666

Online ISSN

2090-1674

Link

https://mjcis.journals.ekb.eg/article_312008.html

Detail API

https://mjcis.journals.ekb.eg/service?article_code=312008

Order

312,008

Type

Original Research Articles.

Type Code

1,784

Publication Type

Journal

Publication Title

Mansoura Journal for Computer and Information Sciences

Publication Link

https://mjcis.journals.ekb.eg/

MainTitle

A Hybrid Approach for Automatic Morphological Diacritization of Arabic Text

Details

Type

Article

Created At

28 Dec 2024