Beta
59295

Modern Standard Arabic Grammar Automatic Extraction from Penn 1 Arabic Treebank Using Natural Language Toolkit

Article

Last updated: 24 Dec 2024

Subjects

-

Tags

-

Abstract

This paper presents a methodology for rule based bottom up parsing technique forModern Standard Arabic (MSA) in
Context Free Grammar (CFG) formalism in Phrase Structure Grammar (PSG) representation, where the grammar is
automatically extracted from a syntactically annotated corpus.The extracted grammar is used to build an automatic lexicon and
grammar rules module. Furthermore, the extracted CFG is further transformed into Probabilistic Context Free Grammar (PCFG)
that could be used in a hybrid approach, which is also calculated automatically. The used corpus is the Penn Arabic
Treebank(PATB)and algorithm implementation is performed with Natural Language Processing Toolkit (NLTK).The parser
showed that automatic extraction of grammar improved the grammar building phase in both coverage of structures and time
needed, but still needs further manual constrains addition. Automatic extraction of grammar is able to enhance rule based
grammar parsers and it will enable a new paradigm of statistically directed symbolic parsing.

DOI

10.21608/ejle.2018.59295

Keywords

Observational Based Grammar, Automatic Grammar Extraction- Rule Based Grammar – Enhancing Arabic Grammar Parsing, Statistically Directed Symbolic Parsing

Authors

First Name

Amira

Last Name

Abdelhalim

MiddleName

-

Affiliation

Phonetics and Linguistics Department, Faculty of Arts, Alexandria University

Email

amira.abdelhalim@yahoo.com

City

Alexandria, Egypt

Orcid

-

First Name

Sameh

Last Name

Alansary

MiddleName

-

Affiliation

Department of Phonetics and Linguistics and the head of Phonetics and Linguistics Department, Faculty of Arts, Alexandria University

Email

s.alansary@alexu.edu.eg

City

Alexandria, Egypt

Orcid

-

Volume

5

Article Issue

1

Related Issue

9001

Issue Date

2018-04-01

Receive Date

2018-01-09

Publish Date

2018-04-01

Page Start

1

Page End

10

Print ISSN

2356-8208

Online ISSN

2356-8216

Link

https://ejle.journals.ekb.eg/article_59295.html

Detail API

https://ejle.journals.ekb.eg/service?article_code=59295

Order

1

Type

Original Article

Type Code

1,039

Publication Type

Journal

Publication Title

The Egyptian Journal of Language Engineering

Publication Link

https://ejle.journals.ekb.eg/

MainTitle

Modern Standard Arabic Grammar Automatic Extraction from Penn 1 Arabic Treebank Using Natural Language Toolkit

Details

Type

Article

Created At

22 Jan 2023