Beta
59427

Toward Building a Comprehensive Phrase-based English-Arabic Statistical Machine Translation System

Article

Last updated: 24 Dec 2024

Subjects

-

Tags

-

Abstract

This paper explores a phrase-based statistical machine translation (PBSMT) pipeline for English-Arabic (En-Ar)
language pair. The work surveys the most recent experiments conducted to enhance Arabic machine translation in the En-Ar direction. It also focuses on free datasets and linguistically motivated ideas that enhance phrase-based En-Ar statistical machine translation (SMT) as it is as aims to use those only in order to build a large scale En-Ar SMT system. In addition, the paper highlights Arabic linguistic challenges in Machine Translation (MT) in general. This paper can be considered a guide for building an En-Ar PBSMT system. Furthermore, the presented pipeline can be generalized to any language pairs.

DOI

10.21608/ejle.2017.59427

Keywords

Machine Translation, Arabic Natural Language Processing, Phrase-based, Statistical machine translation

Authors

First Name

Sara

Last Name

Ebrahim

MiddleName

-

Affiliation

Scientific Computing Department, Faculty of Computer and Information Sciences (FCIS), Ain Shams University, Cairo, Egypt

Email

sara.elkafrawy@gmail.com

City

Cairo, Egypt

Orcid

-

First Name

Samha

Last Name

El-Beltagy

MiddleName

R.

Affiliation

Nile University (NU), Center for Informatics Science

Email

samhaa@computer.org

City

Giza, Egypt

Orcid

-

First Name

Doaa

Last Name

Hegazy

MiddleName

-

Affiliation

Scientific Computing Department, Faculty of Computer and Information Sciences (FCIS), Ain Shams University, Cairo, Egypt.

Email

doaa.hegazy@cis.asu.edu.eg

City

Cairo, Egypt

Orcid

-

First Name

Mostafa

Last Name

Mostafa

MiddleName

G.

Affiliation

Computer Science at the Faculty of Computer and Information Sciences (FCIS), Ain Shams University

Email

mgmostafa@cis.asu.edu.eg

City

Cairo, Egypt

Orcid

-

Volume

4

Article Issue

2

Related Issue

9019

Issue Date

2017-09-01

Receive Date

2017-05-12

Publish Date

2017-09-01

Page Start

10

Page End

26

Print ISSN

2356-8208

Online ISSN

2356-8216

Link

https://ejle.journals.ekb.eg/article_59427.html

Detail API

https://ejle.journals.ekb.eg/service?article_code=59427

Order

2

Type

Original Article

Type Code

1,039

Publication Type

Journal

Publication Title

The Egyptian Journal of Language Engineering

Publication Link

https://ejle.journals.ekb.eg/

MainTitle

Toward Building a Comprehensive Phrase-based English-Arabic Statistical Machine Translation System

Details

Type

Article

Created At

22 Jan 2023