78266

E-PROBCONS: Enhanced PROBCONS for Multiple Sequence Alignment

Article

Last updated: 04 Jan 2025

Subjects

-

Tags

-

Abstract

Abstract— the perfect alignment between three or more sequences of protein, RNA or DNA is a very difficult task in Bioinformatics. There are many techniques for alignment of multiple sequences. Many techniques enlarge speed and do not have a concern with the accuracy of the resulting alignment. However, other techniques heighten accuracy and do not have a concern with the speed. The vital goals of any technique are (a) reducing memory and execution time requirements, and (b) increasing the accuracy of multiple sequence alignment on large-scale datasets. PROBCONS is a multiple protein sequence alignment (MPSA) tool that achieves the most expected accuracy, but it has a time-consuming problem. To solve this problem and enlarging the accuracy of the MPSA, E-PROBCONS is proposed to enhance PROBCONS tool. E- PROBCONS cluster the large multiple protein sequences into structurally similar protein sequences. Then PROBCONS MPSA tool will be performed in parallel on the Amazon Elastic Cloud (EC2). The proposed approaches are more suitable for large-scale data sets and short sequences. Comparing with algorithms (e.g., PROBCONS, KALIGN, and HALIGN I), provided more than 50% improvement in terms of average sum of pairs alignment scores (SPscores) and reduce the execution time for producing the alignment result. The proposed approaches are implemented on big data framework Hadoop Map-Reduce platform in order to improve the scalability with different protein datasets.

DOI

10.21608/ijci.2020.15523.1002

Keywords

Bioinformatics, Multiple sequence alignment, Protein features, PROBCONS

Authors

First Name

Eman

Last Name

Mohamed

MiddleName

M

Affiliation

Computer Science Dept, Faculty of Computers and Information, Menoufia University, Egypt.

Email

eman.mohamed@ci.menofia.edu.eg

City

-

Orcid

-

First Name

Hamdy

Last Name

Mousa

MiddleName

-

Affiliation

Faculty of Computer and Information Menoufia University

Email

hamdimmm@hotmail.com

City

-

Orcid

0000-0001-9503-9124

First Name

arabi

Last Name

keshk

MiddleName

-

Affiliation

Faculty of Computers and Information, Menoufia University, Egypt

Email

arabikesk@yahoo.com

City

-

Orcid

-

Volume

7

Article Issue

1

Related Issue

17861

Issue Date

2020-10-01

Receive Date

2019-07-31

Publish Date

2020-10-01

Page Start

1

Page End

13

Print ISSN

1687-7853

Online ISSN

2735-3257

Link

https://ijci.journals.ekb.eg/article_78266.html

Detail API

https://ijci.journals.ekb.eg/service?article_code=78266

Order

1

Type

Original Article

Type Code

877

Publication Type

Journal

Publication Title

IJCI. International Journal of Computers and Information

Publication Link

https://ijci.journals.ekb.eg/

MainTitle

E-PROBCONS: Enhanced PROBCONS for Multiple Sequence Alignment

Details

Type

Article

Created At

22 Jan 2023