255648

Comparative Study on Feature Selection Methods for Protein

Article

Last updated: 03 Jan 2025

Subjects

-

Tags

-

Abstract

The automated and high-throughput identification of protein function is one of the main issues in computational biology. Predicting the protein's structure is a crucial step in this procedure. In recent years, a wide range of approaches for predicting protein structure has been put forth. They can be divided into two groups: database-based and sequence-based. The first is to identify the principles behind protein structure and attempts to extract valuable characteristics from amino acid sequences. The second one uses pre-existing public annotation databases for data mining. This study emphasizes the sequence-based method and makes use of the ability of amino acid sequences to predict protein activity. The amino acid composition approach, the amino acid tuple approach, and several optimization algorithms were compared. Different protein sequence data sets were used in our experiments. Five classifiers were tested in this research. The best accuracy is 98% using across 10-fold cross-validation. This represents the highest performance in the Human dataset.

DOI

10.21608/ijicis.2022.144051.1190

Keywords

Feature Selection, Protein Sequence, amino acid composition approach, Optimization, classification

Authors

First Name

Walaa

Last Name

Alkady

MiddleName

-

Affiliation

Faculty of computer and Information Sciences, Ain Shams University

Email

walaa.samir@cis.asu.edu.eg

City

Cairo

Orcid

-

First Name

Khaled

Last Name

ElBahnasy

MiddleName

-

Affiliation

Department Information System, Faculty of Computer and Information Sciences,Ain Shams University, Cairo, Egypt.

Email

khaled.bahnasy@cis.asu.edu.eg

City

-

Orcid

-

First Name

Walaa

Last Name

Gad

MiddleName

-

Affiliation

Information Systems Department, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt

Email

walaagad@cis.asu.edu.eg

City

cairo

Orcid

0000-0002-7816-3518

Volume

22

Article Issue

3

Related Issue

36337

Issue Date

2022-08-01

Receive Date

2022-06-11

Publish Date

2022-08-01

Page Start

109

Page End

123

Print ISSN

1687-109X

Online ISSN

2535-1710

Link

https://ijicis.journals.ekb.eg/article_255648.html

Detail API

https://ijicis.journals.ekb.eg/service?article_code=255648

Order

21

Type

Original Article

Type Code

494

Publication Type

Journal

Publication Title

International Journal of Intelligent Computing and Information Sciences

Publication Link

https://ijicis.journals.ekb.eg/

MainTitle

Comparative Study on Feature Selection Methods for Protein

Details

Type

Article

Created At

22 Jan 2023