348788

Integration of Deep Learning Models for Enhanced Classification of Viral DNA Sequences Across Specific Viruses and Viral Families

Article

Last updated: 03 Jan 2025

Subjects

-

Tags

-

Abstract

The field of genomic bioinformatics is continually challenged by the need for precise classification of viral DNA sequences. The challenge of accurately classifying viral sequences is crucial for the development of diagnostic and therapeutic strategies for any viral outbreaks. This study presents a comprehensive approach integrating two distinct deep learning models, namely the Genetic Algorithm (GA) optimized Convolutional Neural Networks (CNN) hybrid model and the CNN-Extreme Learning Machines (ELM) model aiming to enhance the classification of viral DNA sequences across specific viruses and viral families.
A comprehensive data preprocessing strategy is employed, wherein both datasets undergo k-mer, label, and one-hot vector encoding. This allows for a uniform and comparative analysis across different models and datasets. When the optimized GA-CNN is applied to the more generic viral family dataset, it demonstrates a good adaptability with an accuracy of 95.88% achieving a higher result than the CNN-ELM. In contrast, the CNN-ELM, when tested on the specific virus dataset, maintains robust feature extraction capabilities, faster training time but lower than the optimized GA-CNN model achieving an accuracy of 92.7%.
A comparative analysis of training times is also employed in this study. The CNN-ELM model shows a notable efficiency, with a 34% faster training time compared to the GA-CNN. Moreover, when both models are applied to the new generic dataset, a comparative study with other deep learning models is conducted. Remarkably, the GA-CNN outperforms other models, achieving the highest classification accuracy of 95.88%.

DOI

10.21608/ijicis.2024.279692.1332

Keywords

Genomic Bioinformatics, Viral DNA Classification, GA-CNN, CNN-ELM, Extreme Learning Machines

Authors

First Name

Ahmed

Last Name

El-Tohamy

MiddleName

Hesham

Affiliation

Information Systems Department ,Faculty of Computer and Information Sciences,. Ain Shams University

Email

ahmed.eltohamy@cis.asu.edu.eg

City

Cairo

Orcid

-

First Name

Huda

Last Name

Amin

MiddleName

-

Affiliation

Information Systems Department ,Faculty of Computer and Information Sciences, Ain Shams University

Email

huda_amin@cis.asu.edu.eg

City

Cairo

Orcid

0000-0001-5550-5717

First Name

Nagwa

Last Name

Badr

MiddleName

-

Affiliation

Department of Information Systems, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, 11566, Egypt

Email

nagwabadr@cis.asu.edu.eg

City

-

Orcid

0000-0002-5382-1385

Volume

24

Article Issue

1

Related Issue

46955

Issue Date

2024-03-01

Receive Date

2024-03-27

Publish Date

2024-03-31

Page Start

89

Page End

104

Print ISSN

1687-109X

Online ISSN

2535-1710

Link

https://ijicis.journals.ekb.eg/article_348788.html

Detail API

https://ijicis.journals.ekb.eg/service?article_code=348788

Order

348,788

Type

Original Article

Type Code

494

Publication Type

Journal

Publication Title

International Journal of Intelligent Computing and Information Sciences

Publication Link

https://ijicis.journals.ekb.eg/

MainTitle

Integration of Deep Learning Models for Enhanced Classification of Viral DNA Sequences Across Specific Viruses and Viral Families

Details

Type

Article

Created At

23 Dec 2024