Automatic Database Segmentation using Hybrid Spectrum -Visual Approach - Egyptian Knowledge Bank

195054

Automatic Database Segmentation using Hybrid Spectrum -Visual Approach

Article

Last updated: 04 Jan 2025

Subjects

Abstract

Nowadays automated segmentation of speech signals has been attracted many of researchers all-over the world, Many speech processing systems require segmentation of speech waveform into principal acoustic units. In this research, TIMIT DataBase (DB) is utilized to carry on this process and justify its operation or results. Thus, this paper presents a novel method of segmentation of speech phonemes, where the proposed strategy helps in the selection of appropriate feature extraction technique for speech segmentation. There are three main techniques of feature extraction used in our research; the first technique is the Mel Frequency Cepstral Coefficient (MFCC), the second technique is known by Best Tree Encoding (BTE), while the third is Image Normalized Encoder (INE), which is a hybrid technique between the Best Tree Image (BTI), and the Convolution Neural Network (CNN) ResNet-50. Then, data are trained using a hybrid model that consists of Hidden Markov Model (HMM), and Gaussian Mixture Model (GMM) to improve the performance of automatic speech recognition. The proposed model is tested and verified against the most widely used feature Mel Frequency Cepstral Coefficient (MFCC) plus delta and delta-delta coefficients (39 parameters) to evaluate its performance. This approach has the potential to be used in applications such as automatic speech recognition and automatic language identification. The experimental results show that BTE technique achieved the highest success rate (𝜂) (92.64%) than using the (INE) technique. However, the INE technique gives confusion success rate for Tr and NTr of values 97.1% and 99.1%, respectively.

DOI

10.21608/ejle.2021.89867.1024

Keywords

ASR, MFCC, BTE, CNN, HMM

Authors

View Authors

First Name

Manar

Last Name

Gbaily

MiddleName

Othman

Affiliation

Electrical Engineering Department, faculty of engineering, fayoum university,Egypt

Email

noragbaily@yahoo.com

City

fayoum

Orcid

Volume

Article Issue

Related Issue

27704

Issue Date

2021-09-01

Receive Date

2021-08-08

Publish Date

2021-09-01

Page Start

Page End

Print ISSN

2356-8208

Online ISSN

2356-8216

Article File

EJLE_Volume 8_Issue 2_Pages 28-43.pdf

PDF . 1.6MB

Link

https://ejle.journals.ekb.eg/article_195054.html

Detail API

https://ejle.journals.ekb.eg/service?article_code=195054

Order

Type

Original Article

Type Code

1,039

Publication Type

Journal

Publication Title

The Egyptian Journal of Language Engineering

Publication Link

https://ejle.journals.ekb.eg/

MainTitle

Automatic Database Segmentation using Hybrid Spectrum -Visual Approach

Details

Type

Article

Created At

22 Jan 2023

Subjects

Tags

Abstract

DOI

Keywords

Authors

First Name

Last Name

MiddleName

Affiliation

Email

City

Orcid

Volume

Article Issue

Related Issue

Issue Date

Receive Date

Publish Date

Page Start

Page End

Print ISSN

Online ISSN

Article File

EJLE_Volume 8_Issue 2_Pages 28-43.pdf

Link

Detail API

Order

Type

Type Code

Publication Type

Publication Title

Publication Link

MainTitle