Beta
60172

Automatic Speech Segmentation Using Hybrid Wavelet Features and HMM

Article

Last updated: 24 Dec 2024

Subjects

-

Tags

-

Abstract

In this research, a novel feature set is used to automatically segment speech signal. Automatic segmentation is very
useful especially for large database. A hybrid features model is created from wavelet packet analysis and mel-scale is used to train Hidden Markov Model (HMM) for phone boundary detection. HMM is implemented using the Hidden Markov Model Toolkit (HTK).The database (Ked-TIMIT) is used for result verifications and Mel Frequency Cepstral Coefficients (MFCC) is used as reference for evaluating the results of the proposed Hybrid model. The results are categorized for vowels, consonants and short phones. Phone duration and start location are used as metrics to evaluate the system success rate. Success rate of 74% is achieved for consonant detection, 72% for vowel detection and 58% for short phone detection. Using the simple metric that relies only on boundary locations but ignoring duration, the achieved results are 92.5% for consonant detection, 90% for vowel detection and 77.5% for short phoneme detection. In addition to boundary detection the proposed hybrid model is utilized to compare newly developed features called Mel scale Best Tree Encoding (Mel-BTE ) to the mostly used popular features MFCC along with all experiments using the same database. The relative results for Mel-BTE with respect to MFCC are 94.77% for consonant detection, 87.5% for vowel detection and 93.33% for short phoneme detection.

DOI

10.21608/ejle.2016.60172

Keywords

Mel scale, BTE, MFCC, HTK, Gaussian Mixture, Speech Segmentation

Authors

First Name

Amr

Last Name

Gody

MiddleName

M.

Affiliation

Electrical Engineering Department, Faculty of Engineering, Fayoum University

Email

amg00@fayoum.edu.eg

City

Fayoum

Orcid

0000-0003-2079-9860

First Name

Manal

Last Name

Shabaan

MiddleName

-

Affiliation

Electrical Engineering Department, Faculty of Engineering, Fayoum University

Email

manal.mohammed66@yahoo.com

City

Fayoum, Egypt

Orcid

-

First Name

Amr

Last Name

Saleh

MiddleName

-

Affiliation

Electrical Engineering Department, Faculty of Engineering, Fayoum University, Egypt

Email

aae00@fayoum.edu.eg

City

Fayoum, Egypt

Orcid

-

Volume

3

Article Issue

2

Related Issue

9130

Issue Date

2016-09-01

Receive Date

2016-05-11

Publish Date

2016-09-01

Page Start

1

Page End

13

Print ISSN

2356-8208

Online ISSN

2356-8216

Link

https://ejle.journals.ekb.eg/article_60172.html

Detail API

https://ejle.journals.ekb.eg/service?article_code=60172

Order

1

Type

Original Article

Type Code

1,039

Publication Type

Journal

Publication Title

The Egyptian Journal of Language Engineering

Publication Link

https://ejle.journals.ekb.eg/

MainTitle

Automatic Speech Segmentation Using Hybrid Wavelet Features and HMM

Details

Type

Article

Created At

22 Jan 2023