Optimal Entropy to enhance the structure of the Wavelet-Packets-Best-Tree for Automatic Speech Recognition - Egyptian Knowledge Bank

195052

Optimal Entropy to enhance the structure of the Wavelet-Packets-Best-Tree for Automatic Speech Recognition

Article

Last updated: 04 Jan 2025

Subjects

Abstract

Best Tree Encoding (BTE)is a promising feature extraction technique based on wavelet packet decomposition that is utilized in Automatic Speech Recognition(ASR). This research introduces an enhancement of Wavelet Packet Best Tree(WPBT) Calculations. The standard features BTE encodes the tree structure using a mathematical model into a features vector of 4 components. The best tree structure has been calculated using the entropy function. In the standard version of BTE, Shannon entropy has been chosen as the entropy function. In this research, Shannon Entropy (SE), Renyi Entropy (RE), and Tsallis Entropy (TE) are used to construct the Best Tree. The encoding of the Best Tree has been done using the same mathematical model approach in the standard 4-Point BTE. The proposed model is tested and Verified against the most widely used feature Mel Frequency Cepstral Coefficient (MFCC) plus delta and delta-delta coefficients (39 parameters) to evaluate its performance. The TIMIT database is used in this research. All phones are divided into five classes: Vowels, Fricatives, Silences, Nasals, and Plosives. The acoustical model has been implemented using Hidden Markov Model (HMM). No language model has been applied. The HMM Tool Kit (HTK) software is used for model implementation. The experiments show that BTE using Tsallis entropy yields the highest overall success rate of 75.85% which is better than MFCC's overall success rate of 71.76%. Comparing the vector of 4 components of BTE to the 39 components vector of MFCC makes it a very promising feature vector to be considered for research and development.

DOI

10.21608/ejle.2021.80509.1019

Keywords

BTE, WPD, Shannon Entropy, Renyi entropy, Tsallis entropy

Authors

View Authors

First Name

Fatma

Last Name

Abd El_latif Mousa

MiddleName

Mohammed

Affiliation

Mathematics and Physics Department, Faculty of Engineering, Fayoum University, Fayoum, Egypt

Email

fma06@fayoum.edu.eg

City

Orcid

Volume

Article Issue

Related Issue

27704

Issue Date

2021-09-01

Receive Date

2021-06-13

Publish Date

2021-09-01

Page Start

Page End

Print ISSN

2356-8208

Online ISSN

2356-8216

Article File

EJLE_Volume 8_Issue 2_Pages 1-15.pdf

PDF . 1.7MB

Link

https://ejle.journals.ekb.eg/article_195052.html

Detail API

https://ejle.journals.ekb.eg/service?article_code=195052

Order

Type

Original Article

Type Code

1,039

Publication Type

Journal

Publication Title

The Egyptian Journal of Language Engineering

Publication Link

https://ejle.journals.ekb.eg/

MainTitle

Optimal Entropy to enhance the structure of the Wavelet-Packets-Best-Tree for Automatic Speech Recognition

Details

Type

Article

Created At

22 Jan 2023

Subjects

Tags

Abstract

DOI

Keywords

Authors

First Name

Last Name

MiddleName

Affiliation

Email

City

Orcid

Volume

Article Issue

Related Issue

Issue Date

Receive Date

Publish Date

Page Start

Page End

Print ISSN

Online ISSN

Article File

EJLE_Volume 8_Issue 2_Pages 1-15.pdf

Link

Detail API

Order

Type

Type Code

Publication Type

Publication Title

Publication Link

MainTitle