Beta
59890

Automatic Speech Annotation Using HMM based on Best Tree Encoding (BTE) Feature

Article

Last updated: 24 Dec 2024

Subjects

-

Tags

-

Abstract

Manual annotation for time-aligning a speech waveform against the corresponding phonetic sequence is a tedious and time consuming task. This paper aimed to introduce a completely automated phone recognition system based on Best Tree Encoding (BTE) 4-point speech feature. BTE is used to find phoneme boundaries along speech utterance. Comparison to Mel-frequency cepstral coefficients (MFCCs) speech feature in solving the same problem is provided. Hidden Markov Model (HMM) and Gaussian Mixtures are used for building the statistical models through this research. HTK software toolkit is utilized for implementation of the model. The System can identify spoken phone at 59.1% recognition rate based on MFCC and 22.92% recognition rate based on BTE. The current BTE vector is 4 components compared to 39 components of MFCC. This makes it very promising features vector, BTE with 4 components gives a comparable recognition success rate compared to the 39 components MFCC vector widely in the area of ASR.

DOI

10.21608/ejle.2014.59890

Keywords

BTE, MFCC, HTK, Gaussian Mixture, speech recognition

Authors

First Name

Amr

Last Name

Gody

MiddleName

-

Affiliation

Electrical Engineering Department, Faculty of Engineering, Fayoum University

Email

amg00@fayoum.edu.eg

City

Fayoum

Orcid

0000-0003-2079-9860

First Name

Rania

Last Name

Abul Seoud

MiddleName

-

Affiliation

Electrical Engineering Department, Faculty of Engineering, Fayoum University

Email

r-abulseoud@k-space.org

City

Fayoum, Egypt

Orcid

-

First Name

Mohamed

Last Name

Hassan

MiddleName

-

Affiliation

Electrical Engineering Department, Faculty of Engineering, Fayoum University

Email

mh1323@fayou.edu.eg

City

Fayoum, Egypt

Orcid

-

Volume

1

Article Issue

1

Related Issue

8935

Issue Date

2014-01-01

Receive Date

2013-07-17

Publish Date

2014-01-01

Page Start

55

Page End

62

Print ISSN

2356-8208

Online ISSN

2356-8216

Link

https://ejle.journals.ekb.eg/article_59890.html

Detail API

https://ejle.journals.ekb.eg/service?article_code=59890

Order

5

Type

Original Article

Type Code

1,039

Publication Type

Journal

Publication Title

The Egyptian Journal of Language Engineering

Publication Link

https://ejle.journals.ekb.eg/

MainTitle

Automatic Speech Annotation Using HMM based on Best Tree Encoding (BTE) Feature

Details

Type

Article

Created At

22 Jan 2023