34635

Investigating the effect of speech features and the number of HMM mixtures in the quality HMM-based synthesizers

Article

Last updated: 04 Jan 2025

Subjects

-

Tags

-

Abstract

Abstract:
A statistical parametric speech synthesis system based on hidden Markov models
(HMMs) has grown in popularity over the last few years. In this approach the system
simultaneously models spectrum, excitation, and duration of speech using contextdependent
HMMs and generates speech waveforms from the HMMs themselves. This
paper describes the HMM-based speech synthesis system and applies it to Arabic
language using small size training speech database as an example, and shows that the
resulting model database has the advantage of being small (can be less than 1MB).
Experiments show that using Mel-cepstral coefficients as spectral parameters of speech
waveforms for training gives better results than using LPC or PARCOR coefficients.
Experiments also show that increasing the number of Gaussian Mixtures with this
relatively small size training data has the disadvantage of poor generalization of HMMs
that leads to perceivable discontinuities and clicks in the synthesized speech.

DOI

10.21608/iceeng.2008.34635

Keywords

Hidden Markov Model (HMM), speech synthesis

Authors

First Name

M.

Last Name

Barakat

MiddleName

S.

Affiliation

Modern Academy.

Email

-

City

-

Orcid

-

First Name

M.

Last Name

Gadallah

MiddleName

E.

Affiliation

Military Technical College.

Email

-

City

-

Orcid

-

First Name

T.

Last Name

Nazmy

MiddleName

-

Affiliation

Faculty of computer and information sciences, Ain shams University.

Email

-

City

-

Orcid

-

First Name

T.

Last Name

El Arif

MiddleName

-

Affiliation

Faculty of computer and information sciences, Ain shams University.

Email

-

City

-

Orcid

-

Volume

6

Article Issue

6th International Conference on Electrical Engineering ICEENG 2008

Related Issue

5700

Issue Date

2008-05-01

Receive Date

2019-06-13

Publish Date

2008-05-01

Page Start

1

Page End

12

Print ISSN

2636-4433

Online ISSN

2636-4441

Link

https://iceeng.journals.ekb.eg/article_34635.html

Detail API

https://iceeng.journals.ekb.eg/service?article_code=34635

Order

182

Type

Original Article

Type Code

833

Publication Type

Journal

Publication Title

The International Conference on Electrical Engineering

Publication Link

https://iceeng.journals.ekb.eg/

MainTitle

Investigating the effect of speech features and the number of HMM mixtures in the quality HMM-based synthesizers

Details

Type

Article

Created At

22 Jan 2023