Speech Recognitionwith Speech Density Estimation by the Dirichlet Process Mixture

This paper shows a method for the modeling of speech signal distributions based on Dirichlet process mixtures (DPM) and the estimation of noise sequences based on particle filtering. In real situations, the speech recognition rate degrades miser ably because of the effect of environmental noises, re...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	2008 IEEE International Conference on Acoustics, Speech and Signal Processing s. 1553 - 1556
Hlavní autoři:	Ota, K., Duflos, E., Vanheeghe, P., Yanagida, M.
Médium:	Konferenční příspěvek
Jazyk:	angličtina japonština
Vydáno:	IEEE 01.03.2008
Témata:	Degradation Filtering Kalman filtering Particle filters Signal generators Signal processing Signal to noise ratio Speech enhancement Speech processing Speech recognition Stochastic processes Working environment noise
ISBN:	9781424414833, 1424414830
ISSN:	1520-6149
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	This paper shows a method for the modeling of speech signal distributions based on Dirichlet process mixtures (DPM) and the estimation of noise sequences based on particle filtering. In real situations, the speech recognition rate degrades miser ably because of the effect of environmental noises, reflected waves and so on. To improve the speech recognition rate, a technique for the estimation of noise sequences is necessary. In this paper, the distribution of the clean speech is modeled using the DPM instead of the traditional model, which is a Gaussian mixture model (GMM). Speech signal sequences are generated according to the mean and covariance generated from the DPM. Then, noise signal sequences are estimated with a particle filter. The proposed method using extended Kalman filter (EKF) can improve the speech recognition rate significantly in the low SNR region. Applying unscented Kalman filter (UKF), better results can be obtained in also the high SNR.
ISBN:	9781424414833 1424414830
ISSN:	1520-6149
DOI:	10.1109/ICASSP.2008.4517919