A Computationally Light Algorithm for Bayesian Speech Enhancement with SNR Marginalization

While speech enhancement has critically required the estimation of local time-varying SNR, it was recently shown that SNR can be marginalized in a Bayesian sense from the minimum-mean-square-error (MMSE) solution. Precisely, the local SNR is introduced as a stochastic variable and Bayesian integrati...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) s. 6209 - 6213
Hlavní autoři: Thaleiser, Stefan, Enzner, Gerald
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 01.05.2020
Témata:
ISSN:2379-190X
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:While speech enhancement has critically required the estimation of local time-varying SNR, it was recently shown that SNR can be marginalized in a Bayesian sense from the minimum-mean-square-error (MMSE) solution. Precisely, the local SNR is introduced as a stochastic variable and Bayesian integration can be approximately realized under consideration of a hyperprior distribution. In our paper, the proposed approach then takes the multimodal nature of the involved posterior distribution into account for speech inference. Specifically, the extrema of the posterior distribution, which can easily be obtained via differentiation, are combined according to their widths, heights and abscissa. The corresponding solution is not closed form, however, it is found within few iterations. This approach delivers a spectral weighting of noisy speech that simultaneously maximizes instrumental criteria of speech quality, specifically the segmental SNR, STOI score and PESQ.
ISSN:2379-190X
DOI:10.1109/ICASSP40776.2020.9054611