Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization

This paper presents a statistical method of single-channel speech enhancement that uses a variational autoencoder (VAE) as a prior distribution on clean speech. A standard approach to speech enhancement is to train a deep neural network (DNN) to take noisy speech as input and output clean speech. Al...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) S. 716 - 720
Hauptverfasser:	Bando, Yoshiaki, Mimura, Masato, Itoyama, Katsutoshi, Yoshii, Kazuyoshi, Kawahara, Tatsuya
Format:	Tagungsbericht
Sprache:	Englisch
Veröffentlicht:	IEEE 01.04.2018
Schlagworte:	Bayes methods Bayesian signal processing Data models Gaussian distribution Noise measurement Single-channel speech enhancement Spectrogram Speech enhancement Training variational autoencoder
ISSN:	2379-190X
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!