Modular dynamic deep denoising autoencoder for speech enhancement

Deep Denoising Autoencoder (DDAE) is an effective method for noise reduction and speech enhancement. However, a single DDAE with a fixed number of frames for neural network input cannot extract contextual information sufficiently. It has also less generalization in unknown SNRs (signal-to-noise-rati...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:2017 7th International Conference on Computer and Knowledge Engineering (ICCKE) s. 254 - 259
Hlavní autoři: Safari, Razieh, Ahadi, Seyed Mohammad, Seyedin, Sanaz
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 01.10.2017
Témata:
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Deep Denoising Autoencoder (DDAE) is an effective method for noise reduction and speech enhancement. However, a single DDAE with a fixed number of frames for neural network input cannot extract contextual information sufficiently. It has also less generalization in unknown SNRs (signal-to-noise-ratio) and the enhanced output has some residual noise. In this paper, we use a modular model in which three DDAEs with different window lengths are stacked. Experimental results showes that our proposed architecture, namely modular dynamic deep denoising autoencoder (MD-DDAE) provides superior performance in comparison with the traditional DDAE models in different noisy conditions.
DOI:10.1109/ICCKE.2017.8167886