Multi-Task Autoencoder for Noise-Robust Speech Recognition

For speech recognition in noisy environments, we propose a multi-task autoencoder which estimates not only clean speech features but also noise features from noisy speech. We introduce the deSpeeching autoencoder, which excludes speech signals from noisy speech, and combine it with the conventional...

Full description

Saved in:
Bibliographic Details
Published in:2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pp. 5599 - 5603
Main Authors: Zhang, Haoyi, Liu, Conggui, Inoue, Nakamasa, Shinoda, Koichi
Format: Conference Proceeding
Language:English
Published: IEEE 01.04.2018
Subjects:
ISSN:2379-190X
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:For speech recognition in noisy environments, we propose a multi-task autoencoder which estimates not only clean speech features but also noise features from noisy speech. We introduce the deSpeeching autoencoder, which excludes speech signals from noisy speech, and combine it with the conventional denoising autoencoder to form a unified multi-task au-toencoder (MTAE). We evaluate it using the Aurora 2 dataset and CHIME 3 dataset. It reduced WER by 15.7% from the conventional denoising autoencoder in the Aurora 2 test set A.
ISSN:2379-190X
DOI:10.1109/ICASSP.2018.8461446