DNAE-GAN: Noise-free acoustic signal generator by integrating autoencoder and generative adversarial network

Linear predictive coding is an extremely effective voice generation method that operates through simple process. However, linear predictive coding–generated voices have limited variations and exhibit excessive noise. To resolve these problems, this article proposes an artificial intelligence model t...

Full description

Saved in:
Bibliographic Details
Published in:International journal of distributed sensor networks Vol. 16; no. 5; p. 155014772092352
Main Authors: Kuo, Ping-Huan, Lin, Ssu-Ting, Hu, Jun
Format: Journal Article
Language:English
Published: London, England SAGE Publications 01.05.2020
Wiley
Subjects:
ISSN:1550-1329, 1550-1477, 1550-1477
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Linear predictive coding is an extremely effective voice generation method that operates through simple process. However, linear predictive coding–generated voices have limited variations and exhibit excessive noise. To resolve these problems, this article proposes an artificial intelligence model that combines a denoise autoencoder with generative adversarial networks. This model generates voices with similar semantics through the random input from the latent space of generator. The experimental results indicate that voices generated exclusively by generative adversarial networks exhibit excessive noise. To solve this problem, a denoise autoencoder was connected to the generator for denoising. The experimental results prove the feasibility of the proposed voice generation method. In the future, this method can be applied in robots and voice generation applications to increase the humanistic language expression ability of robots and enable robots to demonstrate more humanistic and natural speaking performance.
ISSN:1550-1329
1550-1477
1550-1477
DOI:10.1177/1550147720923529