Transfer learning for denoising the echolocation clicks of finless porpoise (Neophocaena phocaenoides sunameri) using deep convolutional autoencoders

Ocean noise has a negative impact on the acoustic recordings of odontocetes' echolocation clicks. In this study, deep convolutional autoencoders (DCAEs) are presented to denoise the echolocation clicks of the finless porpoise (Neophocaena phocaenoides sunameri). A DCAE consists of an encoder ne...

Full description

Saved in:
Bibliographic Details
Published in:The Journal of the Acoustical Society of America Vol. 150; no. 2; p. 1243
Main Authors: Yang, Wuyi, Chang, Wenlei, Song, Zhongchang, Zhang, Yu, Wang, Xianyan
Format: Journal Article
Language:English
Published: 01.08.2021
ISSN:1520-8524, 1520-8524
Online Access:Get more information
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Ocean noise has a negative impact on the acoustic recordings of odontocetes' echolocation clicks. In this study, deep convolutional autoencoders (DCAEs) are presented to denoise the echolocation clicks of the finless porpoise (Neophocaena phocaenoides sunameri). A DCAE consists of an encoder network and a decoder network. The encoder network is composed of convolutional layers and fully connected layers, whereas the decoder network consists of fully connected layers and transposed convolutional layers. The training scheme of the denoising autoencoder was applied to learn the DCAE parameters. In addition, transfer learning was employed to address the difficulty in collecting a large number of echolocation clicks that are free of ambient sea noise. Gabor functions were used to generate simulated clicks to pretrain the DCAEs; subsequently, the parameters of the DCAEs were fine-tuned using the echolocation clicks of the finless porpoise. The experimental results showed that a DCAE pretrained with simulated clicks achieved better denoising results than a DCAE trained only with echolocation clicks. Moreover, deep fully convolutional autoencoders, which are special DCAEs that do not contain fully connected layers, generally achieved better performance than the DCAEs that contain fully connected layers.Ocean noise has a negative impact on the acoustic recordings of odontocetes' echolocation clicks. In this study, deep convolutional autoencoders (DCAEs) are presented to denoise the echolocation clicks of the finless porpoise (Neophocaena phocaenoides sunameri). A DCAE consists of an encoder network and a decoder network. The encoder network is composed of convolutional layers and fully connected layers, whereas the decoder network consists of fully connected layers and transposed convolutional layers. The training scheme of the denoising autoencoder was applied to learn the DCAE parameters. In addition, transfer learning was employed to address the difficulty in collecting a large number of echolocation clicks that are free of ambient sea noise. Gabor functions were used to generate simulated clicks to pretrain the DCAEs; subsequently, the parameters of the DCAEs were fine-tuned using the echolocation clicks of the finless porpoise. The experimental results showed that a DCAE pretrained with simulated clicks achieved better denoising results than a DCAE trained only with echolocation clicks. Moreover, deep fully convolutional autoencoders, which are special DCAEs that do not contain fully connected layers, generally achieved better performance than the DCAEs that contain fully connected layers.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1520-8524
1520-8524
DOI:10.1121/10.0005887