Transfer learning for denoising the echolocation clicks of finless porpoise (Neophocaena phocaenoides sunameri) using deep convolutional autoencoders
Ocean noise has a negative impact on the acoustic recordings of odontocetes' echolocation clicks. In this study, deep convolutional autoencoders (DCAEs) are presented to denoise the echolocation clicks of the finless porpoise (Neophocaena phocaenoides sunameri). A DCAE consists of an encoder ne...
Saved in:
| Published in: | The Journal of the Acoustical Society of America Vol. 150; no. 2; p. 1243 |
|---|---|
| Main Authors: | , , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
01.08.2021
|
| ISSN: | 1520-8524, 1520-8524 |
| Online Access: | Get more information |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Ocean noise has a negative impact on the acoustic recordings of odontocetes' echolocation clicks. In this study, deep convolutional autoencoders (DCAEs) are presented to denoise the echolocation clicks of the finless porpoise (Neophocaena phocaenoides sunameri). A DCAE consists of an encoder network and a decoder network. The encoder network is composed of convolutional layers and fully connected layers, whereas the decoder network consists of fully connected layers and transposed convolutional layers. The training scheme of the denoising autoencoder was applied to learn the DCAE parameters. In addition, transfer learning was employed to address the difficulty in collecting a large number of echolocation clicks that are free of ambient sea noise. Gabor functions were used to generate simulated clicks to pretrain the DCAEs; subsequently, the parameters of the DCAEs were fine-tuned using the echolocation clicks of the finless porpoise. The experimental results showed that a DCAE pretrained with simulated clicks achieved better denoising results than a DCAE trained only with echolocation clicks. Moreover, deep fully convolutional autoencoders, which are special DCAEs that do not contain fully connected layers, generally achieved better performance than the DCAEs that contain fully connected layers.Ocean noise has a negative impact on the acoustic recordings of odontocetes' echolocation clicks. In this study, deep convolutional autoencoders (DCAEs) are presented to denoise the echolocation clicks of the finless porpoise (Neophocaena phocaenoides sunameri). A DCAE consists of an encoder network and a decoder network. The encoder network is composed of convolutional layers and fully connected layers, whereas the decoder network consists of fully connected layers and transposed convolutional layers. The training scheme of the denoising autoencoder was applied to learn the DCAE parameters. In addition, transfer learning was employed to address the difficulty in collecting a large number of echolocation clicks that are free of ambient sea noise. Gabor functions were used to generate simulated clicks to pretrain the DCAEs; subsequently, the parameters of the DCAEs were fine-tuned using the echolocation clicks of the finless porpoise. The experimental results showed that a DCAE pretrained with simulated clicks achieved better denoising results than a DCAE trained only with echolocation clicks. Moreover, deep fully convolutional autoencoders, which are special DCAEs that do not contain fully connected layers, generally achieved better performance than the DCAEs that contain fully connected layers. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| ISSN: | 1520-8524 1520-8524 |
| DOI: | 10.1121/10.0005887 |