Data-driven Non-uniform Filterbanks Based on F-ratio for Machine Anomalous Sound Detection

Anomalous sound detection (ASD) aims to detect unknown anomalous sounds emitted from a target machine. Most advanced ASD systems use a complicated neural-network- based detector with the log Mel spectrum as input. However, different types of machines have different vibration frequency regions depend...

Full description

Saved in:
Bibliographic Details
Published in:2023 31st European Signal Processing Conference (EUSIPCO) pp. 201 - 205
Main Authors: Li, Kai, Tran, Dung Kim, Lu, Xugang, Akagi, Masato, Unoki, Masashi
Format: Conference Proceeding
Language:English
Published: EURASIP 04.09.2023
Subjects:
ISSN:2076-1465
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Anomalous sound detection (ASD) aims to detect unknown anomalous sounds emitted from a target machine. Most advanced ASD systems use a complicated neural-network- based detector with the log Mel spectrum as input. However, different types of machines have different vibration frequency regions depending on their physical property. The Mel filterbank (FB), which has high resolution in low-frequency regions and low resolution in high frequency, may filter out discriminative information from some important frequency regions, particularly the high-frequency regions. We propose to quantify the frequency importance in ASD of seven types of machines using the Fisher's ratio (F-ratio). The quantified frequency importance is then used to design an ensemble of machine-wise non-uniform FBs and extract the log non-uniform spectrum (LNS). This LNS feature is input to an autoencoder NN-based detector for anomalous sound detection. Experimental results in the DCASE2022 Challenge Task 2 verify the correctness of the quantification results and the effectiveness of the proposed LNS. With a simple autoencoder-based detector, the performance in the averaged harmonic mean of the area under the ROC curve achieved a relative improvement of 9.22 and 5.60% in development and evaluation datasets, respectively.
ISSN:2076-1465
DOI:10.23919/EUSIPCO58844.2023.10289922