Data-driven Non-uniform Filterbanks Based on F-ratio for Machine Anomalous Sound Detection

Anomalous sound detection (ASD) aims to detect unknown anomalous sounds emitted from a target machine. Most advanced ASD systems use a complicated neural-network- based detector with the log Mel spectrum as input. However, different types of machines have different vibration frequency regions depend...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:2023 31st European Signal Processing Conference (EUSIPCO) s. 201 - 205
Hlavní autoři: Li, Kai, Tran, Dung Kim, Lu, Xugang, Akagi, Masato, Unoki, Masashi
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: EURASIP 04.09.2023
Témata:
ISSN:2076-1465
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Anomalous sound detection (ASD) aims to detect unknown anomalous sounds emitted from a target machine. Most advanced ASD systems use a complicated neural-network- based detector with the log Mel spectrum as input. However, different types of machines have different vibration frequency regions depending on their physical property. The Mel filterbank (FB), which has high resolution in low-frequency regions and low resolution in high frequency, may filter out discriminative information from some important frequency regions, particularly the high-frequency regions. We propose to quantify the frequency importance in ASD of seven types of machines using the Fisher's ratio (F-ratio). The quantified frequency importance is then used to design an ensemble of machine-wise non-uniform FBs and extract the log non-uniform spectrum (LNS). This LNS feature is input to an autoencoder NN-based detector for anomalous sound detection. Experimental results in the DCASE2022 Challenge Task 2 verify the correctness of the quantification results and the effectiveness of the proposed LNS. With a simple autoencoder-based detector, the performance in the averaged harmonic mean of the area under the ROC curve achieved a relative improvement of 9.22 and 5.60% in development and evaluation datasets, respectively.
ISSN:2076-1465
DOI:10.23919/EUSIPCO58844.2023.10289922