Density peaks clustering based on Gaussian fuzzy neighborhood with noise parameter

Density peak clustering (DPC) is an effective clustering method known for its robustness, non-iterative nature, and hybrid approach. However, it is not without limitations: (a) the determination of the cutoff distance (dc) relies on human experience, which can significantly impact the clustering out...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications Jg. 255; S. 124782
Hauptverfasser: Waqas, Syed Muhammad, Khan, Sumra, Talpur, Kashif, Khan, Rizwan Ahmed
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Elsevier Ltd 01.12.2024
Schlagworte:
ISSN:0957-4174
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Density peak clustering (DPC) is an effective clustering method known for its robustness, non-iterative nature, and hybrid approach. However, it is not without limitations: (a) the determination of the cutoff distance (dc) relies on human experience, which can significantly impact the clustering outcome; (b) DPC does not take into account the local structure of the data when computing local densities; (c) it employs a crisp kernel for density computation; (d) the performance of DPC is affected by chain reactions; and (e) DPC often struggles to handle noisy data. In order to address these limitations, this paper proposes a novel approach called DPC based on Gaussian fuzzy neighborhood with noise parameter (DPC-GFNN). The proposed method leverages a Gaussian fuzzy kernel to enhance the separation between clusters and mitigates the influence of outliers through an adjustable noise parameter (λ). DPC-GFNN utilizes a k-nearest neighbor graph based on density to label highly dense regions. This technique effectively avoids chain reactions by assigning accurate labels to points in border areas, enabling proper clustering of data with diverse shapes and densities. To evaluate the effectiveness of DPC-GFNN, a series of experiments are conducted on both real-world and synthetic datasets. The experimental results demonstrate that DPC-GFNN exhibits superior robustness and clustering accuracy compared to other modified variants of DPC including DPC based on k-nearest neighbors (DPC-KNN), improved DPC (IDPC), DPC based on density backbone and fuzzy neighborhood (DPC-DBFN), and DPC based on fuzzy weighted k-nearest neighbors (FKNN). •Proposing an efficient DPC-GFNN that enhances clustering robustness.•Gaussian fuzzy kernel improves cluster separation.•Adjustable noise parameter mitigates outlier impact.•Density-based k-NN graph prevents chain reactions.•Comprehensive evaluation proves efficacy in comparison with other recent works.
ISSN:0957-4174
DOI:10.1016/j.eswa.2024.124782