A novel density peaks clustering algorithm based on k nearest neighbors for improving assignment process

Density Peaks Clustering (DPC) algorithm is a kind of density-based clustering approach, which can quickly search and find density peaks. However, DPC has deficiency in assignment process, which is likely to trigger domino effect. Especially, it cannot process some non-spherical data sets such as Sp...

Full description

Saved in:
Bibliographic Details
Published in:Physica A Vol. 523; pp. 702 - 713
Main Authors: Jiang, Jianhua, Chen, Yujun, Meng, Xianqiu, Wang, Limin, Li, Keqin
Format: Journal Article
Language:English
Published: Elsevier B.V 01.06.2019
Subjects:
ISSN:0378-4371, 1873-2119
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Density Peaks Clustering (DPC) algorithm is a kind of density-based clustering approach, which can quickly search and find density peaks. However, DPC has deficiency in assignment process, which is likely to trigger domino effect. Especially, it cannot process some non-spherical data sets such as Spiral. The research results indicate that assignment process appears to be the most significant step in deciding the success of the clustering performance. Therefore, we propose a density peaks clustering based on k nearest neighbors (DPC-KNN) which aims to overcome the weakness of DPC. The proposed DPC-KNN integrates the idea of k nearest neighbors into the distance computation and assignment process, which is more reasonable. It can be seen from experimental results that the DPC-KNN algorithm is more feasible and effective, compared with K-means, DBSCAN and DPC. •K nearest neighbors is adopted to solve domino effect problem in density peaks clustering.•The capability of aggregating some non-spherical clusters is enhanced effectively.•Experimental results show that the DPC-KNN algorithm is more effective.
ISSN:0378-4371
1873-2119
DOI:10.1016/j.physa.2019.03.012