A half-split grid clustering algorithm by simulating cell division

Clustering, one of the important data mining techniques, has two main processing methods on data-based similarity clustering and space-based density grid clustering. The latter has more advantage than the former on larger and multiple shape and density dataset. However, due to a global partition of...

Full description

Saved in:
Bibliographic Details
Published in:2014 International Joint Conference on Neural Networks (IJCNN) pp. 2183 - 2189
Main Authors: Wenxiang Dou, Jinglu Hu
Format: Conference Proceeding
Language:English
Japanese
Published: IEEE 01.07.2014
Subjects:
ISSN:2161-4393
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Clustering, one of the important data mining techniques, has two main processing methods on data-based similarity clustering and space-based density grid clustering. The latter has more advantage than the former on larger and multiple shape and density dataset. However, due to a global partition of existing grid-based methods, they will perform worse when there is a big difference on the density of clusters. In this paper, we propose a novel algorithm that can produces appropriate grid space in different density regions by simulating cell division process. The time complexity of the algorithm is O(n) in which n is number of points in dataset. The proposed algorithm will be applied on popular chameleon datasets and our synthetic datasets with big density difference. The results show our algorithm is effective on any multi-density situation and has scalability on space optimization problems.
ISSN:2161-4393
DOI:10.1109/IJCNN.2014.6889720