A half-split grid clustering algorithm by simulating cell division
Clustering, one of the important data mining techniques, has two main processing methods on data-based similarity clustering and space-based density grid clustering. The latter has more advantage than the former on larger and multiple shape and density dataset. However, due to a global partition of...
Saved in:
| Published in: | 2014 International Joint Conference on Neural Networks (IJCNN) pp. 2183 - 2189 |
|---|---|
| Main Authors: | , |
| Format: | Conference Proceeding |
| Language: | English Japanese |
| Published: |
IEEE
01.07.2014
|
| Subjects: | |
| ISSN: | 2161-4393 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Clustering, one of the important data mining techniques, has two main processing methods on data-based similarity clustering and space-based density grid clustering. The latter has more advantage than the former on larger and multiple shape and density dataset. However, due to a global partition of existing grid-based methods, they will perform worse when there is a big difference on the density of clusters. In this paper, we propose a novel algorithm that can produces appropriate grid space in different density regions by simulating cell division process. The time complexity of the algorithm is O(n) in which n is number of points in dataset. The proposed algorithm will be applied on popular chameleon datasets and our synthetic datasets with big density difference. The results show our algorithm is effective on any multi-density situation and has scalability on space optimization problems. |
|---|---|
| ISSN: | 2161-4393 |
| DOI: | 10.1109/IJCNN.2014.6889720 |