Performance-enhanced rough k-means clustering algorithm

Customer segmentation (CS) is the most critical application in the field of customer relationship management that primarily depends on clustering algorithms. Rough k -means (R K M) clustering algorithm is widely adopted in the literature for achieving CS objective. However, the R K M has certain lim...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Soft computing (Berlin, Germany) Ročník 25; číslo 2; s. 1595 - 1616
Hlavní autoři: Sivaguru, M., Punniyamoorthy, M.
Médium: Journal Article
Jazyk:angličtina
Vydáno: Berlin/Heidelberg Springer Berlin Heidelberg 01.01.2021
Témata:
ISSN:1432-7643, 1433-7479
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Customer segmentation (CS) is the most critical application in the field of customer relationship management that primarily depends on clustering algorithms. Rough k -means (R K M) clustering algorithm is widely adopted in the literature for achieving CS objective. However, the R K M has certain limitations that prevent its successful application to CS. First, it is sensitive to random initial cluster centers. Second, it uses default values for parameters w l and w u used in calculating cluster centers. To address these limitations, a new initialization method is proposed in this study. The proposed initialization mitigates the problems associated with the random choice of initial cluster centers to achieve stable clustering results. A weight optimization scheme for w l and w u is proposed in this study. This scheme helps to estimate suitable weights for w l and w u by counting the number of data points present in clusters. Extensive experiments were carried out by using several benchmark datasets to assess the performance of these proposed methods in comparison with the existing algorithm. The results reveal that the proposed methods have improved the performance of the R K M algorithm, which is validated by the evaluation metrics, namely convergence speed, clustering accuracy, Davies–Bouldin (DB) index, within/total ( W / T ) clustering error index and statistical significance t test. Further, the results are compared with other promising clustering algorithms to show its advantage. A CS framework that shows the utility of these proposed methods in the application domain is also proposed. Finally, it is demonstrated through a case study in a retail supermarket.
ISSN:1432-7643
1433-7479
DOI:10.1007/s00500-020-05247-2