Accelerating EM Missing Data Filling Algorithm Based on the K-Means

In the whole process of data mining, the EM algorithm is widely applied to dealing with incomplete data for its numerical stability, simplicity of implementation, reliable global convergence. the main disadvantage of the EM is slow convergence speed, the algorithm is highly dependent on the initial...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:2018 4th Annual International Conference on Network and Information Systems for Computers (ICNISC) S. 401 - 406
Hauptverfasser: Hua-Yan, SUN, Ye-Li, Li, Yun-Fei, Zi, Xu, Han
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: IEEE 01.04.2018
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In the whole process of data mining, the EM algorithm is widely applied to dealing with incomplete data for its numerical stability, simplicity of implementation, reliable global convergence. the main disadvantage of the EM is slow convergence speed, the algorithm is highly dependent on the initial value of the option, In this paper, the clustering results use K-means algorithm as the initial scope of EM algorithm, according to the different choice of different characteristics of mining purposes, then use incremental EM algorithm (IEM) step by step EM iterative refinement repeatedly, it obtains the optimal value of filling missing data quickly and efficiently. it is concluded that the optimal value of filling missing data experimental results show that the algorithm of this paper to speed up the convergence rate, strengthened the stability of clustering, data filling effect is remarkable. Keywords-recommendation systems; collaborative filtering; fuzzy equivalence; cause-effect clustering; threshold value; weighting
DOI:10.1109/ICNISC.2018.00088