FuzzStream: Fuzzy data stream clustering based on the online-offline framework

Systems capable of generating data quickly and continuously, known as data streams, are a reality today and tend to increase. Due to the nature of data streams, unsupervised learning, such as clustering algorithms, is appropriate. In addition, techniques derived from fuzzy set theory can be useful a...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:IEEE International Fuzzy Systems conference proceedings s. 1 - 6
Hlavní autoři: de Abreu Lopes, Priscilla, de Arruda Camargo, Heloisa
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 01.07.2017
Témata:
ISSN:1558-4739
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Systems capable of generating data quickly and continuously, known as data streams, are a reality today and tend to increase. Due to the nature of data streams, unsupervised learning, such as clustering algorithms, is appropriate. In addition, techniques derived from fuzzy set theory can be useful and add flexibility to the learning process. Fuzzy clustering algorithms for data streams found in the literature are based on chunks, which require the definition of several parameters besides presenting the drawback of overly reducing the summarization of data. An approach to Data Stream clustering that overpasses some of the limitations of chunk-based algorithms is the one called Online-Offline Framework. This framework comprises two phases: summarization and clustering. To the best of our knowledge, there is not a fuzzy version of this framework. The objective of this work is to propose a fuzzy version for the Online-Offline Framework, called FuzzStream, whose main component is a summarization structure and its corresponding maintenance algorithm to be used in the online phase. The well known Weighted Fuzzy C-Means clustering algorithm is used in the offline phase. Experiments show that our proposal is a promising approach to deal with data streams and presents benefits with relation to the classic version.
ISSN:1558-4739
DOI:10.1109/FUZZ-IEEE.2017.8015698