Heterogeneous Data Clustering Considering Multiple User-provided Constraints

Clustering on heterogeneous networks which consist of multi-typed objects and links has proved to be a useful technique in many scenarios. Although numerous clustering methods have achieved remarkable success, current clustering methods for heterogeneous networks tend to consider only internal infor...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:International journal of computers, communications & control Ročník 14; číslo 2; s. 170 - 182
Hlavní autor: Huang, Yue
Médium: Journal Article
Jazyk:angličtina
Vydáno: Oradea Agora University of Oradea 01.04.2019
Témata:
ISSN:1841-9836, 1841-9844
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Clustering on heterogeneous networks which consist of multi-typed objects and links has proved to be a useful technique in many scenarios. Although numerous clustering methods have achieved remarkable success, current clustering methods for heterogeneous networks tend to consider only internal information of the dataset. In order to utilize background domain knowledge, we propose a general framework for clustering heterogeneous data considering multiple user-provided constrains. Specifically, we summarize that three types of manual constraints on the object can be used to guide the clustering process. Then we propose the User- HeteClus algorithm to solve the key issues in the case of star-structure heterogeneous data, which incorporating the user constraint into similarity measurement between central objects. Experiments on a real-world dataset show the effectiveness of the proposed algorithm.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1841-9836
1841-9844
DOI:10.15837/ijccc.2019.2.3419