Medoid based semi-supervised fuzzy clustering algorithms for multi-view relational data

In this paper we present two novel families of semi-supervised multi-view clustering algorithms for relational data, providing relevance weights for views and cluster representatives. The families differ by their cluster representative approach: set-medoids and vector set-medoids. Each family posses...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Fuzzy sets and systems Ročník 469; s. 108630
Hlavní autoři:	P. Branco, Diogo P., T. de Carvalho, Francisco de A.
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	Elsevier B.V 15.10.2023
Témata:	Multi-view data Pairwise constraints Relational data Semi-supervised fuzzy clustering Set-medoids Weighted views Pairwise constraints Relational data Set-medoids Weighted views Multi-view data Semi-supervised fuzzy clustering
ISSN:	0165-0114
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	In this paper we present two novel families of semi-supervised multi-view clustering algorithms for relational data, providing relevance weights for views and cluster representatives. The families differ by their cluster representative approach: set-medoids and vector set-medoids. Each family possesses four models, referred to as variants, differing by their weighting approach and the constraints imposed on those weights. Semi-supervision is attained through the use of must-link and cannot-link pairwise constraints. The algorithms perform the clustering task through the minimisation of an objective function; such minimisation is achieved through the iteration of three steps until convergence, namely: search for the best representatives, computation of the best relevance weights, and computation of the best fuzzy partition. Experiments performed on several real datasets show that those algorithms have greater robustness than other semi-supervised multi-view fuzzy clustering algorithms when handling the pairwise constraints and provide better interpretability of the results. Moreover, those experiments reveal that one family may perform better than the other depending on the data being considered; this can also be observed for the variants within each family. Furthermore, the proposed algorithms were tested in a real multi-view dataset where a synthetic noise view was added; it was observed that those algorithms managed to maintain a good level of quality in the resulting partition as well as discriminating the noise view by assigning it a lower relevance weight.
ISSN:	0165-0114
DOI:	10.1016/j.fss.2023.108630