Distributed data clustering over networks
•Clustering on multiple sources of distributed data.•Non-convex optimization in multi-agent networks with time-varying connectivity;•Leveraging dynamic consensus for diffuse information in the network.•Consensus-based Expectation-Maximization applied to Gaussian mixture models. [Display omitted] In...
Uloženo v:
| Vydáno v: | Pattern recognition Ročník 93; s. 603 - 620 |
|---|---|
| Hlavní autoři: | , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
Elsevier Ltd
01.09.2019
|
| Témata: | |
| ISSN: | 0031-3203, 1873-5142 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Shrnutí: | •Clustering on multiple sources of distributed data.•Non-convex optimization in multi-agent networks with time-varying connectivity;•Leveraging dynamic consensus for diffuse information in the network.•Consensus-based Expectation-Maximization applied to Gaussian mixture models.
[Display omitted]
In this paper, we consider the problem of distributed unsupervised clustering, where training data is partitioned over a set of agents, whose interaction happens over a sparse, but connected, communication network. To solve this problem, we recast the well known Expectation Maximization method in a distributed setting, exploiting a recently proposed algorithmic framework for in-network non-convex optimization. The resulting algorithm, termed as Expectation Maximization Consensus, exploits successive local convexifications to split the computation among agents, while hinging on dynamic consensus to diffuse information over the network in real-time. Convergence to local solutions of the distributed clustering problem is then established. Experimental results on well-known datasets illustrate that the proposed method performs better than other distributed Expectation-Maximization clustering approaches, while the method is faster than a centralized Expectation-Maximization procedure and achieves a comparable performance in terms of cluster validity indexes. The latter ones achieve good values in absolute range scales and prove the quality of the obtained clustering results, which compare favorably with other methods in the literature. |
|---|---|
| ISSN: | 0031-3203 1873-5142 |
| DOI: | 10.1016/j.patcog.2019.04.021 |