A two-phase streaming edge partitioning algorithm for large-scale uncertain graphs
Uncertain graphs, which model uncertainty in edges, are used in various domains such as social networks, biological networks, and recommendation systems. As the scale of these graphs continues to grow, efficient distributed analysis becomes crucial, hinging on effective graph partitioning. However,...
Uloženo v:
| Vydáno v: | Cluster computing Ročník 28; číslo 11; s. 746 |
|---|---|
| Hlavní autoři: | , , , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
New York
Springer US
01.10.2025
Springer Nature B.V |
| Témata: | |
| ISSN: | 1386-7857, 1573-7543 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Shrnutí: | Uncertain graphs, which model uncertainty in edges, are used in various domains such as social networks, biological networks, and recommendation systems. As the scale of these graphs continues to grow, efficient distributed analysis becomes crucial, hinging on effective graph partitioning. However, existing partitioning algorithms are primarily designed for deterministic graphs, which do not account for edge uncertainties. When applied to uncertain graphs, these methods often result in too many replicated vertices and imbalanced partition loads. This paper presents a streaming edge partitioning algorithm TPEP (Two-Phase Edge Partitioning) to partition the large-scale uncertain graphs. Firstly, TPEP formulates the partitioning problem as an optimization problem of minimizing the number of replicated vertices, using the expected number of edges to evaluate partition load. Then, TPEP uses a score function to guide the streaming partitioning process, where the score function consists of the ratio of replicated vertices, load balance, and the number of vertices within the same cluster. To compare the degrees of two vertices, the concept of degree advantage is introduced. Experimental results indicate that TPEP outperforms existing algorithms by achieving an average reduction of 34.3% in replicated vertices, an average reduction of 33.1% in load balance, and the optimal edge probability distribution. |
|---|---|
| Bibliografie: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 1386-7857 1573-7543 |
| DOI: | 10.1007/s10586-025-05338-5 |