Performance Modelling and Cost Effective Execution for Distributed Graph Processing on Configurable VMs

Graph Processing has been widely used to capture complex data dependency and uncover relationship insights. Due to the ever-growing graph scale and algorithm complexity, distributed graph processing has become more and more popular. In this paper, we investigate how to balance performance and cost f...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) s. 74 - 83
Hlavní autoři: Li, Zengxiang, Zhang, Bowen, Ren, Shen, Liu, Yong, Qin, Zheng, Goh, Rick Siow Mong, Gurusamy, Mohan
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: Piscataway, NJ, USA IEEE Press 14.05.2017
IEEE
Edice:ACM Conferences
Témata:
ISBN:9781509066100, 1509066101
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Graph Processing has been widely used to capture complex data dependency and uncover relationship insights. Due to the ever-growing graph scale and algorithm complexity, distributed graph processing has become more and more popular. In this paper, we investigate how to balance performance and cost for large scale graph processing on configurable virtual machines (VMs). We analyze the system architecture and implementation details of a Pregel-like distributed graph processing framework and develop a system-aware model to predict the execution time. Consequently, cost effective execution scenarios are recommended by selecting a certain number of VMs with specified capability subject to the predefined resource price and user preference. Experiments using synthetic and real world graphs have verified that system-aware model can achieve much higher prediction accuracy than popular machine-learning models which treat graph processing framework as a black box. As a result, the recommended execution scenarios have comparable cost efficiency to the optimal scenarios.
ISBN:9781509066100
1509066101
DOI:10.1109/CCGRID.2017.85