AdapPF: Self-Adaptive Scrape Interval for Monitoring in Geo-Distributed Cluster Federations

Uloženo v:
Podrobná bibliografie
Název: AdapPF: Self-Adaptive Scrape Interval for Monitoring in Geo-Distributed Cluster Federations
Autoři: Huang, Chih-Kai, Pierre, Guillaume
Přispěvatelé: Design and Implementation of Autonomous Distributed Systems (MYRIADS), Inria Rennes – Bretagne Atlantique, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-SYSTÈMES LARGE ÉCHELLE (IRISA-D1), Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA), Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique), Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT)-Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique), Institut Mines-Télécom Paris (IMT)-Institut Mines-Télécom Paris (IMT), IEEE, Grid5000
Zdroj: ISCC 2023 - 28th IEEE Symposium on Computers and Communications ; https://inria.hal.science/hal-04103309 ; ISCC 2023 - 28th IEEE Symposium on Computers and Communications, IEEE, Jul 2023, Tunis, Tunisia. pp.1-7, ⟨10.1109/ISCC58397.2023.10218080⟩ ; https://2023.ieee-iscc.org/
Informace o vydavateli: HAL CCSD
IEEE
Rok vydání: 2023
Sbírka: Université de Rennes 1: Publications scientifiques (HAL)
Témata: Monitoring, Geo-distributed cluster federations, Prometheus, Self-adaptive, Fog computing, [INFO]Computer Science [cs]
Geografické téma: Tunis, Tunisia
Popis: International audience ; Monitoring plays a vital role in geo-distributed cluster federation environments to accurately schedule applications across geographically dispersed computing resources. However, using a fixed frequency for collecting monitoring data from clusters may waste network bandwidth and is not necessary for ensuring accurate scheduling. In this paper, we propose Adaptive Prometheus Federation (AdapPF), an extension of the widelyused open-source monitoring tool, Prometheus, and its feature, Prometheus Federation. AdapPF aims to dynamically adjust the collection frequency of monitoring data for each cluster in geodistributed cluster federations. Based on actual deployment in the geo-distributed Grid'5000 testbed, our evaluations demonstrate that AdapPF can achieve comparable results to Prometheus Federation with 5-seconds scrape interval while reducing crosscluster network traffic by 36%.
Druh dokumentu: conference object
Jazyk: English
DOI: 10.1109/ISCC58397.2023.10218080
Dostupnost: https://inria.hal.science/hal-04103309
https://inria.hal.science/hal-04103309v1/document
https://inria.hal.science/hal-04103309v1/file/ISCC2023_CR.pdf
https://doi.org/10.1109/ISCC58397.2023.10218080
Rights: http://creativecommons.org/licenses/by/ ; info:eu-repo/semantics/OpenAccess
Přístupové číslo: edsbas.180BD570
Databáze: BASE
Popis
Abstrakt:International audience ; Monitoring plays a vital role in geo-distributed cluster federation environments to accurately schedule applications across geographically dispersed computing resources. However, using a fixed frequency for collecting monitoring data from clusters may waste network bandwidth and is not necessary for ensuring accurate scheduling. In this paper, we propose Adaptive Prometheus Federation (AdapPF), an extension of the widelyused open-source monitoring tool, Prometheus, and its feature, Prometheus Federation. AdapPF aims to dynamically adjust the collection frequency of monitoring data for each cluster in geodistributed cluster federations. Based on actual deployment in the geo-distributed Grid'5000 testbed, our evaluations demonstrate that AdapPF can achieve comparable results to Prometheus Federation with 5-seconds scrape interval while reducing crosscluster network traffic by 36%.
DOI:10.1109/ISCC58397.2023.10218080