Checkpointing vs. Supervision Resilience Approaches for Dynamic Independent Tasks

With the advent of exascale computing, issues such as application irregularity and permanent hardware failure are growing in importance. Irregularity is often addressed by task-based parallel programming coupled with work stealing. At the task level, resilience can be provided by two principal appro...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) S. 556 - 565
Hauptverfasser: Posner, Jonas, Reitz, Mia, Fohry, Claudia
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: IEEE 01.06.2021
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!