Data Transfer Schemes in the High-Level Communication Library LAIK

With the trend towards more heterogeneity, programming applications on modern HPC systems is getting more complex. Programming models with uniform data transfer concepts across physical memory spaces, be it attached to host CPUs, accelerators, or across nodes, can help significantly. Task-based mode...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:2025 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) s. 187 - 196
Hlavní autoři: Weidendorfer, Josef, Neef, Lukas, Hubinger, Robert, Raoofy, Amir
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 03.06.2025
Témata:
ISSN:2995-066X
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:With the trend towards more heterogeneity, programming applications on modern HPC systems is getting more complex. Programming models with uniform data transfer concepts across physical memory spaces, be it attached to host CPUs, accelerators, or across nodes, can help significantly. Task-based models offer this flexibility, but need a complete rewrite of typical hybrid MPI and OpenMP code. LAIK is a high-level communication library that represents communications as explicit cache coherence enforcement on top of logical global address spaces, ensuring affinity of data and computation for high performance. LAIK enables code porting in small incremental steps since it keeps the same procedural SPMD style as MPI code and, hence, is usable together with MPI, cooperatively. In this paper, we introduce various data transfer schemes implemented as communication backends in LAIK. We mainly focus on communication backends that leverage shared memory transfer schemes via System-V IPC to take advantage of hardware cache coherence. We evaluate the performance of these schemes against the MPI backend in the implementation of LAIK. Finally, we show that a 0-copy scheme, implemented as a backend in LAIK, can provide similar performance to OpenMP while using the same LAIK programming model that works for node- and cluster-level communication.
ISSN:2995-066X
DOI:10.1109/IPDPSW66978.2025.00035