PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems

Uloženo v:
Podrobná bibliografie
Název: PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems
Autoři: Xu Fu
Informace o vydavateli: Zenodo
Rok vydání: 2023
Sbírka: Zenodo
Témata: Sparse LU, Regular 2D block, Distributed platforms, Heterogeneous systems
Popis: Sparse direct solvers play a vital role in large-scale high performance scientific and engineering computations. Existing distributed sparse direct methods employ the multifrontal/supernodal patterns to aggregate columns of near-identical forms and to exploit dense basic linear algebra subprograms (BLAS) for computation. However, such data layout may bring more unevenness when the structure of the input matrix is not ideal, and using dense BLAS may waste many floating point operations on zero fill-ins, accordingly. We in this paper propose a new sparse direct solver called PanguLU. Unlike the multifrontal/supernodal layout, our work relies on simpler regular 2D blocking and stores the blocks in their sparse forms to avoid any extra fill-ins. According to the sparse patterns of the blocks, a variety of block-wise sparse BLAS methods are developed and selected for higher efficiency on local GPUs. To make PanguLU more scalable, we also adjust the mapping of blocks to processes for overall more balanced workload, and propose a synchronisation-free communication strategy considering the dependencies among different sub-tasks to reduce overall latency overhead.
Druh dokumentu: conference object
Jazyk: unknown
Relation: https://zenodo.org/records/8203219; oai:zenodo.org:8203219; https://doi.org/10.5281/zenodo.8203219
DOI: 10.5281/zenodo.8203219
Dostupnost: https://doi.org/10.5281/zenodo.8203219
https://zenodo.org/records/8203219
Rights: Creative Commons Attribution 4.0 International ; cc-by-4.0 ; https://creativecommons.org/licenses/by/4.0/legalcode
Přístupové číslo: edsbas.7F651D1C
Databáze: BASE
Popis
Abstrakt:Sparse direct solvers play a vital role in large-scale high performance scientific and engineering computations. Existing distributed sparse direct methods employ the multifrontal/supernodal patterns to aggregate columns of near-identical forms and to exploit dense basic linear algebra subprograms (BLAS) for computation. However, such data layout may bring more unevenness when the structure of the input matrix is not ideal, and using dense BLAS may waste many floating point operations on zero fill-ins, accordingly. We in this paper propose a new sparse direct solver called PanguLU. Unlike the multifrontal/supernodal layout, our work relies on simpler regular 2D blocking and stores the blocks in their sparse forms to avoid any extra fill-ins. According to the sparse patterns of the blocks, a variety of block-wise sparse BLAS methods are developed and selected for higher efficiency on local GPUs. To make PanguLU more scalable, we also adjust the mapping of blocks to processes for overall more balanced workload, and propose a synchronisation-free communication strategy considering the dependencies among different sub-tasks to reduce overall latency overhead.
DOI:10.5281/zenodo.8203219