Lattice QCD Calculation and Optimization on ARM Processors

Lattice quantum chromodynamics(lattice QCD) is one of the most important applications of large-scale parallel computing in high energy physics, researches in this field usually consume a large amount of computing resources, and its core is to solve the large scale sparse linear equations.Based on th...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Ji suan ji ke xue Ročník 50; číslo 6; s. 52 - 57
Hlavní autoři: Sun, Wei, Bi, Yujiang, Cheng, Yaodong
Médium: Journal Article
Jazyk:čínština
Vydáno: Chongqing Guojia Kexue Jishu Bu 01.06.2023
Editorial office of Computer Science
Témata:
ISSN:1002-137X
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Lattice quantum chromodynamics(lattice QCD) is one of the most important applications of large-scale parallel computing in high energy physics, researches in this field usually consume a large amount of computing resources, and its core is to solve the large scale sparse linear equations.Based on the domestic Kunpeng 920 ARM processor, this paper studies the hot spot of lattice QCD calculation, the Dslash, which is applied on up to 64 nodes(6 144 cores) and show the linear scalability.Based on the roofline performance analysis model, we find that lattice QCD is a typical memory bound application, and by using the compression of 3×3 complex unitary matrices in Dslash based on symmetry, we can improve the performance of Dslash by 22%.For the solving of large scale sparse linear equations, we also explore the usual Krylov subspace iterative algorithm such as BiCGStab and the newly developed state-of-art multigrid algorithm on the same ARM processor, and find that in the practical physics calculation the multigri
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1002-137X
DOI:10.11896/jsjkx.230200159