Lattice QCD Calculation and Optimization on ARM Processors

Lattice quantum chromodynamics(lattice QCD) is one of the most important applications of large-scale parallel computing in high energy physics, researches in this field usually consume a large amount of computing resources, and its core is to solve the large scale sparse linear equations.Based on th...

Full description

Saved in:
Bibliographic Details
Published in:Ji suan ji ke xue Vol. 50; no. 6; pp. 52 - 57
Main Authors: Sun, Wei, Bi, Yujiang, Cheng, Yaodong
Format: Journal Article
Language:Chinese
Published: Chongqing Guojia Kexue Jishu Bu 01.06.2023
Editorial office of Computer Science
Subjects:
ISSN:1002-137X
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Lattice quantum chromodynamics(lattice QCD) is one of the most important applications of large-scale parallel computing in high energy physics, researches in this field usually consume a large amount of computing resources, and its core is to solve the large scale sparse linear equations.Based on the domestic Kunpeng 920 ARM processor, this paper studies the hot spot of lattice QCD calculation, the Dslash, which is applied on up to 64 nodes(6 144 cores) and show the linear scalability.Based on the roofline performance analysis model, we find that lattice QCD is a typical memory bound application, and by using the compression of 3×3 complex unitary matrices in Dslash based on symmetry, we can improve the performance of Dslash by 22%.For the solving of large scale sparse linear equations, we also explore the usual Krylov subspace iterative algorithm such as BiCGStab and the newly developed state-of-art multigrid algorithm on the same ARM processor, and find that in the practical physics calculation the multigri
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1002-137X
DOI:10.11896/jsjkx.230200159