A Parallel Direct Finite Element Solver Empowered by Machine Learning for Solving Large-Scale Electromagnetic Problems

In this communication, a parallel direct finite element solver is proposed for solving complex electromagnetic problems. Our proposed solver features adaptive fill-reducing ordering and highly assembled lazy offloading (HALO) factorization. In our adaptive fill-reducing ordering approach, we propose...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:IEEE transactions on antennas and propagation Ročník 73; číslo 4; s. 2690 - 2695
Hlavní autori: Wang, Junjie, Zuo, Sheng, Zhao, Xunwang, Tian, Min, Lin, Zhongchao, Zhang, Yu
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: New York IEEE 01.04.2025
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Predmet:
ISSN:0018-926X, 1558-2221
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:In this communication, a parallel direct finite element solver is proposed for solving complex electromagnetic problems. Our proposed solver features adaptive fill-reducing ordering and highly assembled lazy offloading (HALO) factorization. In our adaptive fill-reducing ordering approach, we propose a graph convolutional network (GCN) model that adaptively selects the optimal fill-reducing ordering method for each electromagnetic problem. Furthermore, we propose a graph sampling training algorithm to enhance the scalability of the GCN model. In our HALO factorization approach, we focus on small irregular level-1 BLAS (BLAS-1) and level-2 BLAS (BLAS-2) subproblems for the supernode factorization and panel update steps, aggregate unstructured L and U factors stored in vector format into structured dense blocks stored in matrix format, assemble these small irregular BLAS-1 and BLAS-2 subproblems into highly assembled level-3 BLAS (BLAS-3) problems that are well-suited for the parallelism of various dedicated manycore accelerators, and propose a lazy offloading technique to achieve high computational efficiency and low data transfer overhead. Our numerical results demonstrate that our proposed method exhibits high accuracy and significant performance improvement for a variety of complex electromagnetic problems and arbitrarily shaped radiating and coupling objects. Moreover, our proposed solver can achieve a <inline-formula> <tex-math notation="LaTeX">4.25 \times </tex-math></inline-formula> speedup in comparison to the state-of-the-art swSuperLU on Sunway manycore computing systems, which is an unprecedented performance gain for the direct finite element solver on dedicated manycore accelerators.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0018-926X
1558-2221
DOI:10.1109/TAP.2025.3541883