A multi-GPU parallel optimization model for the preconditioned conjugate gradient algorithm

•An adaptive optimization model for the PCG algorithm on multiple GPUs is proposed.•A performance estimation method is proposed to accurately estimate the execution time of SpMV kernels.•The vector-operation and inner-product decision trees can be automatically constructed. In this study, we present...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Parallel computing Ročník 63; s. 1 - 16
Hlavní autoři: Gao, Jiaquan, Zhou, Yuanshen, He, Guixia, Xia, Yifei
Médium: Journal Article
Jazyk:angličtina
Vydáno: Elsevier B.V 01.04.2017
Témata:
ISSN:0167-8191, 1872-7336
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:•An adaptive optimization model for the PCG algorithm on multiple GPUs is proposed.•A performance estimation method is proposed to accurately estimate the execution time of SpMV kernels.•The vector-operation and inner-product decision trees can be automatically constructed. In this study, we present a novel optimization model that can automatically and rapidly generate an optimally parallel preconditioned conjugate gradient (PCG) algorithm for any given linear system on a specific multi-graphics processing unit (GPU) platform. For our proposed model, there are the following novelties: (1) a profile-based performance model for each one of the main components of the PCG algorithm, including the vector operation, inner product, and sparse matrix-vector multiplication (SpMV), is suggested, and (2) our model is general, independent of the problems, and only dependent on the resources of devices, and (3) our model is extensible. For a vector operation kernel, or inner product kernel, or SpMV kernel that is not included in our framework, once its performance model is successfully constructed, it can be incorporated into our framework. Our model is constructed only once for each type of GPU. The experiments validate the high efficiency of our proposed model.
ISSN:0167-8191
1872-7336
DOI:10.1016/j.parco.2017.04.003