The split-and-merge method in general purpose computation on GPUs

► The split-and-merge method was applied to the CUDA programming model. ► The data dependency problem at the block boundaries was solved. ► Some parameters can be adjusted to achieve a higher efficiency. ► The method can be easily extended to the two-dimensional case. The split-and-merge method is a...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Parallel computing Ročník 38; číslo 6-7; s. 277 - 288
Hlavní autoři: Argüello, F., Heras, D.B., Bóo, M., Lamas-Rodríguez, J.
Médium: Journal Article
Jazyk:angličtina
Vydáno: Elsevier B.V 01.06.2012
Témata:
ISSN:0167-8191, 1872-7336
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:► The split-and-merge method was applied to the CUDA programming model. ► The data dependency problem at the block boundaries was solved. ► Some parameters can be adjusted to achieve a higher efficiency. ► The method can be easily extended to the two-dimensional case. The split-and-merge method is an algorithm design paradigm sometimes used in the field of parallel computing. It is applied to multilevel algorithms such as the wavelet transforms and some tridiagonal system solvers. In this paper we present the application of the method in the context of general purpose computation on GPUs. The split-and-merge method allows us to efficiently use the CUDA parallel programming model, where a multithreaded program is partitioned into blocks of threads that execute independently from each other. Thus we can solve the data dependency problem at the block boundaries and efficiently take advantage of the memory hierarchy of the GPU. The results obtained show a significant acceleration compared with the direct implementation of the algorithms on the GPU.
Bibliografie:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:0167-8191
1872-7336
DOI:10.1016/j.parco.2012.03.003