Accelerating the SVD bi-diagonalization of a batch of small matrices using GPUs

•A batched BLAS design based on device function and big-tile setting was proposed on the GPU.•The batched BLAS approach is used to optimize solving the batched bi-diagonalization problem.•For the first time, batched BLAS in HIP code on AMD platform was presented and compared against NVIDIA CUDA plat...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Journal of computational science Ročník 26; číslo C; s. 237 - 245
Hlavní autoři:	Dong, Tingxing, Haidar, Azzam, Tomov, Stanimire, Dongarra, Jack
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	Netherlands Elsevier B.V 01.05.2018 Elsevier
Témata:	Batched Eigenvalue and singular value problems Hardware accelerators Numerical linear algebra Two-sided factorization algorithms Two-sided factorization algorithms Batched Hardware accelerators Numerical linear algebra Eigenvalue and singular value problems
ISSN:	1877-7503, 1877-7511
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Buďte první, kdo okomentuje tento záznam!