Accelerating the SVD bi-diagonalization of a batch of small matrices using GPUs

•A batched BLAS design based on device function and big-tile setting was proposed on the GPU.•The batched BLAS approach is used to optimize solving the batched bi-diagonalization problem.•For the first time, batched BLAS in HIP code on AMD platform was presented and compared against NVIDIA CUDA plat...

Full description

Saved in:

Bibliographic Details
Published in:	Journal of computational science Vol. 26; no. C; pp. 237 - 245
Main Authors:	Dong, Tingxing, Haidar, Azzam, Tomov, Stanimire, Dongarra, Jack
Format:	Journal Article
Language:	English
Published:	Netherlands Elsevier B.V 01.05.2018 Elsevier
Subjects:	Batched Eigenvalue and singular value problems Hardware accelerators Numerical linear algebra Two-sided factorization algorithms Two-sided factorization algorithms Batched Hardware accelerators Numerical linear algebra Eigenvalue and singular value problems
ISSN:	1877-7503, 1877-7511
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Be the first to leave a comment!