Accelerating the SVD bi-diagonalization of a batch of small matrices using GPUs

•A batched BLAS design based on device function and big-tile setting was proposed on the GPU.•The batched BLAS approach is used to optimize solving the batched bi-diagonalization problem.•For the first time, batched BLAS in HIP code on AMD platform was presented and compared against NVIDIA CUDA plat...

Full description

Saved in:
Bibliographic Details
Published in:Journal of computational science Vol. 26; no. C; pp. 237 - 245
Main Authors: Dong, Tingxing, Haidar, Azzam, Tomov, Stanimire, Dongarra, Jack
Format: Journal Article
Language:English
Published: Netherlands Elsevier B.V 01.05.2018
Elsevier
Subjects:
ISSN:1877-7503, 1877-7511
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first