Auto-tuning 3-D FFT library for CUDA GPUs

Existing implementations of FFTs on GPUs are optimized for specific transform sizes like powers of two, and exhibit unstable and peaky performance i.e., do not perform as well in other sizes that appear in practice. Our new auto-tuning 3-D FFT on CUDA generates high performance CUDA kernels for FFTs...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis pp. 1 - 10
Main Authors: Nukada, Akira, Matsuoka, Satoshi
Format: Conference Proceeding
Language:English
Published: New York, NY, USA ACM 14.11.2009
Series:ACM Conferences
Subjects:
ISBN:1605587443, 9781605587448
ISSN:2167-4329
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first