Auto-tuning 3-D FFT library for CUDA GPUs
Existing implementations of FFTs on GPUs are optimized for specific transform sizes like powers of two, and exhibit unstable and peaky performance i.e., do not perform as well in other sizes that appear in practice. Our new auto-tuning 3-D FFT on CUDA generates high performance CUDA kernels for FFTs...
Saved in:
| Published in: | Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis pp. 1 - 10 |
|---|---|
| Main Authors: | , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
New York, NY, USA
ACM
14.11.2009
|
| Series: | ACM Conferences |
| Subjects: | |
| ISBN: | 1605587443, 9781605587448 |
| ISSN: | 2167-4329 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!

