Extracting ultra-scale Lattice Boltzmann performance via hierarchical and distributed auto-tuning
We are witnessing a rapid evolution of HPC node architectures and on-chip parallelism as power and cooling constraints limit increases in microprocessor clock speeds. In this work, we demonstrate a hierarchical approach towards effectively extracting performance for a variety of emerging multicore-b...
Saved in:
| Published in: | 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) pp. 1 - 12 |
|---|---|
| Main Authors: | , , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
New York, NY, USA
ACM
12.11.2011
IEEE |
| Series: | ACM Conferences |
| Subjects: | |
| ISBN: | 145030771X, 9781450307710 |
| ISSN: | 2167-4329 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!

