Extracting ultra-scale Lattice Boltzmann performance via hierarchical and distributed auto-tuning

We are witnessing a rapid evolution of HPC node architectures and on-chip parallelism as power and cooling constraints limit increases in microprocessor clock speeds. In this work, we demonstrate a hierarchical approach towards effectively extracting performance for a variety of emerging multicore-b...

Full description

Saved in:
Bibliographic Details
Published in:2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) pp. 1 - 12
Main Authors: Williams, Samuel, Oliker, Leonid, Carter, Jonathan, Shalf, John
Format: Conference Proceeding
Language:English
Published: New York, NY, USA ACM 12.11.2011
IEEE
Series:ACM Conferences
Subjects:
ISBN:145030771X, 9781450307710
ISSN:2167-4329
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first