A design methodology for domain-optimized power-efficient supercomputing

As power has become the pre-eminent design constraint for future HPC systems, computational efficiency is being emphasized over simply peak performance. Recently, static benchmark codes have been used to find a power efficient architecture. Unfortunately, because compilers generate sub-optimal code,...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis S. 1 - 12
Hauptverfasser: Mohiyuddin, Marghoob, Murphy, Mark, Oliker, Leonid, Shalf, John, Wawrzynek, John, Williams, Samuel
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: New York, NY, USA ACM 14.11.2009
Schriftenreihe:ACM Conferences
Schlagworte:
ISBN:1605587443, 9781605587448
ISSN:2167-4329
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract As power has become the pre-eminent design constraint for future HPC systems, computational efficiency is being emphasized over simply peak performance. Recently, static benchmark codes have been used to find a power efficient architecture. Unfortunately, because compilers generate sub-optimal code, benchmark performance can be a poor indicator of the performance potential of architecture design points. Therefore, we present hardware/software cotuning as a novel approach for system design, in which traditional architecture space exploration is tightly coupled with software auto-tuning for delivering substantial improvements in area and power efficiency. We demonstrate the proposed methodology by exploring the parameter space of a Tensilica-based multi-processor running three of the most heavily used kernels in scientific computing, each with widely varying micro-architectural requirements: sparse matrix vector multiplication, stencil-based computations, and general matrix-matrix multiplication. Results demonstrate that co-tuning significantly improves hardware area and energy efficiency -- a key driver for next generation of HPC system design.
AbstractList As power has become the pre-eminent design constraint for future HPC systems, computational efficiency is being emphasized over simply peak performance. Recently, static benchmark codes have been used to find a power efficient architecture. Unfortunately, because compilers generate sub-optimal code, benchmark performance can be a poor indicator of the performance potential of architecture design points. Therefore, we present hardware/software cotuning as a novel approach for system design, in which traditional architecture space exploration is tightly coupled with software auto-tuning for delivering substantial improvements in area and power efficiency. We demonstrate the proposed methodology by exploring the parameter space of a Tensilica-based multi-processor running three of the most heavily used kernels in scientific computing, each with widely varying micro-architectural requirements: sparse matrix vector multiplication, stencil-based computations, and general matrix-matrix multiplication. Results demonstrate that co-tuning significantly improves hardware area and energy efficiency -- a key driver for next generation of HPC system design.
Author Mohiyuddin, Marghoob
Wawrzynek, John
Murphy, Mark
Shalf, John
Williams, Samuel
Oliker, Leonid
Author_xml – sequence: 1
  givenname: Marghoob
  surname: Mohiyuddin
  fullname: Mohiyuddin, Marghoob
  organization: University of California at Berkeley, Berkeley, CA and Lawrence Berkeley National Laboratory, Berkeley, CA
– sequence: 2
  givenname: Mark
  surname: Murphy
  fullname: Murphy, Mark
  organization: University of California at Berkeley, Berkeley, CA
– sequence: 3
  givenname: Leonid
  surname: Oliker
  fullname: Oliker, Leonid
  organization: Lawrence Berkeley National Laboratory, Berkeley, CA
– sequence: 4
  givenname: John
  surname: Shalf
  fullname: Shalf, John
  organization: Lawrence Berkeley National Laboratory, Berkeley, CA
– sequence: 5
  givenname: John
  surname: Wawrzynek
  fullname: Wawrzynek, John
  organization: University of California at Berkeley, Berkeley, CA
– sequence: 6
  givenname: Samuel
  surname: Williams
  fullname: Williams, Samuel
  organization: Lawrence Berkeley National Laboratory, Berkeley, CA
BookMark eNqNkDtPwzAUhY0oEqV0ZmDJyJLiG7-SsaqAIlVigdny47oYmjhKUqHy62lpB0bucnT1HZ3huyKjJjVIyA3QGQAX9yAFp6Ka_aYqzsgVSCpEqThn53-fERkXIFXOWVFdkmnff9D9lVCwUozJcp557OO6yWoc3pNPm7TeZSF1mU-1iU2e2iHW8Rt91qYv7HIMIbqIzZD12xY7l-p2O8RmfU0ugtn0OD3lhLw9Prwulvnq5el5MV_lhoEYci44U4wFdMEwZqGyaLk3aKxnVXAUmATFnQOQgWLJq1DuuffGhMoqqdiE3B53IyLqtou16XZaMiWEONC7IzWu1jalz14D1Qdh-iRMn4Ttq7N_VrXtIgb2A7Ofals
CODEN IEEPAD
ContentType Conference Proceeding
Copyright 2009 ACM
Copyright_xml – notice: 2009 ACM
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1145/1654059.1654072
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Xplore Digital Library
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore Digital Library
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 1605587443
9781605587448
EndPage 12
ExternalDocumentID 6375557
Genre orig-research
GroupedDBID 6IE
6IF
6IL
6IN
AAJGR
AARBI
ACM
ADPZR
ALMA_UNASSIGNED_HOLDINGS
APO
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
GUFHI
OCL
RIE
RIL
6IH
6IK
AAWTH
ABLEC
ADZIZ
CHZPO
IEGSK
IPLJI
ID FETCH-LOGICAL-a315t-4543733fecfa33b19beb4daeabd39fc0136174cc116f0e849f8b4dddaaf9b7673
IEDL.DBID RIE
ISBN 1605587443
9781605587448
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000320136800044&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 2167-4329
IngestDate Wed Jul 30 06:14:25 EDT 2025
Wed Jan 31 06:45:54 EST 2024
Wed Jan 31 06:45:55 EST 2024
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
License Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org
LinkModel DirectLink
MeetingName SC '09: International Conference for High Performance Computing, Networking, Storage and Analysis
MergedId FETCHMERGED-LOGICAL-a315t-4543733fecfa33b19beb4daeabd39fc0136174cc116f0e849f8b4dddaaf9b7673
OpenAccessLink https://www.osti.gov/servlets/purl/1407081
PageCount 12
ParticipantIDs ieee_primary_6375557
acm_books_10_1145_1654059_1654072
acm_books_10_1145_1654059_1654072_brief
PublicationCentury 2000
PublicationDate 20091114
2009-Nov.
PublicationDateYYYYMMDD 2009-11-14
2009-11-01
PublicationDate_xml – month: 11
  year: 2009
  text: 20091114
  day: 14
PublicationDecade 2000
PublicationPlace New York, NY, USA
PublicationPlace_xml – name: New York, NY, USA
PublicationSeriesTitle ACM Conferences
PublicationTitle Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
PublicationTitleAbbrev SUPERC
PublicationYear 2009
Publisher ACM
Publisher_xml – name: ACM
SSID ssj0000812385
ssj0003204180
Score 1.4766873
Snippet As power has become the pre-eminent design constraint for future HPC systems, computational efficiency is being emphasized over simply peak performance....
SourceID ieee
acm
SourceType Publisher
StartPage 1
SubjectTerms Codes
Computer architecture
Computer systems organization -- Architectures -- Distributed architectures -- Grid computing
Computer systems organization -- Architectures -- Parallel architectures -- Multicore architectures
Computer systems organization -- Architectures -- Serial architectures -- Superscalar architectures
Computing methodologies -- Modeling and simulation -- Simulation types and techniques -- Massively parallel and high-performance simulations
Costs
Energy efficiency
General and reference -- Cross-computing tools and techniques -- Design
General and reference -- Cross-computing tools and techniques -- Performance
Hardware
Hardware -- Electronic design automation -- Modeling and parameter extraction
Kernel
Optimization
Software
Software and its engineering -- Software organization and properties -- Software system structures -- Distributed systems organizing principles -- Grid computing
System analysis and design
Tuning
Title A design methodology for domain-optimized power-efficient supercomputing
URI https://ieeexplore.ieee.org/document/6375557
WOSCitedRecordID wos000320136800044&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3LSgMxFL20xYWrqq1YX0QQ3Dht00wmk6WIpavShUJ3Qx43MIt2Sh8u_HqTdKwIgriaJ5PhMJObe5NzDsA9SiqF82mqENwlvpfME21UmmjudOqUz7lwGM0mxHSaz-dy1oDHAxcGEePiM-yH3TiXbyuzC6WyQcYE51w0oSlEtudqHeopPrT56MMPx2w0TGk0ThtFaW82krWyD035IHB4_MCiH7dBHbipzOKHwUqML-P2_97sBLrfRD0yO4SgU2jg8gzaX04NpP5xOzB5IjYu1SB7x-hYSyd-vEpstVDlMql8z7EoP9CSVbBNSzAqS_g2yWa3wrWJT_RNdOFt_PL6PElqD4VEMcq3QdKcCcYcGqcY01Rq1KlVqLRl0pmg2OZzEmMozdwQ81S63F-3VikntcgEO4fWslriBRCdog_m3DhDdSrRapsjqqGyZiS0proHdx7EIiQHm2LPd-ZFDXRRA92Dhz_vKfS6RNeDToC5WO1FN4oa4cvfT1_BcZzmiSTBa2ht1zu8gSPzvi0369v4pXwCGT65og
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1JSwMxFH7UBfTk0op1jSB4cdqmmTSTo4ilYi09VOhtyPICPXShiwd_vUk6VgRBPM3KZPiYyct7yfd9ALcoqRTOp6lCcJf4XjJLtFFpornTqVM-58JGNJsQvV42HMp-Ce43XBhEjIvPsBZ241y-nZpVKJXVW0xwzsUW7ATnrIKttamo-ODm4w_fHLNmI6XROq0Zxb1ZUxbaPjTl9cDi8UOLWtwGfeAtZcY_LFZihGkf_O_dDqHyTdUj_U0QOoISTo7h4MurgRS_bhk6D8TGxRpk7Rkdq-nEj1iJnY7VaJJMfd8xHn2gJbNgnJZg1JbwbZLFaoZzE5_om6jAW_tp8NhJCheFRDHKl0HUnAnGHBqnGNNUatSpVai0ZdKZoNnmsxJjKG25BmapdJm_bq1STmrREuwEtifTCZ4C0Sn6cM6NM1SnEq22GaJqKGuaQmuqq3DjQcxDerDI14xnnhdA5wXQVbj7855cz0foqlAOMOeztexGXiB89vvpa9jrDF67efe593IO-3HSJ1IGL2B7OV_hJeya9-VoMb-KX80n9Xa86w
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+Conference+on+High+Performance+Computing+Networking%2C+Storage+and+Analysis&rft.atitle=A+design+methodology+for+domain-optimized+power-efficient+supercomputing&rft.au=Mohiyuddin%2C+Marghoob&rft.au=Murphy%2C+Mark&rft.au=Oliker%2C+Leonid&rft.au=Shalf%2C+John&rft.series=ACM+Conferences&rft.date=2009-11-14&rft.pub=ACM&rft.isbn=1605587443&rft.spage=1&rft.epage=12&rft_id=info:doi/10.1145%2F1654059.1654072
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2167-4329&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2167-4329&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2167-4329&client=summon