A design methodology for domain-optimized power-efficient supercomputing
As power has become the pre-eminent design constraint for future HPC systems, computational efficiency is being emphasized over simply peak performance. Recently, static benchmark codes have been used to find a power efficient architecture. Unfortunately, because compilers generate sub-optimal code,...
Gespeichert in:
| Veröffentlicht in: | Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis S. 1 - 12 |
|---|---|
| Hauptverfasser: | , , , , , |
| Format: | Tagungsbericht |
| Sprache: | Englisch |
| Veröffentlicht: |
New York, NY, USA
ACM
14.11.2009
|
| Schriftenreihe: | ACM Conferences |
| Schlagworte: | |
| ISBN: | 1605587443, 9781605587448 |
| ISSN: | 2167-4329 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Abstract | As power has become the pre-eminent design constraint for future HPC systems, computational efficiency is being emphasized over simply peak performance. Recently, static benchmark codes have been used to find a power efficient architecture. Unfortunately, because compilers generate sub-optimal code, benchmark performance can be a poor indicator of the performance potential of architecture design points. Therefore, we present hardware/software cotuning as a novel approach for system design, in which traditional architecture space exploration is tightly coupled with software auto-tuning for delivering substantial improvements in area and power efficiency. We demonstrate the proposed methodology by exploring the parameter space of a Tensilica-based multi-processor running three of the most heavily used kernels in scientific computing, each with widely varying micro-architectural requirements: sparse matrix vector multiplication, stencil-based computations, and general matrix-matrix multiplication. Results demonstrate that co-tuning significantly improves hardware area and energy efficiency -- a key driver for next generation of HPC system design. |
|---|---|
| AbstractList | As power has become the pre-eminent design constraint for future HPC systems, computational efficiency is being emphasized over simply peak performance. Recently, static benchmark codes have been used to find a power efficient architecture. Unfortunately, because compilers generate sub-optimal code, benchmark performance can be a poor indicator of the performance potential of architecture design points. Therefore, we present hardware/software cotuning as a novel approach for system design, in which traditional architecture space exploration is tightly coupled with software auto-tuning for delivering substantial improvements in area and power efficiency. We demonstrate the proposed methodology by exploring the parameter space of a Tensilica-based multi-processor running three of the most heavily used kernels in scientific computing, each with widely varying micro-architectural requirements: sparse matrix vector multiplication, stencil-based computations, and general matrix-matrix multiplication. Results demonstrate that co-tuning significantly improves hardware area and energy efficiency -- a key driver for next generation of HPC system design. |
| Author | Mohiyuddin, Marghoob Wawrzynek, John Murphy, Mark Shalf, John Williams, Samuel Oliker, Leonid |
| Author_xml | – sequence: 1 givenname: Marghoob surname: Mohiyuddin fullname: Mohiyuddin, Marghoob organization: University of California at Berkeley, Berkeley, CA and Lawrence Berkeley National Laboratory, Berkeley, CA – sequence: 2 givenname: Mark surname: Murphy fullname: Murphy, Mark organization: University of California at Berkeley, Berkeley, CA – sequence: 3 givenname: Leonid surname: Oliker fullname: Oliker, Leonid organization: Lawrence Berkeley National Laboratory, Berkeley, CA – sequence: 4 givenname: John surname: Shalf fullname: Shalf, John organization: Lawrence Berkeley National Laboratory, Berkeley, CA – sequence: 5 givenname: John surname: Wawrzynek fullname: Wawrzynek, John organization: University of California at Berkeley, Berkeley, CA – sequence: 6 givenname: Samuel surname: Williams fullname: Williams, Samuel organization: Lawrence Berkeley National Laboratory, Berkeley, CA |
| BookMark | eNqNkDtPwzAUhY0oEqV0ZmDJyJLiG7-SsaqAIlVigdny47oYmjhKUqHy62lpB0bucnT1HZ3huyKjJjVIyA3QGQAX9yAFp6Ka_aYqzsgVSCpEqThn53-fERkXIFXOWVFdkmnff9D9lVCwUozJcp557OO6yWoc3pNPm7TeZSF1mU-1iU2e2iHW8Rt91qYv7HIMIbqIzZD12xY7l-p2O8RmfU0ugtn0OD3lhLw9Prwulvnq5el5MV_lhoEYci44U4wFdMEwZqGyaLk3aKxnVXAUmATFnQOQgWLJq1DuuffGhMoqqdiE3B53IyLqtou16XZaMiWEONC7IzWu1jalz14D1Qdh-iRMn4Ttq7N_VrXtIgb2A7Ofals |
| CODEN | IEEPAD |
| ContentType | Conference Proceeding |
| Copyright | 2009 ACM |
| Copyright_xml | – notice: 2009 ACM |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1145/1654059.1654072 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Xplore Digital Library IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore Digital Library url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISBN | 1605587443 9781605587448 |
| EndPage | 12 |
| ExternalDocumentID | 6375557 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IF 6IL 6IN AAJGR AARBI ACM ADPZR ALMA_UNASSIGNED_HOLDINGS APO BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK GUFHI OCL RIE RIL 6IH 6IK AAWTH ABLEC ADZIZ CHZPO IEGSK IPLJI |
| ID | FETCH-LOGICAL-a315t-4543733fecfa33b19beb4daeabd39fc0136174cc116f0e849f8b4dddaaf9b7673 |
| IEDL.DBID | RIE |
| ISBN | 1605587443 9781605587448 |
| ISICitedReferencesCount | 0 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000320136800044&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 2167-4329 |
| IngestDate | Wed Jul 30 06:14:25 EDT 2025 Wed Jan 31 06:45:54 EST 2024 Wed Jan 31 06:45:55 EST 2024 |
| IsDoiOpenAccess | false |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| License | Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org |
| LinkModel | DirectLink |
| MeetingName | SC '09: International Conference for High Performance Computing, Networking, Storage and Analysis |
| MergedId | FETCHMERGED-LOGICAL-a315t-4543733fecfa33b19beb4daeabd39fc0136174cc116f0e849f8b4dddaaf9b7673 |
| OpenAccessLink | https://www.osti.gov/servlets/purl/1407081 |
| PageCount | 12 |
| ParticipantIDs | ieee_primary_6375557 acm_books_10_1145_1654059_1654072 acm_books_10_1145_1654059_1654072_brief |
| PublicationCentury | 2000 |
| PublicationDate | 20091114 2009-Nov. |
| PublicationDateYYYYMMDD | 2009-11-14 2009-11-01 |
| PublicationDate_xml | – month: 11 year: 2009 text: 20091114 day: 14 |
| PublicationDecade | 2000 |
| PublicationPlace | New York, NY, USA |
| PublicationPlace_xml | – name: New York, NY, USA |
| PublicationSeriesTitle | ACM Conferences |
| PublicationTitle | Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis |
| PublicationTitleAbbrev | SUPERC |
| PublicationYear | 2009 |
| Publisher | ACM |
| Publisher_xml | – name: ACM |
| SSID | ssj0000812385 ssj0003204180 |
| Score | 1.4766873 |
| Snippet | As power has become the pre-eminent design constraint for future HPC systems, computational efficiency is being emphasized over simply peak performance.... |
| SourceID | ieee acm |
| SourceType | Publisher |
| StartPage | 1 |
| SubjectTerms | Codes Computer architecture Computer systems organization -- Architectures -- Distributed architectures -- Grid computing Computer systems organization -- Architectures -- Parallel architectures -- Multicore architectures Computer systems organization -- Architectures -- Serial architectures -- Superscalar architectures Computing methodologies -- Modeling and simulation -- Simulation types and techniques -- Massively parallel and high-performance simulations Costs Energy efficiency General and reference -- Cross-computing tools and techniques -- Design General and reference -- Cross-computing tools and techniques -- Performance Hardware Hardware -- Electronic design automation -- Modeling and parameter extraction Kernel Optimization Software Software and its engineering -- Software organization and properties -- Software system structures -- Distributed systems organizing principles -- Grid computing System analysis and design Tuning |
| Title | A design methodology for domain-optimized power-efficient supercomputing |
| URI | https://ieeexplore.ieee.org/document/6375557 |
| WOSCitedRecordID | wos000320136800044&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3LSgMxFL20xYWrqq1YX0QQ3Dht00wmk6WIpavShUJ3Qx43MIt2Sh8u_HqTdKwIgriaJ5PhMJObe5NzDsA9SiqF82mqENwlvpfME21UmmjudOqUz7lwGM0mxHSaz-dy1oDHAxcGEePiM-yH3TiXbyuzC6WyQcYE51w0oSlEtudqHeopPrT56MMPx2w0TGk0ThtFaW82krWyD035IHB4_MCiH7dBHbipzOKHwUqML-P2_97sBLrfRD0yO4SgU2jg8gzaX04NpP5xOzB5IjYu1SB7x-hYSyd-vEpstVDlMql8z7EoP9CSVbBNSzAqS_g2yWa3wrWJT_RNdOFt_PL6PElqD4VEMcq3QdKcCcYcGqcY01Rq1KlVqLRl0pmg2OZzEmMozdwQ81S63F-3VikntcgEO4fWslriBRCdog_m3DhDdSrRapsjqqGyZiS0proHdx7EIiQHm2LPd-ZFDXRRA92Dhz_vKfS6RNeDToC5WO1FN4oa4cvfT1_BcZzmiSTBa2ht1zu8gSPzvi0369v4pXwCGT65og |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1JSwMxFH7UBfTk0op1jSB4cdqmmTSTo4ilYi09VOhtyPICPXShiwd_vUk6VgRBPM3KZPiYyct7yfd9ALcoqRTOp6lCcJf4XjJLtFFpornTqVM-58JGNJsQvV42HMp-Ce43XBhEjIvPsBZ241y-nZpVKJXVW0xwzsUW7ATnrIKttamo-ODm4w_fHLNmI6XROq0Zxb1ZUxbaPjTl9cDi8UOLWtwGfeAtZcY_LFZihGkf_O_dDqHyTdUj_U0QOoISTo7h4MurgRS_bhk6D8TGxRpk7Rkdq-nEj1iJnY7VaJJMfd8xHn2gJbNgnJZg1JbwbZLFaoZzE5_om6jAW_tp8NhJCheFRDHKl0HUnAnGHBqnGNNUatSpVai0ZdKZoNnmsxJjKG25BmapdJm_bq1STmrREuwEtifTCZ4C0Sn6cM6NM1SnEq22GaJqKGuaQmuqq3DjQcxDerDI14xnnhdA5wXQVbj7855cz0foqlAOMOeztexGXiB89vvpa9jrDF67efe593IO-3HSJ1IGL2B7OV_hJeya9-VoMb-KX80n9Xa86w |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+Conference+on+High+Performance+Computing+Networking%2C+Storage+and+Analysis&rft.atitle=A+design+methodology+for+domain-optimized+power-efficient+supercomputing&rft.au=Mohiyuddin%2C+Marghoob&rft.au=Murphy%2C+Mark&rft.au=Oliker%2C+Leonid&rft.au=Shalf%2C+John&rft.series=ACM+Conferences&rft.date=2009-11-14&rft.pub=ACM&rft.isbn=1605587443&rft.spage=1&rft.epage=12&rft_id=info:doi/10.1145%2F1654059.1654072 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2167-4329&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2167-4329&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2167-4329&client=summon |

