Multi-core acceleration of chemical kinetics for simulation and prediction
This work implements a computationally expensive chemical kinetics kernel from a large-scale community atmospheric model on three multi-core platforms: NVIDIA GPUs using CUDA, the Cell Broadband Engine, and Intel Quad-Core Xeon CPUs. A comparative performance analysis for each platform in double and...
Uložené v:
| Vydané v: | Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis s. 1 - 11 |
|---|---|
| Hlavní autori: | , , , |
| Médium: | Konferenčný príspevok.. |
| Jazyk: | English |
| Vydavateľské údaje: |
New York, NY, USA
ACM
14.11.2009
|
| Edícia: | ACM Conferences |
| Predmet: | |
| ISBN: | 1605587443, 9781605587448 |
| ISSN: | 2167-4329 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Abstract | This work implements a computationally expensive chemical kinetics kernel from a large-scale community atmospheric model on three multi-core platforms: NVIDIA GPUs using CUDA, the Cell Broadband Engine, and Intel Quad-Core Xeon CPUs. A comparative performance analysis for each platform in double and single precision on coarse and fine grids is presented. Platform-specific design and optimization is discussed in a mechanism-agnostic way, permitting the optimization of many chemical mechanisms. The implementation of a three-stage Rosenbrock solver for SIMD architectures is discussed. When used as a template mechanism in the the Kinetic PreProcessor, the multi-core implementation enables the automatic optimization and porting of many chemical mechanisms on a variety of multi-core platforms. Speedups of 5.5x in single precision and 2.7x in double precision are observed when compared to eight Xeon cores. Compared to the serial implementation, the maximum observed speedup is 41.1x in single precision. |
|---|---|
| AbstractList | This work implements a computationally expensive chemical kinetics kernel from a large-scale community atmospheric model on three multi-core platforms: NVIDIA GPUs using CUDA, the Cell Broadband Engine, and Intel Quad-Core Xeon CPUs. A comparative performance analysis for each platform in double and single precision on coarse and fine grids is presented. Platform-specific design and optimization is discussed in a mechanism-agnostic way, permitting the optimization of many chemical mechanisms. The implementation of a three-stage Rosenbrock solver for SIMD architectures is discussed. When used as a template mechanism in the the Kinetic PreProcessor, the multi-core implementation enables the automatic optimization and porting of many chemical mechanisms on a variety of multi-core platforms. Speedups of 5.5x in single precision and 2.7x in double precision are observed when compared to eight Xeon cores. Compared to the serial implementation, the maximum observed speedup is 41.1x in single precision. |
| Author | Vachharajani, Manish Michalakes, John Linford, John C. Sandu, Adrian |
| Author_xml | – sequence: 1 givenname: John C. surname: Linford fullname: Linford, John C. organization: Virginia Polytechnic Institute and State University, Blacksburg, VA – sequence: 2 givenname: John surname: Michalakes fullname: Michalakes, John organization: National Center for Atmospheric Research, Boulder, CO – sequence: 3 givenname: Manish surname: Vachharajani fullname: Vachharajani, Manish organization: University of Colorado at Boulder, Boulder, CO – sequence: 4 givenname: Adrian surname: Sandu fullname: Sandu, Adrian organization: Virginia Polytechnic Institute and State University, Blacksburg, VA |
| BookMark | eNqNkDtPAzEQhI0IEklITUHjkubC-by270oUER4KooHa8mMtTO4R-S4F_54LSUHJNqPR7KxW34xM2q5FQq5ZvmQMxB2TAnJRLX9VqjMyYzIXolQA_PyvmZBpwaTKgBfVJVn0_Vc-TskKXoopeXnd10PMXJeQGuewxmSG2LW0C9R9YhOdqek2tjhE19PQJdrHZl8fd0zr6S6hj-5gr8hFMHWPi5POycf64X31lG3eHp9X95vMcMaHDLwVOH7AQxWck2CERx-qIvcSlFQI1qItwcmCCYAAjIOXVaUstxI8Rz4nN8e7ERH1LsXGpG8tuRJCFmO6PKbGNdp23bbXLNcHZPqETJ-QaZsihrFw-88C_wHh6Wmr |
| CODEN | IEEPAD |
| ContentType | Conference Proceeding |
| Copyright | 2009 ACM |
| Copyright_xml | – notice: 2009 ACM |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1145/1654059.1654067 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Xplore: IEL IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore: IEL url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISBN | 1605587443 9781605587448 |
| EndPage | 11 |
| ExternalDocumentID | 6375562 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IF 6IL 6IN AAJGR AARBI ACM ADPZR ALMA_UNASSIGNED_HOLDINGS APO BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK GUFHI OCL RIE RIL 6IH 6IK AAWTH ABLEC ADZIZ CHZPO IEGSK IPLJI |
| ID | FETCH-LOGICAL-a313t-4db5e0003f9fcc64a5dedf920d64767e4bbeb84c621544f4134d6997b3b64d3e3 |
| IEDL.DBID | RIE |
| ISBN | 1605587443 9781605587448 |
| ISICitedReferencesCount | 10 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000320136800049&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 2167-4329 |
| IngestDate | Wed Jul 30 06:14:30 EDT 2025 Wed Jan 31 06:45:55 EST 2024 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Keywords | multi-core open-MP chemical kinetics NVIDIA CUDA kinetic preprocessor atmospheric modeling cell broadband engine |
| Language | English |
| License | Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org |
| LinkModel | DirectLink |
| MeetingName | SC '09: International Conference for High Performance Computing, Networking, Storage and Analysis |
| MergedId | FETCHMERGED-LOGICAL-a313t-4db5e0003f9fcc64a5dedf920d64767e4bbeb84c621544f4134d6997b3b64d3e3 |
| PageCount | 11 |
| ParticipantIDs | acm_books_10_1145_1654059_1654067 ieee_primary_6375562 acm_books_10_1145_1654059_1654067_brief |
| PublicationCentury | 2000 |
| PublicationDate | 2009-11-14 |
| PublicationDateYYYYMMDD | 2009-11-14 |
| PublicationDate_xml | – month: 11 year: 2009 text: 2009-11-14 day: 14 |
| PublicationDecade | 2000 |
| PublicationPlace | New York, NY, USA |
| PublicationPlace_xml | – name: New York, NY, USA |
| PublicationSeriesTitle | ACM Conferences |
| PublicationTitle | Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis |
| PublicationTitleAbbrev | SUPERC |
| PublicationYear | 2009 |
| Publisher | ACM |
| Publisher_xml | – name: ACM |
| SSID | ssj0000812385 ssj0003204180 |
| Score | 1.5723205 |
| Snippet | This work implements a computationally expensive chemical kinetics kernel from a large-scale community atmospheric model on three multi-core platforms: NVIDIA... |
| SourceID | ieee acm |
| SourceType | Publisher |
| StartPage | 1 |
| SubjectTerms | Applied computing -- Physical sciences and engineering Atmospheric modeling cell broadband engine chemical kinetics Chemical reactions Chemicals Codes Computational modeling Computer architecture Computer systems organization -- Architectures -- Parallel architectures -- Multiple instruction, multiple data Computing methodologies -- Concurrent computing methodologies -- Concurrent programming languages Graphics processing units kinetic preprocessor Kinetic theory Mathematical models multi-core NVIDIA CUDA open-MP Software and its engineering -- Software notations and tools -- General programming languages -- Language types -- Concurrent programming languages Vectors |
| Title | Multi-core acceleration of chemical kinetics for simulation and prediction |
| URI | https://ieeexplore.ieee.org/document/6375562 |
| WOSCitedRecordID | wos000320136800049&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8NAGFxq8eCpaivWFysIXkybZB_JnsUiIqUHld7CPr6FIk1L2vr73d3EiCCIp2RDDmE2yXz7mBmEboiUnKo8jozMhRuggImUUdRLn2LIiWPgkNbw9pxNp_l8LmYddNdqYQAgbD6DkT8Na_lmpXd-qmzMScaY_-HuZRmvtVrtfIqjNsc-rG2TNKZJCE5Lg7U3SUXj7JNQNvYaHldYjMKRB2rSyx8BK4FfJr3_PdkhGnwL9fCspaAj1IHyGPW-khpw8-H20VPQ2UbeshJLrR3V1B2PVxbrxjIAv7uC05s2Y1fH4s1i2QR7YVkavK78go5vDtDr5OHl_jFqUhQiSRKyjahRDDwgVlitOZXMgLEijQ2nGc-AKgUqp5qn3pjHOlKjhguRKaI4NQTICeqWqxJOEWZSWiKUtkYz6kpJlcQaNKUUcsltmgzRtYOx8MODTVErnlnRQF00UA_R7Z_3FKpagB2ivge6WNe2G0WD8dnvl8_RQVjo8Tv06AXqbqsdXKJ9_bFdbKqr8K58AjrUt-M |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3LSsQwFL2ICrryjeMzguDGatvcpO1alFHHwYWKu5LHDQwyMzIPv98kU0cEQVy1KV2Uk7bn5nHOATjlSknUZZpYVVZ-gEI20VZjkD6lVHLPwDGt4aVTdLvl62v1uADncy0MEcXNZ3QRTuNavh2aaZgqu5S8ECL8cJcEYp7O1FrzGRVPbp5_xLzN8xSzGJ2WR3NvnleNt0-G4jKoeHxpcRGPMpKT6f-IWIkMc7P2v2dbh-1vqR57nJPQBizQYBPWvrIaWPPpbsFdVNomwbSSKWM82cy6ng0dM41pAHvzJWewbWa-kmXjXr-J9mJqYNn7KCzphOY2PN9cP121kyZHIVE845MErRYUAHGVM0aiEpasq_LUSixkQag16RKNzIM1j_O0hlZWVaG5lmg58R1YHAwHtAtMKOV4pY2zRqAvJnWWGjKISKWSLs9acOJhrMMAYVzPNM-ibqCuG6hbcPbnPbUe9ci1YCsAXb_PjDfqBuO93y8fw0r76aFTd2679_uwGpd9wn49PIDFyWhKh7BsPia98egovjef_ZW7Kg |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+Conference+on+High+Performance+Computing+Networking%2C+Storage+and+Analysis&rft.atitle=Multi-core+acceleration+of+chemical+kinetics+for+simulation+and+prediction&rft.au=Linford%2C+John+C.&rft.au=Michalakes%2C+John&rft.au=Vachharajani%2C+Manish&rft.au=Sandu%2C+Adrian&rft.date=2009-11-14&rft.pub=ACM&rft.issn=2167-4329&rft.spage=1&rft.epage=11&rft_id=info:doi/10.1145%2F1654059.1654067&rft.externalDocID=6375562 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2167-4329&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2167-4329&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2167-4329&client=summon |

