Multi-core acceleration of chemical kinetics for simulation and prediction

This work implements a computationally expensive chemical kinetics kernel from a large-scale community atmospheric model on three multi-core platforms: NVIDIA GPUs using CUDA, the Cell Broadband Engine, and Intel Quad-Core Xeon CPUs. A comparative performance analysis for each platform in double and...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis s. 1 - 11
Hlavní autori: Linford, John C., Michalakes, John, Vachharajani, Manish, Sandu, Adrian
Médium: Konferenčný príspevok..
Jazyk:English
Vydavateľské údaje: New York, NY, USA ACM 14.11.2009
Edícia:ACM Conferences
Predmet:
ISBN:1605587443, 9781605587448
ISSN:2167-4329
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract This work implements a computationally expensive chemical kinetics kernel from a large-scale community atmospheric model on three multi-core platforms: NVIDIA GPUs using CUDA, the Cell Broadband Engine, and Intel Quad-Core Xeon CPUs. A comparative performance analysis for each platform in double and single precision on coarse and fine grids is presented. Platform-specific design and optimization is discussed in a mechanism-agnostic way, permitting the optimization of many chemical mechanisms. The implementation of a three-stage Rosenbrock solver for SIMD architectures is discussed. When used as a template mechanism in the the Kinetic PreProcessor, the multi-core implementation enables the automatic optimization and porting of many chemical mechanisms on a variety of multi-core platforms. Speedups of 5.5x in single precision and 2.7x in double precision are observed when compared to eight Xeon cores. Compared to the serial implementation, the maximum observed speedup is 41.1x in single precision.
AbstractList This work implements a computationally expensive chemical kinetics kernel from a large-scale community atmospheric model on three multi-core platforms: NVIDIA GPUs using CUDA, the Cell Broadband Engine, and Intel Quad-Core Xeon CPUs. A comparative performance analysis for each platform in double and single precision on coarse and fine grids is presented. Platform-specific design and optimization is discussed in a mechanism-agnostic way, permitting the optimization of many chemical mechanisms. The implementation of a three-stage Rosenbrock solver for SIMD architectures is discussed. When used as a template mechanism in the the Kinetic PreProcessor, the multi-core implementation enables the automatic optimization and porting of many chemical mechanisms on a variety of multi-core platforms. Speedups of 5.5x in single precision and 2.7x in double precision are observed when compared to eight Xeon cores. Compared to the serial implementation, the maximum observed speedup is 41.1x in single precision.
Author Vachharajani, Manish
Michalakes, John
Linford, John C.
Sandu, Adrian
Author_xml – sequence: 1
  givenname: John C.
  surname: Linford
  fullname: Linford, John C.
  organization: Virginia Polytechnic Institute and State University, Blacksburg, VA
– sequence: 2
  givenname: John
  surname: Michalakes
  fullname: Michalakes, John
  organization: National Center for Atmospheric Research, Boulder, CO
– sequence: 3
  givenname: Manish
  surname: Vachharajani
  fullname: Vachharajani, Manish
  organization: University of Colorado at Boulder, Boulder, CO
– sequence: 4
  givenname: Adrian
  surname: Sandu
  fullname: Sandu, Adrian
  organization: Virginia Polytechnic Institute and State University, Blacksburg, VA
BookMark eNqNkDtPAzEQhI0IEklITUHjkubC-by270oUER4KooHa8mMtTO4R-S4F_54LSUHJNqPR7KxW34xM2q5FQq5ZvmQMxB2TAnJRLX9VqjMyYzIXolQA_PyvmZBpwaTKgBfVJVn0_Vc-TskKXoopeXnd10PMXJeQGuewxmSG2LW0C9R9YhOdqek2tjhE19PQJdrHZl8fd0zr6S6hj-5gr8hFMHWPi5POycf64X31lG3eHp9X95vMcMaHDLwVOH7AQxWck2CERx-qIvcSlFQI1qItwcmCCYAAjIOXVaUstxI8Rz4nN8e7ERH1LsXGpG8tuRJCFmO6PKbGNdp23bbXLNcHZPqETJ-QaZsihrFw-88C_wHh6Wmr
CODEN IEEPAD
ContentType Conference Proceeding
Copyright 2009 ACM
Copyright_xml – notice: 2009 ACM
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1145/1654059.1654067
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore: IEL
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore: IEL
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 1605587443
9781605587448
EndPage 11
ExternalDocumentID 6375562
Genre orig-research
GroupedDBID 6IE
6IF
6IL
6IN
AAJGR
AARBI
ACM
ADPZR
ALMA_UNASSIGNED_HOLDINGS
APO
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
GUFHI
OCL
RIE
RIL
6IH
6IK
AAWTH
ABLEC
ADZIZ
CHZPO
IEGSK
IPLJI
ID FETCH-LOGICAL-a313t-4db5e0003f9fcc64a5dedf920d64767e4bbeb84c621544f4134d6997b3b64d3e3
IEDL.DBID RIE
ISBN 1605587443
9781605587448
ISICitedReferencesCount 10
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000320136800049&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 2167-4329
IngestDate Wed Jul 30 06:14:30 EDT 2025
Wed Jan 31 06:45:55 EST 2024
IsPeerReviewed false
IsScholarly false
Keywords multi-core
open-MP
chemical kinetics
NVIDIA CUDA
kinetic preprocessor
atmospheric modeling
cell broadband engine
Language English
License Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org
LinkModel DirectLink
MeetingName SC '09: International Conference for High Performance Computing, Networking, Storage and Analysis
MergedId FETCHMERGED-LOGICAL-a313t-4db5e0003f9fcc64a5dedf920d64767e4bbeb84c621544f4134d6997b3b64d3e3
PageCount 11
ParticipantIDs acm_books_10_1145_1654059_1654067
ieee_primary_6375562
acm_books_10_1145_1654059_1654067_brief
PublicationCentury 2000
PublicationDate 2009-11-14
PublicationDateYYYYMMDD 2009-11-14
PublicationDate_xml – month: 11
  year: 2009
  text: 2009-11-14
  day: 14
PublicationDecade 2000
PublicationPlace New York, NY, USA
PublicationPlace_xml – name: New York, NY, USA
PublicationSeriesTitle ACM Conferences
PublicationTitle Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
PublicationTitleAbbrev SUPERC
PublicationYear 2009
Publisher ACM
Publisher_xml – name: ACM
SSID ssj0000812385
ssj0003204180
Score 1.5723205
Snippet This work implements a computationally expensive chemical kinetics kernel from a large-scale community atmospheric model on three multi-core platforms: NVIDIA...
SourceID ieee
acm
SourceType Publisher
StartPage 1
SubjectTerms Applied computing -- Physical sciences and engineering
Atmospheric modeling
cell broadband engine
chemical kinetics
Chemical reactions
Chemicals
Codes
Computational modeling
Computer architecture
Computer systems organization -- Architectures -- Parallel architectures -- Multiple instruction, multiple data
Computing methodologies -- Concurrent computing methodologies -- Concurrent programming languages
Graphics processing units
kinetic preprocessor
Kinetic theory
Mathematical models
multi-core
NVIDIA CUDA
open-MP
Software and its engineering -- Software notations and tools -- General programming languages -- Language types -- Concurrent programming languages
Vectors
Title Multi-core acceleration of chemical kinetics for simulation and prediction
URI https://ieeexplore.ieee.org/document/6375562
WOSCitedRecordID wos000320136800049&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8NAGFxq8eCpaivWFysIXkybZB_JnsUiIqUHld7CPr6FIk1L2vr73d3EiCCIp2RDDmE2yXz7mBmEboiUnKo8jozMhRuggImUUdRLn2LIiWPgkNbw9pxNp_l8LmYddNdqYQAgbD6DkT8Na_lmpXd-qmzMScaY_-HuZRmvtVrtfIqjNsc-rG2TNKZJCE5Lg7U3SUXj7JNQNvYaHldYjMKRB2rSyx8BK4FfJr3_PdkhGnwL9fCspaAj1IHyGPW-khpw8-H20VPQ2UbeshJLrR3V1B2PVxbrxjIAv7uC05s2Y1fH4s1i2QR7YVkavK78go5vDtDr5OHl_jFqUhQiSRKyjahRDDwgVlitOZXMgLEijQ2nGc-AKgUqp5qn3pjHOlKjhguRKaI4NQTICeqWqxJOEWZSWiKUtkYz6kpJlcQaNKUUcsltmgzRtYOx8MODTVErnlnRQF00UA_R7Z_3FKpagB2ivge6WNe2G0WD8dnvl8_RQVjo8Tv06AXqbqsdXKJ9_bFdbKqr8K58AjrUt-M
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3LSsQwFL2ICrryjeMzguDGatvcpO1alFHHwYWKu5LHDQwyMzIPv98kU0cEQVy1KV2Uk7bn5nHOATjlSknUZZpYVVZ-gEI20VZjkD6lVHLPwDGt4aVTdLvl62v1uADncy0MEcXNZ3QRTuNavh2aaZgqu5S8ECL8cJcEYp7O1FrzGRVPbp5_xLzN8xSzGJ2WR3NvnleNt0-G4jKoeHxpcRGPMpKT6f-IWIkMc7P2v2dbh-1vqR57nJPQBizQYBPWvrIaWPPpbsFdVNomwbSSKWM82cy6ng0dM41pAHvzJWewbWa-kmXjXr-J9mJqYNn7KCzphOY2PN9cP121kyZHIVE845MErRYUAHGVM0aiEpasq_LUSixkQag16RKNzIM1j_O0hlZWVaG5lmg58R1YHAwHtAtMKOV4pY2zRqAvJnWWGjKISKWSLs9acOJhrMMAYVzPNM-ibqCuG6hbcPbnPbUe9ci1YCsAXb_PjDfqBuO93y8fw0r76aFTd2679_uwGpd9wn49PIDFyWhKh7BsPia98egovjef_ZW7Kg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+Conference+on+High+Performance+Computing+Networking%2C+Storage+and+Analysis&rft.atitle=Multi-core+acceleration+of+chemical+kinetics+for+simulation+and+prediction&rft.au=Linford%2C+John+C.&rft.au=Michalakes%2C+John&rft.au=Vachharajani%2C+Manish&rft.au=Sandu%2C+Adrian&rft.date=2009-11-14&rft.pub=ACM&rft.issn=2167-4329&rft.spage=1&rft.epage=11&rft_id=info:doi/10.1145%2F1654059.1654067&rft.externalDocID=6375562
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2167-4329&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2167-4329&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2167-4329&client=summon