Realizing Joint Extreme-Scale Simulations on Multiple Supercomputers-Two Superfacility Case Studies

High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the well-known curse of dimensionality, amplified by the need for fine resolutions in high-fidelity applications. The combination technique (CT) prov...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:SC24: International Conference for High Performance Computing, Networking, Storage and Analysis s. 1 - 17
Hlavní autori: Pollinger, Theresa, Craen, Alexander Van, Offenhauser, Philipp, Pfluger, Dirk
Médium: Konferenčný príspevok..
Jazyk:English
Vydavateľské údaje: IEEE 17.11.2024
Predmet:
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the well-known curse of dimensionality, amplified by the need for fine resolutions in high-fidelity applications. The combination technique (CT) provides a straightforward way of performing such simulations while alleviating the curse of dimensionality. Recent work demonstrated the potential of the CT to join multiple systems simultaneously to perform a single high-dimensional simulation. This paper shows how to extend this to three or more systems and addresses some remaining challenges: load balancing on heterogeneous hardware; utilizing compression to maximize the communication bandwidth; efficient I/O management through hardware mapping; and improving memory utilization through algorithmic optimizations. Combining these contributions, we demonstrate the feasibility of the CT for extreme-scale Superfacility scenarios of 46 trillion DOF on two systems and 35 trillion DOF on three systems. Scenarios at these resolutions would be intractable with full-grid solvers (\gt1,000 nonillion DOF each).
AbstractList High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the well-known curse of dimensionality, amplified by the need for fine resolutions in high-fidelity applications. The combination technique (CT) provides a straightforward way of performing such simulations while alleviating the curse of dimensionality. Recent work demonstrated the potential of the CT to join multiple systems simultaneously to perform a single high-dimensional simulation. This paper shows how to extend this to three or more systems and addresses some remaining challenges: load balancing on heterogeneous hardware; utilizing compression to maximize the communication bandwidth; efficient I/O management through hardware mapping; and improving memory utilization through algorithmic optimizations. Combining these contributions, we demonstrate the feasibility of the CT for extreme-scale Superfacility scenarios of 46 trillion DOF on two systems and 35 trillion DOF on three systems. Scenarios at these resolutions would be intractable with full-grid solvers (\gt1,000 nonillion DOF each).
Author Offenhauser, Philipp
Pollinger, Theresa
Pfluger, Dirk
Craen, Alexander Van
Author_xml – sequence: 1
  givenname: Theresa
  orcidid: 0000-0002-0186-4340
  surname: Pollinger
  fullname: Pollinger, Theresa
  organization: Universität Stuttgart,Chair for Scientific Computing,Germany
– sequence: 2
  givenname: Alexander Van
  orcidid: 0000-0002-3336-7226
  surname: Craen
  fullname: Craen, Alexander Van
  organization: Universität Stuttgart,Chair for Scientific Computing,Germany
– sequence: 3
  givenname: Philipp
  orcidid: 0009-0001-1674-7980
  surname: Offenhauser
  fullname: Offenhauser, Philipp
  organization: Hewlett Packard Enterprise (HPE), Herrenberger Straße 140,Böblingen,Germany,71034
– sequence: 4
  givenname: Dirk
  orcidid: 0000-0002-4360-0212
  surname: Pfluger
  fullname: Pfluger, Dirk
  organization: Universität Stuttgart,Chair for Scientific Computing,Germany
BookMark eNotjN1KwzAYQCMoqHMvIF70BTq_5Eua5lLK_GMi2Hk90uyLBPpHk6Lz6VXm1YHD4Vyy037oibFrDivOwdzWleQSipUAIVcAHOQJWxptSlSAShiuz9kyxtCA0ho1Al4w90a2Dd-h_8ieh9CnbP2VJuoor51tKatDN7c2haGP2dBnL3Obwvjn55EmN3TjnGiK-fZzOCpvXWhDOmSVjb9VmveB4hU787aNtPzngr3fr7fVY755fXiq7ja5FUqmvKCy1EoKic7bxos9eKmx1NCgcUTInSi9azQUJICXhTXgBHjfOM4lKsIFuzl-AxHtxil0djrsOGiDXCr8ASPYV_Y
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SC41406.2024.00104
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350352917
EndPage 17
ExternalDocumentID 10793145
Genre orig-research
GroupedDBID 6IE
6IL
ACM
ALMA_UNASSIGNED_HOLDINGS
APO
CBEJK
LHSKQ
RIE
RIL
ID FETCH-LOGICAL-a254t-6e88754243cfabf2d0f473870b39cee31c28fcb706e20186a90c20ffbc11435e3
IEDL.DBID RIE
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001414891300027&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Thu May 29 05:57:37 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a254t-6e88754243cfabf2d0f473870b39cee31c28fcb706e20186a90c20ffbc11435e3
ORCID 0000-0002-4360-0212
0000-0002-0186-4340
0009-0001-1674-7980
0000-0002-3336-7226
PageCount 17
ParticipantIDs ieee_primary_10793145
PublicationCentury 2000
PublicationDate 2024-Nov.-17
PublicationDateYYYYMMDD 2024-11-17
PublicationDate_xml – month: 11
  year: 2024
  text: 2024-Nov.-17
  day: 17
PublicationDecade 2020
PublicationTitle SC24: International Conference for High Performance Computing, Networking, Storage and Analysis
PublicationTitleAbbrev SC
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssib057737303
Score 1.8896245
Snippet High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Analytical models
Bandwidth
combination technique
Computational modeling
coupling HPC systems
Couplings
file transfer
Hardware
High performance computing
higher-dimensional simulation
large scale
Load management
Load modeling
Memory management
multi-level methods
Optimization
plasma turbulence
Title Realizing Joint Extreme-Scale Simulations on Multiple Supercomputers-Two Superfacility Case Studies
URI https://ieeexplore.ieee.org/document/10793145
WOSCitedRecordID wos001414891300027&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELWgYmACRBHf8sBqcHJOnMxVK4REVZEidasc5yxlIKnalK9fzzlNgYWBLbol0fnsd3be82Pshrpk56gTFpBLLRTkSqQycSI2CRTOhgbAtGYTejxOZrN00onVWy0MIrbkM7z1j-2__KK2a39URjOcqilQ0S7b1TreiLW2xRNpDVStsBXGyPQuGyjaPngeQuivyA68GdsvC5UWQUYH_3z3Iev_aPH45BtljtgOVsfMPlF_V35SgD_UZdXw4XvjD_pERilHnpUvnSvXitcVf-xIgzxbL3BpOx-HlZi-1ZuQM9ZTZD_4gDCNd9TCPnseDaeDe9HZJQhDu7xGxEgLRqRCBdaZ3IWFdEoDzcccUvpICGyYOJtrGSOhfhKbVNpQOpfbwDdNCCesV9UVnjLuEJ20FA0gUkYXKWpL2A4YaUmrZ3jG-j5D88XmRoz5Njnnf8Qv2L4fBK_hC_Ql6zXLNV6xPfvalKvldTuOX58Ln_I
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELWgIMEEiCK-8cBqcGInTuaqVYG2qkiR2CrHOUsZSKo25evXc05TYGFgi25JdD77nZ33_Ai5xi7ZWuyEmUi5YlKkksU8sizUkcis8bUQujabUKNR9Pwcjxuxeq2FAYCafAY37rH-l5-VZumOynCGYzV5MtgkW4GUPl_JtdblEyglsF7FWhrD49ukI3ED4ZgIvrsk23N2bL9MVGoM6e398-37pP2jxqPjb5w5IBtQHBLziB1e_okBel_mRUW775U76mMJJh1okr80vlwLWhZ02NAGabKcwdw0Tg4LNnkrVyGrjSPJftAOohptyIVt8tTrTjp91hgmMI37vIqFgEtGIH0pjNWp9TNupRI4I1MR40cKz_iRNaniISDuR6GOufG5tanxXNsE4oi0irKAY0ItgOUGo54IpFZZDMogugsIFMf10z8hbZeh6Wx1J8Z0nZzTP-JXZKc_GQ6mg7vRwxnZdQPiFH2eOietar6EC7JtXqt8Mb-sx_QLHpqjOQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC24%3A+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=Realizing+Joint+Extreme-Scale+Simulations+on+Multiple+Supercomputers-Two+Superfacility+Case+Studies&rft.au=Pollinger%2C+Theresa&rft.au=Craen%2C+Alexander+Van&rft.au=Offenhauser%2C+Philipp&rft.au=Pfluger%2C+Dirk&rft.date=2024-11-17&rft.pub=IEEE&rft.spage=1&rft.epage=17&rft_id=info:doi/10.1109%2FSC41406.2024.00104&rft.externalDocID=10793145