Realizing Joint Extreme-Scale Simulations on Multiple Supercomputers-Two Superfacility Case Studies

High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the well-known curse of dimensionality, amplified by the need for fine resolutions in high-fidelity applications. The combination technique (CT) prov...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:SC24: International Conference for High Performance Computing, Networking, Storage and Analysis S. 1 - 17
Hauptverfasser: Pollinger, Theresa, Craen, Alexander Van, Offenhauser, Philipp, Pfluger, Dirk
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: IEEE 17.11.2024
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the well-known curse of dimensionality, amplified by the need for fine resolutions in high-fidelity applications. The combination technique (CT) provides a straightforward way of performing such simulations while alleviating the curse of dimensionality. Recent work demonstrated the potential of the CT to join multiple systems simultaneously to perform a single high-dimensional simulation. This paper shows how to extend this to three or more systems and addresses some remaining challenges: load balancing on heterogeneous hardware; utilizing compression to maximize the communication bandwidth; efficient I/O management through hardware mapping; and improving memory utilization through algorithmic optimizations. Combining these contributions, we demonstrate the feasibility of the CT for extreme-scale Superfacility scenarios of 46 trillion DOF on two systems and 35 trillion DOF on three systems. Scenarios at these resolutions would be intractable with full-grid solvers (\gt1,000 nonillion DOF each).
AbstractList High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the well-known curse of dimensionality, amplified by the need for fine resolutions in high-fidelity applications. The combination technique (CT) provides a straightforward way of performing such simulations while alleviating the curse of dimensionality. Recent work demonstrated the potential of the CT to join multiple systems simultaneously to perform a single high-dimensional simulation. This paper shows how to extend this to three or more systems and addresses some remaining challenges: load balancing on heterogeneous hardware; utilizing compression to maximize the communication bandwidth; efficient I/O management through hardware mapping; and improving memory utilization through algorithmic optimizations. Combining these contributions, we demonstrate the feasibility of the CT for extreme-scale Superfacility scenarios of 46 trillion DOF on two systems and 35 trillion DOF on three systems. Scenarios at these resolutions would be intractable with full-grid solvers (\gt1,000 nonillion DOF each).
Author Offenhauser, Philipp
Pollinger, Theresa
Pfluger, Dirk
Craen, Alexander Van
Author_xml – sequence: 1
  givenname: Theresa
  orcidid: 0000-0002-0186-4340
  surname: Pollinger
  fullname: Pollinger, Theresa
  organization: Universität Stuttgart,Chair for Scientific Computing,Germany
– sequence: 2
  givenname: Alexander Van
  orcidid: 0000-0002-3336-7226
  surname: Craen
  fullname: Craen, Alexander Van
  organization: Universität Stuttgart,Chair for Scientific Computing,Germany
– sequence: 3
  givenname: Philipp
  orcidid: 0009-0001-1674-7980
  surname: Offenhauser
  fullname: Offenhauser, Philipp
  organization: Hewlett Packard Enterprise (HPE), Herrenberger Straße 140,Böblingen,Germany,71034
– sequence: 4
  givenname: Dirk
  orcidid: 0000-0002-4360-0212
  surname: Pfluger
  fullname: Pfluger, Dirk
  organization: Universität Stuttgart,Chair for Scientific Computing,Germany
BookMark eNotjN1KwzAYQCMoqHMvIF70BTq_5Eua5lLK_GMi2Hk90uyLBPpHk6Lz6VXm1YHD4Vyy037oibFrDivOwdzWleQSipUAIVcAHOQJWxptSlSAShiuz9kyxtCA0ho1Al4w90a2Dd-h_8ieh9CnbP2VJuoor51tKatDN7c2haGP2dBnL3Obwvjn55EmN3TjnGiK-fZzOCpvXWhDOmSVjb9VmveB4hU787aNtPzngr3fr7fVY755fXiq7ja5FUqmvKCy1EoKic7bxos9eKmx1NCgcUTInSi9azQUJICXhTXgBHjfOM4lKsIFuzl-AxHtxil0djrsOGiDXCr8ASPYV_Y
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SC41406.2024.00104
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE/IET Electronic Library
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE/IET Electronic Library (IEL) (UW System Shared)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350352917
EndPage 17
ExternalDocumentID 10793145
Genre orig-research
GroupedDBID 6IE
6IL
ACM
ALMA_UNASSIGNED_HOLDINGS
APO
CBEJK
LHSKQ
RIE
RIL
ID FETCH-LOGICAL-a254t-6e88754243cfabf2d0f473870b39cee31c28fcb706e20186a90c20ffbc11435e3
IEDL.DBID RIE
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001414891300027&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Thu May 29 05:57:37 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a254t-6e88754243cfabf2d0f473870b39cee31c28fcb706e20186a90c20ffbc11435e3
ORCID 0000-0002-4360-0212
0000-0002-0186-4340
0009-0001-1674-7980
0000-0002-3336-7226
PageCount 17
ParticipantIDs ieee_primary_10793145
PublicationCentury 2000
PublicationDate 2024-Nov.-17
PublicationDateYYYYMMDD 2024-11-17
PublicationDate_xml – month: 11
  year: 2024
  text: 2024-Nov.-17
  day: 17
PublicationDecade 2020
PublicationTitle SC24: International Conference for High Performance Computing, Networking, Storage and Analysis
PublicationTitleAbbrev SC
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssib057737303
Score 1.8896245
Snippet High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Analytical models
Bandwidth
combination technique
Computational modeling
coupling HPC systems
Couplings
file transfer
Hardware
High performance computing
higher-dimensional simulation
large scale
Load management
Load modeling
Memory management
multi-level methods
Optimization
plasma turbulence
Title Realizing Joint Extreme-Scale Simulations on Multiple Supercomputers-Two Superfacility Case Studies
URI https://ieeexplore.ieee.org/document/10793145
WOSCitedRecordID wos001414891300027&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELagYmACRBFveWAN-JHYyVy1QkhUFSlSt8p2zlIGkqpNef16zmkKLAxs0Q1JdI-c7XzffYTcaBmGnFkW8cKzKLYGsOYwINaLRBYsYSBcKzahx-N0NssmHVm95cIAQAs-g9tw2f7LL2q3DkdlWOGYTTxOdsmu1mpD1tomT6K1xGyVW2IMy-7yQYzbh4BDEGFENg9ibL8kVNoOMjr457MPSf-Hi0cn313miOxAdUzcE67vyk800Ie6rBo6fG_CQV-Uo8uB5uVLp8q1onVFHzvQIM3XC1i6TsdhFU3f6o3JGxcgsh90gD2NdtDCPnkeDaeD-6iTS4gM7vKaSAF-MJJYxNJ5g74umI-1xHq0MsOXlNyJ1DurmQLs-qkyGXOCeW8dD4smkCekV9UVnBIaO6EMcGWVCgPObJpak2mO99SQFr44I_3gofliMxFjvnXO-R_2C7IfghA4fFxfkl6zXMMV2XOvTblaXrdx_AK4GqB-
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELagIMEEiCLeeGAN2LETJ3PVqkBbVaRIbFXsnKUMJFWb8vr1nNMUWBjYohuS6B452_m--wi5VsINOdPM45llntQpYM1hQLT1A5GxgIFvarEJNRpFz8_xuCGr11wYAKjBZ3DjLut_-Vlplu6oDCscs4nLYJNsBVL6bEXXWqdPoJTAfBVragyLb5OOxA2EQyL4bkg2d3Jsv0RU6h7S2_vn0_dJ-4eNR8fffeaAbEBxSMwjrvDyTzTQ-zIvKtp9r9xRn5eg04Em-Uujy7WgZUGHDWyQJssZzE2j5LDwJm_lymRT40CyH7SDXY024MI2eep1J52-1wgmeCnu8yovBPxkBNKXwtgUvZ0xK5XAitQixpcU3PiRNVqxELDvR2EaM-Mza7XhbtkE4oi0irKAY0Kl8cMUeKjD0I0401Gk01hxvKeCKLPZCWk7D01nq5kY07VzTv-wX5Gd_mQ4mA7uRg9nZNcFxDH6uDonrWq-hAuybV6rfDG_rGP6BT4po8U
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC24%3A+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=Realizing+Joint+Extreme-Scale+Simulations+on+Multiple+Supercomputers-Two+Superfacility+Case+Studies&rft.au=Pollinger%2C+Theresa&rft.au=Craen%2C+Alexander+Van&rft.au=Offenhauser%2C+Philipp&rft.au=Pfluger%2C+Dirk&rft.date=2024-11-17&rft.pub=IEEE&rft.spage=1&rft.epage=17&rft_id=info:doi/10.1109%2FSC41406.2024.00104&rft.externalDocID=10793145