Realizing Joint Extreme-Scale Simulations on Multiple Supercomputers-Two Superfacility Case Studies

High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the well-known curse of dimensionality, amplified by the need for fine resolutions in high-fidelity applications. The combination technique (CT) prov...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:SC24: International Conference for High Performance Computing, Networking, Storage and Analysis s. 1 - 17
Hlavní autoři: Pollinger, Theresa, Craen, Alexander Van, Offenhauser, Philipp, Pfluger, Dirk
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 17.11.2024
Témata:
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the well-known curse of dimensionality, amplified by the need for fine resolutions in high-fidelity applications. The combination technique (CT) provides a straightforward way of performing such simulations while alleviating the curse of dimensionality. Recent work demonstrated the potential of the CT to join multiple systems simultaneously to perform a single high-dimensional simulation. This paper shows how to extend this to three or more systems and addresses some remaining challenges: load balancing on heterogeneous hardware; utilizing compression to maximize the communication bandwidth; efficient I/O management through hardware mapping; and improving memory utilization through algorithmic optimizations. Combining these contributions, we demonstrate the feasibility of the CT for extreme-scale Superfacility scenarios of 46 trillion DOF on two systems and 35 trillion DOF on three systems. Scenarios at these resolutions would be intractable with full-grid solvers (\gt1,000 nonillion DOF each).
AbstractList High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the well-known curse of dimensionality, amplified by the need for fine resolutions in high-fidelity applications. The combination technique (CT) provides a straightforward way of performing such simulations while alleviating the curse of dimensionality. Recent work demonstrated the potential of the CT to join multiple systems simultaneously to perform a single high-dimensional simulation. This paper shows how to extend this to three or more systems and addresses some remaining challenges: load balancing on heterogeneous hardware; utilizing compression to maximize the communication bandwidth; efficient I/O management through hardware mapping; and improving memory utilization through algorithmic optimizations. Combining these contributions, we demonstrate the feasibility of the CT for extreme-scale Superfacility scenarios of 46 trillion DOF on two systems and 35 trillion DOF on three systems. Scenarios at these resolutions would be intractable with full-grid solvers (\gt1,000 nonillion DOF each).
Author Offenhauser, Philipp
Pollinger, Theresa
Pfluger, Dirk
Craen, Alexander Van
Author_xml – sequence: 1
  givenname: Theresa
  orcidid: 0000-0002-0186-4340
  surname: Pollinger
  fullname: Pollinger, Theresa
  organization: Universität Stuttgart,Chair for Scientific Computing,Germany
– sequence: 2
  givenname: Alexander Van
  orcidid: 0000-0002-3336-7226
  surname: Craen
  fullname: Craen, Alexander Van
  organization: Universität Stuttgart,Chair for Scientific Computing,Germany
– sequence: 3
  givenname: Philipp
  orcidid: 0009-0001-1674-7980
  surname: Offenhauser
  fullname: Offenhauser, Philipp
  organization: Hewlett Packard Enterprise (HPE), Herrenberger Straße 140,Böblingen,Germany,71034
– sequence: 4
  givenname: Dirk
  orcidid: 0000-0002-4360-0212
  surname: Pfluger
  fullname: Pfluger, Dirk
  organization: Universität Stuttgart,Chair for Scientific Computing,Germany
BookMark eNotjN1KwzAYQCMoqHMvIF70BTq_5Eua5lLK_GMi2Hk90uyLBPpHk6Lz6VXm1YHD4Vyy037oibFrDivOwdzWleQSipUAIVcAHOQJWxptSlSAShiuz9kyxtCA0ho1Al4w90a2Dd-h_8ieh9CnbP2VJuoor51tKatDN7c2haGP2dBnL3Obwvjn55EmN3TjnGiK-fZzOCpvXWhDOmSVjb9VmveB4hU787aNtPzngr3fr7fVY755fXiq7ja5FUqmvKCy1EoKic7bxos9eKmx1NCgcUTInSi9azQUJICXhTXgBHjfOM4lKsIFuzl-AxHtxil0djrsOGiDXCr8ASPYV_Y
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SC41406.2024.00104
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350352917
EndPage 17
ExternalDocumentID 10793145
Genre orig-research
GroupedDBID 6IE
6IL
ACM
ALMA_UNASSIGNED_HOLDINGS
APO
CBEJK
LHSKQ
RIE
RIL
ID FETCH-LOGICAL-a254t-6e88754243cfabf2d0f473870b39cee31c28fcb706e20186a90c20ffbc11435e3
IEDL.DBID RIE
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001414891300027&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Thu May 29 05:57:37 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a254t-6e88754243cfabf2d0f473870b39cee31c28fcb706e20186a90c20ffbc11435e3
ORCID 0000-0002-4360-0212
0000-0002-0186-4340
0009-0001-1674-7980
0000-0002-3336-7226
PageCount 17
ParticipantIDs ieee_primary_10793145
PublicationCentury 2000
PublicationDate 2024-Nov.-17
PublicationDateYYYYMMDD 2024-11-17
PublicationDate_xml – month: 11
  year: 2024
  text: 2024-Nov.-17
  day: 17
PublicationDecade 2020
PublicationTitle SC24: International Conference for High Performance Computing, Networking, Storage and Analysis
PublicationTitleAbbrev SC
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssib057737303
Score 1.8896245
Snippet High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Analytical models
Bandwidth
combination technique
Computational modeling
coupling HPC systems
Couplings
file transfer
Hardware
High performance computing
higher-dimensional simulation
large scale
Load management
Load modeling
Memory management
multi-level methods
Optimization
plasma turbulence
Title Realizing Joint Extreme-Scale Simulations on Multiple Supercomputers-Two Superfacility Case Studies
URI https://ieeexplore.ieee.org/document/10793145
WOSCitedRecordID wos001414891300027&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELVoxcAEiCK-5YHVYMdOnMxVK4REVZEidats9yxlIKnatHz8es5pCiwMbNEtic53fmfn3j1CbuPUGuHNnMlYcoYR4pk1KmY2wKcGBCnlG7EJPRql02k2bsnqDRcGAJrmM7gLj82__Hnl1uGqDDMco0mouEM6WidbstYueGKtJUar3BFjeHaf9xUeH0IfQhRGZIsgxvZLQqVBkOHhP999RHo_XDw6_kaZY7IH5Qlxz1jfFZ9ooI9VUdZ08F6Hiz6Wo8uB5sVrq8q1olVJn9qmQZqvF7B0rY7Dik3eqq3JGxdaZD9oHzGNtq2FPfIyHEz6D6yVS2AGT3k1SwA3jFhFSjpvrI_m3CstMR-tzPAjpXBR6p3VPAFE_TQxGXcR9946EYomkKekW1YlnBEqFeY6bpPNvK_UJCkHnTkb6jHOjUjOSS94aLbYTsSY7Zxz8Yf9khyERQgcPqGvSLderuGa7LtNXayWN806fgEqZZ7v
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELWgIMEEiCLKpwdWgxPbcTJXrQq0VUWK1K2yXVvKQFK1KV-_nnOaAgsDW3RLovOd39m5dw-hGxFrFTg1I0wwSiBCHNGKC6I9fEoLIMVdJTYhh8N4MklGNVm94sJYa6vmM3vrH6t_-bPCrPxVGWQ4RFPAxTbaEZyHdE3X2oSPkJJBvLINNYYmd2mbwwHCdyKEfkh24OXYfomoVBjSPfjn2w9R84eNh0ffOHOEtmx-jMwTVHjZJxjwQ5HlJe68l_6qj6TgdIvT7KXW5VriIseDum0Qp6u5XZhayWFJxm_F2uSU8U2yH7gNqIbr5sImeu52xu0eqQUTiIJzXkkiC1uG4CFnxintwhl1XDLISM0S-EgWmDB2RksaWcD9OFIJNSF1TpvAl02WnaBGXuT2FGHGIdtho6wmfsUqiqmVidG-IqNUBVELNb2HpvP1TIzpxjlnf9iv0V5vPOhP-_fDx3O07xfEM_oCeYEa5WJlL9GueS2z5eKqWtMvnbOiNg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC24%3A+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=Realizing+Joint+Extreme-Scale+Simulations+on+Multiple+Supercomputers-Two+Superfacility+Case+Studies&rft.au=Pollinger%2C+Theresa&rft.au=Craen%2C+Alexander+Van&rft.au=Offenhauser%2C+Philipp&rft.au=Pfluger%2C+Dirk&rft.date=2024-11-17&rft.pub=IEEE&rft.spage=1&rft.epage=17&rft_id=info:doi/10.1109%2FSC41406.2024.00104&rft.externalDocID=10793145