Realizing Joint Extreme-Scale Simulations on Multiple Supercomputers-Two Superfacility Case Studies
High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the well-known curse of dimensionality, amplified by the need for fine resolutions in high-fidelity applications. The combination technique (CT) prov...
Saved in:
| Published in: | SC24: International Conference for High Performance Computing, Networking, Storage and Analysis pp. 1 - 17 |
|---|---|
| Main Authors: | , , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
17.11.2024
|
| Subjects: | |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the well-known curse of dimensionality, amplified by the need for fine resolutions in high-fidelity applications. The combination technique (CT) provides a straightforward way of performing such simulations while alleviating the curse of dimensionality. Recent work demonstrated the potential of the CT to join multiple systems simultaneously to perform a single high-dimensional simulation. This paper shows how to extend this to three or more systems and addresses some remaining challenges: load balancing on heterogeneous hardware; utilizing compression to maximize the communication bandwidth; efficient I/O management through hardware mapping; and improving memory utilization through algorithmic optimizations. Combining these contributions, we demonstrate the feasibility of the CT for extreme-scale Superfacility scenarios of 46 trillion DOF on two systems and 35 trillion DOF on three systems. Scenarios at these resolutions would be intractable with full-grid solvers (\gt1,000 nonillion DOF each). |
|---|---|
| AbstractList | High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the well-known curse of dimensionality, amplified by the need for fine resolutions in high-fidelity applications. The combination technique (CT) provides a straightforward way of performing such simulations while alleviating the curse of dimensionality. Recent work demonstrated the potential of the CT to join multiple systems simultaneously to perform a single high-dimensional simulation. This paper shows how to extend this to three or more systems and addresses some remaining challenges: load balancing on heterogeneous hardware; utilizing compression to maximize the communication bandwidth; efficient I/O management through hardware mapping; and improving memory utilization through algorithmic optimizations. Combining these contributions, we demonstrate the feasibility of the CT for extreme-scale Superfacility scenarios of 46 trillion DOF on two systems and 35 trillion DOF on three systems. Scenarios at these resolutions would be intractable with full-grid solvers (\gt1,000 nonillion DOF each). |
| Author | Offenhauser, Philipp Pollinger, Theresa Pfluger, Dirk Craen, Alexander Van |
| Author_xml | – sequence: 1 givenname: Theresa orcidid: 0000-0002-0186-4340 surname: Pollinger fullname: Pollinger, Theresa organization: Universität Stuttgart,Chair for Scientific Computing,Germany – sequence: 2 givenname: Alexander Van orcidid: 0000-0002-3336-7226 surname: Craen fullname: Craen, Alexander Van organization: Universität Stuttgart,Chair for Scientific Computing,Germany – sequence: 3 givenname: Philipp orcidid: 0009-0001-1674-7980 surname: Offenhauser fullname: Offenhauser, Philipp organization: Hewlett Packard Enterprise (HPE), Herrenberger Straße 140,Böblingen,Germany,71034 – sequence: 4 givenname: Dirk orcidid: 0000-0002-4360-0212 surname: Pfluger fullname: Pfluger, Dirk organization: Universität Stuttgart,Chair for Scientific Computing,Germany |
| BookMark | eNotjN1KwzAYQCMoqHMvIF70BTq_5Eua5lLK_GMi2Hk90uyLBPpHk6Lz6VXm1YHD4Vyy037oibFrDivOwdzWleQSipUAIVcAHOQJWxptSlSAShiuz9kyxtCA0ho1Al4w90a2Dd-h_8ieh9CnbP2VJuoor51tKatDN7c2haGP2dBnL3Obwvjn55EmN3TjnGiK-fZzOCpvXWhDOmSVjb9VmveB4hU787aNtPzngr3fr7fVY755fXiq7ja5FUqmvKCy1EoKic7bxos9eKmx1NCgcUTInSi9azQUJICXhTXgBHjfOM4lKsIFuzl-AxHtxil0djrsOGiDXCr8ASPYV_Y |
| CODEN | IEEPAD |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/SC41406.2024.00104 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9798350352917 |
| EndPage | 17 |
| ExternalDocumentID | 10793145 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL ACM ALMA_UNASSIGNED_HOLDINGS APO CBEJK LHSKQ RIE RIL |
| ID | FETCH-LOGICAL-a254t-6e88754243cfabf2d0f473870b39cee31c28fcb706e20186a90c20ffbc11435e3 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 0 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001414891300027&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Thu May 29 05:57:37 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-a254t-6e88754243cfabf2d0f473870b39cee31c28fcb706e20186a90c20ffbc11435e3 |
| ORCID | 0000-0002-4360-0212 0000-0002-0186-4340 0009-0001-1674-7980 0000-0002-3336-7226 |
| PageCount | 17 |
| ParticipantIDs | ieee_primary_10793145 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-Nov.-17 |
| PublicationDateYYYYMMDD | 2024-11-17 |
| PublicationDate_xml | – month: 11 year: 2024 text: 2024-Nov.-17 day: 17 |
| PublicationDecade | 2020 |
| PublicationTitle | SC24: International Conference for High Performance Computing, Networking, Storage and Analysis |
| PublicationTitleAbbrev | SC |
| PublicationYear | 2024 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssib057737303 |
| Score | 1.8896245 |
| Snippet | High-dimensional grid-based simulations serve as both a tool and a challenge in researching various domains. The main challenge of these approaches is the... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 1 |
| SubjectTerms | Analytical models Bandwidth combination technique Computational modeling coupling HPC systems Couplings file transfer Hardware High performance computing higher-dimensional simulation large scale Load management Load modeling Memory management multi-level methods Optimization plasma turbulence |
| Title | Realizing Joint Extreme-Scale Simulations on Multiple Supercomputers-Two Superfacility Case Studies |
| URI | https://ieeexplore.ieee.org/document/10793145 |
| WOSCitedRecordID | wos001414891300027&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELVoxcAEiCK-5YHVkNiOncxVK4SgqkhB3SrHOUsZSKo2LR-_nnOaAgsDW3Sykuh8zrtz7vkRcu2ssVkeJUxxsFiggGMGx7JcIrwjZmSm4XG_POjRKJ5Ok3FLVm-4MADQNJ_Bjb9s_uXnlV35rTJc4RhNoYw6pKO12pC1tsETaS0wWsWWGBMkt2lfYvng-xC4PyI79GJsvyRUGgQZ7v_z2Qek98PFo-NvlDkkO1AeEfuE-V3xiQZ6XxVlTQfvtd_oYym6HGhavLaqXEtalfSxbRqk6WoOC9vqOCzZ5K3amJyxvkX2g_YR02jbWtgjz8PBpH_HWrkEZrDKq5kC_GBEkkthnckczwMntcD1mIkEX1KElsfOZjpQgKgfK5MElgfOZTb0SROIY9ItqxJOCI1VnOFgLQHv7EQUO4F5UKAMN5E0Mj4lPe-h2XxzIsZs65yzP-znZM9PgufwhfqCdOvFCi7Jrl3XxXJx1czjF84NoKo |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELagIMEEiCLeeGA1JLbzmqtWBdqqIgV1qxznLGUgqdKU16_nnKbAwsAWnawkOp_z3Tn3-SPk2milk9SLmM9BY4EChikcy1KJ8I6Ykaiax_08CEajcDqNxg1ZvebCAEDdfAY39rL-l58Wemm3ynCFYzS50tskW56U3FnRtdbh4wWBwHgVa2qME93GHYkFhO1E4PaQbNfKsf0SUakxpLf3z6fvk_YPG4-Ov3HmgGxAfkj0I2Z42Sca6H2R5RXtvld2q4_F6HSgcfbS6HItaJHTYdM2SOPlHErdKDks2OStWJmM0rZJ9oN2ENVo01zYJk-97qTTZ41gAlNY51XMB_xkeJJLoY1KDE8dIwOBKzIREb6kcDUPjU4CxwfE_dBXkaO5Y0yiXZs2gTgirbzI4ZjQ0A8THBxIwDsb4YVGYCbk-IorTyoZnpC29dBsvjoTY7Z2zukf9iuy058MB7PB3ejhjOzaCbGMPjc4J62qXMIF2davVbYoL-s5_QJYYKPx |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC24%3A+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=Realizing+Joint+Extreme-Scale+Simulations+on+Multiple+Supercomputers-Two+Superfacility+Case+Studies&rft.au=Pollinger%2C+Theresa&rft.au=Craen%2C+Alexander+Van&rft.au=Offenhauser%2C+Philipp&rft.au=Pfluger%2C+Dirk&rft.date=2024-11-17&rft.pub=IEEE&rft.spage=1&rft.epage=17&rft_id=info:doi/10.1109%2FSC41406.2024.00104&rft.externalDocID=10793145 |