Towards an Optimized Heterogeneous Distributed Task Scheduler in OpenMP Cluster
This paper addresses the challenges of optimizing task scheduling for a distributed, task-based execution model in OpenMP for cluster computing environments. Traditional OpenMP implementations are primarily designed for shared-memory parallelism and offer limited control over task scheduling. Howeve...
Uloženo v:
| Vydáno v: | SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis s. 1894 - 1903 |
|---|---|
| Hlavní autoři: | , , , , , |
| Médium: | Konferenční příspěvek |
| Jazyk: | angličtina |
| Vydáno: |
IEEE
17.11.2024
|
| Témata: | |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | This paper addresses the challenges of optimizing task scheduling for a distributed, task-based execution model in OpenMP for cluster computing environments. Traditional OpenMP implementations are primarily designed for shared-memory parallelism and offer limited control over task scheduling. However, improved scheduling mechanisms are critical to achieving performance and portability in distributed and heterogeneous environments. OpenMP Cluster (OMPC) was introduced to overcome these limitations, extending OpenMP with the Heterogeneous Earliest Finish Time (HEFT) task scheduling algorithm tailored for large-scale systems. To improve scheduling and enable better system utilization, the runtime system must resolve challenges such as changes in the application balance, amount of parallelism, and varying communication latencies.This work presents three key contributions: first, the refactoring of the OMPC runtime to unify task scheduling across devices and hosts; second, the optimization of the HEFT-based scheduling algorithm to ensure efficient task execution in distributed environments; and third, an extensive evaluation of Work Stealing and HEFT scheduling mechanisms in real-world clusters. While the HEFT implementation in OMPC is not fully optimized, this work provides a significant step toward improving distributed task scheduling in cluster computing, offering insights and incremental advancements that support the development of scalable and high-performance applications. Results show improvements of up to 24% in scheduling time while opening up to more extensions in the scheduling methods. |
|---|---|
| AbstractList | This paper addresses the challenges of optimizing task scheduling for a distributed, task-based execution model in OpenMP for cluster computing environments. Traditional OpenMP implementations are primarily designed for shared-memory parallelism and offer limited control over task scheduling. However, improved scheduling mechanisms are critical to achieving performance and portability in distributed and heterogeneous environments. OpenMP Cluster (OMPC) was introduced to overcome these limitations, extending OpenMP with the Heterogeneous Earliest Finish Time (HEFT) task scheduling algorithm tailored for large-scale systems. To improve scheduling and enable better system utilization, the runtime system must resolve challenges such as changes in the application balance, amount of parallelism, and varying communication latencies.This work presents three key contributions: first, the refactoring of the OMPC runtime to unify task scheduling across devices and hosts; second, the optimization of the HEFT-based scheduling algorithm to ensure efficient task execution in distributed environments; and third, an extensive evaluation of Work Stealing and HEFT scheduling mechanisms in real-world clusters. While the HEFT implementation in OMPC is not fully optimized, this work provides a significant step toward improving distributed task scheduling in cluster computing, offering insights and incremental advancements that support the development of scalable and high-performance applications. Results show improvements of up to 24% in scheduling time while opening up to more extensions in the scheduling methods. |
| Author | Neveu, Remy Leite, Gustavo Diaz, Jose M. Monsalve Yviquel, Herve Araujo, Guido Ceccato, Rodrigo |
| Author_xml | – sequence: 1 givenname: Remy surname: Neveu fullname: Neveu, Remy organization: Universidade Estadual de Campinas (UNICAMP),Instituto de Computação,Campinas,Brazil – sequence: 2 givenname: Rodrigo surname: Ceccato fullname: Ceccato, Rodrigo email: rodrigo.ceccato@ic.unicamp.br organization: Universidade Estadual de Campinas (UNICAMP),Instituto de Computação,Campinas,Brazil – sequence: 3 givenname: Gustavo surname: Leite fullname: Leite, Gustavo organization: Universidade Estadual de Campinas (UNICAMP),Instituto de Computação,Campinas,Brazil – sequence: 4 givenname: Guido surname: Araujo fullname: Araujo, Guido organization: Universidade Estadual de Campinas (UNICAMP),Instituto de Computação,Campinas,Brazil – sequence: 5 givenname: Jose M. Monsalve surname: Diaz fullname: Diaz, Jose M. Monsalve organization: Universidade Estadual de Campinas (UNICAMP),Instituto de Computação,Campinas,Brazil – sequence: 6 givenname: Herve surname: Yviquel fullname: Yviquel, Herve email: hyviquel@unicamp.br organization: Universidade Estadual de Campinas (UNICAMP),Instituto de Computação,Campinas,Brazil |
| BookMark | eNotj8tKxDAYRiMoqGOfQBd5gda_ubZLqZcRRipMxeWQNH802GmHpkX06a3o6lsczgfnnBz3Q4-EXOaQ5TmU19vqVXEmIGPARAbAeHlEklKXBZfApZSCn5IkxmBBgSwEFPKM1M3waUYXqelpfZjCPnyjo2uccBzesMdhjvQ2xGkMdp4W0pj4QbftO7q5w5GGXwv7p2dadXNcpAty4k0XMfnfFXm5v2uqdbqpHx6rm01qmFRTmjNlvbRY6rZlaFtpCs80Gq25b10hvFPelsoJpQVfcM78kqCtW3KEssBX5OrvNyDi7jCGvRm_djkUDJRU_AfCa1Di |
| CODEN | IEEPAD |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/SCW63240.2024.00239 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9798350355543 |
| EndPage | 1903 |
| ExternalDocumentID | 10820656 |
| Genre | orig-research |
| GrantInformation_xml | – fundername: Petrobras funderid: 10.13039/501100004225 |
| GroupedDBID | 6IE 6IL ACM ALMA_UNASSIGNED_HOLDINGS CBEJK RIE RIL |
| ID | FETCH-LOGICAL-a256t-126bf5be97cc2ebc5a8f27ea773fcd84fd6fb96d46743bc512f0357bd97946b03 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 1 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001451792300197&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Wed Aug 27 01:59:34 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-a256t-126bf5be97cc2ebc5a8f27ea773fcd84fd6fb96d46743bc512f0357bd97946b03 |
| PageCount | 10 |
| ParticipantIDs | ieee_primary_10820656 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-Nov.-17 |
| PublicationDateYYYYMMDD | 2024-11-17 |
| PublicationDate_xml | – month: 11 year: 2024 text: 2024-Nov.-17 day: 17 |
| PublicationDecade | 2020 |
| PublicationTitle | SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis |
| PublicationTitleAbbrev | SC-W |
| PublicationYear | 2024 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssib060584085 |
| Score | 1.8971257 |
| Snippet | This paper addresses the challenges of optimizing task scheduling for a distributed, task-based execution model in OpenMP for cluster computing environments.... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 1894 |
| SubjectTerms | Cluster computing Distributed architectures Dynamic scheduling Iterative methods Large-scale systems Optimal scheduling Parallel processing Parallel programming Parallel systems Resource management Runtime Scheduling algorithms Scheduling and task partitioning |
| Title | Towards an Optimized Heterogeneous Distributed Task Scheduler in OpenMP Cluster |
| URI | https://ieeexplore.ieee.org/document/10820656 |
| WOSCitedRecordID | wos001451792300197&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELZoxcAEiCLe8sAaSJzEjudC1YW2UoPoVvlxlipKitqEgV-Pz02BhYHN8kOWzs-z7_s-Qm5Bu1xnTkQaIe4ZT4zfBxWPeOyKRECeZrEOYhNiNCpmMzlpweoBCwMAIfgM7jAZ_vLtyjT4VOZXOLKN57xDOkKILVhrN3nwew_ZulpmoSSW99P-C5KRx94LZMiRzVAR_JeGSjhCBof_7PyI9H7AeHTyfcwckz2oTsi4DOGuG6oqOvar_m3xCZYOMbZl5acEeH-ePiAnLspZ-ZJSbV7p1A-QbZawpgtsBdXThPaXDVIl9Mjz4LHsD6NWGyFS_pJSRwnjaGWQwhgG2uSqcEyAEiJ1xhaZs9xpyS2KiaS-OGEuTnOhrURGeR2np6RbrSo4I9Q7IVBI5SsYk1kmtREFWNBKydT5quekh9aYv2_pL-Y7Q1z8kX9JDtDgCNhLxBXp1usGrsm--agXm_VNGLQvuh-Z_g |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1JTwIxFG4UTfSkRoy7PXgdneksnZ5RgpEtYYzcSJfXhAiDAcaDv96-AurFg7emS5q8rq993_cRcgvKpiqxPFAIcU-ySLt9UGZBFto84pDGSai82ATvdvPhUPTXYHWPhQEAH3wGd5j0f_lmpit8KnMrHNnG02yb7KRJwqIVXGszffCDD_m61txCUSjuB41XpCMPnR_IkCWboSb4LxUVf4g0D_7Z_SGp_8DxaP_7oDkiW1Aek17hA14XVJa059b9dPwJhrYwumXmJgU4j54-ICsuClq5kkIu3ujADZGpJjCnY2wFZadPG5MKyRLq5KX5WDRawVodIZDumrIMIpahnUFwrRkoncrcMg6S89hqkyfWZFaJzKCcSOyKI2bDOOXKCOSUV2F8QmrlrIRTQp0bArmQroLWiWFCaZ6DASWliK2rekbqaI3R-4oAY7QxxPkf-Tdkr1V02qP2U_f5guyj8RG-F_FLUlvOK7giu_pjOV7Mr_0AfgHrWJ1F |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC24-W%3A+Workshops+of+the+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=Towards+an+Optimized+Heterogeneous+Distributed+Task+Scheduler+in+OpenMP+Cluster&rft.au=Neveu%2C+Remy&rft.au=Ceccato%2C+Rodrigo&rft.au=Leite%2C+Gustavo&rft.au=Araujo%2C+Guido&rft.date=2024-11-17&rft.pub=IEEE&rft.spage=1894&rft.epage=1903&rft_id=info:doi/10.1109%2FSCW63240.2024.00239&rft.externalDocID=10820656 |