Towards an Optimized Heterogeneous Distributed Task Scheduler in OpenMP Cluster

This paper addresses the challenges of optimizing task scheduling for a distributed, task-based execution model in OpenMP for cluster computing environments. Traditional OpenMP implementations are primarily designed for shared-memory parallelism and offer limited control over task scheduling. Howeve...

Full description

Saved in:
Bibliographic Details
Published in:SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis pp. 1894 - 1903
Main Authors: Neveu, Remy, Ceccato, Rodrigo, Leite, Gustavo, Araujo, Guido, Diaz, Jose M. Monsalve, Yviquel, Herve
Format: Conference Proceeding
Language:English
Published: IEEE 17.11.2024
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract This paper addresses the challenges of optimizing task scheduling for a distributed, task-based execution model in OpenMP for cluster computing environments. Traditional OpenMP implementations are primarily designed for shared-memory parallelism and offer limited control over task scheduling. However, improved scheduling mechanisms are critical to achieving performance and portability in distributed and heterogeneous environments. OpenMP Cluster (OMPC) was introduced to overcome these limitations, extending OpenMP with the Heterogeneous Earliest Finish Time (HEFT) task scheduling algorithm tailored for large-scale systems. To improve scheduling and enable better system utilization, the runtime system must resolve challenges such as changes in the application balance, amount of parallelism, and varying communication latencies.This work presents three key contributions: first, the refactoring of the OMPC runtime to unify task scheduling across devices and hosts; second, the optimization of the HEFT-based scheduling algorithm to ensure efficient task execution in distributed environments; and third, an extensive evaluation of Work Stealing and HEFT scheduling mechanisms in real-world clusters. While the HEFT implementation in OMPC is not fully optimized, this work provides a significant step toward improving distributed task scheduling in cluster computing, offering insights and incremental advancements that support the development of scalable and high-performance applications. Results show improvements of up to 24% in scheduling time while opening up to more extensions in the scheduling methods.
AbstractList This paper addresses the challenges of optimizing task scheduling for a distributed, task-based execution model in OpenMP for cluster computing environments. Traditional OpenMP implementations are primarily designed for shared-memory parallelism and offer limited control over task scheduling. However, improved scheduling mechanisms are critical to achieving performance and portability in distributed and heterogeneous environments. OpenMP Cluster (OMPC) was introduced to overcome these limitations, extending OpenMP with the Heterogeneous Earliest Finish Time (HEFT) task scheduling algorithm tailored for large-scale systems. To improve scheduling and enable better system utilization, the runtime system must resolve challenges such as changes in the application balance, amount of parallelism, and varying communication latencies.This work presents three key contributions: first, the refactoring of the OMPC runtime to unify task scheduling across devices and hosts; second, the optimization of the HEFT-based scheduling algorithm to ensure efficient task execution in distributed environments; and third, an extensive evaluation of Work Stealing and HEFT scheduling mechanisms in real-world clusters. While the HEFT implementation in OMPC is not fully optimized, this work provides a significant step toward improving distributed task scheduling in cluster computing, offering insights and incremental advancements that support the development of scalable and high-performance applications. Results show improvements of up to 24% in scheduling time while opening up to more extensions in the scheduling methods.
Author Neveu, Remy
Leite, Gustavo
Diaz, Jose M. Monsalve
Yviquel, Herve
Araujo, Guido
Ceccato, Rodrigo
Author_xml – sequence: 1
  givenname: Remy
  surname: Neveu
  fullname: Neveu, Remy
  organization: Universidade Estadual de Campinas (UNICAMP),Instituto de Computação,Campinas,Brazil
– sequence: 2
  givenname: Rodrigo
  surname: Ceccato
  fullname: Ceccato, Rodrigo
  email: rodrigo.ceccato@ic.unicamp.br
  organization: Universidade Estadual de Campinas (UNICAMP),Instituto de Computação,Campinas,Brazil
– sequence: 3
  givenname: Gustavo
  surname: Leite
  fullname: Leite, Gustavo
  organization: Universidade Estadual de Campinas (UNICAMP),Instituto de Computação,Campinas,Brazil
– sequence: 4
  givenname: Guido
  surname: Araujo
  fullname: Araujo, Guido
  organization: Universidade Estadual de Campinas (UNICAMP),Instituto de Computação,Campinas,Brazil
– sequence: 5
  givenname: Jose M. Monsalve
  surname: Diaz
  fullname: Diaz, Jose M. Monsalve
  organization: Universidade Estadual de Campinas (UNICAMP),Instituto de Computação,Campinas,Brazil
– sequence: 6
  givenname: Herve
  surname: Yviquel
  fullname: Yviquel, Herve
  email: hyviquel@unicamp.br
  organization: Universidade Estadual de Campinas (UNICAMP),Instituto de Computação,Campinas,Brazil
BookMark eNotj8tKxDAYRiMoqGOfQBd5gda_ubZLqZcRRipMxeWQNH802GmHpkX06a3o6lsczgfnnBz3Q4-EXOaQ5TmU19vqVXEmIGPARAbAeHlEklKXBZfApZSCn5IkxmBBgSwEFPKM1M3waUYXqelpfZjCPnyjo2uccBzesMdhjvQ2xGkMdp4W0pj4QbftO7q5w5GGXwv7p2dadXNcpAty4k0XMfnfFXm5v2uqdbqpHx6rm01qmFRTmjNlvbRY6rZlaFtpCs80Gq25b10hvFPelsoJpQVfcM78kqCtW3KEssBX5OrvNyDi7jCGvRm_djkUDJRU_AfCa1Di
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SCW63240.2024.00239
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350355543
EndPage 1903
ExternalDocumentID 10820656
Genre orig-research
GrantInformation_xml – fundername: Petrobras
  funderid: 10.13039/501100004225
GroupedDBID 6IE
6IL
ACM
ALMA_UNASSIGNED_HOLDINGS
CBEJK
RIE
RIL
ID FETCH-LOGICAL-a256t-126bf5be97cc2ebc5a8f27ea773fcd84fd6fb96d46743bc512f0357bd97946b03
IEDL.DBID RIE
ISICitedReferencesCount 1
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001451792300197&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Wed Aug 27 01:59:34 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a256t-126bf5be97cc2ebc5a8f27ea773fcd84fd6fb96d46743bc512f0357bd97946b03
PageCount 10
ParticipantIDs ieee_primary_10820656
PublicationCentury 2000
PublicationDate 2024-Nov.-17
PublicationDateYYYYMMDD 2024-11-17
PublicationDate_xml – month: 11
  year: 2024
  text: 2024-Nov.-17
  day: 17
PublicationDecade 2020
PublicationTitle SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis
PublicationTitleAbbrev SC-W
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssib060584085
Score 1.8971257
Snippet This paper addresses the challenges of optimizing task scheduling for a distributed, task-based execution model in OpenMP for cluster computing environments....
SourceID ieee
SourceType Publisher
StartPage 1894
SubjectTerms Cluster computing
Distributed architectures
Dynamic scheduling
Iterative methods
Large-scale systems
Optimal scheduling
Parallel processing
Parallel programming
Parallel systems
Resource management
Runtime
Scheduling algorithms
Scheduling and task partitioning
Title Towards an Optimized Heterogeneous Distributed Task Scheduler in OpenMP Cluster
URI https://ieeexplore.ieee.org/document/10820656
WOSCitedRecordID wos001451792300197&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlZ3PT8IwFMcbIR48qRGjoqYHr9W13db1jBIOCiRg5Eb64zUh4jDAPPjX2zdAvXjwtqxZlrz25a3r-36-hNyYwnhnfMqcCYKlXlhmY5VhzmWKp04FntRC4UfV7xeTiR5uxeq1FgYA6uYzuMXL-izfL1yFv8pihiNtPMsbpKFUvhFr7RYPHu8hrWtLFuKJvht1XhBGnsRdoEBGtkBH8F8eKnUJ6R7-8-VHpPUjxqPD7zJzTPagPCGDcd3uuqKmpIOY9W-zT_C0h70ti7gkIO7n6T0ycdHOKo6MzeqVjuIE-WoOSzrDp6B8GtLOvEJUQos8dx_GnR7beiMwEz9S1oyL3IbMglbOCbAuM0UQCoxSMjhfpMHnwerco5mIjMNchERmynqNRHmbyFPSLBclnBEqJTe8AKu98jGjjdVJblSRWa2tdEV6TloYjen7Bn8x3QXi4o_7bXKAAUfBHleXpLleVnBF9t3HerZaXteT9gVdiJoI
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlZ27T8MwEMYtKEgwAaKINx5YA7HzcDwXqiL6khpEt8qPs1TRpqhtGPjr8aUtsDCwRbGiSGefLo7v-32E3KpMWaNsHBjleBBbrgPtq0xgTCJYbIRjYSUUbotuNxsOZX8tVq-0MABQNZ_BHV5WZ_l2Zkr8VeYzHGnjSbpNdpI45uFKrrVZPnjAh7yuNVuIhfJ-0HhFHHno94EcKdkcPcF_uahURaR58M_XH5L6jxyP9r8LzRHZguKY9PKq4XVBVUF7Pu-n40-wtIXdLTO_KMDv6OkDUnHR0MqP5GrxRgd-imw5gTkd41NQdPq0MSkRllAnL83HvNEK1u4IgfKfKcuA8VS7RIMUxnDQJlGZ4wKUEJEzNoudTZ2WqUU7kcgPM-7CKBHaSmTK6zA6IbViVsApoVHEFMtASyusz2mlZZgqkSVaSh2ZLD4jdYzG6H0FwBhtAnH-x_0bstfKO-1R-6n7fEH2Mfgo32PiktSW8xKuyK75WI4X8-tqAr8AkSOdTw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC24-W%3A+Workshops+of+the+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=Towards+an+Optimized+Heterogeneous+Distributed+Task+Scheduler+in+OpenMP+Cluster&rft.au=Neveu%2C+Remy&rft.au=Ceccato%2C+Rodrigo&rft.au=Leite%2C+Gustavo&rft.au=Araujo%2C+Guido&rft.date=2024-11-17&rft.pub=IEEE&rft.spage=1894&rft.epage=1903&rft_id=info:doi/10.1109%2FSCW63240.2024.00239&rft.externalDocID=10820656