Hierarchical management of extreme-scale task-based applications

Uložené v:
Podrobná bibliografia
Názov: Hierarchical management of extreme-scale task-based applications
Autori: Lordan Gomis, Francesc, Puigdemunt Schmolling, Gabriel, Vergés Boncompte, Pere, Conejero Bañón, Francisco Javier, Ejarque Artigas, Jorge, Badia Sala, Rosa Maria
Prispievatelia: Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors, Barcelona Supercomputing Center
Informácie o vydavateľovi: Springer Cham
Rok vydania: 2023
Zbierka: Universitat Politècnica de Catalunya, BarcelonaTech: UPCommons - Global access to UPC knowledge
Predmety: Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors, Electronic data processing -- Distributed processing, High performance computing, Distributed systems, Exascale, Task-based, Programming model, Workflow, Hierarchy, Runtime system, Peer-to-peer, Decentralized management, Processament distribuït de dades, Càlcul intensiu (Informàtica)
Popis: The scale and heterogeneity of exascale systems increment the complexity of programming applications exploiting them. Task-based approaches with support for nested tasks are a good-fitting model for them because of the flexibility lying in the task concept. Resembling the hierarchical organization of the hardware, this paper proposes establishing a hierarchy in the application workflow for mapping coarse-grain tasks to the broader hardware components and finer-grain tasks to the lowest levels of the resource hierarchy to benefit from lower-latency and higher-bandwidth communications and exploiting locality. Building on a proposed mechanism to encapsulate within the task the management of its finer-grain parallelism, the paper presents a hierarchical peer-to-peer engine orchestrating the execution of workflow hierarchies with fully-decentralized management. The tests conducted on the MareNostrum 4 supercomputer using a prototype implementation prove the validity of the proposal supporting the execution of up to 707,653 tasks using 2,400 cores and achieving speedups of up to 106 times faster than executions of a single workflow and centralized management. ; This work has been supported by the Spanish Government (PID2019-107255GB), by MCIN/AEI /10.13039/501100011033 (CEX2021-001148-S), by the Departament de Recerca i Universitats de la Generalitat de Catalunya to the Research Group MPiEDist (2021 SGR 00412), and by the European Commission through the Horizon Europe Research and Innovation program under Grant Agreements 101070177 (ICOS project) and 101016577 (AI-Sprint project). ; Peer Reviewed ; Postprint (author's final draft)
Druh dokumentu: conference object
Popis súboru: 14 p.; application/pdf
Jazyk: English
Relation: https://link.springer.com/chapter/10.1007/978-3-031-39698-4_8; info:eu-repo/grantAgreement/EC/HE/101070177/EU/Towards a functional continuum operating system/ICOS; info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-107255GB-C21/ES/BSC - COMPUTACION DE ALTAS PRESTACIONES VIII/; http://hdl.handle.net/2117/393224
DOI: 10.1007/978-3-031-39698-4_8
Dostupnosť: http://hdl.handle.net/2117/393224
https://doi.org/10.1007/978-3-031-39698-4_8
Rights: Open Access
Prístupové číslo: edsbas.FDAC4E24
Databáza: BASE
FullText Text:
  Availability: 0
CustomLinks:
  – Url: http://hdl.handle.net/2117/393224#
    Name: EDS - BASE (s4221598)
    Category: fullText
    Text: View record from BASE
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Gomis%20L
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edsbas
DbLabel: BASE
An: edsbas.FDAC4E24
RelevancyScore: 954
AccessLevel: 3
PubType: Conference
PubTypeId: conference
PreciseRelevancyScore: 953.929138183594
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Hierarchical management of extreme-scale task-based applications
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Lordan+Gomis%2C+Francesc%22">Lordan Gomis, Francesc</searchLink><br /><searchLink fieldCode="AR" term="%22Puigdemunt+Schmolling%2C+Gabriel%22">Puigdemunt Schmolling, Gabriel</searchLink><br /><searchLink fieldCode="AR" term="%22Vergés+Boncompte%2C+Pere%22">Vergés Boncompte, Pere</searchLink><br /><searchLink fieldCode="AR" term="%22Conejero+Bañón%2C+Francisco+Javier%22">Conejero Bañón, Francisco Javier</searchLink><br /><searchLink fieldCode="AR" term="%22Ejarque+Artigas%2C+Jorge%22">Ejarque Artigas, Jorge</searchLink><br /><searchLink fieldCode="AR" term="%22Badia+Sala%2C+Rosa+Maria%22">Badia Sala, Rosa Maria</searchLink>
– Name: Author
  Label: Contributors
  Group: Au
  Data: Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors<br />Barcelona Supercomputing Center
– Name: Publisher
  Label: Publisher Information
  Group: PubInfo
  Data: Springer Cham
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2023
– Name: Subset
  Label: Collection
  Group: HoldingsInfo
  Data: Universitat Politècnica de Catalunya, BarcelonaTech: UPCommons - Global access to UPC knowledge
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Àrees+temàtiques+de+la+UPC%3A%3AInformàtica%3A%3AArquitectura+de+computadors%22">Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors</searchLink><br /><searchLink fieldCode="DE" term="%22Electronic+data+processing+--+Distributed+processing%22">Electronic data processing -- Distributed processing</searchLink><br /><searchLink fieldCode="DE" term="%22High+performance+computing%22">High performance computing</searchLink><br /><searchLink fieldCode="DE" term="%22Distributed+systems%22">Distributed systems</searchLink><br /><searchLink fieldCode="DE" term="%22Exascale%22">Exascale</searchLink><br /><searchLink fieldCode="DE" term="%22Task-based%22">Task-based</searchLink><br /><searchLink fieldCode="DE" term="%22Programming+model%22">Programming model</searchLink><br /><searchLink fieldCode="DE" term="%22Workflow%22">Workflow</searchLink><br /><searchLink fieldCode="DE" term="%22Hierarchy%22">Hierarchy</searchLink><br /><searchLink fieldCode="DE" term="%22Runtime+system%22">Runtime system</searchLink><br /><searchLink fieldCode="DE" term="%22Peer-to-peer%22">Peer-to-peer</searchLink><br /><searchLink fieldCode="DE" term="%22Decentralized+management%22">Decentralized management</searchLink><br /><searchLink fieldCode="DE" term="%22Processament+distribuït+de+dades%22">Processament distribuït de dades</searchLink><br /><searchLink fieldCode="DE" term="%22Càlcul+intensiu+%28Informàtica%29%22">Càlcul intensiu (Informàtica)</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: The scale and heterogeneity of exascale systems increment the complexity of programming applications exploiting them. Task-based approaches with support for nested tasks are a good-fitting model for them because of the flexibility lying in the task concept. Resembling the hierarchical organization of the hardware, this paper proposes establishing a hierarchy in the application workflow for mapping coarse-grain tasks to the broader hardware components and finer-grain tasks to the lowest levels of the resource hierarchy to benefit from lower-latency and higher-bandwidth communications and exploiting locality. Building on a proposed mechanism to encapsulate within the task the management of its finer-grain parallelism, the paper presents a hierarchical peer-to-peer engine orchestrating the execution of workflow hierarchies with fully-decentralized management. The tests conducted on the MareNostrum 4 supercomputer using a prototype implementation prove the validity of the proposal supporting the execution of up to 707,653 tasks using 2,400 cores and achieving speedups of up to 106 times faster than executions of a single workflow and centralized management. ; This work has been supported by the Spanish Government (PID2019-107255GB), by MCIN/AEI /10.13039/501100011033 (CEX2021-001148-S), by the Departament de Recerca i Universitats de la Generalitat de Catalunya to the Research Group MPiEDist (2021 SGR 00412), and by the European Commission through the Horizon Europe Research and Innovation program under Grant Agreements 101070177 (ICOS project) and 101016577 (AI-Sprint project). ; Peer Reviewed ; Postprint (author's final draft)
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: conference object
– Name: Format
  Label: File Description
  Group: SrcInfo
  Data: 14 p.; application/pdf
– Name: Language
  Label: Language
  Group: Lang
  Data: English
– Name: NoteTitleSource
  Label: Relation
  Group: SrcInfo
  Data: https://link.springer.com/chapter/10.1007/978-3-031-39698-4_8; info:eu-repo/grantAgreement/EC/HE/101070177/EU/Towards a functional continuum operating system/ICOS; info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-107255GB-C21/ES/BSC - COMPUTACION DE ALTAS PRESTACIONES VIII/; http://hdl.handle.net/2117/393224
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.1007/978-3-031-39698-4_8
– Name: URL
  Label: Availability
  Group: URL
  Data: http://hdl.handle.net/2117/393224<br />https://doi.org/10.1007/978-3-031-39698-4_8
– Name: Copyright
  Label: Rights
  Group: Cpyrght
  Data: Open Access
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsbas.FDAC4E24
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsbas&AN=edsbas.FDAC4E24
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1007/978-3-031-39698-4_8
    Languages:
      – Text: English
    Subjects:
      – SubjectFull: Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
        Type: general
      – SubjectFull: Electronic data processing -- Distributed processing
        Type: general
      – SubjectFull: High performance computing
        Type: general
      – SubjectFull: Distributed systems
        Type: general
      – SubjectFull: Exascale
        Type: general
      – SubjectFull: Task-based
        Type: general
      – SubjectFull: Programming model
        Type: general
      – SubjectFull: Workflow
        Type: general
      – SubjectFull: Hierarchy
        Type: general
      – SubjectFull: Runtime system
        Type: general
      – SubjectFull: Peer-to-peer
        Type: general
      – SubjectFull: Decentralized management
        Type: general
      – SubjectFull: Processament distribuït de dades
        Type: general
      – SubjectFull: Càlcul intensiu (Informàtica)
        Type: general
    Titles:
      – TitleFull: Hierarchical management of extreme-scale task-based applications
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Lordan Gomis, Francesc
      – PersonEntity:
          Name:
            NameFull: Puigdemunt Schmolling, Gabriel
      – PersonEntity:
          Name:
            NameFull: Vergés Boncompte, Pere
      – PersonEntity:
          Name:
            NameFull: Conejero Bañón, Francisco Javier
      – PersonEntity:
          Name:
            NameFull: Ejarque Artigas, Jorge
      – PersonEntity:
          Name:
            NameFull: Badia Sala, Rosa Maria
      – PersonEntity:
          Name:
            NameFull: Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
      – PersonEntity:
          Name:
            NameFull: Barcelona Supercomputing Center
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 01
              Type: published
              Y: 2023
          Identifiers:
            – Type: issn-locals
              Value: edsbas
            – Type: issn-locals
              Value: edsbas.oa
ResultId 1