General Purpose Task-Dependence Management Hardware for Task-Based Dataflow Programming Models
Task-based programming models such as OpenMP, IntelTBB and OmpSs offer the possibility of expressing dependences among tasks to drive their execution at runtime. Managing these dependences introduces noticeable overheads when targeting fine-grained tasks, diminishing the potential speedups or even i...
Gespeichert in:
| Veröffentlicht in: | Proceedings - IEEE International Parallel and Distributed Processing Symposium S. 244 - 253 |
|---|---|
| Hauptverfasser: | , , , , , , |
| Format: | Tagungsbericht |
| Sprache: | Englisch |
| Veröffentlicht: |
IEEE
01.05.2017
|
| Schlagworte: | |
| ISSN: | 1530-2075 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Abstract | Task-based programming models such as OpenMP, IntelTBB and OmpSs offer the possibility of expressing dependences among tasks to drive their execution at runtime. Managing these dependences introduces noticeable overheads when targeting fine-grained tasks, diminishing the potential speedups or even introducing performance losses. To overcome this drawback, we present a general purpose hardware accelerator, Picos++, to manage the inter-task dependences efficiently in both time and energy. Our design also includes a novel nested task support. To this end, a new hardware/software co-design is presented to overcome the fact that nested tasks with dependences could result in system deadlocks due to the limited amount of resources in hardware task dependence managers. In this paper we describe a detailed implementation of this design and evaluate a parallel task-based programming model using Picos++ in a Linux embedded system with two ARM Cortex-A9 and a FPGA. The scalability and energy consumption of the real system implemented have been studied and compared against a software runtime. Even in a system limited to 2 threads, using Picos++ results in more than 1.8x speedup and 40% of energy savings in the most demanding parallelizations of real benchmarks. As a matter of fact, a hardware task dependence manager should be able to achieve much higher speedup and provide more energy savings with more threads. |
|---|---|
| AbstractList | Task-based programming models such as OpenMP, IntelTBB and OmpSs offer the possibility of expressing dependences among tasks to drive their execution at runtime. Managing these dependences introduces noticeable overheads when targeting fine-grained tasks, diminishing the potential speedups or even introducing performance losses. To overcome this drawback, we present a general purpose hardware accelerator, Picos++, to manage the inter-task dependences efficiently in both time and energy. Our design also includes a novel nested task support. To this end, a new hardware/software co-design is presented to overcome the fact that nested tasks with dependences could result in system deadlocks due to the limited amount of resources in hardware task dependence managers. In this paper we describe a detailed implementation of this design and evaluate a parallel task-based programming model using Picos++ in a Linux embedded system with two ARM Cortex-A9 and a FPGA. The scalability and energy consumption of the real system implemented have been studied and compared against a software runtime. Even in a system limited to 2 threads, using Picos++ results in more than 1.8x speedup and 40% of energy savings in the most demanding parallelizations of real benchmarks. As a matter of fact, a hardware task dependence manager should be able to achieve much higher speedup and provide more energy savings with more threads. |
| Author | Vidal, Miquel Bosch, Jaume Xubin Tan Ayguade, Eduard Alvarez, Carlos Jimenez-Gonzalez, Daniel Valero, Mateo |
| Author_xml | – sequence: 1 surname: Xubin Tan fullname: Xubin Tan email: xubin.tan@bsc.es organization: Barcelona Supercomput. Center, Univ. Politεcnica de Catalunya, Barcelona, Spain – sequence: 2 givenname: Jaume surname: Bosch fullname: Bosch, Jaume email: jbosch@bsc.es organization: Barcelona Supercomput. Center, Univ. Politεcnica de Catalunya, Barcelona, Spain – sequence: 3 givenname: Miquel surname: Vidal fullname: Vidal, Miquel email: mvidal@bsc.es organization: Barcelona Supercomput. Center, Univ. Politεcnica de Catalunya, Barcelona, Spain – sequence: 4 givenname: Carlos surname: Alvarez fullname: Alvarez, Carlos email: eduard@bsc.es organization: Barcelona Supercomput. Center, Univ. Politεcnica de Catalunya, Barcelona, Spain – sequence: 5 givenname: Daniel surname: Jimenez-Gonzalez fullname: Jimenez-Gonzalez, Daniel email: djimenez@ac.upc.edu organization: Barcelona Supercomput. Center, Univ. Politεcnica de Catalunya, Barcelona, Spain – sequence: 6 givenname: Eduard surname: Ayguade fullname: Ayguade, Eduard organization: Barcelona Supercomput. Center, Univ. Politεcnica de Catalunya, Barcelona, Spain – sequence: 7 givenname: Mateo surname: Valero fullname: Valero, Mateo email: mateo.valero@bsc.es organization: Barcelona Supercomput. Center, Univ. Politεcnica de Catalunya, Barcelona, Spain |
| BookMark | eNotj8FOAjEURavRRECWrtz0Bwb7Ou20XSookECcRNaSx_QNQWc6pB1D_HtJcHVzk3NPcofsJnSBGHsAMQEQ7mlZzsqPiRRgJspesbEzFnRui9yBKq7Z4FxEJoXRd2yY0pcQUuTKDdjnnAJFbHj5E49dIr7B9J3N6EjBU6iIrzHgnloKPV9g9CeMxOsuXrgXTOT5DHusm-7Ey9jtI7btIez5uvPUpHt2W2OTaPyfI7Z5e91MF9nqfb6cPq-ySoLpM-WNqnIrrawdeKkVkCChKwHF-ciuxt2uqr1V4LSXlSu8Pc9Ik5amACHzEXu8aA9EtD3GQ4vxd2tcYQBU_gciaFRK |
| CODEN | IEEPAD |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/IPDPS.2017.48 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library (IEL) (UW System Shared) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISBN | 9781538639146 1538639149 |
| EISSN | 1530-2075 |
| EndPage | 253 |
| ExternalDocumentID | 7967114 |
| Genre | orig-research |
| GroupedDBID | 29O 6IE 6IF 6IH 6IK 6IL 6IN AAJGR AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI OCL RIE RIL |
| ID | FETCH-LOGICAL-c217t-4d74c38282f91d2541e0e05c016978bfabbcfd84195d2c96d8217e5e52761023 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 9 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000427044800026&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Wed Aug 27 02:19:53 EDT 2025 |
| IsDoiOpenAccess | false |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c217t-4d74c38282f91d2541e0e05c016978bfabbcfd84195d2c96d8217e5e52761023 |
| OpenAccessLink | https://recercat.cat/handle/2072/291863 |
| PageCount | 10 |
| ParticipantIDs | ieee_primary_7967114 |
| PublicationCentury | 2000 |
| PublicationDate | 2017-May |
| PublicationDateYYYYMMDD | 2017-05-01 |
| PublicationDate_xml | – month: 05 year: 2017 text: 2017-May |
| PublicationDecade | 2010 |
| PublicationTitle | Proceedings - IEEE International Parallel and Distributed Processing Symposium |
| PublicationTitleAbbrev | IPDPS |
| PublicationYear | 2017 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0020349 ssib030101683 |
| Score | 1.6920213 |
| Snippet | Task-based programming models such as OpenMP, IntelTBB and OmpSs offer the possibility of expressing dependences among tasks to drive their execution at... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 244 |
| SubjectTerms | Discrete cosine transforms Energy saving Fine- grained task parallelism Hardware Instruction sets Nested tasks Parallel processing Picos++ Hardware Accelerator Programming Runtime Task-based dataflow programming models |
| Title | General Purpose Task-Dependence Management Hardware for Task-Based Dataflow Programming Models |
| URI | https://ieeexplore.ieee.org/document/7967114 |
| WOSCitedRecordID | wos000427044800026&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELVKxcBUoEV8ywMjbhM3tuMVSgUSqiLRoROV4w8JAQ1KWvr38Tlpy8DCFkVJHJ2TXJ7v3XsI3UjHWW4jR1SU5CRhwhFJPVjhw8QoqsDuNujMPovJJJ3NZNZCt9teGGttIJ_ZPmyGWr4p9AqWygZCchGDa_WeELzu1do8O8OglZZuv8IUdFd2mpqDp2yUvQCTS_TB6ueXk0pIJOPO_27hEPV2HXk42-aaI9Syi2PU2Vgy4OYN7aLXRkYaZz6ARWXxVFXvZNQ43fqL7OguGIr2a1Va7P9b6-PufEozeKTA0rdYw4DA3fr0I2LwTPuoemg6fpjeP5LGQoFojzWWJDEi0UOPqqiTsfFgMLaRjZgGDRaR5k7luXYmTWLJDNWSm9SfZpllVHAQdThB7UWxsKcIO-FopJw21Cc0wzwwzCkzMqWhspfwM9SFaM2_apGMeROo8793X6ADmIuaOXiJ2styZa_Qvv5evlXldZjZH-gko24 |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PT8IwFG4ImugJFYy_7cGjg62063pVJBCRLHEHTpKuPxIjMsNA_n37tgEevHhblm1dXre9fX3f-z6E7oQNWWp860mfph5l3HqCOLASdqmWRILdbaEzO-LjcTSZiLiG7re9MMaYgnxm2rBZ1PJ1plawVNbhIuQBuFbvMUqJX3ZrbZ6ebqGWFm2_wwSUV3aqmp1h3ItfgcvF22D288tLpUgl_cb_buIItXY9eTjeZptjVDPzE9TYmDLg6h1tordKSBrHLoRZbnAi8w-vV3nduovsCC8YyvZruTDY_bmWxz24pKZxT4Kpb7aGAYG99elGxOCaNstbKOk_JY8DrzJR8JRDG0uPak5V1-EqYkWgHRwMjG98pkCFhUeplWmqrI5oIJgmSoQ6cqcZZhjhIcg6nKL6PJubM4Qtt8SXVmniUppmDhqmhGkRkaK2R8Nz1IRoTb9KmYxpFaiLv3ffooNB8jKajobj50t0CPNS8givUH25WJlrtK--l-_54qaY5R-Xhaa1 |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+-+IEEE+International+Parallel+and+Distributed+Processing+Symposium&rft.atitle=General+Purpose+Task-Dependence+Management+Hardware+for+Task-Based+Dataflow+Programming+Models&rft.au=Xubin+Tan&rft.au=Bosch%2C+Jaume&rft.au=Vidal%2C+Miquel&rft.au=Alvarez%2C+Carlos&rft.date=2017-05-01&rft.pub=IEEE&rft.eissn=1530-2075&rft.spage=244&rft.epage=253&rft_id=info:doi/10.1109%2FIPDPS.2017.48&rft.externalDocID=7967114 |