On the adequacy of lightweight thread approaches for high-level parallel programming models
High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: the complexity of the hardware is hidden by the underlying runtime system,...
Saved in:
| Published in: | Future generation computer systems Vol. 84; no. C; pp. 22 - 31 |
|---|---|
| Main Authors: | , , , , , |
| Format: | Journal Article Publication |
| Language: | English |
| Published: |
United States
Elsevier B.V
01.07.2018
Elsevier |
| Subjects: | |
| ISSN: | 0167-739X, 1872-7115 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: the complexity of the hardware is hidden by the underlying runtime system, improving coding productivity. The implementations of OpenMP usually rely on POSIX threads (pthreads), offering excellent performance for coarse-grained parallelism and a perfect match with the current hardware. OmpSs is a task oriented PM based on an ad hoc runtime solution called Nanos++; it is the precursor of the tasking parallelism in the OpenMP tasking specification. A recent trend in runtimes and applications points to leveraging massive on-node parallelism in conjunction with fine-grained and dynamic scheduling paradigms. In this paper we analyze the behavior of the OpenMP and OmpSs PMs on top of the recently emerged Generic Lightweight Threads (GLT) API. GLT exposes a common API for lightweight thread (LWT) libraries that offers the possibility of running the same application over different native LWT solutions. We describe the design details of those high-level PMs implemented on top of GLT and analyze different scenarios in order to assess where the use of LWTs may benefit application performance. Our work reveals those scenarios where LWTs overperform pthread-based solutions and compares the performance between an ad hoc solution and a generic implementation.
•Design and implementation of OpenMP and OmpSs on top of lightweight threads.•Analysis of the relationship between programming models and lightweight threads.•Performance evaluation in different OpenMP and OmpSs scenarios. |
|---|---|
| AbstractList | High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: the complexity of the hardware is hidden by the underlying runtime system, improving coding productivity. The implementations of OpenMP usually rely on POSIX threads (pthreads), offering excellent performance for coarse-grained parallelism and a perfect match with the current hardware. OmpSs is a task oriented PM based on an ad hoc runtime solution called Nanos++; it is the precursor of the tasking parallelism in the OpenMP tasking specification. A recent trend in runtimes and applications points to leveraging massive on-node parallelism in conjunction with fine-grained and dynamic scheduling paradigms. In this paper we analyze the behavior of the OpenMP and OmpSs PMs on top of the recently emerged Generic Lightweight Threads (GLT) API. GLT exposes a common API for lightweight thread (LWT) libraries that offers the possibility of running the same application over different native LWT solutions. We describe the design details of those high-level PMs implemented on top of GLT and analyze different scenarios in order to assess where the use of LWTs may benefit application performance. Our work reveals those scenarios where LWTs overperform pthread-based solutions and compares the performance between an ad hoc solution and a generic implementation.
•Design and implementation of OpenMP and OmpSs on top of lightweight threads.•Analysis of the relationship between programming models and lightweight threads.•Performance evaluation in different OpenMP and OmpSs scenarios. High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: the complexity of the hardware is hidden by the underlying runtime system, improving coding productivity. The implementations of OpenMP usually rely on POSIX threads (pthreads), offering excellent performance for coarse-grained parallelism and a perfect match with the current hardware. OmpSs is a task oriented PM based on an ad hoc runtime solution called Nanos++; it is the precursor of the tasking parallelism in the OpenMP tasking specification. A recent trend in runtimes and applications points to leveraging massive on-node parallelism in conjunction with fine-grained and dynamic scheduling paradigms. In this paper we analyze the behavior of the OpenMP and OmpSs PMs on top of the recently emerged Generic Lightweight Threads (GLT) API. GLT exposses a common API for lightweight thread (LWT) libraries that offers the possibility of running the same application over different native LWT solutions. We describe the design details of those high-level PMs implemented on top of GLT and analyze different scenarios in order to assess where the use of LWTs may benefit application performance. Our work reveals those scenarios where LWTs overperform pthread-based solutions and compares the performance between an ad hoc solution and a generic implementation. High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: the complexity of the hardware is hidden by the underlying runtime system, improving coding productivity. The implementations of OpenMP usually rely on POSIX threads (pthreads), offering excellent performance for coarse-grained parallelism and a perfect match with the current hardware. OmpSs is a task oriented PM based on an ad hoc runtime solution called Nanos++; it is the precursor of the tasking parallelism in the OpenMP tasking specification. A recent trend in runtimes and applications points to leveraging massive on-node parallelism in conjunction with fine-grained and dynamic scheduling paradigms. In this paper we analyze the behavior of the OpenMP and OmpSs PMs on top of the recently emerged Generic Lightweight Threads (GLT) API. GLT exposes a common API for lightweight thread (LWT) libraries that offers the possibility of running the same application over different native LWT solutions. We describe the design details of those high-level PMs implemented on top of GLT and analyze different scenarios in order to assess where the use of LWTs may benefit application performance. Our work reveals those scenarios where LWTs overperform pthread-based solutions and compares the performance between an ad hoc solution and a generic implementation. The researchers from the Universitat Jaume I de Castelló were supported by project TIN2014-53495-R of the MINECO, Spain and FEDER, Spain, the Generalitat Valenciana fellowship programme, Spain Vali+d 2015. Antonio J. Peña is cofinanced by the Spanish Ministry of Economy and Competitiveness, Spain under Juan de la Cierva fellowship number IJCI-2015-23266. This work was partially supported by the U.S. Dept. of Energy, Office of Science, Office of Advanced Scientific Computing Research (SC-21), under contract DE-AC02-06CH11357. We gratefully acknowledge Enrique S. Quintana-Ortí (Universitat Jaume I) and Sangmin Seo (Samsung Corp.) for their advice in this work and the computing resources provided and operated by the Joint Laboratory for System Evaluation (JLSE) at Argonne National Laboratory. Peer Reviewed |
| Author | Castelló, Adrián Balaji, Pavan Sala, Kevin Beltran, Vicenç Mayo, Rafael Peña, Antonio J. |
| Author_xml | – sequence: 1 givenname: Adrián surname: Castelló fullname: Castelló, Adrián email: adcastel@uji.es organization: Universitat Jaume I de Castelló, 12071 Castelló de la Plana, Spain – sequence: 2 givenname: Rafael surname: Mayo fullname: Mayo, Rafael email: mayo@uji.es organization: Universitat Jaume I de Castelló, 12071 Castelló de la Plana, Spain – sequence: 3 givenname: Kevin orcidid: 0000-0001-8233-1185 surname: Sala fullname: Sala, Kevin email: ksala@bsc.es organization: Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain – sequence: 4 givenname: Vicenç surname: Beltran fullname: Beltran, Vicenç email: vbeltran@bsc.es organization: Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain – sequence: 5 givenname: Pavan surname: Balaji fullname: Balaji, Pavan email: balaji@anl.gov organization: Argonne National Laboratory, Lemont, IL, USA – sequence: 6 givenname: Antonio J. orcidid: 0000-0002-3575-4617 surname: Peña fullname: Peña, Antonio J. email: antonio.pena@bsc.es organization: Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain |
| BackLink | https://www.osti.gov/servlets/purl/1465734$$D View this record in Osti.gov |
| BookMark | eNqFkE1r3DAQhkVJoZuPf9CD6N2uxpItu4dCCU0bCOTSQCEHoY_xWovX2krahPz7ymyg0EN70EjzzjyD5j0nZ0tYkJD3wGpg0H3c1eMxHyPWDYO-Zk1dxDdkA71sKgnQnpFNUWQl-fDzHTlPaccYA8lhQx7vF5onpNrhr6O2LzSMdPbbKT_jGkstonZUHw4xaDthomOIdCq1asYnnOlBRz3P6yOGbdT7vV-2dB8czumSvB31nPDq9b4gDzdff1x_r-7uv91ef7mrrGggV3LEnonOORia1gxGcI3GGCEMdG2LTS-dlQOwQbROw8D56LgElGg7oztj-AX5cJobUvYqWZ_RTjYsC9qsQHSt5KI0wanJpqNVES1Gq7MK2v9J1tMw2SjOBj70hRGvTAwpRRzVIfq9ji8KmFqdVzt1cl6tzivWqCIW7NNfWPmSzj4sOWo__w_-fIKLgfjkMa4L4WLR-bju44L_94Df8GSleA |
| CitedBy_id | crossref_primary_10_1109_TC_2020_2970706 crossref_primary_10_3390_e25101444 crossref_primary_10_1007_s10586_019_02927_z |
| Cites_doi | 10.1109/ICPP.2017.15 10.1109/CLUSTER.2016.12 10.1007/s10766-010-0136-3 10.1007/978-3-319-64203-1_34 10.1109/IPPS.1996.508060 10.1109/99.660313 10.1002/cpe.1174 10.1002/cpe.3341 10.1007/s11432-016-5588-7 10.1142/S0129626411000151 10.1109/IPDPS.2008.4536359 |
| ContentType | Journal Article Publication |
| Contributor | Barcelona Supercomputing Center |
| Contributor_xml | – sequence: 1 fullname: Barcelona Supercomputing Center |
| Copyright | 2018 Elsevier B.V. Attribution-NonCommercial 4.0 Spain info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-nc/3.0/es |
| Copyright_xml | – notice: 2018 Elsevier B.V. – notice: Attribution-NonCommercial 4.0 Spain info:eu-repo/semantics/openAccess <a href="http://creativecommons.org/licenses/by-nc/3.0/es/">http://creativecommons.org/licenses/by-nc/3.0/es/</a> |
| CorporateAuthor | Argonne National Lab. (ANL), Argonne, IL (United States) |
| CorporateAuthor_xml | – name: Argonne National Lab. (ANL), Argonne, IL (United States) |
| DBID | AAYXX CITATION XX2 OIOZB OTOTI |
| DOI | 10.1016/j.future.2018.02.016 |
| DatabaseName | CrossRef Recercat OSTI.GOV - Hybrid OSTI.GOV |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 1872-7115 |
| EndPage | 31 |
| ExternalDocumentID | 1465734 oai_recercat_cat_2072_309398 10_1016_j_future_2018_02_016 S0167739X17328819 |
| GroupedDBID | --K --M -~X .DC .~1 0R~ 1B1 1~. 1~5 29H 4.4 457 4G. 5GY 5VS 7-5 71M 8P~ 9JN AACTN AAEDT AAEDW AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAXUO AAYFN ABBOA ABFNM ABJNI ABMAC ABXDB ABYKQ ACDAQ ACGFS ACNNM ACRLP ACZNC ADBBV ADEZE ADJOM ADMUD AEBSH AEKER AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHZHX AIALX AIEXJ AIKHN AITUG AJBFU AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD ASPBG AVWKF AXJTR AZFZN BKOJK BLXMC CS3 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-Q G8K GBLVA GBOLZ HLZ HVGLF HZ~ IHE J1W KOM LG9 M41 MO0 MS~ N9A O-L O9- OAUVE OZT P-8 P-9 PC. Q38 R2- RIG ROL RPZ SBC SDF SDG SES SEW SPC SPCBC SSV SSZ T5K UHS WUQ XPP ZMT ~G- 9DU AATTM AAXKI AAYWO AAYXX ABDPE ABWVN ACLOT ACRPL ADNMO AEIPS AFJKZ AGQPQ AIIUN ANKPU APXCP CITATION EFKBS ~HD XX2 AALMO ABPIF OIOZB OTOTI |
| ID | FETCH-LOGICAL-c421t-7fe8046dd1925b9b43aebbb44b1655e287dc7910945da1933fd371e7ec6ba6bb3 |
| ISICitedReferencesCount | 6 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000430521600003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0167-739X |
| IngestDate | Fri May 19 01:11:24 EDT 2023 Fri Nov 07 13:51:05 EST 2025 Sat Nov 29 07:27:03 EST 2025 Tue Nov 18 22:39:01 EST 2025 Fri Feb 23 02:47:14 EST 2024 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | C |
| Keywords | Programming models POSIX threads OpenMP Lightweight threads OmpSs GLT |
| Language | English |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c421t-7fe8046dd1925b9b43aebbb44b1655e287dc7910945da1933fd371e7ec6ba6bb3 |
| Notes | USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) Spanish Ministerio de Economia y Competitividad (MINECO) AC02-06CH11357; SC-21 |
| ORCID | 0000-0001-8233-1185 0000-0002-3575-4617 0000000235754617 0000000182331185 |
| OpenAccessLink | https://recercat.cat/handle/2072/309398 |
| PageCount | 10 |
| ParticipantIDs | osti_scitechconnect_1465734 csuc_recercat_oai_recercat_cat_2072_309398 crossref_primary_10_1016_j_future_2018_02_016 crossref_citationtrail_10_1016_j_future_2018_02_016 elsevier_sciencedirect_doi_10_1016_j_future_2018_02_016 |
| PublicationCentury | 2000 |
| PublicationDate | 2018-07-01 |
| PublicationDateYYYYMMDD | 2018-07-01 |
| PublicationDate_xml | – month: 07 year: 2018 text: 2018-07-01 day: 01 |
| PublicationDecade | 2010 |
| PublicationPlace | United States |
| PublicationPlace_xml | – name: United States |
| PublicationTitle | Future generation computer systems |
| PublicationYear | 2018 |
| Publisher | Elsevier B.V Elsevier |
| Publisher_xml | – name: Elsevier B.V – name: Elsevier |
| References | PGI Compilers & Tools Kale, Krishnan (b11) 1993 Intel OpenMP Runtime Library A. Castelló, A.J. Peña, S. Seo, R. Mayo, P. Balaji, E.S. Quintana-Ortí, A review of lightweight thread approaches for high performance computing, in: Proceedings of the IEEE International Conference on Cluster Computing, Taipei, Taiwan, 2016. BOLT: A Lightning-Fast OpenMP Implementation Broquedis, Furmento, Goglin, Wacrenier, Namyst (b30) 2010; 38 Programming with Solaris Threads A. Castelló, S. Seo, R. Mayo, P. Balaji, E.S. Quintana-Ortí, A.J. Peña, GLT: A unified API for lightweight thread libraries, in: Proceedings of the IEEE International European Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, 2017. Seo, Amer, Balaji, Bordage, Bosilca, Brooks, Carns, Castelló, Genet, Herault, Iwasaki, Jindal, Kale, Krishnamoorthy, Lifflander, Lu, Meneses, Snir, Sun, Taura, Beckman (b14) 2017; PP BSC, Mercurium CloverLeaf miniapp Microsoft MSDN Library, Fibers. Clang project L.V. Kalé, M.A. Bhandarkar, N. Jagathesan, S. Krishnan, J. Yelon, Converse: An interoperable framework for parallel programming, in: Proceedings of the 10th International Parallel Processing Symposium, IPPS, 1996, pp. 212–217. Duran, Ayguadé, Badia, Labarta, Martinell, Martorell, Planas (b20) 2011; 21 Dagum, Menon (b4) 1998; 5 TOP500 Supercomputer Sites, (June 2016). Nakashima, Taura (b12) 2014; vol. 8665 A. Castelló, S. Seo, R. Mayo, P. Balaji, E.S. Quintana-Ortí, A.J. Peña, GLTO: On the adequacy of lightweight thread approaches for OpenMP implementations, in: Proceedings of the International Conference on Parallel Processing, Bristol, UK, 2017. Liao, Hernandez, Chapman, Chen, Zheng (b25) 2007; 19 Tanaka, Taura, Sato, Yonezawa (b29) 2000 D. Stein, D. Shah, Implementing lightweight threads, in: USENIX Summer, 1992. Hadjidoukas, Dimakopoulos (b28) 2007 BSC, The OmpSs Programming Model LLVM project Chang, Hwu (b21) 1989 OpenMP Architecture Review Board, OpenMP Application Programming Interface Version 4.5, (Nov. 2015). BSC, Nanos . Intel Math Kernel Library K.B. Wheeler, R.C. Murphy, D. Thain, Qthreads: An API for programming with millions of lightweight threads, in: Proceedings of Workshop on Multithreaded Architectures and Applications, 2008. Fu, Liao, Yang, Wang, Song, Huang, Yang, Xue, Liu, Qiao, Zhao, Yin, Hou, Zhang, Ge, Zhang, Wang, Zhou, Yang (b1) 2016; 59 Generic Lightweight Threads API Olivier, Porterfield, Wheeler, Prins (b31) 2011 Pthreads API Aliaga, Anzt, Castillo, Fernández, León, Pérez, Quintana-Ortí (b34) 2015; 27 BSC, Bsc Application Repository 10.1016/j.future.2018.02.016_b5 10.1016/j.future.2018.02.016_b2 10.1016/j.future.2018.02.016_b3 10.1016/j.future.2018.02.016_b8 10.1016/j.future.2018.02.016_b9 Seo (10.1016/j.future.2018.02.016_b14) 2017; PP 10.1016/j.future.2018.02.016_b6 10.1016/j.future.2018.02.016_b7 Liao (10.1016/j.future.2018.02.016_b25) 2007; 19 10.1016/j.future.2018.02.016_b33 10.1016/j.future.2018.02.016_b10 10.1016/j.future.2018.02.016_b32 10.1016/j.future.2018.02.016_b16 10.1016/j.future.2018.02.016_b15 10.1016/j.future.2018.02.016_b36 10.1016/j.future.2018.02.016_b13 10.1016/j.future.2018.02.016_b35 Dagum (10.1016/j.future.2018.02.016_b4) 1998; 5 10.1016/j.future.2018.02.016_b19 10.1016/j.future.2018.02.016_b18 10.1016/j.future.2018.02.016_b17 Chang (10.1016/j.future.2018.02.016_b21) 1989 Nakashima (10.1016/j.future.2018.02.016_b12) 2014; vol. 8665 Olivier (10.1016/j.future.2018.02.016_b31) 2011 Duran (10.1016/j.future.2018.02.016_b20) 2011; 21 Kale (10.1016/j.future.2018.02.016_b11) 1993 Hadjidoukas (10.1016/j.future.2018.02.016_b28) 2007 10.1016/j.future.2018.02.016_b23 10.1016/j.future.2018.02.016_b22 10.1016/j.future.2018.02.016_b27 10.1016/j.future.2018.02.016_b26 10.1016/j.future.2018.02.016_b24 Tanaka (10.1016/j.future.2018.02.016_b29) 2000 Broquedis (10.1016/j.future.2018.02.016_b30) 2010; 38 Fu (10.1016/j.future.2018.02.016_b1) 2016; 59 Aliaga (10.1016/j.future.2018.02.016_b34) 2015; 27 |
| References_xml | – reference: TOP500 Supercomputer Sites, (June 2016). – reference: BOLT: A Lightning-Fast OpenMP Implementation, – volume: 19 start-page: 2317 year: 2007 end-page: 2332 ident: b25 article-title: OpenUH: an optimizing, portable OpenMP compiler publication-title: Concurr. Comput.: Pract. Exper. – reference: A. Castelló, A.J. Peña, S. Seo, R. Mayo, P. Balaji, E.S. Quintana-Ortí, A review of lightweight thread approaches for high performance computing, in: Proceedings of the IEEE International Conference on Cluster Computing, Taipei, Taiwan, 2016. – reference: L.V. Kalé, M.A. Bhandarkar, N. Jagathesan, S. Krishnan, J. Yelon, Converse: An interoperable framework for parallel programming, in: Proceedings of the 10th International Parallel Processing Symposium, IPPS, 1996, pp. 212–217. – start-page: 100 year: 2000 end-page: 112 ident: b29 article-title: Performance evaluation of OpenMP applications with nested parallelism publication-title: International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers – reference: Clang project, – reference: A. Castelló, S. Seo, R. Mayo, P. Balaji, E.S. Quintana-Ortí, A.J. Peña, GLTO: On the adequacy of lightweight thread approaches for OpenMP implementations, in: Proceedings of the International Conference on Parallel Processing, Bristol, UK, 2017. – reference: Microsoft MSDN Library, Fibers. – reference: BSC, Nanos++, – reference: Intel OpenMP Runtime Library, – start-page: 662 year: 2007 end-page: 671 ident: b28 article-title: Nested parallelism in the OMPIOpenmP/C compiler publication-title: European Conference on Parallel Processing – volume: 5 start-page: 46 year: 1998 end-page: 55 ident: b4 article-title: OpenMP: an industry standard API for shared-memory programming publication-title: IEEE Comput. Sci. Eng. – reference: Generic Lightweight Threads API, – volume: 59 start-page: 072001 year: 2016 ident: b1 article-title: The Sunway TaihuLight supercomputer: system and applications publication-title: Sci. China Inf. Sci. – reference: . – reference: BSC, Mercurium, – reference: K.B. Wheeler, R.C. Murphy, D. Thain, Qthreads: An API for programming with millions of lightweight threads, in: Proceedings of Workshop on Multithreaded Architectures and Applications, 2008. – start-page: 246 year: 1989 end-page: 257 ident: b21 article-title: Inline function expansion for compiling C programs publication-title: Procedings of the ACM SIGPLAN 1989 Conference on Programming Language Design and Implementation – reference: Intel Math Kernel Library, – reference: LLVM project, – reference: BSC, Bsc Application Repository, – reference: . – volume: vol. 8665 start-page: 222 year: 2014 end-page: 238 ident: b12 article-title: MassiveThreads: A thread library for high productivity languages publication-title: Concurrent Objects and beyond – year: 1993 ident: b11 article-title: CHARM++: A Portable Concurrent Object Oriented System Based on C++, Vol. 28 – reference: CloverLeaf miniapp, – volume: 21 start-page: 173 year: 2011 end-page: 193 ident: b20 article-title: OmpSs: A proposal for programming heterogeneous multi-core architectures publication-title: Parallel Process. Lett. – start-page: 49 year: 2011 end-page: 56 ident: b31 article-title: Scheduling task parallelism on multi-socket multicore systems publication-title: Proceedings of the 1st International Workshop on Runtime and Operating Systems for Supercomputers – volume: PP year: 2017 ident: b14 article-title: Argobots: A lightweight low-level threading and tasking framework publication-title: IEEE Trans. Parallel Distrib. Syst. – reference: BSC, The OmpSs Programming Model, – reference: D. Stein, D. Shah, Implementing lightweight threads, in: USENIX Summer, 1992. – reference: Pthreads API, – reference: OpenMP Architecture Review Board, OpenMP Application Programming Interface Version 4.5, (Nov. 2015). – reference: A. Castelló, S. Seo, R. Mayo, P. Balaji, E.S. Quintana-Ortí, A.J. Peña, GLT: A unified API for lightweight thread libraries, in: Proceedings of the IEEE International European Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, 2017. – reference: PGI Compilers & Tools, – reference: Programming with Solaris Threads, – volume: 38 start-page: 418 year: 2010 end-page: 439 ident: b30 article-title: ForestGOMP: an efficient OpenMP environment for NUMA architectures publication-title: Int. J. Parallel Program. – volume: 27 start-page: 885 year: 2015 end-page: 904 ident: b34 article-title: Unveiling the performance-energy trade-off in iterative linear system solvers for multithreaded processors publication-title: Concurr. Comput.: Pract. Exp. – ident: 10.1016/j.future.2018.02.016_b22 doi: 10.1109/ICPP.2017.15 – ident: 10.1016/j.future.2018.02.016_b26 – ident: 10.1016/j.future.2018.02.016_b36 – ident: 10.1016/j.future.2018.02.016_b18 – ident: 10.1016/j.future.2018.02.016_b27 doi: 10.1109/CLUSTER.2016.12 – ident: 10.1016/j.future.2018.02.016_b32 – start-page: 246 year: 1989 ident: 10.1016/j.future.2018.02.016_b21 article-title: Inline function expansion for compiling C programs – volume: 38 start-page: 418 issue: 5 year: 2010 ident: 10.1016/j.future.2018.02.016_b30 article-title: ForestGOMP: an efficient OpenMP environment for NUMA architectures publication-title: Int. J. Parallel Program. doi: 10.1007/s10766-010-0136-3 – year: 1993 ident: 10.1016/j.future.2018.02.016_b11 – ident: 10.1016/j.future.2018.02.016_b16 doi: 10.1007/978-3-319-64203-1_34 – ident: 10.1016/j.future.2018.02.016_b5 – ident: 10.1016/j.future.2018.02.016_b7 – ident: 10.1016/j.future.2018.02.016_b3 – ident: 10.1016/j.future.2018.02.016_b23 – start-page: 49 year: 2011 ident: 10.1016/j.future.2018.02.016_b31 article-title: Scheduling task parallelism on multi-socket multicore systems – start-page: 662 year: 2007 ident: 10.1016/j.future.2018.02.016_b28 article-title: Nested parallelism in the OMPIOpenmP/C compiler – ident: 10.1016/j.future.2018.02.016_b9 doi: 10.1109/IPPS.1996.508060 – ident: 10.1016/j.future.2018.02.016_b10 – volume: vol. 8665 start-page: 222 year: 2014 ident: 10.1016/j.future.2018.02.016_b12 article-title: MassiveThreads: A thread library for high productivity languages – volume: PP issue: 99 year: 2017 ident: 10.1016/j.future.2018.02.016_b14 article-title: Argobots: A lightweight low-level threading and tasking framework publication-title: IEEE Trans. Parallel Distrib. Syst. – ident: 10.1016/j.future.2018.02.016_b33 – ident: 10.1016/j.future.2018.02.016_b19 – ident: 10.1016/j.future.2018.02.016_b35 – ident: 10.1016/j.future.2018.02.016_b17 – ident: 10.1016/j.future.2018.02.016_b15 – ident: 10.1016/j.future.2018.02.016_b8 – volume: 5 start-page: 46 issue: 1 year: 1998 ident: 10.1016/j.future.2018.02.016_b4 article-title: OpenMP: an industry standard API for shared-memory programming publication-title: IEEE Comput. Sci. Eng. doi: 10.1109/99.660313 – volume: 19 start-page: 2317 issue: 18 year: 2007 ident: 10.1016/j.future.2018.02.016_b25 article-title: OpenUH: an optimizing, portable OpenMP compiler publication-title: Concurr. Comput.: Pract. Exper. doi: 10.1002/cpe.1174 – start-page: 100 year: 2000 ident: 10.1016/j.future.2018.02.016_b29 article-title: Performance evaluation of OpenMP applications with nested parallelism – volume: 27 start-page: 885 issue: 4 year: 2015 ident: 10.1016/j.future.2018.02.016_b34 article-title: Unveiling the performance-energy trade-off in iterative linear system solvers for multithreaded processors publication-title: Concurr. Comput.: Pract. Exp. doi: 10.1002/cpe.3341 – volume: 59 start-page: 072001 issue: 7 year: 2016 ident: 10.1016/j.future.2018.02.016_b1 article-title: The Sunway TaihuLight supercomputer: system and applications publication-title: Sci. China Inf. Sci. doi: 10.1007/s11432-016-5588-7 – volume: 21 start-page: 173 issue: 02 year: 2011 ident: 10.1016/j.future.2018.02.016_b20 article-title: OmpSs: A proposal for programming heterogeneous multi-core architectures publication-title: Parallel Process. Lett. doi: 10.1142/S0129626411000151 – ident: 10.1016/j.future.2018.02.016_b13 doi: 10.1109/IPDPS.2008.4536359 – ident: 10.1016/j.future.2018.02.016_b6 – ident: 10.1016/j.future.2018.02.016_b24 – ident: 10.1016/j.future.2018.02.016_b2 |
| SSID | ssj0001731 |
| Score | 2.2442765 |
| Snippet | High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism.... |
| SourceID | osti csuc crossref elsevier |
| SourceType | Open Access Repository Enrichment Source Index Database Publisher |
| StartPage | 22 |
| SubjectTerms | Dynamic programming GLT Informàtica Lightweight threads MATHEMATICS AND COMPUTING OmpSs OpenMP POSIX Threads Programació dinàmica Programming models Àrees temàtiques de la UPC |
| Title | On the adequacy of lightweight thread approaches for high-level parallel programming models |
| URI | https://dx.doi.org/10.1016/j.future.2018.02.016 https://recercat.cat/handle/2072/309398 https://www.osti.gov/servlets/purl/1465734 |
| Volume | 84 |
| WOSCitedRecordID | wos000430521600003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 customDbUrl: eissn: 1872-7115 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001731 issn: 0167-739X databaseCode: AIEXJ dateStart: 19950201 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3db9MwELdKxwMvfCPKAPmBJ6agJk5i53GrNgESG0IDVeLBsh1H2pSlI23H-BP4r7mznTQDxOCBh0Zpmo_K9_P57nL3O0JepJprZSuY3-AsRzATTaRZJaIpL61CE5_HlWs2wQ8PxXxevB-Nvne1MBc1bxpxeVmc_1dRwzEQNpbO_oO4-5vCAdgHocMWxA7bvxL8kU9cVKX9ssZe7gusMwYP_KsLgmJjHhBrzyVuHR_DDrIWRzUmEO0gGXhd445P3TrDYIJrmLMcWrIHjowEOzDbACITGkQEdujeWJ-pJZaq4Cv5PeZ0UdmeuBf0cY_Md-qbi9l-UFWXxI9hH1X7ijVYvftT92y9an3Y9hNoucbdiQ-jF7HoM11DSC2s_8MIJ2huzlx_XVigvFYWHNyA2Nd9dmrbd5YL8Jz9djXwgYnTV56eBfP4hCNojX8i3_beUJpnnKU3yFbCswJ0_Nbum_35235Zj3lobhn-X1eH6ZIFf33CFTtnbJZrc8XcGS9Agw8smeO75HZwQeiuh849MrLNfXKna-9Bg7Z_QD4fNRSQRDsk0UVFB0iiHkl0gyQKSKIbJNEOSXSAJOqR9JB8PNg_nr2OQi-OyKRJvIp4ZcU0zcsSPIJMFzplymqt01THeZZZmNOl4WB6FmlWKnAKWFUyHltuTa5VrjV7RMbNorGPCWU53CuumEhMkZoiFjmcDSMqGANv32YTwrqRkyYQ1WO_lFp2GYmn0o-3xPGW00TCwQmJ-qvOPVHLNee_RKFIsCtsa9RKIs96_wU_yZQnEhMFCjEhvBOdDJaptzglQO2ax2yjpPEqJGU2mL0GlwWwPfnjr9vk1mbKPCXjVbu2z8hNc7E6WbbPAz5_AD0pt8I |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=On+the+adequacy+of+lightweight+thread+approaches+for+high-level+parallel+programming+models&rft.jtitle=Future+generation+computer+systems&rft.au=Castell%C3%B3%2C+Adri%C3%A1n&rft.au=Mayo%2C+Rafael&rft.au=Sala%2C+Kevin&rft.au=Beltran%2C+Vicen%C3%A7&rft.date=2018-07-01&rft.pub=Elsevier&rft.issn=0167-739X&rft.eissn=1872-7115&rft.volume=84&rft.issue=C&rft_id=info:doi/10.1016%2Fj.future.2018.02.016&rft.externalDocID=1465734 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-739X&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-739X&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-739X&client=summon |