On the adequacy of lightweight thread approaches for high-level parallel programming models

High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: the complexity of the hardware is hidden by the underlying runtime system,...

Full description

Saved in:
Bibliographic Details
Published in:Future generation computer systems Vol. 84; no. C; pp. 22 - 31
Main Authors: Castelló, Adrián, Mayo, Rafael, Sala, Kevin, Beltran, Vicenç, Balaji, Pavan, Peña, Antonio J.
Format: Journal Article Publication
Language:English
Published: United States Elsevier B.V 01.07.2018
Elsevier
Subjects:
ISSN:0167-739X, 1872-7115
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: the complexity of the hardware is hidden by the underlying runtime system, improving coding productivity. The implementations of OpenMP usually rely on POSIX threads (pthreads), offering excellent performance for coarse-grained parallelism and a perfect match with the current hardware. OmpSs is a task oriented PM based on an ad hoc runtime solution called Nanos++; it is the precursor of the tasking parallelism in the OpenMP tasking specification. A recent trend in runtimes and applications points to leveraging massive on-node parallelism in conjunction with fine-grained and dynamic scheduling paradigms. In this paper we analyze the behavior of the OpenMP and OmpSs PMs on top of the recently emerged Generic Lightweight Threads (GLT) API. GLT exposes a common API for lightweight thread (LWT) libraries that offers the possibility of running the same application over different native LWT solutions. We describe the design details of those high-level PMs implemented on top of GLT and analyze different scenarios in order to assess where the use of LWTs may benefit application performance. Our work reveals those scenarios where LWTs overperform pthread-based solutions and compares the performance between an ad hoc solution and a generic implementation. •Design and implementation of OpenMP and OmpSs on top of lightweight threads.•Analysis of the relationship between programming models and lightweight threads.•Performance evaluation in different OpenMP and OmpSs scenarios.
AbstractList High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: the complexity of the hardware is hidden by the underlying runtime system, improving coding productivity. The implementations of OpenMP usually rely on POSIX threads (pthreads), offering excellent performance for coarse-grained parallelism and a perfect match with the current hardware. OmpSs is a task oriented PM based on an ad hoc runtime solution called Nanos++; it is the precursor of the tasking parallelism in the OpenMP tasking specification. A recent trend in runtimes and applications points to leveraging massive on-node parallelism in conjunction with fine-grained and dynamic scheduling paradigms. In this paper we analyze the behavior of the OpenMP and OmpSs PMs on top of the recently emerged Generic Lightweight Threads (GLT) API. GLT exposes a common API for lightweight thread (LWT) libraries that offers the possibility of running the same application over different native LWT solutions. We describe the design details of those high-level PMs implemented on top of GLT and analyze different scenarios in order to assess where the use of LWTs may benefit application performance. Our work reveals those scenarios where LWTs overperform pthread-based solutions and compares the performance between an ad hoc solution and a generic implementation. •Design and implementation of OpenMP and OmpSs on top of lightweight threads.•Analysis of the relationship between programming models and lightweight threads.•Performance evaluation in different OpenMP and OmpSs scenarios.
High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: the complexity of the hardware is hidden by the underlying runtime system, improving coding productivity. The implementations of OpenMP usually rely on POSIX threads (pthreads), offering excellent performance for coarse-grained parallelism and a perfect match with the current hardware. OmpSs is a task oriented PM based on an ad hoc runtime solution called Nanos++; it is the precursor of the tasking parallelism in the OpenMP tasking specification. A recent trend in runtimes and applications points to leveraging massive on-node parallelism in conjunction with fine-grained and dynamic scheduling paradigms. In this paper we analyze the behavior of the OpenMP and OmpSs PMs on top of the recently emerged Generic Lightweight Threads (GLT) API. GLT exposses a common API for lightweight thread (LWT) libraries that offers the possibility of running the same application over different native LWT solutions. We describe the design details of those high-level PMs implemented on top of GLT and analyze different scenarios in order to assess where the use of LWTs may benefit application performance. Our work reveals those scenarios where LWTs overperform pthread-based solutions and compares the performance between an ad hoc solution and a generic implementation.
High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: the complexity of the hardware is hidden by the underlying runtime system, improving coding productivity. The implementations of OpenMP usually rely on POSIX threads (pthreads), offering excellent performance for coarse-grained parallelism and a perfect match with the current hardware. OmpSs is a task oriented PM based on an ad hoc runtime solution called Nanos++; it is the precursor of the tasking parallelism in the OpenMP tasking specification. A recent trend in runtimes and applications points to leveraging massive on-node parallelism in conjunction with fine-grained and dynamic scheduling paradigms. In this paper we analyze the behavior of the OpenMP and OmpSs PMs on top of the recently emerged Generic Lightweight Threads (GLT) API. GLT exposes a common API for lightweight thread (LWT) libraries that offers the possibility of running the same application over different native LWT solutions. We describe the design details of those high-level PMs implemented on top of GLT and analyze different scenarios in order to assess where the use of LWTs may benefit application performance. Our work reveals those scenarios where LWTs overperform pthread-based solutions and compares the performance between an ad hoc solution and a generic implementation. The researchers from the Universitat Jaume I de Castelló were supported by project TIN2014-53495-R of the MINECO, Spain and FEDER, Spain, the Generalitat Valenciana fellowship programme, Spain Vali+d 2015. Antonio J. Peña is cofinanced by the Spanish Ministry of Economy and Competitiveness, Spain under Juan de la Cierva fellowship number IJCI-2015-23266. This work was partially supported by the U.S. Dept. of Energy, Office of Science, Office of Advanced Scientific Computing Research (SC-21), under contract DE-AC02-06CH11357. We gratefully acknowledge Enrique S. Quintana-Ortí (Universitat Jaume I) and Sangmin Seo (Samsung Corp.) for their advice in this work and the computing resources provided and operated by the Joint Laboratory for System Evaluation (JLSE) at Argonne National Laboratory. Peer Reviewed
Author Castelló, Adrián
Balaji, Pavan
Sala, Kevin
Beltran, Vicenç
Mayo, Rafael
Peña, Antonio J.
Author_xml – sequence: 1
  givenname: Adrián
  surname: Castelló
  fullname: Castelló, Adrián
  email: adcastel@uji.es
  organization: Universitat Jaume I de Castelló, 12071 Castelló de la Plana, Spain
– sequence: 2
  givenname: Rafael
  surname: Mayo
  fullname: Mayo, Rafael
  email: mayo@uji.es
  organization: Universitat Jaume I de Castelló, 12071 Castelló de la Plana, Spain
– sequence: 3
  givenname: Kevin
  orcidid: 0000-0001-8233-1185
  surname: Sala
  fullname: Sala, Kevin
  email: ksala@bsc.es
  organization: Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
– sequence: 4
  givenname: Vicenç
  surname: Beltran
  fullname: Beltran, Vicenç
  email: vbeltran@bsc.es
  organization: Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
– sequence: 5
  givenname: Pavan
  surname: Balaji
  fullname: Balaji, Pavan
  email: balaji@anl.gov
  organization: Argonne National Laboratory, Lemont, IL, USA
– sequence: 6
  givenname: Antonio J.
  orcidid: 0000-0002-3575-4617
  surname: Peña
  fullname: Peña, Antonio J.
  email: antonio.pena@bsc.es
  organization: Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
BackLink https://www.osti.gov/servlets/purl/1465734$$D View this record in Osti.gov
BookMark eNqFkE1r3DAQhkVJoZuPf9CD6N2uxpItu4dCCU0bCOTSQCEHoY_xWovX2krahPz7ymyg0EN70EjzzjyD5j0nZ0tYkJD3wGpg0H3c1eMxHyPWDYO-Zk1dxDdkA71sKgnQnpFNUWQl-fDzHTlPaccYA8lhQx7vF5onpNrhr6O2LzSMdPbbKT_jGkstonZUHw4xaDthomOIdCq1asYnnOlBRz3P6yOGbdT7vV-2dB8czumSvB31nPDq9b4gDzdff1x_r-7uv91ef7mrrGggV3LEnonOORia1gxGcI3GGCEMdG2LTS-dlQOwQbROw8D56LgElGg7oztj-AX5cJobUvYqWZ_RTjYsC9qsQHSt5KI0wanJpqNVES1Gq7MK2v9J1tMw2SjOBj70hRGvTAwpRRzVIfq9ji8KmFqdVzt1cl6tzivWqCIW7NNfWPmSzj4sOWo__w_-fIKLgfjkMa4L4WLR-bju44L_94Df8GSleA
CitedBy_id crossref_primary_10_1109_TC_2020_2970706
crossref_primary_10_3390_e25101444
crossref_primary_10_1007_s10586_019_02927_z
Cites_doi 10.1109/ICPP.2017.15
10.1109/CLUSTER.2016.12
10.1007/s10766-010-0136-3
10.1007/978-3-319-64203-1_34
10.1109/IPPS.1996.508060
10.1109/99.660313
10.1002/cpe.1174
10.1002/cpe.3341
10.1007/s11432-016-5588-7
10.1142/S0129626411000151
10.1109/IPDPS.2008.4536359
ContentType Journal Article
Publication
Contributor Barcelona Supercomputing Center
Contributor_xml – sequence: 1
  fullname: Barcelona Supercomputing Center
Copyright 2018 Elsevier B.V.
Attribution-NonCommercial 4.0 Spain info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-nc/3.0/es
Copyright_xml – notice: 2018 Elsevier B.V.
– notice: Attribution-NonCommercial 4.0 Spain info:eu-repo/semantics/openAccess <a href="http://creativecommons.org/licenses/by-nc/3.0/es/">http://creativecommons.org/licenses/by-nc/3.0/es/</a>
CorporateAuthor Argonne National Lab. (ANL), Argonne, IL (United States)
CorporateAuthor_xml – name: Argonne National Lab. (ANL), Argonne, IL (United States)
DBID AAYXX
CITATION
XX2
OIOZB
OTOTI
DOI 10.1016/j.future.2018.02.016
DatabaseName CrossRef
Recercat
OSTI.GOV - Hybrid
OSTI.GOV
DatabaseTitle CrossRef
DatabaseTitleList


DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1872-7115
EndPage 31
ExternalDocumentID 1465734
oai_recercat_cat_2072_309398
10_1016_j_future_2018_02_016
S0167739X17328819
GroupedDBID --K
--M
-~X
.DC
.~1
0R~
1B1
1~.
1~5
29H
4.4
457
4G.
5GY
5VS
7-5
71M
8P~
9JN
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXUO
AAYFN
ABBOA
ABFNM
ABJNI
ABMAC
ABXDB
ABYKQ
ACDAQ
ACGFS
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADJOM
ADMUD
AEBSH
AEKER
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
ASPBG
AVWKF
AXJTR
AZFZN
BKOJK
BLXMC
CS3
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-Q
G8K
GBLVA
GBOLZ
HLZ
HVGLF
HZ~
IHE
J1W
KOM
LG9
M41
MO0
MS~
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
PC.
Q38
R2-
RIG
ROL
RPZ
SBC
SDF
SDG
SES
SEW
SPC
SPCBC
SSV
SSZ
T5K
UHS
WUQ
XPP
ZMT
~G-
9DU
AATTM
AAXKI
AAYWO
AAYXX
ABDPE
ABWVN
ACLOT
ACRPL
ADNMO
AEIPS
AFJKZ
AGQPQ
AIIUN
ANKPU
APXCP
CITATION
EFKBS
~HD
XX2
AALMO
ABPIF
OIOZB
OTOTI
ID FETCH-LOGICAL-c421t-7fe8046dd1925b9b43aebbb44b1655e287dc7910945da1933fd371e7ec6ba6bb3
ISICitedReferencesCount 6
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000430521600003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0167-739X
IngestDate Fri May 19 01:11:24 EDT 2023
Fri Nov 07 13:51:05 EST 2025
Sat Nov 29 07:27:03 EST 2025
Tue Nov 18 22:39:01 EST 2025
Fri Feb 23 02:47:14 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue C
Keywords Programming models
POSIX threads
OpenMP
Lightweight threads
OmpSs
GLT
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c421t-7fe8046dd1925b9b43aebbb44b1655e287dc7910945da1933fd371e7ec6ba6bb3
Notes USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Spanish Ministerio de Economia y Competitividad (MINECO)
AC02-06CH11357; SC-21
ORCID 0000-0001-8233-1185
0000-0002-3575-4617
0000000235754617
0000000182331185
OpenAccessLink https://recercat.cat/handle/2072/309398
PageCount 10
ParticipantIDs osti_scitechconnect_1465734
csuc_recercat_oai_recercat_cat_2072_309398
crossref_primary_10_1016_j_future_2018_02_016
crossref_citationtrail_10_1016_j_future_2018_02_016
elsevier_sciencedirect_doi_10_1016_j_future_2018_02_016
PublicationCentury 2000
PublicationDate 2018-07-01
PublicationDateYYYYMMDD 2018-07-01
PublicationDate_xml – month: 07
  year: 2018
  text: 2018-07-01
  day: 01
PublicationDecade 2010
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Future generation computer systems
PublicationYear 2018
Publisher Elsevier B.V
Elsevier
Publisher_xml – name: Elsevier B.V
– name: Elsevier
References PGI Compilers & Tools
Kale, Krishnan (b11) 1993
Intel OpenMP Runtime Library
A. Castelló, A.J. Peña, S. Seo, R. Mayo, P. Balaji, E.S. Quintana-Ortí, A review of lightweight thread approaches for high performance computing, in: Proceedings of the IEEE International Conference on Cluster Computing, Taipei, Taiwan, 2016.
BOLT: A Lightning-Fast OpenMP Implementation
Broquedis, Furmento, Goglin, Wacrenier, Namyst (b30) 2010; 38
Programming with Solaris Threads
A. Castelló, S. Seo, R. Mayo, P. Balaji, E.S. Quintana-Ortí, A.J. Peña, GLT: A unified API for lightweight thread libraries, in: Proceedings of the IEEE International European Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, 2017.
Seo, Amer, Balaji, Bordage, Bosilca, Brooks, Carns, Castelló, Genet, Herault, Iwasaki, Jindal, Kale, Krishnamoorthy, Lifflander, Lu, Meneses, Snir, Sun, Taura, Beckman (b14) 2017; PP
BSC, Mercurium
CloverLeaf miniapp
Microsoft MSDN Library, Fibers.
Clang project
L.V. Kalé, M.A. Bhandarkar, N. Jagathesan, S. Krishnan, J. Yelon, Converse: An interoperable framework for parallel programming, in: Proceedings of the 10th International Parallel Processing Symposium, IPPS, 1996, pp. 212–217.
Duran, Ayguadé, Badia, Labarta, Martinell, Martorell, Planas (b20) 2011; 21
Dagum, Menon (b4) 1998; 5
TOP500 Supercomputer Sites, (June 2016).
Nakashima, Taura (b12) 2014; vol. 8665
A. Castelló, S. Seo, R. Mayo, P. Balaji, E.S. Quintana-Ortí, A.J. Peña, GLTO: On the adequacy of lightweight thread approaches for OpenMP implementations, in: Proceedings of the International Conference on Parallel Processing, Bristol, UK, 2017.
Liao, Hernandez, Chapman, Chen, Zheng (b25) 2007; 19
Tanaka, Taura, Sato, Yonezawa (b29) 2000
D. Stein, D. Shah, Implementing lightweight threads, in: USENIX Summer, 1992.
Hadjidoukas, Dimakopoulos (b28) 2007
BSC, The OmpSs Programming Model
LLVM project
Chang, Hwu (b21) 1989
OpenMP Architecture Review Board, OpenMP Application Programming Interface Version 4.5, (Nov. 2015).
BSC, Nanos
.
Intel Math Kernel Library
K.B. Wheeler, R.C. Murphy, D. Thain, Qthreads: An API for programming with millions of lightweight threads, in: Proceedings of Workshop on Multithreaded Architectures and Applications, 2008.
Fu, Liao, Yang, Wang, Song, Huang, Yang, Xue, Liu, Qiao, Zhao, Yin, Hou, Zhang, Ge, Zhang, Wang, Zhou, Yang (b1) 2016; 59
Generic Lightweight Threads API
Olivier, Porterfield, Wheeler, Prins (b31) 2011
Pthreads API
Aliaga, Anzt, Castillo, Fernández, León, Pérez, Quintana-Ortí (b34) 2015; 27
BSC, Bsc Application Repository
10.1016/j.future.2018.02.016_b5
10.1016/j.future.2018.02.016_b2
10.1016/j.future.2018.02.016_b3
10.1016/j.future.2018.02.016_b8
10.1016/j.future.2018.02.016_b9
Seo (10.1016/j.future.2018.02.016_b14) 2017; PP
10.1016/j.future.2018.02.016_b6
10.1016/j.future.2018.02.016_b7
Liao (10.1016/j.future.2018.02.016_b25) 2007; 19
10.1016/j.future.2018.02.016_b33
10.1016/j.future.2018.02.016_b10
10.1016/j.future.2018.02.016_b32
10.1016/j.future.2018.02.016_b16
10.1016/j.future.2018.02.016_b15
10.1016/j.future.2018.02.016_b36
10.1016/j.future.2018.02.016_b13
10.1016/j.future.2018.02.016_b35
Dagum (10.1016/j.future.2018.02.016_b4) 1998; 5
10.1016/j.future.2018.02.016_b19
10.1016/j.future.2018.02.016_b18
10.1016/j.future.2018.02.016_b17
Chang (10.1016/j.future.2018.02.016_b21) 1989
Nakashima (10.1016/j.future.2018.02.016_b12) 2014; vol. 8665
Olivier (10.1016/j.future.2018.02.016_b31) 2011
Duran (10.1016/j.future.2018.02.016_b20) 2011; 21
Kale (10.1016/j.future.2018.02.016_b11) 1993
Hadjidoukas (10.1016/j.future.2018.02.016_b28) 2007
10.1016/j.future.2018.02.016_b23
10.1016/j.future.2018.02.016_b22
10.1016/j.future.2018.02.016_b27
10.1016/j.future.2018.02.016_b26
10.1016/j.future.2018.02.016_b24
Tanaka (10.1016/j.future.2018.02.016_b29) 2000
Broquedis (10.1016/j.future.2018.02.016_b30) 2010; 38
Fu (10.1016/j.future.2018.02.016_b1) 2016; 59
Aliaga (10.1016/j.future.2018.02.016_b34) 2015; 27
References_xml – reference: TOP500 Supercomputer Sites, (June 2016).
– reference: BOLT: A Lightning-Fast OpenMP Implementation,
– volume: 19
  start-page: 2317
  year: 2007
  end-page: 2332
  ident: b25
  article-title: OpenUH: an optimizing, portable OpenMP compiler
  publication-title: Concurr. Comput.: Pract. Exper.
– reference: A. Castelló, A.J. Peña, S. Seo, R. Mayo, P. Balaji, E.S. Quintana-Ortí, A review of lightweight thread approaches for high performance computing, in: Proceedings of the IEEE International Conference on Cluster Computing, Taipei, Taiwan, 2016.
– reference: L.V. Kalé, M.A. Bhandarkar, N. Jagathesan, S. Krishnan, J. Yelon, Converse: An interoperable framework for parallel programming, in: Proceedings of the 10th International Parallel Processing Symposium, IPPS, 1996, pp. 212–217.
– start-page: 100
  year: 2000
  end-page: 112
  ident: b29
  article-title: Performance evaluation of OpenMP applications with nested parallelism
  publication-title: International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
– reference: Clang project,
– reference: A. Castelló, S. Seo, R. Mayo, P. Balaji, E.S. Quintana-Ortí, A.J. Peña, GLTO: On the adequacy of lightweight thread approaches for OpenMP implementations, in: Proceedings of the International Conference on Parallel Processing, Bristol, UK, 2017.
– reference: Microsoft MSDN Library, Fibers.
– reference: BSC, Nanos++,
– reference: Intel OpenMP Runtime Library,
– start-page: 662
  year: 2007
  end-page: 671
  ident: b28
  article-title: Nested parallelism in the OMPIOpenmP/C compiler
  publication-title: European Conference on Parallel Processing
– volume: 5
  start-page: 46
  year: 1998
  end-page: 55
  ident: b4
  article-title: OpenMP: an industry standard API for shared-memory programming
  publication-title: IEEE Comput. Sci. Eng.
– reference: Generic Lightweight Threads API,
– volume: 59
  start-page: 072001
  year: 2016
  ident: b1
  article-title: The Sunway TaihuLight supercomputer: system and applications
  publication-title: Sci. China Inf. Sci.
– reference: .
– reference: BSC, Mercurium,
– reference: K.B. Wheeler, R.C. Murphy, D. Thain, Qthreads: An API for programming with millions of lightweight threads, in: Proceedings of Workshop on Multithreaded Architectures and Applications, 2008.
– start-page: 246
  year: 1989
  end-page: 257
  ident: b21
  article-title: Inline function expansion for compiling C programs
  publication-title: Procedings of the ACM SIGPLAN 1989 Conference on Programming Language Design and Implementation
– reference: Intel Math Kernel Library,
– reference: LLVM project,
– reference: BSC, Bsc Application Repository,
– reference: .
– volume: vol. 8665
  start-page: 222
  year: 2014
  end-page: 238
  ident: b12
  article-title: MassiveThreads: A thread library for high productivity languages
  publication-title: Concurrent Objects and beyond
– year: 1993
  ident: b11
  article-title: CHARM++: A Portable Concurrent Object Oriented System Based on C++, Vol. 28
– reference: CloverLeaf miniapp,
– volume: 21
  start-page: 173
  year: 2011
  end-page: 193
  ident: b20
  article-title: OmpSs: A proposal for programming heterogeneous multi-core architectures
  publication-title: Parallel Process. Lett.
– start-page: 49
  year: 2011
  end-page: 56
  ident: b31
  article-title: Scheduling task parallelism on multi-socket multicore systems
  publication-title: Proceedings of the 1st International Workshop on Runtime and Operating Systems for Supercomputers
– volume: PP
  year: 2017
  ident: b14
  article-title: Argobots: A lightweight low-level threading and tasking framework
  publication-title: IEEE Trans. Parallel Distrib. Syst.
– reference: BSC, The OmpSs Programming Model,
– reference: D. Stein, D. Shah, Implementing lightweight threads, in: USENIX Summer, 1992.
– reference: Pthreads API,
– reference: OpenMP Architecture Review Board, OpenMP Application Programming Interface Version 4.5, (Nov. 2015).
– reference: A. Castelló, S. Seo, R. Mayo, P. Balaji, E.S. Quintana-Ortí, A.J. Peña, GLT: A unified API for lightweight thread libraries, in: Proceedings of the IEEE International European Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, 2017.
– reference: PGI Compilers & Tools,
– reference: Programming with Solaris Threads,
– volume: 38
  start-page: 418
  year: 2010
  end-page: 439
  ident: b30
  article-title: ForestGOMP: an efficient OpenMP environment for NUMA architectures
  publication-title: Int. J. Parallel Program.
– volume: 27
  start-page: 885
  year: 2015
  end-page: 904
  ident: b34
  article-title: Unveiling the performance-energy trade-off in iterative linear system solvers for multithreaded processors
  publication-title: Concurr. Comput.: Pract. Exp.
– ident: 10.1016/j.future.2018.02.016_b22
  doi: 10.1109/ICPP.2017.15
– ident: 10.1016/j.future.2018.02.016_b26
– ident: 10.1016/j.future.2018.02.016_b36
– ident: 10.1016/j.future.2018.02.016_b18
– ident: 10.1016/j.future.2018.02.016_b27
  doi: 10.1109/CLUSTER.2016.12
– ident: 10.1016/j.future.2018.02.016_b32
– start-page: 246
  year: 1989
  ident: 10.1016/j.future.2018.02.016_b21
  article-title: Inline function expansion for compiling C programs
– volume: 38
  start-page: 418
  issue: 5
  year: 2010
  ident: 10.1016/j.future.2018.02.016_b30
  article-title: ForestGOMP: an efficient OpenMP environment for NUMA architectures
  publication-title: Int. J. Parallel Program.
  doi: 10.1007/s10766-010-0136-3
– year: 1993
  ident: 10.1016/j.future.2018.02.016_b11
– ident: 10.1016/j.future.2018.02.016_b16
  doi: 10.1007/978-3-319-64203-1_34
– ident: 10.1016/j.future.2018.02.016_b5
– ident: 10.1016/j.future.2018.02.016_b7
– ident: 10.1016/j.future.2018.02.016_b3
– ident: 10.1016/j.future.2018.02.016_b23
– start-page: 49
  year: 2011
  ident: 10.1016/j.future.2018.02.016_b31
  article-title: Scheduling task parallelism on multi-socket multicore systems
– start-page: 662
  year: 2007
  ident: 10.1016/j.future.2018.02.016_b28
  article-title: Nested parallelism in the OMPIOpenmP/C compiler
– ident: 10.1016/j.future.2018.02.016_b9
  doi: 10.1109/IPPS.1996.508060
– ident: 10.1016/j.future.2018.02.016_b10
– volume: vol. 8665
  start-page: 222
  year: 2014
  ident: 10.1016/j.future.2018.02.016_b12
  article-title: MassiveThreads: A thread library for high productivity languages
– volume: PP
  issue: 99
  year: 2017
  ident: 10.1016/j.future.2018.02.016_b14
  article-title: Argobots: A lightweight low-level threading and tasking framework
  publication-title: IEEE Trans. Parallel Distrib. Syst.
– ident: 10.1016/j.future.2018.02.016_b33
– ident: 10.1016/j.future.2018.02.016_b19
– ident: 10.1016/j.future.2018.02.016_b35
– ident: 10.1016/j.future.2018.02.016_b17
– ident: 10.1016/j.future.2018.02.016_b15
– ident: 10.1016/j.future.2018.02.016_b8
– volume: 5
  start-page: 46
  issue: 1
  year: 1998
  ident: 10.1016/j.future.2018.02.016_b4
  article-title: OpenMP: an industry standard API for shared-memory programming
  publication-title: IEEE Comput. Sci. Eng.
  doi: 10.1109/99.660313
– volume: 19
  start-page: 2317
  issue: 18
  year: 2007
  ident: 10.1016/j.future.2018.02.016_b25
  article-title: OpenUH: an optimizing, portable OpenMP compiler
  publication-title: Concurr. Comput.: Pract. Exper.
  doi: 10.1002/cpe.1174
– start-page: 100
  year: 2000
  ident: 10.1016/j.future.2018.02.016_b29
  article-title: Performance evaluation of OpenMP applications with nested parallelism
– volume: 27
  start-page: 885
  issue: 4
  year: 2015
  ident: 10.1016/j.future.2018.02.016_b34
  article-title: Unveiling the performance-energy trade-off in iterative linear system solvers for multithreaded processors
  publication-title: Concurr. Comput.: Pract. Exp.
  doi: 10.1002/cpe.3341
– volume: 59
  start-page: 072001
  issue: 7
  year: 2016
  ident: 10.1016/j.future.2018.02.016_b1
  article-title: The Sunway TaihuLight supercomputer: system and applications
  publication-title: Sci. China Inf. Sci.
  doi: 10.1007/s11432-016-5588-7
– volume: 21
  start-page: 173
  issue: 02
  year: 2011
  ident: 10.1016/j.future.2018.02.016_b20
  article-title: OmpSs: A proposal for programming heterogeneous multi-core architectures
  publication-title: Parallel Process. Lett.
  doi: 10.1142/S0129626411000151
– ident: 10.1016/j.future.2018.02.016_b13
  doi: 10.1109/IPDPS.2008.4536359
– ident: 10.1016/j.future.2018.02.016_b6
– ident: 10.1016/j.future.2018.02.016_b24
– ident: 10.1016/j.future.2018.02.016_b2
SSID ssj0001731
Score 2.2442765
Snippet High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism....
SourceID osti
csuc
crossref
elsevier
SourceType Open Access Repository
Enrichment Source
Index Database
Publisher
StartPage 22
SubjectTerms Dynamic programming
GLT
Informàtica
Lightweight threads
MATHEMATICS AND COMPUTING
OmpSs
OpenMP
POSIX Threads
Programació dinàmica
Programming models
Àrees temàtiques de la UPC
Title On the adequacy of lightweight thread approaches for high-level parallel programming models
URI https://dx.doi.org/10.1016/j.future.2018.02.016
https://recercat.cat/handle/2072/309398
https://www.osti.gov/servlets/purl/1465734
Volume 84
WOSCitedRecordID wos000430521600003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1872-7115
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0001731
  issn: 0167-739X
  databaseCode: AIEXJ
  dateStart: 19950201
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3db9MwELdKxwMvfCPKAPmBJ6agJk5i53GrNgESG0IDVeLBsh1H2pSlI23H-BP4r7mznTQDxOCBh0Zpmo_K9_P57nL3O0JepJprZSuY3-AsRzATTaRZJaIpL61CE5_HlWs2wQ8PxXxevB-Nvne1MBc1bxpxeVmc_1dRwzEQNpbO_oO4-5vCAdgHocMWxA7bvxL8kU9cVKX9ssZe7gusMwYP_KsLgmJjHhBrzyVuHR_DDrIWRzUmEO0gGXhd445P3TrDYIJrmLMcWrIHjowEOzDbACITGkQEdujeWJ-pJZaq4Cv5PeZ0UdmeuBf0cY_Md-qbi9l-UFWXxI9hH1X7ijVYvftT92y9an3Y9hNoucbdiQ-jF7HoM11DSC2s_8MIJ2huzlx_XVigvFYWHNyA2Nd9dmrbd5YL8Jz9djXwgYnTV56eBfP4hCNojX8i3_beUJpnnKU3yFbCswJ0_Nbum_35235Zj3lobhn-X1eH6ZIFf33CFTtnbJZrc8XcGS9Agw8smeO75HZwQeiuh849MrLNfXKna-9Bg7Z_QD4fNRSQRDsk0UVFB0iiHkl0gyQKSKIbJNEOSXSAJOqR9JB8PNg_nr2OQi-OyKRJvIp4ZcU0zcsSPIJMFzplymqt01THeZZZmNOl4WB6FmlWKnAKWFUyHltuTa5VrjV7RMbNorGPCWU53CuumEhMkZoiFjmcDSMqGANv32YTwrqRkyYQ1WO_lFp2GYmn0o-3xPGW00TCwQmJ-qvOPVHLNee_RKFIsCtsa9RKIs96_wU_yZQnEhMFCjEhvBOdDJaptzglQO2ax2yjpPEqJGU2mL0GlwWwPfnjr9vk1mbKPCXjVbu2z8hNc7E6WbbPAz5_AD0pt8I
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=On+the+adequacy+of+lightweight+thread+approaches+for+high-level+parallel+programming+models&rft.jtitle=Future+generation+computer+systems&rft.au=Castell%C3%B3%2C+Adri%C3%A1n&rft.au=Mayo%2C+Rafael&rft.au=Sala%2C+Kevin&rft.au=Beltran%2C+Vicen%C3%A7&rft.date=2018-07-01&rft.pub=Elsevier&rft.issn=0167-739X&rft.eissn=1872-7115&rft.volume=84&rft.issue=C&rft_id=info:doi/10.1016%2Fj.future.2018.02.016&rft.externalDocID=1465734
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-739X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-739X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-739X&client=summon