Locality-Aware Automatic Parallelization for GPGPU with OpenHMPP Directives

The use of GPUs for general purpose computation has increased dramatically in the past years due to the rising demands of computing power and their tremendous computing capacity at low cost. Hence, new programming models have been developed to integrate these accelerators with high-level programming...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:International journal of parallel programming Ročník 44; číslo 3; s. 620 - 643
Hlavní autori: Andión, José M., Arenaz, Manuel, Bodin, François, Rodríguez, Gabriel, Touriño, Juan
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: New York Springer US 01.06.2016
Springer Nature B.V
Predmet:
ISSN:0885-7458, 1573-7640
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract The use of GPUs for general purpose computation has increased dramatically in the past years due to the rising demands of computing power and their tremendous computing capacity at low cost. Hence, new programming models have been developed to integrate these accelerators with high-level programming languages, giving place to heterogeneous computing systems. Unfortunately, this heterogeneity is also exposed to the programmer complicating its exploitation. This paper presents a new technique to automatically rewrite sequential programs into a parallel counterpart targeting GPU-based heterogeneous systems. The original source code is analyzed through domain-independent computational kernels, which hide the complexity of the implementation details by presenting a non-statement-based, high-level, hierarchical representation of the application. Next, a locality-aware technique based on standard compiler transformations is applied to the original code through OpenHMPP directives. Two representative case studies from scientific applications have been selected: the three-dimensional discrete convolution and the simple-precision general matrix multiplication. The effectiveness of our technique is corroborated by a performance evaluation on NVIDIA GPUs.
AbstractList The use of GPUs for general purpose computation has increased dramatically in the past years due to the rising demands of computing power and their tremendous computing capacity at low cost. Hence, new programming models have been developed to integrate these accelerators with high-level programming languages, giving place to heterogeneous computing systems. Unfortunately, this heterogeneity is also exposed to the programmer complicating its exploitation. This paper presents a new technique to automatically rewrite sequential programs into a parallel counterpart targeting GPU-based heterogeneous systems. The original source code is analyzed through domain-independent computational kernels, which hide the complexity of the implementation details by presenting a non-statement-based, high-level, hierarchical representation of the application. Next, a locality-aware technique based on standard compiler transformations is applied to the original code through OpenHMPP directives. Two representative case studies from scientific applications have been selected: the three-dimensional discrete convolution and the simple-precision general matrix multiplication. The effectiveness of our technique is corroborated by a performance evaluation on NVIDIA GPUs.
Issue Title: Special Issue on High-Level Parallel Programming and Applications The use of GPUs for general purpose computation has increased dramatically in the past years due to the rising demands of computing power and their tremendous computing capacity at low cost. Hence, new programming models have been developed to integrate these accelerators with high-level programming languages, giving place to heterogeneous computing systems. Unfortunately, this heterogeneity is also exposed to the programmer complicating its exploitation. This paper presents a new technique to automatically rewrite sequential programs into a parallel counterpart targeting GPU-based heterogeneous systems. The original source code is analyzed through domain-independent computational kernels, which hide the complexity of the implementation details by presenting a non-statement-based, high-level, hierarchical representation of the application. Next, a locality-aware technique based on standard compiler transformations is applied to the original code through OpenHMPP directives. Two representative case studies from scientific applications have been selected: the three-dimensional discrete convolution and the simple-precision general matrix multiplication. The effectiveness of our technique is corroborated by a performance evaluation on NVIDIA GPUs.
Author Touriño, Juan
Bodin, François
Andión, José M.
Arenaz, Manuel
Rodríguez, Gabriel
Author_xml – sequence: 1
  givenname: José M.
  surname: Andión
  fullname: Andión, José M.
  email: jandion@udc.es
  organization: Dep. de Electrónica e Sistemas, Universidade da Coruña
– sequence: 2
  givenname: Manuel
  surname: Arenaz
  fullname: Arenaz, Manuel
  organization: Dep. de Electrónica e Sistemas, Universidade da Coruña
– sequence: 3
  givenname: François
  surname: Bodin
  fullname: Bodin, François
  organization: Institut de Recherche en Informatique et Systèmes Aléatoires
– sequence: 4
  givenname: Gabriel
  surname: Rodríguez
  fullname: Rodríguez, Gabriel
  organization: Dep. de Electrónica e Sistemas, Universidade da Coruña
– sequence: 5
  givenname: Juan
  surname: Touriño
  fullname: Touriño, Juan
  organization: Dep. de Electrónica e Sistemas, Universidade da Coruña
BookMark eNp9kE1LAzEQhoNUsFV_gLcFL15WJ5vNxx6LH1WsuAc9hzSd1ch2U5PUUn-9q_Uggp6GgecZ3nlHZND5Dgk5onBKAeRZpCCFyIHyHJgo8mqHDCmXLJeihAEZglI8lyVXe2QU4wsAVFKpIbmdemtalzb5eG0CZuNV8guTnM1qE0zbYuve-9V3WeNDNqkn9WO2duk5u19id31X19mFC2iTe8N4QHYb00Y8_J775PHq8uH8Op_eT27Ox9PcsqpIfaLGQFE0jAsLqmGylHLOoaIz5DNm6JzNhBUMgaMoRYGqsraSUgGdS4GVZfvkZHt3GfzrCmPSCxcttq3p0K-ipgoUpQxK2qPHv9AXvwpdn05TqZjijBWip-SWssHHGLDR1qWvr1MwrtUU9GfJeluy7kvWnyXrqjfpL3MZ3MKEzb9OsXViz3ZPGH5k-lP6ABUxjpY
CODEN IJPPE5
CitedBy_id crossref_primary_10_1016_j_comcom_2020_04_012
crossref_primary_10_1145_3084540
Cites_doi 10.1145/1391956.1391959
10.1109/TPDS.2010.62
10.1007/s00450-011-0160-6
10.1002/cpe.2917
10.1002/cpe.1173
10.1145/2400682.2400713
10.1109/TPDS.2011.311
10.1109/TPDS.2012.160
10.1109/JPROC.2008.917757
10.1016/j.parco.2013.04.003
10.1145/582034.582089
10.1109/SC.2012.51
10.1145/220346.220352
10.1145/1375581.1375595
10.1145/1735688.1735697
10.1145/2259016.2259038
10.1109/InPar.2012.6339595
10.1007/BFb0038658
10.1007/978-3-642-11970-5_14
10.1109/SC.2010.36
10.1145/1993498.1993516
10.1109/IPDPS.2004.1303015
ContentType Journal Article
Copyright Springer Science+Business Media New York 2015
Springer Science+Business Media New York 2016
Copyright_xml – notice: Springer Science+Business Media New York 2015
– notice: Springer Science+Business Media New York 2016
DBID AAYXX
CITATION
3V.
7SC
7WY
7WZ
7XB
87Z
8AL
8FD
8FE
8FG
8FK
8FL
8G5
ABUWG
AFKRA
ARAPS
AZQEC
BENPR
BEZIV
BGLVJ
CCPQU
DWQXO
FRNLG
F~G
GNUQQ
GUQSH
HCIFZ
JQ2
K60
K6~
K7-
L.-
L.0
L7M
L~C
L~D
M0C
M0N
M2O
MBDVC
P5Z
P62
PHGZM
PHGZT
PKEHL
PQBIZ
PQBZA
PQEST
PQGLB
PQQKQ
PQUKI
Q9U
DOI 10.1007/s10766-015-0362-9
DatabaseName CrossRef
ProQuest Central (Corporate)
Computer and Information Systems Abstracts
ProQuest ABI/INFORM Collection
ABI/INFORM Global (PDF only)
ProQuest Central (purchase pre-March 2016)
ABI/INFORM Collection
Computing Database (Alumni Edition)
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Central (Alumni) (purchase pre-March 2016)
ABI/INFORM Collection (Alumni Edition)
Research Library (Alumni Edition)
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
Advanced Technologies & Computer Science Collection
ProQuest Central Essentials
ProQuest Central
Business Premium Collection
Technology Collection
ProQuest One Community College
ProQuest Central Korea
Business Premium Collection (Alumni)
ABI/INFORM Global (Corporate)
ProQuest Central Student
Research Library Prep
SciTech Premium Collection
ProQuest Computer Science Collection
ProQuest Business Collection (Alumni Edition)
ProQuest Business Collection
Computer Science Database
ABI/INFORM Professional Advanced
ABI/INFORM Professional Standard
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
ABI/INFORM Global
Computing Database
Research Library
Research Library (Corporate)
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Premium
ProQuest One Academic
ProQuest One Academic Middle East (New)
ProQuest One Business (OCUL)
ProQuest One Business (Alumni)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic (retired)
ProQuest One Academic UKI Edition
ProQuest Central Basic
DatabaseTitle CrossRef
ABI/INFORM Global (Corporate)
ProQuest Business Collection (Alumni Edition)
ProQuest One Business
Research Library Prep
Computer Science Database
ProQuest Central Student
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest One Academic Middle East (New)
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
Research Library (Alumni Edition)
ABI/INFORM Complete
ProQuest Central
ABI/INFORM Professional Advanced
ProQuest One Applied & Life Sciences
ABI/INFORM Professional Standard
ProQuest Central Korea
ProQuest Research Library
ProQuest Central (New)
Advanced Technologies Database with Aerospace
ABI/INFORM Complete (Alumni Edition)
Advanced Technologies & Aerospace Collection
Business Premium Collection
ABI/INFORM Global
ProQuest Computing
ABI/INFORM Global (Alumni Edition)
ProQuest Central Basic
ProQuest Computing (Alumni Edition)
ProQuest One Academic Eastern Edition
ProQuest Technology Collection
ProQuest SciTech Collection
ProQuest Business Collection
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
ProQuest One Academic UKI Edition
ProQuest One Business (Alumni)
ProQuest One Academic
ProQuest One Academic (New)
ProQuest Central (Alumni)
Business Premium Collection (Alumni)
DatabaseTitleList
ABI/INFORM Global (Corporate)
Computer and Information Systems Abstracts
Database_xml – sequence: 1
  dbid: BENPR
  name: ProQuest Central
  url: https://www.proquest.com/central
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1573-7640
EndPage 643
ExternalDocumentID 4033358311
10_1007_s10766_015_0362_9
Genre Feature
GroupedDBID -4Z
-59
-5G
-BR
-EM
-Y2
-~C
-~X
.4S
.86
.DC
.VR
06D
0R~
0VY
199
1N0
2.D
203
28-
29J
2J2
2JN
2JY
2KG
2LR
2P1
2VQ
2~H
30V
3V.
4.4
406
408
409
40D
40E
5GY
5QI
5VS
67Z
6NX
78A
7WY
8FE
8FG
8FL
8G5
8TC
8UJ
95-
95.
95~
96X
AAAVM
AABHQ
AACDK
AAHNG
AAIAL
AAJBT
AAJKR
AANZL
AAOBN
AARHV
AARTL
AASML
AATNV
AATVU
AAUYE
AAWCG
AAYIU
AAYJJ
AAYQN
AAYTO
AAYZH
ABAKF
ABBBX
ABBXA
ABDBF
ABDPE
ABDZT
ABECU
ABFSI
ABFTD
ABFTV
ABHLI
ABHQN
ABJNI
ABJOX
ABKCH
ABKTR
ABMNI
ABMQK
ABNWP
ABQBU
ABQSL
ABSXP
ABTAH
ABTEG
ABTHY
ABTKH
ABTMW
ABULA
ABUWG
ABWNU
ABXPI
ACAOD
ACBXY
ACDTI
ACGFO
ACGFS
ACHSB
ACHXU
ACIHN
ACKNC
ACMDZ
ACMLO
ACNCT
ACOKC
ACOMO
ACPIV
ACREN
ACUHS
ACZOJ
ADHIR
ADINQ
ADKNI
ADKPE
ADMLS
ADRFC
ADTPH
ADURQ
ADYFF
ADYOE
ADZKW
AEAQA
AEBTG
AEFIE
AEFQL
AEGAL
AEGNC
AEJHL
AEJRE
AEKMD
AEMSY
AENEX
AEOHA
AEPYU
AESKC
AETLH
AEVLU
AEXYK
AFBBN
AFEXP
AFGCZ
AFKRA
AFLOW
AFQWF
AFWTZ
AFYQB
AFZKB
AGAYW
AGDGC
AGGDS
AGJBK
AGMZJ
AGQEE
AGQMX
AGRTI
AGWIL
AGWZB
AGYKE
AHAVH
AHBYD
AHKAY
AHSBF
AHYZX
AIAKS
AIGIU
AIIXL
AILAN
AITGF
AJBLW
AJRNO
AJZVZ
ALMA_UNASSIGNED_HOLDINGS
ALWAN
AMKLP
AMTXH
AMXSW
AMYLF
AOCGG
ARAPS
ARCSS
ARMRJ
AXYYD
AYJHY
AZFZN
AZQEC
B-.
B0M
BA0
BBWZM
BDATZ
BENPR
BEZIV
BGLVJ
BGNMA
BKOMP
BPHCQ
BSONS
CAG
CCPQU
COF
CS3
CSCUP
DDRTE
DL5
DNIVK
DPUIP
DU5
DWQXO
E.L
EAD
EAP
EAS
EBLON
EBS
EDO
EIOEI
EJD
EMK
EPL
ESBYG
ESX
FEDTE
FERAY
FFXSO
FIGPU
FINBP
FNLPD
FRNLG
FRRFC
FSGXE
FWDCC
GGCAI
GGRSB
GJIRD
GNUQQ
GNWQR
GQ6
GQ7
GQ8
GROUPED_ABI_INFORM_COMPLETE
GROUPED_ABI_INFORM_RESEARCH
GUQSH
GXS
H13
HCIFZ
HF~
HG5
HG6
HMJXF
HQYDN
HRMNR
HVGLF
HZ~
H~9
I-F
I09
IHE
IJ-
IKXTQ
ITM
IWAJR
IXC
IZIGR
IZQ
I~X
I~Z
J-C
J0Z
JBSCW
JCJTX
JZLTJ
K60
K6V
K6~
K7-
KDC
KOV
KOW
LAK
LLZTM
M0C
M0N
M2O
M4Y
MA-
MS~
N2Q
NB0
NDZJH
NPVJJ
NQJWS
NU0
O9-
O93
O9G
O9I
O9J
OAM
OVD
P19
P62
P9O
PF0
PQBIZ
PQBZA
PQQKQ
PROAC
PT4
PT5
Q2X
QOK
QOS
R89
R9I
RHV
RNI
RNS
ROL
RPX
RSV
RZC
RZE
RZK
S16
S1Z
S26
S27
S28
S3B
SAP
SCJ
SCLPG
SCO
SDH
SDM
SHX
SISQX
SJYHP
SNE
SNPRN
SNX
SOHCF
SOJ
SPISZ
SRMVM
SSLCW
STPWE
SZN
T13
T16
TAE
TEORI
TN5
TSG
TSK
TSV
TUC
TUS
U2A
U5U
UG4
UOJIU
UTJUX
UZXMN
VC2
VFIZW
VXZ
W23
W48
WH7
WK8
YLTOR
Z45
Z7R
Z7X
Z81
Z83
Z88
Z8R
Z8W
Z92
ZMTXR
ZY4
~8M
~EX
AAPKM
AAYXX
ABBRH
ABDBE
ABFSG
ABRTQ
ACSTC
ADHKG
AEZWR
AFDZB
AFFHD
AFHIU
AFOHR
AGQPQ
AHPBZ
AHWEU
AIXLP
ATHPR
AYFIA
CITATION
PHGZM
PHGZT
PQGLB
7SC
7XB
8AL
8FD
8FK
JQ2
L.-
L.0
L7M
L~C
L~D
MBDVC
PKEHL
PQEST
PQUKI
Q9U
ID FETCH-LOGICAL-c392t-76fa022f356c08f37477d5091be5b3a1d3b6c63e05e6462e89cc977801d76e9c3
IEDL.DBID RSV
ISICitedReferencesCount 8
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000374897200013&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0885-7458
IngestDate Sun Nov 09 12:45:16 EST 2025
Tue Nov 04 22:00:45 EST 2025
Sat Nov 29 01:59:42 EST 2025
Tue Nov 18 21:33:38 EST 2025
Fri Feb 21 02:37:21 EST 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 3
Keywords Heterogeneous systems
Automatic parallelization
Domain-independent kernel
Locality
GPGPU
OpenHMPP
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c392t-76fa022f356c08f37477d5091be5b3a1d3b6c63e05e6462e89cc977801d76e9c3
Notes SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
OpenAccessLink http://hdl.handle.net/2183/20902
PQID 1783853326
PQPubID 48389
PageCount 24
ParticipantIDs proquest_miscellaneous_1808113041
proquest_journals_1783853326
crossref_citationtrail_10_1007_s10766_015_0362_9
crossref_primary_10_1007_s10766_015_0362_9
springer_journals_10_1007_s10766_015_0362_9
PublicationCentury 2000
PublicationDate 20160600
2016-6-00
20160601
PublicationDateYYYYMMDD 2016-06-01
PublicationDate_xml – month: 6
  year: 2016
  text: 20160600
PublicationDecade 2010
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationTitle International journal of parallel programming
PublicationTitleAbbrev Int J Parallel Prog
PublicationYear 2016
Publisher Springer US
Springer Nature B.V
Publisher_xml – name: Springer US
– name: Springer Nature B.V
References Owens, Houston, Luebke, Green, Stone, Phillips (CR29) 2008; 96
Andrade, Arenaz, Fraguela, Touriño, Doallo (CR2) 2007; 19
CR19
CR17
CR16
CR15
CR37
CR14
CR36
CR35
CR12
CR11
CR32
Arenaz, Touriño, Doallo (CR5) 2008; 30
CR31
CR30
Andión, Arenaz, Rodríguez, Touriño (CR1) 2013; 39
Han, Abdelrahman (CR13) 2011; 22
CR4
CR3
CR6
Christen, Schenk, Burkhart (CR10) 2011; 26
CR7
CR28
CR9
CR27
CR26
CR25
Bodin, Bihan (CR8) 2009; 17
CR24
CR23
CR22
CR21
CR20
Zhang, Mueller (CR38) 2013; 24
Viñas, Lobeiras, Fraguela, Arenaz, Amor, García, Castro, Doallo (CR34) 2013; 25
Verdoolaege, Juega, Cohen, Gómez, Tenllado, Catthoor (CR33) 2013; 9
Kurzak, Tomov, Dongarra (CR18) 2012; 23
362_CR31
M Viñas (362_CR34) 2013; 25
362_CR30
362_CR11
S Verdoolaege (362_CR33) 2013; 9
362_CR32
J Owens (362_CR29) 2008; 96
362_CR24
362_CR23
362_CR26
362_CR25
J Kurzak (362_CR18) 2012; 23
362_CR28
M Christen (362_CR10) 2011; 26
362_CR27
D Andrade (362_CR2) 2007; 19
362_CR3
TD Han (362_CR13) 2011; 22
362_CR20
362_CR4
362_CR22
362_CR6
362_CR21
362_CR7
362_CR9
JM Andión (362_CR1) 2013; 39
F Bodin (362_CR8) 2009; 17
M Arenaz (362_CR5) 2008; 30
Y Zhang (362_CR38) 2013; 24
362_CR35
362_CR12
362_CR15
362_CR37
362_CR14
362_CR36
362_CR17
362_CR16
362_CR19
References_xml – volume: 30
  start-page: 32:1
  issue: 6
  year: 2008
  end-page: 32:56
  ident: CR5
  article-title: XARK: an extensible framework for automatic recognition of computational kernels
  publication-title: ACM Trans. Program. Lang. Syst.
  doi: 10.1145/1391956.1391959
– ident: CR22
– volume: 22
  start-page: 78
  issue: 1
  year: 2011
  end-page: 90
  ident: CR13
  article-title: hiCUDA: High-level GPGPU programming
  publication-title: IEEE Trans. Parallel Distrib. Syst.
  doi: 10.1109/TPDS.2010.62
– ident: CR4
– ident: CR14
– ident: CR16
– ident: CR37
– ident: CR12
– ident: CR30
– ident: CR35
– ident: CR6
– ident: CR25
– ident: CR27
– ident: CR23
– volume: 26
  start-page: 205
  issue: 3–4
  year: 2011
  end-page: 210
  ident: CR10
  article-title: Automatic code generation and tuning for stencil kernels on modern shared memory architectures
  publication-title: Comp. Sci. Res. Dev.
  doi: 10.1007/s00450-011-0160-6
– ident: CR21
– ident: CR19
– volume: 25
  start-page: 1153
  issue: 8
  year: 2013
  end-page: 1169
  ident: CR34
  article-title: A multi-GPU shallow-water simulation with transport of contaminants
  publication-title: Concurr. Comput. Pract. Exp.
  doi: 10.1002/cpe.2917
– ident: CR3
– ident: CR15
– volume: 19
  start-page: 2407
  issue: 18
  year: 2007
  end-page: 2423
  ident: CR2
  article-title: Automated and accurate cache behavior analysis for codes with irregular access patterns
  publication-title: Concurr. Comput. Pract. Exp.
  doi: 10.1002/cpe.1173
– ident: CR17
– ident: CR31
– ident: CR11
– volume: 9
  start-page: 54:1
  issue: 4
  year: 2013
  end-page: 54:23
  ident: CR33
  article-title: Polyhedral parallel code generation for CUDA
  publication-title: ACM Trans. Archit. Code Optim.
  doi: 10.1145/2400682.2400713
– ident: CR9
– volume: 23
  start-page: 2045
  issue: 11
  year: 2012
  end-page: 2057
  ident: CR18
  article-title: Autotuning GEMM kernels for the Fermi GPU
  publication-title: IEEE Trans. Parallel Distrib. Syst.
  doi: 10.1109/TPDS.2011.311
– ident: CR32
– volume: 24
  start-page: 417
  issue: 3
  year: 2013
  end-page: 427
  ident: CR38
  article-title: Autogeneration and autotuning of 3D stencil codes on homogeneous and heterogeneous GPU clusters
  publication-title: IEEE Trans. Parallel Distrib. Syst.
  doi: 10.1109/TPDS.2012.160
– ident: CR36
– ident: CR7
– volume: 96
  start-page: 879
  issue: 5
  year: 2008
  end-page: 899
  ident: CR29
  article-title: GPU computing
  publication-title: Proc. IEEE
  doi: 10.1109/JPROC.2008.917757
– volume: 17
  start-page: 325
  issue: 4
  year: 2009
  end-page: 336
  ident: CR8
  article-title: Heterogeneous multicore parallel programming for graphics processing units
  publication-title: Sci. Program.
– ident: CR28
– ident: CR26
– ident: CR24
– ident: CR20
– volume: 39
  start-page: 442
  issue: 9
  year: 2013
  end-page: 460
  ident: CR1
  article-title: A novel compiler support for automatic parallelization on multicore systems
  publication-title: Parallel Comput.
  doi: 10.1016/j.parco.2013.04.003
– ident: 362_CR24
– volume: 96
  start-page: 879
  issue: 5
  year: 2008
  ident: 362_CR29
  publication-title: Proc. IEEE
  doi: 10.1109/JPROC.2008.917757
– ident: 362_CR22
– volume: 24
  start-page: 417
  issue: 3
  year: 2013
  ident: 362_CR38
  publication-title: IEEE Trans. Parallel Distrib. Syst.
  doi: 10.1109/TPDS.2012.160
– volume: 30
  start-page: 32:1
  issue: 6
  year: 2008
  ident: 362_CR5
  publication-title: ACM Trans. Program. Lang. Syst.
  doi: 10.1145/1391956.1391959
– volume: 39
  start-page: 442
  issue: 9
  year: 2013
  ident: 362_CR1
  publication-title: Parallel Comput.
  doi: 10.1016/j.parco.2013.04.003
– volume: 26
  start-page: 205
  issue: 3–4
  year: 2011
  ident: 362_CR10
  publication-title: Comp. Sci. Res. Dev.
  doi: 10.1007/s00450-011-0160-6
– ident: 362_CR19
  doi: 10.1145/582034.582089
– ident: 362_CR21
  doi: 10.1109/SC.2012.51
– ident: 362_CR28
– ident: 362_CR26
– ident: 362_CR30
– volume: 22
  start-page: 78
  issue: 1
  year: 2011
  ident: 362_CR13
  publication-title: IEEE Trans. Parallel Distrib. Syst.
  doi: 10.1109/TPDS.2010.62
– volume: 17
  start-page: 325
  issue: 4
  year: 2009
  ident: 362_CR8
  publication-title: Sci. Program.
– ident: 362_CR37
  doi: 10.1145/220346.220352
– ident: 362_CR7
– ident: 362_CR9
  doi: 10.1145/1375581.1375595
– ident: 362_CR36
  doi: 10.1145/1735688.1735697
– ident: 362_CR15
– ident: 362_CR32
– ident: 362_CR16
  doi: 10.1145/2259016.2259038
– ident: 362_CR12
  doi: 10.1109/InPar.2012.6339595
– ident: 362_CR23
– volume: 19
  start-page: 2407
  issue: 18
  year: 2007
  ident: 362_CR2
  publication-title: Concurr. Comput. Pract. Exp.
  doi: 10.1002/cpe.1173
– ident: 362_CR25
– ident: 362_CR27
– ident: 362_CR31
– ident: 362_CR11
  doi: 10.1007/BFb0038658
– volume: 9
  start-page: 54:1
  issue: 4
  year: 2013
  ident: 362_CR33
  publication-title: ACM Trans. Archit. Code Optim.
  doi: 10.1145/2400682.2400713
– ident: 362_CR3
– ident: 362_CR6
  doi: 10.1007/978-3-642-11970-5_14
– ident: 362_CR14
– volume: 23
  start-page: 2045
  issue: 11
  year: 2012
  ident: 362_CR18
  publication-title: IEEE Trans. Parallel Distrib. Syst.
  doi: 10.1109/TPDS.2011.311
– ident: 362_CR20
  doi: 10.1109/SC.2010.36
– volume: 25
  start-page: 1153
  issue: 8
  year: 2013
  ident: 362_CR34
  publication-title: Concurr. Comput. Pract. Exp.
  doi: 10.1002/cpe.2917
– ident: 362_CR35
– ident: 362_CR17
  doi: 10.1145/1993498.1993516
– ident: 362_CR4
  doi: 10.1109/IPDPS.2004.1303015
SSID ssj0009788
Score 2.0962272
Snippet The use of GPUs for general purpose computation has increased dramatically in the past years due to the rising demands of computing power and their tremendous...
Issue Title: Special Issue on High-Level Parallel Programming and Applications The use of GPUs for general purpose computation has increased dramatically in...
SourceID proquest
crossref
springer
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 620
SubjectTerms Analysis
Automation
Case studies
Computation
Computer programming
Computer Science
Computing costs
Demand
Heterogeneity
Optimization techniques
Parallel processing
Performance evaluation
Processor Architectures
Programming languages
Software
Software Engineering/Programming and Operating Systems
Source code
Studies
Theory of Computation
Three dimensional
Transformations
SummonAdditionalLinks – databaseName: ABI/INFORM Global
  dbid: M0C
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3JSgQxEC3cDl7cxXEjgiclOG3WPskgLuBCHxS8NeksIMiMzqK_b6UnPaOCXjx3uhP6pVIvqUo9gEMmgtDSCKqywCnnQVMjQkUt98rljhkeaqRv1f29fnrKi3TgNkhplc2aWC_UrmfjGflJpjRD14Js4-z1jUbVqBhdTRIaszAfmU1M6btrn0-L7qpadxINCcfChW6imuOrc0rGvbSg9cWh_LtfmpLNH_HR2u1cLv93wCuwlAgn6YxnyCrM-O4aLDdiDiTZ9jrc3Eavhpycdj5M35POaNiry7mSwvSj4MpLurFJkOaSq-KqeCTxEJfElJTru6Igk9VzsAGPlxcP59c0KS1Qi_xoSJUMBp15YELatg4M9xjKRSpReVExkzlWSSuZbwsvuTz1OrcWiSN6N6ekzy3bhLlur-u3gHhmfAinlRU5585qg3selzkdjK1EUKEF7eY_lzaVIY9qGC_ltIByhKZEaMoITZm34Gjyyuu4BsdfjXcbOMpkjoNyikULDiaP0ZBidMR0fW-EbaIGCXp0nrXguAH9yyd-63D77w53YBE5lhxnl-3C3LA_8nuwYN-Hz4P-fj1fPwG5ye-J
  priority: 102
  providerName: ProQuest
Title Locality-Aware Automatic Parallelization for GPGPU with OpenHMPP Directives
URI https://link.springer.com/article/10.1007/s10766-015-0362-9
https://www.proquest.com/docview/1783853326
https://www.proquest.com/docview/1808113041
Volume 44
WOSCitedRecordID wos000374897200013&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVPQU
  databaseName: ABI/INFORM Global
  customDbUrl:
  eissn: 1573-7640
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0009788
  issn: 0885-7458
  databaseCode: M0C
  dateStart: 19970201
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/abiglobal
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Advanced Technologies & Aerospace Database
  customDbUrl:
  eissn: 1573-7640
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0009788
  issn: 0885-7458
  databaseCode: P5Z
  dateStart: 19970201
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/hightechjournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Computer Science Database
  customDbUrl:
  eissn: 1573-7640
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0009788
  issn: 0885-7458
  databaseCode: K7-
  dateStart: 19970201
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/compscijour
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest ABI/INFORM Collection
  customDbUrl:
  eissn: 1573-7640
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0009788
  issn: 0885-7458
  databaseCode: 7WY
  dateStart: 19970201
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/abicomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl:
  eissn: 1573-7640
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0009788
  issn: 0885-7458
  databaseCode: BENPR
  dateStart: 19970201
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Research Library
  customDbUrl:
  eissn: 1573-7640
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0009788
  issn: 0885-7458
  databaseCode: M2O
  dateStart: 19970201
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/pqrl
  providerName: ProQuest
– providerCode: PRVAVX
  databaseName: SpringerLINK Contemporary 1997-Present
  customDbUrl:
  eissn: 1573-7640
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0009788
  issn: 0885-7458
  databaseCode: RSV
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22
  providerName: Springer Nature
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3da9swED-adg97Wdt9sPQjqLCnDoEdffoxLf2AtKlJ163bi5FlCQYlGXHS_fs7OXaSlq7QvgiMZVmcdLqfdLr7AXxhwgstjaAq9pxy7jU1wufUcqeKpGCG-2qkL9RgoG9vk7SO4y6b2-6NS7JaqVeC3ZQMu19Bq1CfpAUbaO104GsYXn9fZtpVFdkkag92gAvduDKfauKhMVoizEdO0crWnG6-qpdb8K6GlqQ3nwvbsOZG72GzoW0gtRZ_gP5FsF-Ivmnvr5k40ptNx1XiVpKaSaBWuatjMwkCWnKWnqU3JBzXknD55PwyTclinSw_ws3pybfjc1pzKlCLSGhKlfQGzbZnQtpIe4a7CVUE0JA7kTMTFyyXVjIXCSe57DqdWIsQEe1YoaRLLPsE66PxyH0G4phx3ndzKxLOC6sN7m6KuNDe2Fx45dsQNcLNbJ1wPPBe3GXLVMlBWBkKKwvCypI2HC4--TPPtvFc5b1mxLJa8cosVpohAkFQ2oaDxWtUmeAHMSM3nmGdwDaCtpvHbfjajOJKE__74c6Lau_CWwRXcn6tbA_Wp5OZ24c39n76u5x0oKV-_OzAxtHJIB3iU19RLC-j41B2r7BMxa9ONbP_AaNy61Q
linkProvider Springer Nature
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1LTxRBEK4gmuhFEDWuILYJXjQddrafcyBko8KSXTZzgITb2NOPxITswj4g_Cl-I9Xz2AUTuXHwPD3d6anX11Nd9QHsMBGElkZQlQROOQ-aGhEKarlXLnXM8FBKeqCGQ312lmYrcNvUwsRrlY1PLB21G9v4j3w3UZphaEG0sX9xSSNrVMyuNhQalVr0_c01Htmme0c_Ub5fO52DXyc_erRmFaAWscCMKhkMBq7AhLRtHRjiaeVi2Cy8KJhJHCuklcy3hZdcdrxOrUWQhJ7cKelTy3DeZ_CcM62iXfUVXTb5VSXPJRou7p0L3WRRq1I9JePZXdCyUCl9GAeX4PavfGwZ5g7W_rcPtA6va0BNupUFvIEVP9qAtYasgtS-6y30BzFq45mDdq_NxJPufDYu29WSzEwiocx5XZFKEMaTw-wwOyXxJzWJV256x1lGFtFh-g5On2RL72F1NB75D0A8Mz6ETmFFyrmz2uCZziVOB2MLEVRoQbuRa27rNuuR7eM8XzaIjqqQoyrkURXytAXfFq9cVD1GHhu81Yg_r93NNF_KvgVfFo_RUcTsjxn58RzHRI4VRCw8acH3RsnuTfGvBT8-vuBneNk7OR7kg6NhfxNeIZ6U1U26LVidTeb-E7ywV7M_08l2aSsEfj-17t0BYUxKrg
linkToPdf http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1JTxsxFH6iUFVcCt1EKKWuRC-tLDLxOgeEokIAJURzKBLqZerxIlVCCWQp6l_rr-vzLElBKjcOPY_H1ow_v_c9vw1gj4kgtDSCqiRwynnQ1IhQUMu9cqljhodypwdqONSXl2m2Ar-bXJgYVtnIxFJQu7GNd-T7idIMVQuyjf1Qh0VkR73D6xsaO0hFT2vTTqOCSN__ukXzbXpwdoR7_bHT6R1__XJK6w4D1CIvmFElg0ElFpiQtq0DQ26tXFShhRcFM4ljhbSS-bbwksuO16m1SJhQqjslfWoZzvsE1hTamDGcMBPflgV_VdnzEg8x_gcudONRrdL2lIx2vKBl0lJ6Vycuie4932yp8nob__PP2oTnNdEm3epkvIAVP3oJG00TC1LLtFfQH0RtjrYI7d6aiSfd-WxclrElmZnERjNXdaYqQXpPTrKT7ILEy2sSQ3FOz7OMLLTG9DVcPMonvYHV0Xjkt4B4ZnwIncKKlHNntUFbzyVOB2MLEVRoQbvZ49zW5ddjF5CrfFk4OsIiR1jkERZ52oJPi1euq9ojDw3eaaCQ12Jomi9x0IIPi8coQKJXyIz8eI5jYu8VZDI8acHnBnB_TfGvBbcfXvA9PEPI5YOzYf8trCPNlFWA3Q6sziZz_w6e2p-zH9PJbnlsCHx_bOj9AfaPU9I
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Locality-Aware+Automatic+Parallelization+for+GPGPU+with+OpenHMPP+Directives&rft.jtitle=International+journal+of+parallel+programming&rft.au=Andion%2C+Jose+M&rft.au=Arenaz%2C+Manuel&rft.au=Bodin%2C+Francois&rft.au=Rodriguez%2C+Gabriel&rft.date=2016-06-01&rft.issn=0885-7458&rft.eissn=1573-7640&rft.volume=44&rft.issue=3&rft.spage=620&rft.epage=643&rft_id=info:doi/10.1007%2Fs10766-015-0362-9&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0885-7458&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0885-7458&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0885-7458&client=summon