Locality-Aware Automatic Parallelization for GPGPU with OpenHMPP Directives
The use of GPUs for general purpose computation has increased dramatically in the past years due to the rising demands of computing power and their tremendous computing capacity at low cost. Hence, new programming models have been developed to integrate these accelerators with high-level programming...
Uložené v:
| Vydané v: | International journal of parallel programming Ročník 44; číslo 3; s. 620 - 643 |
|---|---|
| Hlavní autori: | , , , , |
| Médium: | Journal Article |
| Jazyk: | English |
| Vydavateľské údaje: |
New York
Springer US
01.06.2016
Springer Nature B.V |
| Predmet: | |
| ISSN: | 0885-7458, 1573-7640 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Abstract | The use of GPUs for general purpose computation has increased dramatically in the past years due to the rising demands of computing power and their tremendous computing capacity at low cost. Hence, new programming models have been developed to integrate these accelerators with high-level programming languages, giving place to heterogeneous computing systems. Unfortunately, this heterogeneity is also exposed to the programmer complicating its exploitation. This paper presents a new technique to automatically rewrite sequential programs into a parallel counterpart targeting GPU-based heterogeneous systems. The original source code is analyzed through domain-independent computational kernels, which hide the complexity of the implementation details by presenting a non-statement-based, high-level, hierarchical representation of the application. Next, a locality-aware technique based on standard compiler transformations is applied to the original code through OpenHMPP directives. Two representative case studies from scientific applications have been selected: the three-dimensional discrete convolution and the simple-precision general matrix multiplication. The effectiveness of our technique is corroborated by a performance evaluation on NVIDIA GPUs. |
|---|---|
| AbstractList | The use of GPUs for general purpose computation has increased dramatically in the past years due to the rising demands of computing power and their tremendous computing capacity at low cost. Hence, new programming models have been developed to integrate these accelerators with high-level programming languages, giving place to heterogeneous computing systems. Unfortunately, this heterogeneity is also exposed to the programmer complicating its exploitation. This paper presents a new technique to automatically rewrite sequential programs into a parallel counterpart targeting GPU-based heterogeneous systems. The original source code is analyzed through domain-independent computational kernels, which hide the complexity of the implementation details by presenting a non-statement-based, high-level, hierarchical representation of the application. Next, a locality-aware technique based on standard compiler transformations is applied to the original code through OpenHMPP directives. Two representative case studies from scientific applications have been selected: the three-dimensional discrete convolution and the simple-precision general matrix multiplication. The effectiveness of our technique is corroborated by a performance evaluation on NVIDIA GPUs. Issue Title: Special Issue on High-Level Parallel Programming and Applications The use of GPUs for general purpose computation has increased dramatically in the past years due to the rising demands of computing power and their tremendous computing capacity at low cost. Hence, new programming models have been developed to integrate these accelerators with high-level programming languages, giving place to heterogeneous computing systems. Unfortunately, this heterogeneity is also exposed to the programmer complicating its exploitation. This paper presents a new technique to automatically rewrite sequential programs into a parallel counterpart targeting GPU-based heterogeneous systems. The original source code is analyzed through domain-independent computational kernels, which hide the complexity of the implementation details by presenting a non-statement-based, high-level, hierarchical representation of the application. Next, a locality-aware technique based on standard compiler transformations is applied to the original code through OpenHMPP directives. Two representative case studies from scientific applications have been selected: the three-dimensional discrete convolution and the simple-precision general matrix multiplication. The effectiveness of our technique is corroborated by a performance evaluation on NVIDIA GPUs. |
| Author | Touriño, Juan Bodin, François Andión, José M. Arenaz, Manuel Rodríguez, Gabriel |
| Author_xml | – sequence: 1 givenname: José M. surname: Andión fullname: Andión, José M. email: jandion@udc.es organization: Dep. de Electrónica e Sistemas, Universidade da Coruña – sequence: 2 givenname: Manuel surname: Arenaz fullname: Arenaz, Manuel organization: Dep. de Electrónica e Sistemas, Universidade da Coruña – sequence: 3 givenname: François surname: Bodin fullname: Bodin, François organization: Institut de Recherche en Informatique et Systèmes Aléatoires – sequence: 4 givenname: Gabriel surname: Rodríguez fullname: Rodríguez, Gabriel organization: Dep. de Electrónica e Sistemas, Universidade da Coruña – sequence: 5 givenname: Juan surname: Touriño fullname: Touriño, Juan organization: Dep. de Electrónica e Sistemas, Universidade da Coruña |
| BookMark | eNp9kE1LAzEQhoNUsFV_gLcFL15WJ5vNxx6LH1WsuAc9hzSd1ch2U5PUUn-9q_Uggp6GgecZ3nlHZND5Dgk5onBKAeRZpCCFyIHyHJgo8mqHDCmXLJeihAEZglI8lyVXe2QU4wsAVFKpIbmdemtalzb5eG0CZuNV8guTnM1qE0zbYuve-9V3WeNDNqkn9WO2duk5u19id31X19mFC2iTe8N4QHYb00Y8_J775PHq8uH8Op_eT27Ox9PcsqpIfaLGQFE0jAsLqmGylHLOoaIz5DNm6JzNhBUMgaMoRYGqsraSUgGdS4GVZfvkZHt3GfzrCmPSCxcttq3p0K-ipgoUpQxK2qPHv9AXvwpdn05TqZjijBWip-SWssHHGLDR1qWvr1MwrtUU9GfJeluy7kvWnyXrqjfpL3MZ3MKEzb9OsXViz3ZPGH5k-lP6ABUxjpY |
| CODEN | IJPPE5 |
| CitedBy_id | crossref_primary_10_1016_j_comcom_2020_04_012 crossref_primary_10_1145_3084540 |
| Cites_doi | 10.1145/1391956.1391959 10.1109/TPDS.2010.62 10.1007/s00450-011-0160-6 10.1002/cpe.2917 10.1002/cpe.1173 10.1145/2400682.2400713 10.1109/TPDS.2011.311 10.1109/TPDS.2012.160 10.1109/JPROC.2008.917757 10.1016/j.parco.2013.04.003 10.1145/582034.582089 10.1109/SC.2012.51 10.1145/220346.220352 10.1145/1375581.1375595 10.1145/1735688.1735697 10.1145/2259016.2259038 10.1109/InPar.2012.6339595 10.1007/BFb0038658 10.1007/978-3-642-11970-5_14 10.1109/SC.2010.36 10.1145/1993498.1993516 10.1109/IPDPS.2004.1303015 |
| ContentType | Journal Article |
| Copyright | Springer Science+Business Media New York 2015 Springer Science+Business Media New York 2016 |
| Copyright_xml | – notice: Springer Science+Business Media New York 2015 – notice: Springer Science+Business Media New York 2016 |
| DBID | AAYXX CITATION 3V. 7SC 7WY 7WZ 7XB 87Z 8AL 8FD 8FE 8FG 8FK 8FL 8G5 ABUWG AFKRA ARAPS AZQEC BENPR BEZIV BGLVJ CCPQU DWQXO FRNLG F~G GNUQQ GUQSH HCIFZ JQ2 K60 K6~ K7- L.- L.0 L7M L~C L~D M0C M0N M2O MBDVC P5Z P62 PHGZM PHGZT PKEHL PQBIZ PQBZA PQEST PQGLB PQQKQ PQUKI Q9U |
| DOI | 10.1007/s10766-015-0362-9 |
| DatabaseName | CrossRef ProQuest Central (Corporate) Computer and Information Systems Abstracts ProQuest ABI/INFORM Collection ABI/INFORM Global (PDF only) ProQuest Central (purchase pre-March 2016) ABI/INFORM Collection Computing Database (Alumni Edition) Technology Research Database ProQuest SciTech Collection ProQuest Technology Collection ProQuest Central (Alumni) (purchase pre-March 2016) ABI/INFORM Collection (Alumni Edition) Research Library (Alumni Edition) ProQuest Central (Alumni) ProQuest Central UK/Ireland Advanced Technologies & Computer Science Collection ProQuest Central Essentials ProQuest Central Business Premium Collection Technology Collection ProQuest One Community College ProQuest Central Korea Business Premium Collection (Alumni) ABI/INFORM Global (Corporate) ProQuest Central Student Research Library Prep SciTech Premium Collection ProQuest Computer Science Collection ProQuest Business Collection (Alumni Edition) ProQuest Business Collection Computer Science Database ABI/INFORM Professional Advanced ABI/INFORM Professional Standard Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional ABI/INFORM Global Computing Database Research Library Research Library (Corporate) Advanced Technologies & Aerospace Database ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Premium ProQuest One Academic ProQuest One Academic Middle East (New) ProQuest One Business (OCUL) ProQuest One Business (Alumni) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic (retired) ProQuest One Academic UKI Edition ProQuest Central Basic |
| DatabaseTitle | CrossRef ABI/INFORM Global (Corporate) ProQuest Business Collection (Alumni Edition) ProQuest One Business Research Library Prep Computer Science Database ProQuest Central Student Technology Collection Technology Research Database Computer and Information Systems Abstracts – Academic ProQuest One Academic Middle East (New) ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection Computer and Information Systems Abstracts ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College Research Library (Alumni Edition) ABI/INFORM Complete ProQuest Central ABI/INFORM Professional Advanced ProQuest One Applied & Life Sciences ABI/INFORM Professional Standard ProQuest Central Korea ProQuest Research Library ProQuest Central (New) Advanced Technologies Database with Aerospace ABI/INFORM Complete (Alumni Edition) Advanced Technologies & Aerospace Collection Business Premium Collection ABI/INFORM Global ProQuest Computing ABI/INFORM Global (Alumni Edition) ProQuest Central Basic ProQuest Computing (Alumni Edition) ProQuest One Academic Eastern Edition ProQuest Technology Collection ProQuest SciTech Collection ProQuest Business Collection Computer and Information Systems Abstracts Professional Advanced Technologies & Aerospace Database ProQuest One Academic UKI Edition ProQuest One Business (Alumni) ProQuest One Academic ProQuest One Academic (New) ProQuest Central (Alumni) Business Premium Collection (Alumni) |
| DatabaseTitleList | ABI/INFORM Global (Corporate) Computer and Information Systems Abstracts |
| Database_xml | – sequence: 1 dbid: BENPR name: ProQuest Central url: https://www.proquest.com/central sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 1573-7640 |
| EndPage | 643 |
| ExternalDocumentID | 4033358311 10_1007_s10766_015_0362_9 |
| Genre | Feature |
| GroupedDBID | -4Z -59 -5G -BR -EM -Y2 -~C -~X .4S .86 .DC .VR 06D 0R~ 0VY 199 1N0 2.D 203 28- 29J 2J2 2JN 2JY 2KG 2LR 2P1 2VQ 2~H 30V 3V. 4.4 406 408 409 40D 40E 5GY 5QI 5VS 67Z 6NX 78A 7WY 8FE 8FG 8FL 8G5 8TC 8UJ 95- 95. 95~ 96X AAAVM AABHQ AACDK AAHNG AAIAL AAJBT AAJKR AANZL AAOBN AARHV AARTL AASML AATNV AATVU AAUYE AAWCG AAYIU AAYJJ AAYQN AAYTO AAYZH ABAKF ABBBX ABBXA ABDBF ABDPE ABDZT ABECU ABFSI ABFTD ABFTV ABHLI ABHQN ABJNI ABJOX ABKCH ABKTR ABMNI ABMQK ABNWP ABQBU ABQSL ABSXP ABTAH ABTEG ABTHY ABTKH ABTMW ABULA ABUWG ABWNU ABXPI ACAOD ACBXY ACDTI ACGFO ACGFS ACHSB ACHXU ACIHN ACKNC ACMDZ ACMLO ACNCT ACOKC ACOMO ACPIV ACREN ACUHS ACZOJ ADHIR ADINQ ADKNI ADKPE ADMLS ADRFC ADTPH ADURQ ADYFF ADYOE ADZKW AEAQA AEBTG AEFIE AEFQL AEGAL AEGNC AEJHL AEJRE AEKMD AEMSY AENEX AEOHA AEPYU AESKC AETLH AEVLU AEXYK AFBBN AFEXP AFGCZ AFKRA AFLOW AFQWF AFWTZ AFYQB AFZKB AGAYW AGDGC AGGDS AGJBK AGMZJ AGQEE AGQMX AGRTI AGWIL AGWZB AGYKE AHAVH AHBYD AHKAY AHSBF AHYZX AIAKS AIGIU AIIXL AILAN AITGF AJBLW AJRNO AJZVZ ALMA_UNASSIGNED_HOLDINGS ALWAN AMKLP AMTXH AMXSW AMYLF AOCGG ARAPS ARCSS ARMRJ AXYYD AYJHY AZFZN AZQEC B-. B0M BA0 BBWZM BDATZ BENPR BEZIV BGLVJ BGNMA BKOMP BPHCQ BSONS CAG CCPQU COF CS3 CSCUP DDRTE DL5 DNIVK DPUIP DU5 DWQXO E.L EAD EAP EAS EBLON EBS EDO EIOEI EJD EMK EPL ESBYG ESX FEDTE FERAY FFXSO FIGPU FINBP FNLPD FRNLG FRRFC FSGXE FWDCC GGCAI GGRSB GJIRD GNUQQ GNWQR GQ6 GQ7 GQ8 GROUPED_ABI_INFORM_COMPLETE GROUPED_ABI_INFORM_RESEARCH GUQSH GXS H13 HCIFZ HF~ HG5 HG6 HMJXF HQYDN HRMNR HVGLF HZ~ H~9 I-F I09 IHE IJ- IKXTQ ITM IWAJR IXC IZIGR IZQ I~X I~Z J-C J0Z JBSCW JCJTX JZLTJ K60 K6V K6~ K7- KDC KOV KOW LAK LLZTM M0C M0N M2O M4Y MA- MS~ N2Q NB0 NDZJH NPVJJ NQJWS NU0 O9- O93 O9G O9I O9J OAM OVD P19 P62 P9O PF0 PQBIZ PQBZA PQQKQ PROAC PT4 PT5 Q2X QOK QOS R89 R9I RHV RNI RNS ROL RPX RSV RZC RZE RZK S16 S1Z S26 S27 S28 S3B SAP SCJ SCLPG SCO SDH SDM SHX SISQX SJYHP SNE SNPRN SNX SOHCF SOJ SPISZ SRMVM SSLCW STPWE SZN T13 T16 TAE TEORI TN5 TSG TSK TSV TUC TUS U2A U5U UG4 UOJIU UTJUX UZXMN VC2 VFIZW VXZ W23 W48 WH7 WK8 YLTOR Z45 Z7R Z7X Z81 Z83 Z88 Z8R Z8W Z92 ZMTXR ZY4 ~8M ~EX AAPKM AAYXX ABBRH ABDBE ABFSG ABRTQ ACSTC ADHKG AEZWR AFDZB AFFHD AFHIU AFOHR AGQPQ AHPBZ AHWEU AIXLP ATHPR AYFIA CITATION PHGZM PHGZT PQGLB 7SC 7XB 8AL 8FD 8FK JQ2 L.- L.0 L7M L~C L~D MBDVC PKEHL PQEST PQUKI Q9U |
| ID | FETCH-LOGICAL-c392t-76fa022f356c08f37477d5091be5b3a1d3b6c63e05e6462e89cc977801d76e9c3 |
| IEDL.DBID | RSV |
| ISICitedReferencesCount | 8 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000374897200013&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0885-7458 |
| IngestDate | Sun Nov 09 12:45:16 EST 2025 Tue Nov 04 22:00:45 EST 2025 Sat Nov 29 01:59:42 EST 2025 Tue Nov 18 21:33:38 EST 2025 Fri Feb 21 02:37:21 EST 2025 |
| IsDoiOpenAccess | false |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 3 |
| Keywords | Heterogeneous systems Automatic parallelization Domain-independent kernel Locality GPGPU OpenHMPP |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c392t-76fa022f356c08f37477d5091be5b3a1d3b6c63e05e6462e89cc977801d76e9c3 |
| Notes | SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23 |
| OpenAccessLink | http://hdl.handle.net/2183/20902 |
| PQID | 1783853326 |
| PQPubID | 48389 |
| PageCount | 24 |
| ParticipantIDs | proquest_miscellaneous_1808113041 proquest_journals_1783853326 crossref_citationtrail_10_1007_s10766_015_0362_9 crossref_primary_10_1007_s10766_015_0362_9 springer_journals_10_1007_s10766_015_0362_9 |
| PublicationCentury | 2000 |
| PublicationDate | 20160600 2016-6-00 20160601 |
| PublicationDateYYYYMMDD | 2016-06-01 |
| PublicationDate_xml | – month: 6 year: 2016 text: 20160600 |
| PublicationDecade | 2010 |
| PublicationPlace | New York |
| PublicationPlace_xml | – name: New York |
| PublicationTitle | International journal of parallel programming |
| PublicationTitleAbbrev | Int J Parallel Prog |
| PublicationYear | 2016 |
| Publisher | Springer US Springer Nature B.V |
| Publisher_xml | – name: Springer US – name: Springer Nature B.V |
| References | Owens, Houston, Luebke, Green, Stone, Phillips (CR29) 2008; 96 Andrade, Arenaz, Fraguela, Touriño, Doallo (CR2) 2007; 19 CR19 CR17 CR16 CR15 CR37 CR14 CR36 CR35 CR12 CR11 CR32 Arenaz, Touriño, Doallo (CR5) 2008; 30 CR31 CR30 Andión, Arenaz, Rodríguez, Touriño (CR1) 2013; 39 Han, Abdelrahman (CR13) 2011; 22 CR4 CR3 CR6 Christen, Schenk, Burkhart (CR10) 2011; 26 CR7 CR28 CR9 CR27 CR26 CR25 Bodin, Bihan (CR8) 2009; 17 CR24 CR23 CR22 CR21 CR20 Zhang, Mueller (CR38) 2013; 24 Viñas, Lobeiras, Fraguela, Arenaz, Amor, García, Castro, Doallo (CR34) 2013; 25 Verdoolaege, Juega, Cohen, Gómez, Tenllado, Catthoor (CR33) 2013; 9 Kurzak, Tomov, Dongarra (CR18) 2012; 23 362_CR31 M Viñas (362_CR34) 2013; 25 362_CR30 362_CR11 S Verdoolaege (362_CR33) 2013; 9 362_CR32 J Owens (362_CR29) 2008; 96 362_CR24 362_CR23 362_CR26 362_CR25 J Kurzak (362_CR18) 2012; 23 362_CR28 M Christen (362_CR10) 2011; 26 362_CR27 D Andrade (362_CR2) 2007; 19 362_CR3 TD Han (362_CR13) 2011; 22 362_CR20 362_CR4 362_CR22 362_CR6 362_CR21 362_CR7 362_CR9 JM Andión (362_CR1) 2013; 39 F Bodin (362_CR8) 2009; 17 M Arenaz (362_CR5) 2008; 30 Y Zhang (362_CR38) 2013; 24 362_CR35 362_CR12 362_CR15 362_CR37 362_CR14 362_CR36 362_CR17 362_CR16 362_CR19 |
| References_xml | – volume: 30 start-page: 32:1 issue: 6 year: 2008 end-page: 32:56 ident: CR5 article-title: XARK: an extensible framework for automatic recognition of computational kernels publication-title: ACM Trans. Program. Lang. Syst. doi: 10.1145/1391956.1391959 – ident: CR22 – volume: 22 start-page: 78 issue: 1 year: 2011 end-page: 90 ident: CR13 article-title: hiCUDA: High-level GPGPU programming publication-title: IEEE Trans. Parallel Distrib. Syst. doi: 10.1109/TPDS.2010.62 – ident: CR4 – ident: CR14 – ident: CR16 – ident: CR37 – ident: CR12 – ident: CR30 – ident: CR35 – ident: CR6 – ident: CR25 – ident: CR27 – ident: CR23 – volume: 26 start-page: 205 issue: 3–4 year: 2011 end-page: 210 ident: CR10 article-title: Automatic code generation and tuning for stencil kernels on modern shared memory architectures publication-title: Comp. Sci. Res. Dev. doi: 10.1007/s00450-011-0160-6 – ident: CR21 – ident: CR19 – volume: 25 start-page: 1153 issue: 8 year: 2013 end-page: 1169 ident: CR34 article-title: A multi-GPU shallow-water simulation with transport of contaminants publication-title: Concurr. Comput. Pract. Exp. doi: 10.1002/cpe.2917 – ident: CR3 – ident: CR15 – volume: 19 start-page: 2407 issue: 18 year: 2007 end-page: 2423 ident: CR2 article-title: Automated and accurate cache behavior analysis for codes with irregular access patterns publication-title: Concurr. Comput. Pract. Exp. doi: 10.1002/cpe.1173 – ident: CR17 – ident: CR31 – ident: CR11 – volume: 9 start-page: 54:1 issue: 4 year: 2013 end-page: 54:23 ident: CR33 article-title: Polyhedral parallel code generation for CUDA publication-title: ACM Trans. Archit. Code Optim. doi: 10.1145/2400682.2400713 – ident: CR9 – volume: 23 start-page: 2045 issue: 11 year: 2012 end-page: 2057 ident: CR18 article-title: Autotuning GEMM kernels for the Fermi GPU publication-title: IEEE Trans. Parallel Distrib. Syst. doi: 10.1109/TPDS.2011.311 – ident: CR32 – volume: 24 start-page: 417 issue: 3 year: 2013 end-page: 427 ident: CR38 article-title: Autogeneration and autotuning of 3D stencil codes on homogeneous and heterogeneous GPU clusters publication-title: IEEE Trans. Parallel Distrib. Syst. doi: 10.1109/TPDS.2012.160 – ident: CR36 – ident: CR7 – volume: 96 start-page: 879 issue: 5 year: 2008 end-page: 899 ident: CR29 article-title: GPU computing publication-title: Proc. IEEE doi: 10.1109/JPROC.2008.917757 – volume: 17 start-page: 325 issue: 4 year: 2009 end-page: 336 ident: CR8 article-title: Heterogeneous multicore parallel programming for graphics processing units publication-title: Sci. Program. – ident: CR28 – ident: CR26 – ident: CR24 – ident: CR20 – volume: 39 start-page: 442 issue: 9 year: 2013 end-page: 460 ident: CR1 article-title: A novel compiler support for automatic parallelization on multicore systems publication-title: Parallel Comput. doi: 10.1016/j.parco.2013.04.003 – ident: 362_CR24 – volume: 96 start-page: 879 issue: 5 year: 2008 ident: 362_CR29 publication-title: Proc. IEEE doi: 10.1109/JPROC.2008.917757 – ident: 362_CR22 – volume: 24 start-page: 417 issue: 3 year: 2013 ident: 362_CR38 publication-title: IEEE Trans. Parallel Distrib. Syst. doi: 10.1109/TPDS.2012.160 – volume: 30 start-page: 32:1 issue: 6 year: 2008 ident: 362_CR5 publication-title: ACM Trans. Program. Lang. Syst. doi: 10.1145/1391956.1391959 – volume: 39 start-page: 442 issue: 9 year: 2013 ident: 362_CR1 publication-title: Parallel Comput. doi: 10.1016/j.parco.2013.04.003 – volume: 26 start-page: 205 issue: 3–4 year: 2011 ident: 362_CR10 publication-title: Comp. Sci. Res. Dev. doi: 10.1007/s00450-011-0160-6 – ident: 362_CR19 doi: 10.1145/582034.582089 – ident: 362_CR21 doi: 10.1109/SC.2012.51 – ident: 362_CR28 – ident: 362_CR26 – ident: 362_CR30 – volume: 22 start-page: 78 issue: 1 year: 2011 ident: 362_CR13 publication-title: IEEE Trans. Parallel Distrib. Syst. doi: 10.1109/TPDS.2010.62 – volume: 17 start-page: 325 issue: 4 year: 2009 ident: 362_CR8 publication-title: Sci. Program. – ident: 362_CR37 doi: 10.1145/220346.220352 – ident: 362_CR7 – ident: 362_CR9 doi: 10.1145/1375581.1375595 – ident: 362_CR36 doi: 10.1145/1735688.1735697 – ident: 362_CR15 – ident: 362_CR32 – ident: 362_CR16 doi: 10.1145/2259016.2259038 – ident: 362_CR12 doi: 10.1109/InPar.2012.6339595 – ident: 362_CR23 – volume: 19 start-page: 2407 issue: 18 year: 2007 ident: 362_CR2 publication-title: Concurr. Comput. Pract. Exp. doi: 10.1002/cpe.1173 – ident: 362_CR25 – ident: 362_CR27 – ident: 362_CR31 – ident: 362_CR11 doi: 10.1007/BFb0038658 – volume: 9 start-page: 54:1 issue: 4 year: 2013 ident: 362_CR33 publication-title: ACM Trans. Archit. Code Optim. doi: 10.1145/2400682.2400713 – ident: 362_CR3 – ident: 362_CR6 doi: 10.1007/978-3-642-11970-5_14 – ident: 362_CR14 – volume: 23 start-page: 2045 issue: 11 year: 2012 ident: 362_CR18 publication-title: IEEE Trans. Parallel Distrib. Syst. doi: 10.1109/TPDS.2011.311 – ident: 362_CR20 doi: 10.1109/SC.2010.36 – volume: 25 start-page: 1153 issue: 8 year: 2013 ident: 362_CR34 publication-title: Concurr. Comput. Pract. Exp. doi: 10.1002/cpe.2917 – ident: 362_CR35 – ident: 362_CR17 doi: 10.1145/1993498.1993516 – ident: 362_CR4 doi: 10.1109/IPDPS.2004.1303015 |
| SSID | ssj0009788 |
| Score | 2.0962272 |
| Snippet | The use of GPUs for general purpose computation has increased dramatically in the past years due to the rising demands of computing power and their tremendous... Issue Title: Special Issue on High-Level Parallel Programming and Applications The use of GPUs for general purpose computation has increased dramatically in... |
| SourceID | proquest crossref springer |
| SourceType | Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 620 |
| SubjectTerms | Analysis Automation Case studies Computation Computer programming Computer Science Computing costs Demand Heterogeneity Optimization techniques Parallel processing Performance evaluation Processor Architectures Programming languages Software Software Engineering/Programming and Operating Systems Source code Studies Theory of Computation Three dimensional Transformations |
| SummonAdditionalLinks | – databaseName: ABI/INFORM Global dbid: M0C link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3JSgQxEC3cDl7cxXEjgiclOG3WPskgLuBCHxS8NeksIMiMzqK_b6UnPaOCXjx3uhP6pVIvqUo9gEMmgtDSCKqywCnnQVMjQkUt98rljhkeaqRv1f29fnrKi3TgNkhplc2aWC_UrmfjGflJpjRD14Js4-z1jUbVqBhdTRIaszAfmU1M6btrn0-L7qpadxINCcfChW6imuOrc0rGvbSg9cWh_LtfmpLNH_HR2u1cLv93wCuwlAgn6YxnyCrM-O4aLDdiDiTZ9jrc3Eavhpycdj5M35POaNiry7mSwvSj4MpLurFJkOaSq-KqeCTxEJfElJTru6Igk9VzsAGPlxcP59c0KS1Qi_xoSJUMBp15YELatg4M9xjKRSpReVExkzlWSSuZbwsvuTz1OrcWiSN6N6ekzy3bhLlur-u3gHhmfAinlRU5585qg3selzkdjK1EUKEF7eY_lzaVIY9qGC_ltIByhKZEaMoITZm34Gjyyuu4BsdfjXcbOMpkjoNyikULDiaP0ZBidMR0fW-EbaIGCXp0nrXguAH9yyd-63D77w53YBE5lhxnl-3C3LA_8nuwYN-Hz4P-fj1fPwG5ye-J priority: 102 providerName: ProQuest |
| Title | Locality-Aware Automatic Parallelization for GPGPU with OpenHMPP Directives |
| URI | https://link.springer.com/article/10.1007/s10766-015-0362-9 https://www.proquest.com/docview/1783853326 https://www.proquest.com/docview/1808113041 |
| Volume | 44 |
| WOSCitedRecordID | wos000374897200013&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVPQU databaseName: ABI/INFORM Global customDbUrl: eissn: 1573-7640 dateEnd: 20171231 omitProxy: false ssIdentifier: ssj0009788 issn: 0885-7458 databaseCode: M0C dateStart: 19970201 isFulltext: true titleUrlDefault: https://search.proquest.com/abiglobal providerName: ProQuest – providerCode: PRVPQU databaseName: Advanced Technologies & Aerospace Database customDbUrl: eissn: 1573-7640 dateEnd: 20171231 omitProxy: false ssIdentifier: ssj0009788 issn: 0885-7458 databaseCode: P5Z dateStart: 19970201 isFulltext: true titleUrlDefault: https://search.proquest.com/hightechjournals providerName: ProQuest – providerCode: PRVPQU databaseName: Computer Science Database customDbUrl: eissn: 1573-7640 dateEnd: 20171231 omitProxy: false ssIdentifier: ssj0009788 issn: 0885-7458 databaseCode: K7- dateStart: 19970201 isFulltext: true titleUrlDefault: http://search.proquest.com/compscijour providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest ABI/INFORM Collection customDbUrl: eissn: 1573-7640 dateEnd: 20171231 omitProxy: false ssIdentifier: ssj0009788 issn: 0885-7458 databaseCode: 7WY dateStart: 19970201 isFulltext: true titleUrlDefault: https://www.proquest.com/abicomplete providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central customDbUrl: eissn: 1573-7640 dateEnd: 20171231 omitProxy: false ssIdentifier: ssj0009788 issn: 0885-7458 databaseCode: BENPR dateStart: 19970201 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVPQU databaseName: Research Library customDbUrl: eissn: 1573-7640 dateEnd: 20171231 omitProxy: false ssIdentifier: ssj0009788 issn: 0885-7458 databaseCode: M2O dateStart: 19970201 isFulltext: true titleUrlDefault: https://search.proquest.com/pqrl providerName: ProQuest – providerCode: PRVAVX databaseName: SpringerLINK Contemporary 1997-Present customDbUrl: eissn: 1573-7640 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0009788 issn: 0885-7458 databaseCode: RSV dateStart: 19970101 isFulltext: true titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22 providerName: Springer Nature |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3da9swED-adg97Wdt9sPQjqLCnDoEdffoxLf2AtKlJ163bi5FlCQYlGXHS_fs7OXaSlq7QvgiMZVmcdLqfdLr7AXxhwgstjaAq9pxy7jU1wufUcqeKpGCG-2qkL9RgoG9vk7SO4y6b2-6NS7JaqVeC3ZQMu19Bq1CfpAUbaO104GsYXn9fZtpVFdkkag92gAvduDKfauKhMVoizEdO0crWnG6-qpdb8K6GlqQ3nwvbsOZG72GzoW0gtRZ_gP5FsF-Ivmnvr5k40ptNx1XiVpKaSaBWuatjMwkCWnKWnqU3JBzXknD55PwyTclinSw_ws3pybfjc1pzKlCLSGhKlfQGzbZnQtpIe4a7CVUE0JA7kTMTFyyXVjIXCSe57DqdWIsQEe1YoaRLLPsE66PxyH0G4phx3ndzKxLOC6sN7m6KuNDe2Fx45dsQNcLNbJ1wPPBe3GXLVMlBWBkKKwvCypI2HC4--TPPtvFc5b1mxLJa8cosVpohAkFQ2oaDxWtUmeAHMSM3nmGdwDaCtpvHbfjajOJKE__74c6Lau_CWwRXcn6tbA_Wp5OZ24c39n76u5x0oKV-_OzAxtHJIB3iU19RLC-j41B2r7BMxa9ONbP_AaNy61Q |
| linkProvider | Springer Nature |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1LTxRBEK4gmuhFEDWuILYJXjQddrafcyBko8KSXTZzgITb2NOPxITswj4g_Cl-I9Xz2AUTuXHwPD3d6anX11Nd9QHsMBGElkZQlQROOQ-aGhEKarlXLnXM8FBKeqCGQ312lmYrcNvUwsRrlY1PLB21G9v4j3w3UZphaEG0sX9xSSNrVMyuNhQalVr0_c01Htmme0c_Ub5fO52DXyc_erRmFaAWscCMKhkMBq7AhLRtHRjiaeVi2Cy8KJhJHCuklcy3hZdcdrxOrUWQhJ7cKelTy3DeZ_CcM62iXfUVXTb5VSXPJRou7p0L3WRRq1I9JePZXdCyUCl9GAeX4PavfGwZ5g7W_rcPtA6va0BNupUFvIEVP9qAtYasgtS-6y30BzFq45mDdq_NxJPufDYu29WSzEwiocx5XZFKEMaTw-wwOyXxJzWJV256x1lGFtFh-g5On2RL72F1NB75D0A8Mz6ETmFFyrmz2uCZziVOB2MLEVRoQbuRa27rNuuR7eM8XzaIjqqQoyrkURXytAXfFq9cVD1GHhu81Yg_r93NNF_KvgVfFo_RUcTsjxn58RzHRI4VRCw8acH3RsnuTfGvBT8-vuBneNk7OR7kg6NhfxNeIZ6U1U26LVidTeb-E7ywV7M_08l2aSsEfj-17t0BYUxKrg |
| linkToPdf | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1JTxsxFH6iUFVcCt1EKKWuRC-tLDLxOgeEokIAJURzKBLqZerxIlVCCWQp6l_rr-vzLElBKjcOPY_H1ow_v_c9vw1gj4kgtDSCqiRwynnQ1IhQUMu9cqljhodypwdqONSXl2m2Ar-bXJgYVtnIxFJQu7GNd-T7idIMVQuyjf1Qh0VkR73D6xsaO0hFT2vTTqOCSN__ukXzbXpwdoR7_bHT6R1__XJK6w4D1CIvmFElg0ElFpiQtq0DQ26tXFShhRcFM4ljhbSS-bbwksuO16m1SJhQqjslfWoZzvsE1hTamDGcMBPflgV_VdnzEg8x_gcudONRrdL2lIx2vKBl0lJ6Vycuie4932yp8nob__PP2oTnNdEm3epkvIAVP3oJG00TC1LLtFfQH0RtjrYI7d6aiSfd-WxclrElmZnERjNXdaYqQXpPTrKT7ILEy2sSQ3FOz7OMLLTG9DVcPMonvYHV0Xjkt4B4ZnwIncKKlHNntUFbzyVOB2MLEVRoQbvZ49zW5ddjF5CrfFk4OsIiR1jkERZ52oJPi1euq9ojDw3eaaCQ12Jomi9x0IIPi8coQKJXyIz8eI5jYu8VZDI8acHnBnB_TfGvBbcfXvA9PEPI5YOzYf8trCPNlFWA3Q6sziZz_w6e2p-zH9PJbnlsCHx_bOj9AfaPU9I |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Locality-Aware+Automatic+Parallelization+for+GPGPU+with+OpenHMPP+Directives&rft.jtitle=International+journal+of+parallel+programming&rft.au=Andion%2C+Jose+M&rft.au=Arenaz%2C+Manuel&rft.au=Bodin%2C+Francois&rft.au=Rodriguez%2C+Gabriel&rft.date=2016-06-01&rft.issn=0885-7458&rft.eissn=1573-7640&rft.volume=44&rft.issue=3&rft.spage=620&rft.epage=643&rft_id=info:doi/10.1007%2Fs10766-015-0362-9&rft.externalDBID=NO_FULL_TEXT |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0885-7458&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0885-7458&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0885-7458&client=summon |