N-body computations using skeletal frameworks on multicore CPU/graphics processing unit architectures: an empirical performance evaluation

SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches ma...

Full description

Saved in:

Bibliographic Details
Published in:	Concurrency and computation Vol. 26; no. 4; pp. 972 - 986
Main Authors:	Goli, Mehdi, González-Vélez, Horacio
Format:	Journal Article
Language:	English
Published:	Blackwell Publishing Ltd 25.03.2014
Subjects:	algorithmic skeletons Algorithms Complexity Computation Devices general-purpose computing on graphics processing units GPU Graphics processing units Hardware parallel computing Performance evaluation Serials structured parallelism
ISSN:	1532-0626, 1532-0634
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Abstract	SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd.
AbstractList	With the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N ‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd. With the emergence of general-purpose computation on graphics processing units, high-level approaches that hide the conceptual complexity of the low-level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade-off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all-pairs N-body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed-up over serial version with a Tesla M2050 with lower implementation complexity than low-level Compute Unified Device Architecture programming. Copyright copyright 2013 John Wiley & Sons, Ltd. SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd.
Author	González-Vélez, Horacio Goli, Mehdi
Author_xml	– sequence: 1 givenname: Mehdi surname: Goli fullname: Goli, Mehdi email: Correspondence to: Mehdi Goli, Robert Gordon University, IDEAS Research Institute, St Andrew Street, Aberdeen AB25 1HG, Scotland, United Kingdom., m.goli@rgu.ac.uk organization: IDEAS Research Institute, Robert Gordon University, Aberdeen, UK – sequence: 2 givenname: Horacio surname: González-Vélez fullname: González-Vélez, Horacio organization: Cloud Competency Centre, National College of Ireland, Dublin, Ireland
BookMark	eNp1kMtO4zAUhi3ESFwGiUfwkk2KHde5sIMKGKADRQKxtBznBEwdO9jOQF-Bpyalo46Qho2PF9__n6NvB21aZwGhfUpGlJD0UHUwYiTPNtA25SxNSMbGm-t_mm2hnRCeCaGUMLqN3q-TytULrFzb9VFG7WzAfdD2EYc5GIjS4MbLFl6dnwfsLG57E7VyHvBkdn_46GX3pFXAnXcKwmewtzpi6dWTjqBi7yEcYWkxtJ32Wg2FHfjG-VZaBRj-SNN_7v2JfjTSBNj7O3fR_dnp3eRXMr05v5gcTxPFWJYlknIo86oCzgrZlLwmPCuKLE_HFS_48DCoOB3XVVoqAnXDVTMeMrmsq4qotGG76GDVO5z80kOIotVBgTHSguuDoDwlZVqUJRvQ0QpV3oXgoRFKryRFL7URlIildDFIF0vp_7rXgc7rVvrF_9Bkhb5qA4tvOTGZnX7ldYjwtualn4ssZzkXD9fnYnbCH4rb35fiin0AIfyl2g
CitedBy_id	crossref_primary_10_1002_cpe_3077 crossref_primary_10_1016_j_future_2013_12_038 crossref_primary_10_1007_s10766_016_0419_4 crossref_primary_10_1007_s10766_017_0511_4
Cites_doi	10.1142/S0129626405002209 10.1145/1863482.1863487 10.1887/0852743920 10.1007/978‐3‐642‐23397‐5_17 10.1063/1.1704156 10.1002/cpe.1860 10.1002/spe.1026 10.1038/324446a0 10.1111/j.1467‐8659.2007.01012.x
ContentType	Journal Article
Copyright	Copyright © 2013 John Wiley & Sons, Ltd.
Copyright_xml	– notice: Copyright © 2013 John Wiley & Sons, Ltd.
DBID	BSCLL AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D
DOI	10.1002/cpe.3076
DatabaseName	Istex CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional
DatabaseTitle	CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional
DatabaseTitleList	CrossRef Computer and Information Systems Abstracts
DeliveryMethod	fulltext_linktorsrc
Discipline	Computer Science
EISSN	1532-0634
EndPage	986
ExternalDocumentID	10_1002_cpe_3076 CPE3076 ark_67375_WNG_PB5W8QMJ_K
Genre	article
GrantInformation_xml	– fundername: EU's Seventh Framework Programme funderid: 288570 – fundername: European Commission: FP7 STREP ŞParaPhrase: Parallel Patterns for Adaptive Heterogeneous Multicore SystemsŤ; funderid: 288570
GroupedDBID	.3N .DC .GA .Y3 05W 0R~ 10A 1L6 1OC 33P 3SF 3WU 4.4 50Y 50Z 51W 51X 52M 52N 52O 52P 52S 52T 52U 52W 52X 5GY 5VS 66C 702 7PT 8-0 8-1 8-3 8-4 8-5 8UM 930 A03 AAESR AAEVG AAHQN AAMNL AANHP AANLZ AAONW AASGY AAXRX AAYCA AAZKR ABCQN ABCUV ABEML ABIJN ACAHQ ACBWZ ACCZN ACPOU ACRPL ACSCC ACXBN ACXQS ACYXJ ADBBV ADEOM ADIZJ ADKYN ADMGS ADMLS ADNMO ADOZA ADXAS ADZMN AEIGN AEIMD AEUYR AEYWJ AFBPY AFFPM AFGKR AFWVQ AGQPQ AGYGG AHBTC AITYG AIURR AJXKR ALMA_UNASSIGNED_HOLDINGS ALVPJ AMBMR AMYDB ATUGU AUFTA AZBYB BAFTC BDRZF BFHJK BHBCM BMNLL BROTX BRXPI BSCLL BY8 CS3 D-E D-F DCZOG DPXWK DR2 DRFUL DRSTM EBS EJD F00 F01 F04 F5P G-S G.N GNP GODZA HGLYW HHY HZ~ IX1 JPC KQQ LATKE LAW LC2 LC3 LEEKS LH4 LITHE LOXES LP6 LP7 LUTES LW6 LYRES MEWTI MK4 MRFUL MRSTM MSFUL MSSTM MXFUL MXSTM N04 N05 N9A O66 O9- OIG P2W P2X P4D PQQKQ Q.N Q11 QB0 QRW R.K ROL RX1 SUPJJ TN5 UB1 V2E W8V W99 WBKPD WIH WIK WOHZO WQJ WXSBR WYISQ WZISG XG1 XV2 ~IA ~WT AAHHS ACCFJ ADZOD AEEZP AEQDE AEUQT AFPWT AIWBW AJBDE ALUQN RWI WRC AAYXX CITATION O8X 7SC 8FD JQ2 L7M L~C L~D
ID	FETCH-LOGICAL-c3366-a15e97bbe538af95d056886724b5854b53eb514db29c0edf5cf4a157adbb0c2f3
IEDL.DBID	DRFUL
ISICitedReferencesCount	4
ISICitedReferencesURI	http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000331358200011&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN	1532-0626
IngestDate	Wed Oct 01 09:44:51 EDT 2025 Sat Nov 29 01:41:13 EST 2025 Tue Nov 18 20:06:00 EST 2025 Wed Jan 22 16:47:55 EST 2025 Tue Nov 11 03:33:23 EST 2025
IsPeerReviewed	true
IsScholarly	true
Issue	4
Language	English
License	http://onlinelibrary.wiley.com/termsAndConditions#vor
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c3366-a15e97bbe538af95d056886724b5854b53eb514db29c0edf5cf4a157adbb0c2f3
Notes	istex:16AE1610BB93CFFEE7FDB6DDBD860E707611989F ArticleID:CPE3076 European Commission: FP7 STREP ŞParaPhrase: Parallel Patterns for Adaptive Heterogeneous Multicore SystemsŤ; - No. 288570 EU's Seventh Framework Programme - No. 288570 ark:/67375/WNG-PB5W8QMJ-K ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23
PQID	1520928993
PQPubID	23500
PageCount	15
ParticipantIDs	proquest_miscellaneous_1520928993 crossref_citationtrail_10_1002_cpe_3076 crossref_primary_10_1002_cpe_3076 wiley_primary_10_1002_cpe_3076_CPE3076 istex_primary_ark_67375_WNG_PB5W8QMJ_K
PublicationCentury	2000
PublicationDate	25 March 2014
PublicationDateYYYYMMDD	2014-03-25
PublicationDate_xml	– month: 03 year: 2014 text: 25 March 2014 day: 25
PublicationDecade	2010
PublicationTitle	Concurrency and computation
PublicationTitleAlternate	Concurrency Computat.: Pract. Exper
PublicationYear	2014
Publisher	Blackwell Publishing Ltd
Publisher_xml	– name: Blackwell Publishing Ltd
References	Hockney R, Eastwood J. Computer Simulation Using Particles. Taylor & Francis: Abingdon, 1988. Nyland L, Harris M, Prins J. Fast N-body simulation with CUDA. GPU Gems 2007; 3:677-695. McGuire JB. Study of exactly soluble one-dimensional N-body problems. Journal of Mathematical Physics 1964; 5(5):622-636. DOI: 10.1063/1.1704156. González-Vélez H, Leyton M. A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers. Software-Practice and Experience 2010; 40(12):1135-1160. DOI: 10.1002/spe.1026. Bischof H, Gorlatch S, Leshchinskiy R, Müller J. Data parallelism in C++ template programs: a Barnes-Hut case study. Parallel Processing Letters 2005; 15(3):257-272. DOI: 10.1142/S0129626405002209. Barnes J, Hut P. A hierarchical O(N log N) force-calculation algorithm. Nature 1986; 324:446-449. DOI: 10.1038/324446a0. Malik M, Li T, Sharif U, Shahid R, El-Ghazawi T, Newby G. Productivity of GPUs under different programming paradigms. Concurrency and Computation: Practice and Experience 2012; 24(2):179-191. DOI: 10.1002/cpe.1860. Owens JD, Luebke D, Govindaraju N, Harris M, Krüger J, Lefohn AE, Purcell TJ. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 2007; 26(1):80-113. DOI: 10.1111/j.1467-8659.2007.01012.x. Cole M. Algorithmic Skeletons: Structured Management of Parallel Computation, Research Monographs in Parallel and Distributed Computing. MIT Press/Pitman: London, 1989. 2011; 6853 1986; 324 2012 2011 2010 1964; 5 1987 2009 2007; 3 2005; 15 2012; 24 2010; 40 2007; 26 1989 1988 e_1_2_8_17_1 e_1_2_8_18_1 e_1_2_8_19_1 e_1_2_8_13_1 e_1_2_8_14_1 e_1_2_8_16_1 e_1_2_8_3_1 e_1_2_8_2_1 e_1_2_8_5_1 e_1_2_8_4_1 e_1_2_8_7_1 e_1_2_8_6_1 e_1_2_8_9_1 Bell N (e_1_2_8_11_1) 2011 Cole M (e_1_2_8_8_1) 1989 e_1_2_8_10_1 e_1_2_8_12_1 Nyland L (e_1_2_8_15_1) 2007; 3
References_xml	– reference: Cole M. Algorithmic Skeletons: Structured Management of Parallel Computation, Research Monographs in Parallel and Distributed Computing. MIT Press/Pitman: London, 1989. – reference: Hockney R, Eastwood J. Computer Simulation Using Particles. Taylor & Francis: Abingdon, 1988. – reference: Owens JD, Luebke D, Govindaraju N, Harris M, Krüger J, Lefohn AE, Purcell TJ. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 2007; 26(1):80-113. DOI: 10.1111/j.1467-8659.2007.01012.x. – reference: González-Vélez H, Leyton M. A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers. Software-Practice and Experience 2010; 40(12):1135-1160. DOI: 10.1002/spe.1026. – reference: Malik M, Li T, Sharif U, Shahid R, El-Ghazawi T, Newby G. Productivity of GPUs under different programming paradigms. Concurrency and Computation: Practice and Experience 2012; 24(2):179-191. DOI: 10.1002/cpe.1860. – reference: Bischof H, Gorlatch S, Leshchinskiy R, Müller J. Data parallelism in C++ template programs: a Barnes-Hut case study. Parallel Processing Letters 2005; 15(3):257-272. DOI: 10.1142/S0129626405002209. – reference: Barnes J, Hut P. A hierarchical O(N log N) force-calculation algorithm. Nature 1986; 324:446-449. DOI: 10.1038/324446a0. – reference: Nyland L, Harris M, Prins J. Fast N-body simulation with CUDA. GPU Gems 2007; 3:677-695. – reference: McGuire JB. Study of exactly soluble one-dimensional N-body problems. Journal of Mathematical Physics 1964; 5(5):622-636. DOI: 10.1063/1.1704156. – volume: 324 start-page: 446 year: 1986 end-page: 449 article-title: A hierarchical ( log ) force‐calculation algorithm publication-title: Nature – year: 2011 – year: 2009 – volume: 15 start-page: 257 issue: 3 year: 2005 end-page: 272 article-title: Data parallelism in C++ template programs: a Barnes‐Hut case study publication-title: Parallel Processing Letters – volume: 5 start-page: 622 issue: 5 year: 1964 end-page: 636 article-title: Study of exactly soluble one‐dimensional N‐body problems publication-title: Journal of Mathematical Physics – start-page: 359 year: 2011 end-page: 371 – volume: 6853 start-page: 170 year: 2011 end-page: 181 – volume: 3 start-page: 677 year: 2007 end-page: 695 article-title: Fast N‐body simulation with CUDA publication-title: GPU Gems – year: 1987 – year: 1988 – year: 1989 – volume: 40 start-page: 1135 issue: 12 year: 2010 end-page: 1160 article-title: A survey of algorithmic skeleton frameworks: high‐level structured parallel programming enablers publication-title: Software–Practice and Experience – volume: 24 start-page: 179 issue: 2 year: 2012 end-page: 191 article-title: Productivity of GPUs under different programming paradigms publication-title: Concurrency and Computation: Practice and Experience – volume: 26 start-page: 80 issue: 1 year: 2007 end-page: 113 article-title: A survey of general‐purpose computation on graphics hardware publication-title: Computer Graphics Forum – start-page: 5 year: 2010 end-page: 14 – year: 2012 – start-page: 359 volume-title: GPU Computing Gems year: 2011 ident: e_1_2_8_11_1 – volume: 3 start-page: 677 year: 2007 ident: e_1_2_8_15_1 article-title: Fast N‐body simulation with CUDA publication-title: GPU Gems – ident: e_1_2_8_14_1 doi: 10.1142/S0129626405002209 – ident: e_1_2_8_10_1 doi: 10.1145/1863482.1863487 – ident: e_1_2_8_17_1 – ident: e_1_2_8_19_1 – ident: e_1_2_8_5_1 doi: 10.1887/0852743920 – ident: e_1_2_8_16_1 doi: 10.1007/978‐3‐642‐23397‐5_17 – ident: e_1_2_8_2_1 doi: 10.1063/1.1704156 – ident: e_1_2_8_18_1 – ident: e_1_2_8_12_1 – ident: e_1_2_8_7_1 doi: 10.1002/cpe.1860 – volume-title: Algorithmic Skeletons: Structured Management of Parallel Computation year: 1989 ident: e_1_2_8_8_1 – ident: e_1_2_8_9_1 doi: 10.1002/spe.1026 – ident: e_1_2_8_3_1 doi: 10.1038/324446a0 – ident: e_1_2_8_4_1 – ident: e_1_2_8_13_1 – ident: e_1_2_8_6_1 doi: 10.1111/j.1467‐8659.2007.01012.x
SSID	ssj0011031
Score	2.038398
Snippet	SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the... With the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level... With the emergence of general-purpose computation on graphics processing units, high-level approaches that hide the conceptual complexity of the low-level...
SourceID	proquest crossref wiley istex
SourceType	Aggregation Database Enrichment Source Index Database Publisher
StartPage	972
SubjectTerms	algorithmic skeletons Algorithms Complexity Computation Devices general-purpose computing on graphics processing units GPU Graphics processing units Hardware parallel computing Performance evaluation Serials structured parallelism
Title	N-body computations using skeletal frameworks on multicore CPU/graphics processing unit architectures: an empirical performance evaluation
URI	https://api.istex.fr/ark:/67375/WNG-PB5W8QMJ-K/fulltext.pdf https://onlinelibrary.wiley.com/doi/abs/10.1002%2Fcpe.3076 https://www.proquest.com/docview/1520928993
Volume	26
WOSCitedRecordID	wos000331358200011&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVWIB databaseName: Wiley Online Library Full Collection 2020 customDbUrl: eissn: 1532-0634 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0011031 issn: 1532-0626 databaseCode: DRFUL dateStart: 20010101 isFulltext: true titleUrlDefault: https://onlinelibrary.wiley.com providerName: Wiley-Blackwell
link	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LT9wwELbaXQ69AOUhHi0yEqKndLOOnQc3urCtgK62VVdws_wKWgHZaMMiuPUHcOA38ksYO8kuSCBV6iU5ZJxE4xnPZ3v8DUI7KokMDXXqJeA8HkQI5QkqfC8lirJIaxak1BWbiHq9-Ows6VdZlfYsTMkPMV1ws57hxmvr4EIWrRlpqMrNVzDQ8D1qEjBb2kDNg9_dwcl0D8EWMCjZUonnA26vqWd90qrbvghGTavX2xdI8zledQGnu_A_v7qI5iuYifdLu_iI3plsCS3UJRxw5dHL6L73-PdBjvQdVu5ZuX6HbTb8OS4uICYBOMdpncFV4FGGXQ6iZb_Enf6g5Rivh6rAeXnkwDacwDiBn29RFHtYZNhc5UPHSILz2XEFPOMbX0GD7uGfzg-vKtDgqSAIQ0-0mUkiKQ2MmiJNmAY0FcdhRKiEWQhcAiMBkGlJEuUbnTKVUmgTCS2lr0garKJGNsrMGsLtUMSxUExSFVMZiUQANlRKgwaZAVS0jr7UPcVVxV5ui2hc8pJ3mXBQMrdKXkfbU8m8ZOx4RWbXdfZUQIwvbIZbxPhp7zvvf2On8a-fR_wYXlZbAwe_s5spIjOjScHbNn_Izlbh13Zd57_5Nd7pH9r7xr8KbqIPgMuoTXUj7BNqXI8n5jOaUzfXw2K8Vdn5ExSQBrY
linkProvider	Wiley-Blackwell
linkToHtml	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lb9NAEB6VBAkulFdFeS4SKicTx971g54gNBSaWgE1am-rfbmKWhwrbqpy4wf0wG_klzC7tpNWAgmJi33wrG3Nzux8uzv7DcArlcaGRjr3UnQeDyOE8gQVvpcHirJYaxbm1BWbiLMsOTpKx2uw3Z6Fqfkhlgtu1jPceG0d3C5I91asoao0b9BCoxvQpWhFrAPdD1-Hk9FyE8FWMKjpUgPPR-Decs_6Qa9tey0ada1iL65BzauA1UWc4fp__etduNMATfKutox7sGaK-7DeFnEgjU8_gMvs14-fcqa_E-We1St4xObDH5PqBKMSwnOStzlcFZkVxGUhWv5LMhhPeo7zeqoqUtaHDmzDBY4U5OomRfWWiIKYb-XUcZKQcnVggawYxx_CZLhzMNj1mhINngrDKPJEn5k0ltLguCnylGnEU0kSxQGVOA_BS2gkQjItg1T5RudM5RTbxEJL6asgDzegU8wK8whIPxJJIhSTVCVUxiIViA6V0qhBZhAXbcLrtqu4avjLbRmNU14zLwcclcytkjfh5VKyrDk7_iCz5Xp7KSDmJzbHLWb8MPvIx-_ZYfJl_zPfw5e15sDR8-x2iijMbFHxvs0gsvNV_LUt1_t__RofjHfs_fG_Cr6AW7sH-yM--pTtPYHbiNKoTXwL2FPonM0X5hncVOdn02r-vDH637bWCqY
linkToPdf	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lb9QwEB6VLkJcKK-K8jQSKqew2cTOA06w7fJoiQJi1d4sP9GqkI02XQQ3fgAHfiO_hLGT7LYSSEhckkPGSTSe8Xy2x98APFJ5amiibZCj8wQYIVQgqAgDGynKUq1ZbKkvNpEWRXZ8nJcb8Kw_C9PyQ6wW3Jxn-PHaObiptR2uWUNVbZ6ghSYXYEBZnqBXDvbeT6aHq00EV8GgpUuNghCBe889G0bDvu25aDRwiv16DmqeBaw-4ky2_utfr8KVDmiS561lXIMNU12Hrb6IA-l8-gb8KH59_ynn-htR_lm7gkdcPvxH0pxgVEJ4Tmyfw9WQeUV8FqLjvyTjcjr0nNcz1ZC6PXTgGi5xpCBnNymap0RUxHyuZ56ThNTrAwtkzTh-E6aT_Q_jV0FXoiFQcZwkgRgxk6dSGhw3hc2ZRjyVZUkaUYnzELzERiIk0zLKVWi0ZcpSbJMKLWWoIhtvw2Y1r8wtIKNEZJlQTFKVUZmKXCA6VEqjBplBXLQDj_uu4qrjL3dlND7xlnk54qhk7pS8Aw9XknXL2fEHmV3f2ysBsThxOW4p40fFS16-YEfZu7dv-AG-rDcHjp7ntlNEZebLho9cBpGbr-Kv7fre_-vX-Ljcd_fb_yr4AC6VexN--Lo4uAOXEaRRl_cWsbuwebpYmntwUX05nTWL-53N_wY1cAoh
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=N%E2%80%90body+computations+using+skeletal+frameworks+on+multicore+CPU%2Fgraphics+processing+unit+architectures%3A+an+empirical+performance+evaluation&rft.jtitle=Concurrency+and+computation&rft.au=Goli%2C+Mehdi&rft.au=Gonz%C3%A1lez%E2%80%93V%C3%A9lez%2C+Horacio&rft.date=2014-03-25&rft.issn=1532-0626&rft.eissn=1532-0634&rft.volume=26&rft.issue=4&rft.spage=972&rft.epage=986&rft_id=info:doi/10.1002%2Fcpe.3076&rft.externalDBID=10.1002%252Fcpe.3076&rft.externalDocID=CPE3076
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1532-0626&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1532-0626&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1532-0626&client=summon