N-body computations using skeletal frameworks on multicore CPU/graphics processing unit architectures: an empirical performance evaluation
SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches ma...
Saved in:
| Published in: | Concurrency and computation Vol. 26; no. 4; pp. 972 - 986 |
|---|---|
| Main Authors: | , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Blackwell Publishing Ltd
25.03.2014
|
| Subjects: | |
| ISSN: | 1532-0626, 1532-0634 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd. |
|---|---|
| AbstractList | With the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs
N
‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd. With the emergence of general-purpose computation on graphics processing units, high-level approaches that hide the conceptual complexity of the low-level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade-off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all-pairs N-body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed-up over serial version with a Tesla M2050 with lower implementation complexity than low-level Compute Unified Device Architecture programming. Copyright copyright 2013 John Wiley & Sons, Ltd. SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd. |
| Author | González-Vélez, Horacio Goli, Mehdi |
| Author_xml | – sequence: 1 givenname: Mehdi surname: Goli fullname: Goli, Mehdi email: Correspondence to: Mehdi Goli, Robert Gordon University, IDEAS Research Institute, St Andrew Street, Aberdeen AB25 1HG, Scotland, United Kingdom., m.goli@rgu.ac.uk organization: IDEAS Research Institute, Robert Gordon University, Aberdeen, UK – sequence: 2 givenname: Horacio surname: González-Vélez fullname: González-Vélez, Horacio organization: Cloud Competency Centre, National College of Ireland, Dublin, Ireland |
| BookMark | eNp1kMtO4zAUhi3ESFwGiUfwkk2KHde5sIMKGKADRQKxtBznBEwdO9jOQF-Bpyalo46Qho2PF9__n6NvB21aZwGhfUpGlJD0UHUwYiTPNtA25SxNSMbGm-t_mm2hnRCeCaGUMLqN3q-TytULrFzb9VFG7WzAfdD2EYc5GIjS4MbLFl6dnwfsLG57E7VyHvBkdn_46GX3pFXAnXcKwmewtzpi6dWTjqBi7yEcYWkxtJ32Wg2FHfjG-VZaBRj-SNN_7v2JfjTSBNj7O3fR_dnp3eRXMr05v5gcTxPFWJYlknIo86oCzgrZlLwmPCuKLE_HFS_48DCoOB3XVVoqAnXDVTMeMrmsq4qotGG76GDVO5z80kOIotVBgTHSguuDoDwlZVqUJRvQ0QpV3oXgoRFKryRFL7URlIildDFIF0vp_7rXgc7rVvrF_9Bkhb5qA4tvOTGZnX7ldYjwtualn4ssZzkXD9fnYnbCH4rb35fiin0AIfyl2g |
| CitedBy_id | crossref_primary_10_1002_cpe_3077 crossref_primary_10_1016_j_future_2013_12_038 crossref_primary_10_1007_s10766_016_0419_4 crossref_primary_10_1007_s10766_017_0511_4 |
| Cites_doi | 10.1142/S0129626405002209 10.1145/1863482.1863487 10.1887/0852743920 10.1007/978‐3‐642‐23397‐5_17 10.1063/1.1704156 10.1002/cpe.1860 10.1002/spe.1026 10.1038/324446a0 10.1111/j.1467‐8659.2007.01012.x |
| ContentType | Journal Article |
| Copyright | Copyright © 2013 John Wiley & Sons, Ltd. |
| Copyright_xml | – notice: Copyright © 2013 John Wiley & Sons, Ltd. |
| DBID | BSCLL AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D |
| DOI | 10.1002/cpe.3076 |
| DatabaseName | Istex CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
| DatabaseTitle | CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | CrossRef Computer and Information Systems Abstracts |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 1532-0634 |
| EndPage | 986 |
| ExternalDocumentID | 10_1002_cpe_3076 CPE3076 ark_67375_WNG_PB5W8QMJ_K |
| Genre | article |
| GrantInformation_xml | – fundername: EU's Seventh Framework Programme funderid: 288570 – fundername: European Commission: FP7 STREP ŞParaPhrase: Parallel Patterns for Adaptive Heterogeneous Multicore SystemsŤ; funderid: 288570 |
| GroupedDBID | .3N .DC .GA .Y3 05W 0R~ 10A 1L6 1OC 33P 3SF 3WU 4.4 50Y 50Z 51W 51X 52M 52N 52O 52P 52S 52T 52U 52W 52X 5GY 5VS 66C 702 7PT 8-0 8-1 8-3 8-4 8-5 8UM 930 A03 AAESR AAEVG AAHQN AAMNL AANHP AANLZ AAONW AASGY AAXRX AAYCA AAZKR ABCQN ABCUV ABEML ABIJN ACAHQ ACBWZ ACCZN ACPOU ACRPL ACSCC ACXBN ACXQS ACYXJ ADBBV ADEOM ADIZJ ADKYN ADMGS ADMLS ADNMO ADOZA ADXAS ADZMN AEIGN AEIMD AEUYR AEYWJ AFBPY AFFPM AFGKR AFWVQ AGQPQ AGYGG AHBTC AITYG AIURR AJXKR ALMA_UNASSIGNED_HOLDINGS ALVPJ AMBMR AMYDB ATUGU AUFTA AZBYB BAFTC BDRZF BFHJK BHBCM BMNLL BROTX BRXPI BSCLL BY8 CS3 D-E D-F DCZOG DPXWK DR2 DRFUL DRSTM EBS EJD F00 F01 F04 F5P G-S G.N GNP GODZA HGLYW HHY HZ~ IX1 JPC KQQ LATKE LAW LC2 LC3 LEEKS LH4 LITHE LOXES LP6 LP7 LUTES LW6 LYRES MEWTI MK4 MRFUL MRSTM MSFUL MSSTM MXFUL MXSTM N04 N05 N9A O66 O9- OIG P2W P2X P4D PQQKQ Q.N Q11 QB0 QRW R.K ROL RX1 SUPJJ TN5 UB1 V2E W8V W99 WBKPD WIH WIK WOHZO WQJ WXSBR WYISQ WZISG XG1 XV2 ~IA ~WT AAHHS ACCFJ ADZOD AEEZP AEQDE AEUQT AFPWT AIWBW AJBDE ALUQN RWI WRC AAYXX CITATION O8X 7SC 8FD JQ2 L7M L~C L~D |
| ID | FETCH-LOGICAL-c3366-a15e97bbe538af95d056886724b5854b53eb514db29c0edf5cf4a157adbb0c2f3 |
| IEDL.DBID | DRFUL |
| ISICitedReferencesCount | 4 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000331358200011&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1532-0626 |
| IngestDate | Wed Oct 01 09:44:51 EDT 2025 Sat Nov 29 01:41:13 EST 2025 Tue Nov 18 20:06:00 EST 2025 Wed Jan 22 16:47:55 EST 2025 Tue Nov 11 03:33:23 EST 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 4 |
| Language | English |
| License | http://onlinelibrary.wiley.com/termsAndConditions#vor |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c3366-a15e97bbe538af95d056886724b5854b53eb514db29c0edf5cf4a157adbb0c2f3 |
| Notes | istex:16AE1610BB93CFFEE7FDB6DDBD860E707611989F ArticleID:CPE3076 European Commission: FP7 STREP ŞParaPhrase: Parallel Patterns for Adaptive Heterogeneous Multicore SystemsŤ; - No. 288570 EU's Seventh Framework Programme - No. 288570 ark:/67375/WNG-PB5W8QMJ-K ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 |
| PQID | 1520928993 |
| PQPubID | 23500 |
| PageCount | 15 |
| ParticipantIDs | proquest_miscellaneous_1520928993 crossref_citationtrail_10_1002_cpe_3076 crossref_primary_10_1002_cpe_3076 wiley_primary_10_1002_cpe_3076_CPE3076 istex_primary_ark_67375_WNG_PB5W8QMJ_K |
| PublicationCentury | 2000 |
| PublicationDate | 25 March 2014 |
| PublicationDateYYYYMMDD | 2014-03-25 |
| PublicationDate_xml | – month: 03 year: 2014 text: 25 March 2014 day: 25 |
| PublicationDecade | 2010 |
| PublicationTitle | Concurrency and computation |
| PublicationTitleAlternate | Concurrency Computat.: Pract. Exper |
| PublicationYear | 2014 |
| Publisher | Blackwell Publishing Ltd |
| Publisher_xml | – name: Blackwell Publishing Ltd |
| References | Hockney R, Eastwood J. Computer Simulation Using Particles. Taylor & Francis: Abingdon, 1988. Nyland L, Harris M, Prins J. Fast N-body simulation with CUDA. GPU Gems 2007; 3:677-695. McGuire JB. Study of exactly soluble one-dimensional N-body problems. Journal of Mathematical Physics 1964; 5(5):622-636. DOI: 10.1063/1.1704156. González-Vélez H, Leyton M. A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers. Software-Practice and Experience 2010; 40(12):1135-1160. DOI: 10.1002/spe.1026. Bischof H, Gorlatch S, Leshchinskiy R, Müller J. Data parallelism in C++ template programs: a Barnes-Hut case study. Parallel Processing Letters 2005; 15(3):257-272. DOI: 10.1142/S0129626405002209. Barnes J, Hut P. A hierarchical O(N log N) force-calculation algorithm. Nature 1986; 324:446-449. DOI: 10.1038/324446a0. Malik M, Li T, Sharif U, Shahid R, El-Ghazawi T, Newby G. Productivity of GPUs under different programming paradigms. Concurrency and Computation: Practice and Experience 2012; 24(2):179-191. DOI: 10.1002/cpe.1860. Owens JD, Luebke D, Govindaraju N, Harris M, Krüger J, Lefohn AE, Purcell TJ. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 2007; 26(1):80-113. DOI: 10.1111/j.1467-8659.2007.01012.x. Cole M. Algorithmic Skeletons: Structured Management of Parallel Computation, Research Monographs in Parallel and Distributed Computing. MIT Press/Pitman: London, 1989. 2011; 6853 1986; 324 2012 2011 2010 1964; 5 1987 2009 2007; 3 2005; 15 2012; 24 2010; 40 2007; 26 1989 1988 e_1_2_8_17_1 e_1_2_8_18_1 e_1_2_8_19_1 e_1_2_8_13_1 e_1_2_8_14_1 e_1_2_8_16_1 e_1_2_8_3_1 e_1_2_8_2_1 e_1_2_8_5_1 e_1_2_8_4_1 e_1_2_8_7_1 e_1_2_8_6_1 e_1_2_8_9_1 Bell N (e_1_2_8_11_1) 2011 Cole M (e_1_2_8_8_1) 1989 e_1_2_8_10_1 e_1_2_8_12_1 Nyland L (e_1_2_8_15_1) 2007; 3 |
| References_xml | – reference: Cole M. Algorithmic Skeletons: Structured Management of Parallel Computation, Research Monographs in Parallel and Distributed Computing. MIT Press/Pitman: London, 1989. – reference: Hockney R, Eastwood J. Computer Simulation Using Particles. Taylor & Francis: Abingdon, 1988. – reference: Owens JD, Luebke D, Govindaraju N, Harris M, Krüger J, Lefohn AE, Purcell TJ. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 2007; 26(1):80-113. DOI: 10.1111/j.1467-8659.2007.01012.x. – reference: González-Vélez H, Leyton M. A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers. Software-Practice and Experience 2010; 40(12):1135-1160. DOI: 10.1002/spe.1026. – reference: Malik M, Li T, Sharif U, Shahid R, El-Ghazawi T, Newby G. Productivity of GPUs under different programming paradigms. Concurrency and Computation: Practice and Experience 2012; 24(2):179-191. DOI: 10.1002/cpe.1860. – reference: Bischof H, Gorlatch S, Leshchinskiy R, Müller J. Data parallelism in C++ template programs: a Barnes-Hut case study. Parallel Processing Letters 2005; 15(3):257-272. DOI: 10.1142/S0129626405002209. – reference: Barnes J, Hut P. A hierarchical O(N log N) force-calculation algorithm. Nature 1986; 324:446-449. DOI: 10.1038/324446a0. – reference: Nyland L, Harris M, Prins J. Fast N-body simulation with CUDA. GPU Gems 2007; 3:677-695. – reference: McGuire JB. Study of exactly soluble one-dimensional N-body problems. Journal of Mathematical Physics 1964; 5(5):622-636. DOI: 10.1063/1.1704156. – volume: 324 start-page: 446 year: 1986 end-page: 449 article-title: A hierarchical ( log ) force‐calculation algorithm publication-title: Nature – year: 2011 – year: 2009 – volume: 15 start-page: 257 issue: 3 year: 2005 end-page: 272 article-title: Data parallelism in C++ template programs: a Barnes‐Hut case study publication-title: Parallel Processing Letters – volume: 5 start-page: 622 issue: 5 year: 1964 end-page: 636 article-title: Study of exactly soluble one‐dimensional N‐body problems publication-title: Journal of Mathematical Physics – start-page: 359 year: 2011 end-page: 371 – volume: 6853 start-page: 170 year: 2011 end-page: 181 – volume: 3 start-page: 677 year: 2007 end-page: 695 article-title: Fast N‐body simulation with CUDA publication-title: GPU Gems – year: 1987 – year: 1988 – year: 1989 – volume: 40 start-page: 1135 issue: 12 year: 2010 end-page: 1160 article-title: A survey of algorithmic skeleton frameworks: high‐level structured parallel programming enablers publication-title: Software–Practice and Experience – volume: 24 start-page: 179 issue: 2 year: 2012 end-page: 191 article-title: Productivity of GPUs under different programming paradigms publication-title: Concurrency and Computation: Practice and Experience – volume: 26 start-page: 80 issue: 1 year: 2007 end-page: 113 article-title: A survey of general‐purpose computation on graphics hardware publication-title: Computer Graphics Forum – start-page: 5 year: 2010 end-page: 14 – year: 2012 – start-page: 359 volume-title: GPU Computing Gems year: 2011 ident: e_1_2_8_11_1 – volume: 3 start-page: 677 year: 2007 ident: e_1_2_8_15_1 article-title: Fast N‐body simulation with CUDA publication-title: GPU Gems – ident: e_1_2_8_14_1 doi: 10.1142/S0129626405002209 – ident: e_1_2_8_10_1 doi: 10.1145/1863482.1863487 – ident: e_1_2_8_17_1 – ident: e_1_2_8_19_1 – ident: e_1_2_8_5_1 doi: 10.1887/0852743920 – ident: e_1_2_8_16_1 doi: 10.1007/978‐3‐642‐23397‐5_17 – ident: e_1_2_8_2_1 doi: 10.1063/1.1704156 – ident: e_1_2_8_18_1 – ident: e_1_2_8_12_1 – ident: e_1_2_8_7_1 doi: 10.1002/cpe.1860 – volume-title: Algorithmic Skeletons: Structured Management of Parallel Computation year: 1989 ident: e_1_2_8_8_1 – ident: e_1_2_8_9_1 doi: 10.1002/spe.1026 – ident: e_1_2_8_3_1 doi: 10.1038/324446a0 – ident: e_1_2_8_4_1 – ident: e_1_2_8_13_1 – ident: e_1_2_8_6_1 doi: 10.1111/j.1467‐8659.2007.01012.x |
| SSID | ssj0011031 |
| Score | 2.038398 |
| Snippet | SUMMARYWith the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the... With the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level... With the emergence of general-purpose computation on graphics processing units, high-level approaches that hide the conceptual complexity of the low-level... |
| SourceID | proquest crossref wiley istex |
| SourceType | Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 972 |
| SubjectTerms | algorithmic skeletons Algorithms Complexity Computation Devices general-purpose computing on graphics processing units GPU Graphics processing units Hardware parallel computing Performance evaluation Serials structured parallelism |
| Title | N-body computations using skeletal frameworks on multicore CPU/graphics processing unit architectures: an empirical performance evaluation |
| URI | https://api.istex.fr/ark:/67375/WNG-PB5W8QMJ-K/fulltext.pdf https://onlinelibrary.wiley.com/doi/abs/10.1002%2Fcpe.3076 https://www.proquest.com/docview/1520928993 |
| Volume | 26 |
| WOSCitedRecordID | wos000331358200011&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVWIB databaseName: Wiley Online Library Full Collection 2020 customDbUrl: eissn: 1532-0634 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0011031 issn: 1532-0626 databaseCode: DRFUL dateStart: 20010101 isFulltext: true titleUrlDefault: https://onlinelibrary.wiley.com providerName: Wiley-Blackwell |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LT9wwELbaXQ69AOUhHi0yEqKndLOOnQc3urCtgK62VVdws_wKWgHZaMMiuPUHcOA38ksYO8kuSCBV6iU5ZJxE4xnPZ3v8DUI7KokMDXXqJeA8HkQI5QkqfC8lirJIaxak1BWbiHq9-Ows6VdZlfYsTMkPMV1ws57hxmvr4EIWrRlpqMrNVzDQ8D1qEjBb2kDNg9_dwcl0D8EWMCjZUonnA26vqWd90qrbvghGTavX2xdI8zledQGnu_A_v7qI5iuYifdLu_iI3plsCS3UJRxw5dHL6L73-PdBjvQdVu5ZuX6HbTb8OS4uICYBOMdpncFV4FGGXQ6iZb_Enf6g5Rivh6rAeXnkwDacwDiBn29RFHtYZNhc5UPHSILz2XEFPOMbX0GD7uGfzg-vKtDgqSAIQ0-0mUkiKQ2MmiJNmAY0FcdhRKiEWQhcAiMBkGlJEuUbnTKVUmgTCS2lr0garKJGNsrMGsLtUMSxUExSFVMZiUQANlRKgwaZAVS0jr7UPcVVxV5ui2hc8pJ3mXBQMrdKXkfbU8m8ZOx4RWbXdfZUQIwvbIZbxPhp7zvvf2On8a-fR_wYXlZbAwe_s5spIjOjScHbNn_Izlbh13Zd57_5Nd7pH9r7xr8KbqIPgMuoTXUj7BNqXI8n5jOaUzfXw2K8Vdn5ExSQBrY |
| linkProvider | Wiley-Blackwell |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lb9NAEB6VBAkulFdFeS4SKicTx971g54gNBSaWgE1am-rfbmKWhwrbqpy4wf0wG_klzC7tpNWAgmJi33wrG3Nzux8uzv7DcArlcaGRjr3UnQeDyOE8gQVvpcHirJYaxbm1BWbiLMsOTpKx2uw3Z6Fqfkhlgtu1jPceG0d3C5I91asoao0b9BCoxvQpWhFrAPdD1-Hk9FyE8FWMKjpUgPPR-Decs_6Qa9tey0ada1iL65BzauA1UWc4fp__etduNMATfKutox7sGaK-7DeFnEgjU8_gMvs14-fcqa_E-We1St4xObDH5PqBKMSwnOStzlcFZkVxGUhWv5LMhhPeo7zeqoqUtaHDmzDBY4U5OomRfWWiIKYb-XUcZKQcnVggawYxx_CZLhzMNj1mhINngrDKPJEn5k0ltLguCnylGnEU0kSxQGVOA_BS2gkQjItg1T5RudM5RTbxEJL6asgDzegU8wK8whIPxJJIhSTVCVUxiIViA6V0qhBZhAXbcLrtqu4avjLbRmNU14zLwcclcytkjfh5VKyrDk7_iCz5Xp7KSDmJzbHLWb8MPvIx-_ZYfJl_zPfw5e15sDR8-x2iijMbFHxvs0gsvNV_LUt1_t__RofjHfs_fG_Cr6AW7sH-yM--pTtPYHbiNKoTXwL2FPonM0X5hncVOdn02r-vDH637bWCqY |
| linkToPdf | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lb9QwEB6VLkJcKK-K8jQSKqew2cTOA06w7fJoiQJi1d4sP9GqkI02XQQ3fgAHfiO_hLGT7LYSSEhckkPGSTSe8Xy2x98APFJ5amiibZCj8wQYIVQgqAgDGynKUq1ZbKkvNpEWRXZ8nJcb8Kw_C9PyQ6wW3Jxn-PHaObiptR2uWUNVbZ6ghSYXYEBZnqBXDvbeT6aHq00EV8GgpUuNghCBe889G0bDvu25aDRwiv16DmqeBaw-4ky2_utfr8KVDmiS561lXIMNU12Hrb6IA-l8-gb8KH59_ynn-htR_lm7gkdcPvxH0pxgVEJ4Tmyfw9WQeUV8FqLjvyTjcjr0nNcz1ZC6PXTgGi5xpCBnNymap0RUxHyuZ56ThNTrAwtkzTh-E6aT_Q_jV0FXoiFQcZwkgRgxk6dSGhw3hc2ZRjyVZUkaUYnzELzERiIk0zLKVWi0ZcpSbJMKLWWoIhtvw2Y1r8wtIKNEZJlQTFKVUZmKXCA6VEqjBplBXLQDj_uu4qrjL3dlND7xlnk54qhk7pS8Aw9XknXL2fEHmV3f2ysBsThxOW4p40fFS16-YEfZu7dv-AG-rDcHjp7ntlNEZebLho9cBpGbr-Kv7fre_-vX-Ljcd_fb_yr4AC6VexN--Lo4uAOXEaRRl_cWsbuwebpYmntwUX05nTWL-53N_wY1cAoh |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=N%E2%80%90body+computations+using+skeletal+frameworks+on+multicore+CPU%2Fgraphics+processing+unit+architectures%3A+an+empirical+performance+evaluation&rft.jtitle=Concurrency+and+computation&rft.au=Goli%2C+Mehdi&rft.au=Gonz%C3%A1lez%E2%80%93V%C3%A9lez%2C+Horacio&rft.date=2014-03-25&rft.issn=1532-0626&rft.eissn=1532-0634&rft.volume=26&rft.issue=4&rft.spage=972&rft.epage=986&rft_id=info:doi/10.1002%2Fcpe.3076&rft.externalDBID=10.1002%252Fcpe.3076&rft.externalDocID=CPE3076 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1532-0626&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1532-0626&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1532-0626&client=summon |