OpenMP compiler for distributed memory architectures

OpenMP is an emerging industry standard for shared memory architectures. While OpenMP has advantages on its ease of use and incremental programming, message passing is today still the most widely-used programming model for distributed memory architectures. How to effectively extend OpenMP to distrib...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Science China. Information sciences Jg. 53; H. 5; S. 932 - 944
Hauptverfasser: Wang, Jue, Hu, ChangJun, Zhang, JiLin, Li, JianJiang
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Heidelberg SP Science China Press 01.05.2010
Springer Nature B.V
Schlagworte:
ISSN:1674-733X, 1869-1919
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract OpenMP is an emerging industry standard for shared memory architectures. While OpenMP has advantages on its ease of use and incremental programming, message passing is today still the most widely-used programming model for distributed memory architectures. How to effectively extend OpenMP to distributed memory architectures has been a hot spot. This paper proposes an OpenMP system, called KLCoMP, for distributed memory architectures. Based on the "partially replicating shared arrays" memory model, we propose an algorithm for shared array recognition based on the inter-procedural analysis, optimization technique based on the producer/consumer relationship, and communication generation technique for nonlinear references. We evaluate the performance on nine benchmarks which cover computational fluid dynamics, integer sorting, molecular dynamics, earthquake simulation, and computational chemistry. The average scalability achieved by KLCoMP version is close to that achieved by MPI version. We compare the performance of our translated programs with that of versions generated for Omni+SCASH, LLCoMP, and OpenMP(Purdue), and find that parallel applications (especially, irregular applications) translated by KLCoMP can achieve more effective performance than other versions.
AbstractList OpenMP is an emerging industry standard for shared memory architectures. While OpenMP has advantages on its ease of use and incremental programming, message passing is today still the most widely-used programming model for distributed memory architectures. How to effectively extend OpenMP to distributed memory architectures has been a hot spot. This paper proposes an OpenMP system, called KLCoMP, for distributed memory architectures. Based on the "partially replicating shared arrays" memory model, we propose an algorithm for shared array recognition based on the inter-procedural analysis, optimization technique based on the producer/consumer relationship, and communication generation technique for nonlinear references. We evaluate the performance on nine benchmarks which cover computational fluid dynamics, integer sorting, molecular dynamics, earthquake simulation, and computational chemistry. The average scalability achieved by KLCoMP version is close to that achieved by MPI version. We compare the performance of our translated programs with that of versions generated for Omni+SCASH, LLCoMP, and OpenMP(Purdue), and find that parallel applications (especially, irregular applications) translated by KLCoMP can achieve more effective performance than other versions.
OpenMP is an emerging industry standard for shared memory architectures. While OpenMP has advantages on its ease of use and incremental programming, message passing is today still the most widely-used programming model for distributed memory architectures. How to effectively extend OpenMP to distributed memory architectures has been a hot spot. This paper proposes an OpenMP system, called KLCoMP, for distributed memory architectures. Based on the "partially replicating shared arrays" memory model, we propose an algorithm for shared array recognition based on the inter-procedural analysis, optimization technique based on the producer/consumer relationship, and communication generation technique for nonlinear references. We evaluate the performance on nine benchmarks which cover computational fluid dynamics, integer sorting, molecular dynamics, earthquake simulation, and computational chemistry. The average scalability achieved by KLCoMP version is close to that achieved by MPI version. We compare the performance of our translated programs with that of versions generated for Omni+SCASH, LLCoMP, and OpenMP(Purdue), and find that parallel applications (especially, irregular applications) translated by KLCoMP can achieve more effective performance than other versions.
Author WANG Jue HU ChangJun ZHANG JiLin LI JianJiang
AuthorAffiliation School of Information and Engineering, University of Science and Technology Beijing, Beijing 100083, China
Author_xml – sequence: 1
  givenname: Jue
  surname: Wang
  fullname: Wang, Jue
  email: ncepu5@gmail.com
  organization: School of Information and Engineering, University of Science and Technology Beijing
– sequence: 2
  givenname: ChangJun
  surname: Hu
  fullname: Hu, ChangJun
  organization: School of Information and Engineering, University of Science and Technology Beijing
– sequence: 3
  givenname: JiLin
  surname: Zhang
  fullname: Zhang, JiLin
  organization: School of Information and Engineering, University of Science and Technology Beijing
– sequence: 4
  givenname: JianJiang
  surname: Li
  fullname: Li, JianJiang
  organization: School of Information and Engineering, University of Science and Technology Beijing
BookMark eNp9kTtPBCEUhYnRxNf-ALuJFlajPGZ5lMb4SjRaaGJHgL0oOjOswBT778XsJiYW0gA35-Pce9hH22McAaEjgs8IxuI8E9Ix2mKC23rtWryF9ojkqiWKqO165rUoGHvdRbOcP3BdjGEq5B7qHpcwPjw1Lg7L0ENqfEzNIuSSgp0KLJoBhphWjUnuPRRwZUqQD9GON32G2WY_QC_XV8-Xt-39483d5cV965iYl7bDGCwxVEnGmLeOC047JaVyQJ0Bv1BeKiKZsdx4bhXwjjA3nwsvjLXGsgN0un53meLXBLnoIWQHfW9GiFPWdUTJiMTzqjz5o_yIUxprc5pWC06k5KKqyFrlUsw5gdfLFAaTVppg_ZOkXiepa5L6J0mNKyP-MC4UU0IcSzKh_5ekazJXl_EN0m9P_0HHG7v3OL59VU5b4z59_RvNGBeYU8q-AezDk9I
CitedBy_id crossref_primary_10_1080_02533839_2012_737113
Cites_doi 10.1145/1066650.1066661
10.1016/j.parco.2006.07.001
10.1007/978-3-540-75444-2_42
10.1002/jcc.540040211
10.1006/jpdc.1997.1332
10.1145/233561.233568
10.1109/71.752780
10.1109/71.544354
10.1016/j.jpdc.2005.06.018
10.1145/1088149.1088174
10.1109/CCGRID.2003.1199400
ContentType Journal Article
Copyright Science China Press and Springer-Verlag Berlin Heidelberg 2010
Science China Press and Springer-Verlag Berlin Heidelberg 2010.
Copyright_xml – notice: Science China Press and Springer-Verlag Berlin Heidelberg 2010
– notice: Science China Press and Springer-Verlag Berlin Heidelberg 2010.
DBID 2RA
92L
CQIGP
W92
~WA
AAYXX
CITATION
8FE
8FG
AFKRA
ARAPS
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
GNUQQ
HCIFZ
JQ2
K7-
P5Z
P62
PHGZM
PHGZT
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
7SC
8FD
L7M
L~C
L~D
DOI 10.1007/s11432-010-0074-0
DatabaseName 维普期刊资源整合服务平台
中文科技期刊数据库-CALIS站点
维普中文期刊数据库
中文科技期刊数据库-工程技术
中文科技期刊数据库- 镜像站点
CrossRef
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Central UK/Ireland
Advanced Technologies & Computer Science Collection
ProQuest Central Essentials
ProQuest Central
Technology Collection
ProQuest One Community College
ProQuest Central Korea
ProQuest Central Student
SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database (ProQuest)
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Premium
ProQuest One Academic (New)
ProQuest One Academic Middle East (New)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic (retired)
ProQuest One Academic UKI Edition
Computer and Information Systems Abstracts
Technology Research Database
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Advanced Technologies & Aerospace Collection
Computer Science Database
ProQuest Central Student
Technology Collection
ProQuest One Academic Middle East (New)
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
ProQuest One Academic Eastern Edition
SciTech Premium Collection
ProQuest One Community College
ProQuest Technology Collection
ProQuest SciTech Collection
ProQuest Central
Advanced Technologies & Aerospace Database
ProQuest One Applied & Life Sciences
ProQuest One Academic UKI Edition
ProQuest Central Korea
ProQuest Central (New)
ProQuest One Academic
ProQuest One Academic (New)
Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList Computer and Information Systems Abstracts
Advanced Technologies & Aerospace Collection


Database_xml – sequence: 1
  dbid: P5Z
  name: Advanced Technologies & Aerospace Database
  url: https://search.proquest.com/hightechjournals
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
DocumentTitleAlternate OpenMP compiler for distributed memory architectures
EISSN 1869-1919
EndPage 944
ExternalDocumentID 10_1007_s11432_010_0074_0
33670622
GroupedDBID -59
-5G
-BR
-EM
-Y2
-~C
.VR
06D
0VY
1N0
2B.
2C.
2J2
2JN
2JY
2KG
2KM
2LR
2RA
2VQ
2~H
30V
4.4
406
40D
40E
5VR
5VS
8TC
8UJ
92E
92I
92L
92Q
93N
95-
95.
96X
AAAVM
AABHQ
AAFGU
AAHNG
AAIAL
AAJKR
AANZL
AAPBV
AARHV
AARTL
AATVU
AAUYE
AAWCG
AAYIU
AAYQN
AAYTO
ABBBX
ABDZT
ABECU
ABFGW
ABFTV
ABHQN
ABJOX
ABKAS
ABKCH
ABKTR
ABMQK
ABNWP
ABQBU
ABSXP
ABTEG
ABTHY
ABTMW
ABXPI
ACBMV
ACBRV
ACBXY
ACBYP
ACGFO
ACGFS
ACHSB
ACHXU
ACIGE
ACIPQ
ACKNC
ACMDZ
ACMLO
ACOKC
ACOMO
ACREN
ACSNA
ACTTH
ACVWB
ACWMK
ADHIR
ADINQ
ADKNI
ADKPE
ADMDM
ADRFC
ADTIX
ADURQ
ADYFF
ADYOE
ADZKW
AEBTG
AEFTE
AEGAL
AEGNC
AEJHL
AEJRE
AEOHA
AEPYU
AESTI
AETLH
AEVTX
AEXYK
AFLOW
AFQWF
AFUIB
AFWTZ
AFYQB
AFZKB
AGAYW
AGDGC
AGGBP
AGJBK
AGMZJ
AGQMX
AGWIL
AGWZB
AGYKE
AHAVH
AHBYD
AHSBF
AHYZX
AIAKS
AIIXL
AILAN
AIMYW
AITGF
AJBLW
AJDOV
AJRNO
AJZVZ
AKQUC
ALMA_UNASSIGNED_HOLDINGS
ALWAN
AMKLP
AMTXH
AMYLF
ARMRJ
ASPBG
AVWKF
AXYYD
AZFZN
B-.
BDATZ
BGNMA
CAG
CCEZO
CDYEO
CHBEP
COF
CQIGP
CSCUP
CUBFJ
CW9
DNIVK
EBLON
EBS
EIOEI
EJD
ESBYG
FA0
FEDTE
FERAY
FFXSO
FIGPU
FINBP
FNLPD
FRRFC
FSGXE
FWDCC
GGCAI
GGRSB
GJIRD
GNWQR
GQ6
GQ7
H13
HG6
HMJXF
HRMNR
HVGLF
HZ~
IJ-
IPNFZ
IXD
I~X
I~Z
J-C
JBSCW
JZLTJ
KOV
LLZTM
M4Y
MA-
N2Q
NB0
NQJWS
NU0
O9J
P9O
PF0
PT4
QOS
R89
RIG
ROL
RSV
S16
S3B
SAP
SCL
SCO
SHX
SISQX
SJYHP
SNE
SNX
SOJ
SPISZ
SRMVM
SSLCW
STPWE
SZN
TCJ
TGP
TR2
TSG
TUC
U2A
UG4
UNUBA
UOJIU
UTJUX
UZXMN
VC2
VFIZW
W23
W48
W92
WK8
YLTOR
Z7R
Z7X
Z83
Z88
ZMTXR
~A9
~WA
-SI
-S~
0R~
AACDK
AAJBT
AASML
AATNV
AAXDM
AAYZH
ABAKF
ABJNI
ABQSL
ABTKH
ABWNU
ACAOD
ACDTI
ACZOJ
ADTPH
AEFQL
AEMSY
AESKC
AEVLU
AFKRA
AGQEE
AGRTI
AIGIU
AMXSW
AOCGG
ARAPS
BENPR
BGLVJ
BSONS
CAJEI
CCPQU
CJPJV
DDRTE
DPUIP
HCIFZ
IKXTQ
IWAJR
K7-
NPVJJ
Q--
SNPRN
SOHCF
U1G
U5S
AAPKM
AAYXX
ABBRH
ABDBE
ABRTQ
ADHKG
AFDZB
AFFHD
AFOHR
AGQPQ
AHPBZ
ATHPR
AYFIA
CITATION
PHGZM
PHGZT
PQGLB
8FE
8FG
AZQEC
DWQXO
GNUQQ
JQ2
P62
PKEHL
PQEST
PQQKQ
PQUKI
7SC
8FD
L7M
L~C
L~D
PUEGO
ID FETCH-LOGICAL-c375t-400eb1a298333fbc676249889ce2caefd9f89183ab6af6b9e6413c557f7abbab3
IEDL.DBID K7-
ISICitedReferencesCount 5
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000277417400005&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1674-733X
IngestDate Fri Sep 05 11:47:54 EDT 2025
Fri Nov 07 23:37:12 EST 2025
Tue Nov 18 21:47:41 EST 2025
Sat Nov 29 03:42:07 EST 2025
Fri Feb 21 02:33:10 EST 2025
Thu Nov 24 20:27:24 EST 2022
IsPeerReviewed true
IsScholarly true
Issue 5
Keywords irregular application
high performance computing
distributed memory architecture
OpenMP
parallel compiling
Language English
License http://www.springer.com/tdm
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c375t-400eb1a298333fbc676249889ce2caefd9f89183ab6af6b9e6413c557f7abbab3
Notes parallel compiling, high performance computing, distributed memory architecture, OpenMP, irregular application
11-5847/TP
P456.7
TP333.1
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Article-2
ObjectType-Feature-1
content type line 23
PQID 2918618867
PQPubID 2043626
PageCount 13
ParticipantIDs proquest_miscellaneous_869831805
proquest_journals_2918618867
crossref_primary_10_1007_s11432_010_0074_0
crossref_citationtrail_10_1007_s11432_010_0074_0
springer_journals_10_1007_s11432_010_0074_0
chongqing_backfile_33670622
PublicationCentury 2000
PublicationDate 2010-05-01
PublicationDateYYYYMMDD 2010-05-01
PublicationDate_xml – month: 05
  year: 2010
  text: 2010-05-01
  day: 01
PublicationDecade 2010
PublicationPlace Heidelberg
PublicationPlace_xml – name: Heidelberg
PublicationTitle Science China. Information sciences
PublicationTitleAbbrev Sci. China Inf. Sci
PublicationTitleAlternate SCIENCE CHINA Information Sciences
PublicationYear 2010
Publisher SP Science China Press
Springer Nature B.V
Publisher_xml – name: SP Science China Press
– name: Springer Nature B.V
References OpenMP Architecture Review Board. OpenMP Application Program Interface, version 2.5, 2005
Jeun W C, Kee Y S, Ha S. Improving performance of OpenMP for SMP clusters through overlapped page migrations. In: Proc. of International Workshop on OpenMP, Reims, France, 2006
BasumallikA.EigenmannR.Optimizing irregular shared-memory applications for distributed-memory systemsProc. of the Eleventh ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming2006New YorkACM Press11912810.1145/1122971.1122990
LiZ.Array privatization for parallel execution of loopsProc. of the ACM International Conference on Supercomputing1992New YorkACM Press313322
BasumallikA.EigenmannR.Towards automatic translation of OpenMP to MPIProc. of the 19th Annual International Conference on Supercomputing2005New YorkACM Press18919810.1145/1088149.1088174
CostaJ. J.CortesT.MartorellX.Running OpenMP applications efficiently on an everything-shared SDSMJ Parall Distrib Comput2006666476581101.6845810.1016/j.jpdc.2005.06.018
DortaA.LopezP.SandeF.Basic skeletons in llcParall Comput20063249150610.1016/j.parco.2006.07.001
HuC.LiJ.WangJ.Communication generation for irregular parallel applicationsProc. of IEEE International Symposium on Parallel Computing in Electrical Engineering2006New YorkIEEE Press263270
HaghighatM. R.PolychronopoulosC. D.Symbolic analysis for parallelizing compilersACM Trans Program Languag Syst19961847751810.1145/233561.233568
Min S J, Eigenmann R. Combined compile-time and runtime-driven, pro-active data movement in software DSM systems. In: Proc. of Seventh Workshop on Languages, Compilers, and Run-time Support for Scalable Systems, Houston, Texas, 2004. 1–6
WangJ.HuC. J.ZhangJ. L.An optimized strategy for collective communication in data parallelism (in Chinese)Chinese J Comput200823183282432879
Ojima Y, Sato M, Harada H, et al. Performance of cluster-enabled OpenMP for the SCASH software distributed shared memory system, cluster computing and the grid. In: Proc. of 3rd IEEE/ACM International Symposium on CCGrid, Tokyo, Japan, 2003. 450–456
AslotV.DomeikaM.EigenmannR.SPEComp: A new benchmark suite for measuring parallel computer performanceProc. of the Workshop on OpenMP Applications and Tools2001BelinSpringer110
EachempatiD.HuangL.ChapmanB. M.Strategies and implementation for translating OpenMP code for clustersProc. of High Performance Computing and Communications2007BelinSpringer42043110.1007/978-3-540-75444-2_42
Brandes T. ADAPTOR Users Guide, Fraunhofer Gesellschaft, Augustin, Germany, 2004
Michelle M, Barbara K, Paul D. Data-flow analysis for MPI programs. In: Proceedings of the 2006 International Conference on Parallel Processing, Columbus, Ohio, USA, 2006. 175–184
MPICH2.1.0.7, http://www.mcs.anl.gov/research/projects/mpich2/, March 21, 2008
BrooksB. R.BruccoleriR. E.OlafsonB. D.A program for macromolecular energy, minimization, and dynamics calculationsJ Comp Chem1983418721710.1002/jcc.540040211
LuH. H.Quantifying the performance differences between PVM and TreadMarksJ Parall Distrib Comput199743657810.1006/jpdc.1997.1332
EngelenR.BirchJ.ShouY.A unified framework for nonlinear dependence testing and symbolic analysisProc. of the ACM International Conference on Supercomputing2004New YorkACM Press106115
Brezany P, Dang M. CHAOS+ Runtime Library. Internal Report, Institute for Software Technology and Parallel Systems, University of Vienna, September 1997
WangJ.HuC.ZhangJ.OpenMP extensions for irregular parallel applications on cluster international workshop on OpenMP2007BerlinSpringer Publisher101111
BasumallikA.MinS.EigenmannR.Programming distributed memory systems using OpenMPProc. of International Parallel and Distributed Processing Symposium2007New YorkIEEE Press18
COSMIC group, University of Maryland. COSMIC software for irregular applications. http://www.cs.umd.edu/projects/osmic/software.html
TsengE.GaudlotJ.Communication generation for aligned and cyclic(k) distributions using integer latticeIEEE Trans Parallel Distrib Syst19991013614610.1109/71.752780
SatoM.SatohS.KusanoK.Design of OpenMP compiler for an SMP clusterProc. of the 1st European Workshop on OpenMP1999BelinSpringer3239
EigenmannR.HoeflingerJ.KuhnR. H.Is OpenMP for Grids?Proc. of International Parallel and Distributed Processing Symposium2002New YorkIEEE Press17117810.1109/IPDPS.2002.1016571
Jin H, Frumkin M, Yan J. The OpenMP implementation of NAS parallel benchmarks and its performance. Technical Report NAS-99-011, 1999
PetersenP.PaduaD. A.Static and dynamic evaluation of data dependence analysis techniquesIEEE Trans Parall Distrib Syst199671121113210.1109/71.544354
J. J. Costa (74_CR3) 2006; 66
74_CR20
A. Dorta (74_CR10) 2006; 32
H. H. Lu (74_CR5) 1997; 43
V. Aslot (74_CR15) 2001
A. Basumallik (74_CR8) 2006
R. Engelen (74_CR23) 2004
C. Hu (74_CR26) 2006
74_CR29
A. Basumallik (74_CR6) 2007
D. Eachempati (74_CR13) 2007
J. Wang (74_CR22) 2008; 2
74_CR21
74_CR4
P. Petersen (74_CR19) 1996; 7
74_CR1
A. Basumallik (74_CR7) 2005
R. Eigenmann (74_CR11) 2002
74_CR9
B. R. Brooks (74_CR17) 1983; 4
E. Tseng (74_CR28) 1999; 10
74_CR18
74_CR14
Z. Li (74_CR24) 1992
M. Sato (74_CR2) 1999
74_CR16
M. R. Haghighat (74_CR25) 1996; 18
J. Wang (74_CR27) 2007
74_CR12
References_xml – reference: Jeun W C, Kee Y S, Ha S. Improving performance of OpenMP for SMP clusters through overlapped page migrations. In: Proc. of International Workshop on OpenMP, Reims, France, 2006
– reference: WangJ.HuC. J.ZhangJ. L.An optimized strategy for collective communication in data parallelism (in Chinese)Chinese J Comput200823183282432879
– reference: BrooksB. R.BruccoleriR. E.OlafsonB. D.A program for macromolecular energy, minimization, and dynamics calculationsJ Comp Chem1983418721710.1002/jcc.540040211
– reference: MPICH2.1.0.7, http://www.mcs.anl.gov/research/projects/mpich2/, March 21, 2008
– reference: Jin H, Frumkin M, Yan J. The OpenMP implementation of NAS parallel benchmarks and its performance. Technical Report NAS-99-011, 1999
– reference: BasumallikA.EigenmannR.Optimizing irregular shared-memory applications for distributed-memory systemsProc. of the Eleventh ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming2006New YorkACM Press11912810.1145/1122971.1122990
– reference: BasumallikA.EigenmannR.Towards automatic translation of OpenMP to MPIProc. of the 19th Annual International Conference on Supercomputing2005New YorkACM Press18919810.1145/1088149.1088174
– reference: EachempatiD.HuangL.ChapmanB. M.Strategies and implementation for translating OpenMP code for clustersProc. of High Performance Computing and Communications2007BelinSpringer42043110.1007/978-3-540-75444-2_42
– reference: BasumallikA.MinS.EigenmannR.Programming distributed memory systems using OpenMPProc. of International Parallel and Distributed Processing Symposium2007New YorkIEEE Press18
– reference: LuH. H.Quantifying the performance differences between PVM and TreadMarksJ Parall Distrib Comput199743657810.1006/jpdc.1997.1332
– reference: WangJ.HuC.ZhangJ.OpenMP extensions for irregular parallel applications on cluster international workshop on OpenMP2007BerlinSpringer Publisher101111
– reference: OpenMP Architecture Review Board. OpenMP Application Program Interface, version 2.5, 2005
– reference: PetersenP.PaduaD. A.Static and dynamic evaluation of data dependence analysis techniquesIEEE Trans Parall Distrib Syst199671121113210.1109/71.544354
– reference: EngelenR.BirchJ.ShouY.A unified framework for nonlinear dependence testing and symbolic analysisProc. of the ACM International Conference on Supercomputing2004New YorkACM Press106115
– reference: Min S J, Eigenmann R. Combined compile-time and runtime-driven, pro-active data movement in software DSM systems. In: Proc. of Seventh Workshop on Languages, Compilers, and Run-time Support for Scalable Systems, Houston, Texas, 2004. 1–6
– reference: SatoM.SatohS.KusanoK.Design of OpenMP compiler for an SMP clusterProc. of the 1st European Workshop on OpenMP1999BelinSpringer3239
– reference: CostaJ. J.CortesT.MartorellX.Running OpenMP applications efficiently on an everything-shared SDSMJ Parall Distrib Comput2006666476581101.6845810.1016/j.jpdc.2005.06.018
– reference: EigenmannR.HoeflingerJ.KuhnR. H.Is OpenMP for Grids?Proc. of International Parallel and Distributed Processing Symposium2002New YorkIEEE Press17117810.1109/IPDPS.2002.1016571
– reference: Michelle M, Barbara K, Paul D. Data-flow analysis for MPI programs. In: Proceedings of the 2006 International Conference on Parallel Processing, Columbus, Ohio, USA, 2006. 175–184
– reference: TsengE.GaudlotJ.Communication generation for aligned and cyclic(k) distributions using integer latticeIEEE Trans Parallel Distrib Syst19991013614610.1109/71.752780
– reference: DortaA.LopezP.SandeF.Basic skeletons in llcParall Comput20063249150610.1016/j.parco.2006.07.001
– reference: Brezany P, Dang M. CHAOS+ Runtime Library. Internal Report, Institute for Software Technology and Parallel Systems, University of Vienna, September 1997
– reference: HaghighatM. R.PolychronopoulosC. D.Symbolic analysis for parallelizing compilersACM Trans Program Languag Syst19961847751810.1145/233561.233568
– reference: HuC.LiJ.WangJ.Communication generation for irregular parallel applicationsProc. of IEEE International Symposium on Parallel Computing in Electrical Engineering2006New YorkIEEE Press263270
– reference: LiZ.Array privatization for parallel execution of loopsProc. of the ACM International Conference on Supercomputing1992New YorkACM Press313322
– reference: AslotV.DomeikaM.EigenmannR.SPEComp: A new benchmark suite for measuring parallel computer performanceProc. of the Workshop on OpenMP Applications and Tools2001BelinSpringer110
– reference: COSMIC group, University of Maryland. COSMIC software for irregular applications. http://www.cs.umd.edu/projects/osmic/software.html
– reference: Brandes T. ADAPTOR Users Guide, Fraunhofer Gesellschaft, Augustin, Germany, 2004
– reference: Ojima Y, Sato M, Harada H, et al. Performance of cluster-enabled OpenMP for the SCASH software distributed shared memory system, cluster computing and the grid. In: Proc. of 3rd IEEE/ACM International Symposium on CCGrid, Tokyo, Japan, 2003. 450–456
– ident: 74_CR4
  doi: 10.1145/1066650.1066661
– start-page: 313
  volume-title: Proc. of the ACM International Conference on Supercomputing
  year: 1992
  ident: 74_CR24
– start-page: 1
  volume-title: Proc. of the Workshop on OpenMP Applications and Tools
  year: 2001
  ident: 74_CR15
– start-page: 263
  volume-title: Proc. of IEEE International Symposium on Parallel Computing in Electrical Engineering
  year: 2006
  ident: 74_CR26
– volume: 2
  start-page: 318
  year: 2008
  ident: 74_CR22
  publication-title: Chinese J Comput
– start-page: 106
  volume-title: Proc. of the ACM International Conference on Supercomputing
  year: 2004
  ident: 74_CR23
– ident: 74_CR21
– start-page: 32
  volume-title: Proc. of the 1st European Workshop on OpenMP
  year: 1999
  ident: 74_CR2
– volume: 32
  start-page: 491
  year: 2006
  ident: 74_CR10
  publication-title: Parall Comput
  doi: 10.1016/j.parco.2006.07.001
– start-page: 420
  volume-title: Proc. of High Performance Computing and Communications
  year: 2007
  ident: 74_CR13
  doi: 10.1007/978-3-540-75444-2_42
– volume: 4
  start-page: 187
  year: 1983
  ident: 74_CR17
  publication-title: J Comp Chem
  doi: 10.1002/jcc.540040211
– volume: 43
  start-page: 65
  year: 1997
  ident: 74_CR5
  publication-title: J Parall Distrib Comput
  doi: 10.1006/jpdc.1997.1332
– ident: 74_CR12
– ident: 74_CR14
– start-page: 119
  volume-title: Proc. of the Eleventh ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
  year: 2006
  ident: 74_CR8
– volume: 18
  start-page: 477
  year: 1996
  ident: 74_CR25
  publication-title: ACM Trans Program Languag Syst
  doi: 10.1145/233561.233568
– start-page: 101
  volume-title: OpenMP extensions for irregular parallel applications on cluster international workshop on OpenMP
  year: 2007
  ident: 74_CR27
– start-page: 171
  volume-title: Proc. of International Parallel and Distributed Processing Symposium
  year: 2002
  ident: 74_CR11
– ident: 74_CR16
– ident: 74_CR18
– volume: 10
  start-page: 136
  year: 1999
  ident: 74_CR28
  publication-title: IEEE Trans Parallel Distrib Syst
  doi: 10.1109/71.752780
– ident: 74_CR1
– volume: 7
  start-page: 1121
  year: 1996
  ident: 74_CR19
  publication-title: IEEE Trans Parall Distrib Syst
  doi: 10.1109/71.544354
– ident: 74_CR9
– ident: 74_CR20
– volume: 66
  start-page: 647
  year: 2006
  ident: 74_CR3
  publication-title: J Parall Distrib Comput
  doi: 10.1016/j.jpdc.2005.06.018
– start-page: 189
  volume-title: Proc. of the 19th Annual International Conference on Supercomputing
  year: 2005
  ident: 74_CR7
  doi: 10.1145/1088149.1088174
– start-page: 1
  volume-title: Proc. of International Parallel and Distributed Processing Symposium
  year: 2007
  ident: 74_CR6
– ident: 74_CR29
  doi: 10.1109/CCGRID.2003.1199400
SSID ssj0000330278
Score 1.8428456
Snippet OpenMP is an emerging industry standard for shared memory architectures. While OpenMP has advantages on its ease of use and incremental programming, message...
SourceID proquest
crossref
springer
chongqing
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 932
SubjectTerms Algorithms
Arrays
Computational chemistry
Computational fluid dynamics
Computer Science
Computer simulation
Distributed memory
Information Systems and Communication Service
Message passing
Molecular dynamics
Nonlinearity
OpenMP
Optimization techniques
Performance evaluation
Programming
Research Papers
Seismic phenomena
Sorting
内存架构
分布式
化学性能
程序分析
编程模型
编译器
计算流体动力学
SummonAdditionalLinks – databaseName: SpringerLINK Contemporary 1997-Present
  dbid: RSV
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3LS_QwEB9EPejBt7i-yOE7KYVu06bJUUTxoCLoJ3sLeVVF7ep2FfzvnantrooKem6atJPMKzPzG4B_Am0gLl0auS56Oql3JrJGqcgo73OPJoKoL_Qvj_PTU9nrqbOmjrtqs93bkGQtqcfFbqjaKY2AKqHzNEI_fSojsBly0c8vRxcrMadQXF0CJyjXkPNeG838ahbCVLjul1ePuOJH3TQ2OD_FSGvVczj_p49egLnG0mR7b0djESZCuQTzbRcH1jD1Esy-gyRchpQyTE7OGKWao8AYMDRqmSd0XWqMFTy7p9TcF_Y-AFGtwP_Dg4v9o6jprBA5nmdDdBpjlNEmUZJzXlgnUCSmSkrlQuJMKLwqpEJmN1aYQlgVBOo6l2V5kRtrjeWrMFn2y7AGrEAGD7F1GQrKNLhMCS-tL4RPc69iH3dgY0Rf1MzulvCmNCfcOJEkHYhbimvXgJJTb4w7PYZTJgpqpKAmCmqccGf0ysMbIsdPgzfbbdQNc1Y6wV8TXSlF3gE2eoxsRbESU4b-U6WlQNp0ZZx1YLfd2fEM3663_qvRGzDTJiPE3U2YHA6ewhZMu-fhTTXYro_1K-U47_E
  priority: 102
  providerName: Springer Nature
Title OpenMP compiler for distributed memory architectures
URI https://link.springer.com/article/10.1007/s11432-010-0074-0
https://www.proquest.com/docview/2918618867
https://www.proquest.com/docview/869831805
Volume 53
WOSCitedRecordID wos000277417400005&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVPQU
  databaseName: Advanced Technologies & Aerospace Database
  customDbUrl:
  eissn: 1869-1919
  dateEnd: 20241214
  omitProxy: false
  ssIdentifier: ssj0000330278
  issn: 1674-733X
  databaseCode: P5Z
  dateStart: 20010201
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/hightechjournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Computer Science Database
  customDbUrl:
  eissn: 1869-1919
  dateEnd: 20241214
  omitProxy: false
  ssIdentifier: ssj0000330278
  issn: 1674-733X
  databaseCode: K7-
  dateStart: 20010201
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/compscijour
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl:
  eissn: 1869-1919
  dateEnd: 20241214
  omitProxy: false
  ssIdentifier: ssj0000330278
  issn: 1674-733X
  databaseCode: BENPR
  dateStart: 20010201
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVAVX
  databaseName: SpringerLINK Contemporary 1997-Present
  customDbUrl:
  eissn: 1869-1919
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0000330278
  issn: 1674-733X
  databaseCode: RSV
  dateStart: 20100101
  isFulltext: true
  titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22
  providerName: Springer Nature
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LT9wwEB5R6KE9AH2gLrugHHpqFdWJE8c-VYBASMBqRVu06sXyK4CALGyWSv33nQnJLq0El158cWJbM_bMeGb8DcBHgTYQly6LXYI3ncw7E1ujVGyU94VHE0E0Dv2z42I4lOOxGrUOt7pNq-xkYiOo_cSRj_xLqhIpEilF8fX2LqaqURRdbUtovIAVnCqhCgZHRTz3sTBOUbnmNZygtEPOx11gs3k9h7YC5SXQ02rsZgSvcDGpzu9Qafytpha25z_h0kYLHaz97_rXYbW1P6Odhw3zBpZC9RZeP0IlfAcZJZmcjCLKNkeZMY3Qro08AexSbazgoxvKzv0dPY5B1O_hx8H-973DuC2uEDte5DO8NzIU0yZVknNeWidQKmZKSuVC6kwovSolLpkbK0wprAoC1Z3L86IsjLXG8g1YriZV-ABRiWc8MOtylJVZcLkSXlpfCp8VXjHPetCf0xWVs7siyCnNCTpOpGkPWEdp7VpcciqPca0XiMrEKI2M0sQojQN-mv9y-wDK8dzHg44huj2ftV5wowfRvBtPFoVLTBUm97WWAmmTSJb34HPH9cUIT863-fx8fXjVJSCwZADLs-l92IKX7tfssp5uw8ru_nB0ut3sY2xH-U9sT7-d_QHKEvgN
linkProvider ProQuest
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1Lb9QwEB6VLRL0wBuxbYEc4AKKcOLEsQ8I8WjVqtvVChW0N9evAIJm2822qH-K38hMNtktSPTWA2cndpQZfzP2zHwD8EygD8Sly2KX4Ekn887E1igVG-V94dFFEM2F_udBMRzK8ViNVuBXVwtDaZUdJjZA7SeO7shfpSqRIpFSFG-OT2LqGkXR1a6Fxlwt9sL5Tzyy1a93P6B8n6fp9tbB-5247SoQO17kMzwwMcQnkyrJOS-tEwgHmZJSuZA6E0qvSolrcWOFKYVVQSDOuzwvysJYayzHea_BasYzkfdg9d3WcPRxcavDOMUBm_o7QYmOnI-7UGpTr4feCWVCUDE3DjMidPg6qb6coJn60zAuvd2_ArSN3du-_b_9sTtwq_Wwo7fzLXEXVkJ1D9Yu8C7eh4zSaPZHEeXTIypOI_TcI08UwtT9K_joiPKPz6OLUZb6AXy6ks9-CL1qUoVHEJWIYoFZl6M1yILLlfDS-lL4rPCKedaHjYUc0f1w34lUS3MixxNp2gfWSVa7lnmdGoD80EvOaFIMjYqhSTE0Tvhi8crxnHbksoc3OwXQLQLVein9PkSLYcQOCgiZKkxOay0F_ptEsrwPLzstW87wz_XWL1_vKdzYOdgf6MHucG8DbnbpFizZhN5sehoew3V3NvtWT5-0uyeCw6tWv9-knVSK
linkToPdf http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3dT9swED8hhib2sPKpdWXMDzxtikjjxLEf0bYKBKsq2FDfLH8FEFvKmoLEf89dm7Rs2iYhnuPY8tn35bv7HcCeQBuIS5dGroueTuqdiaxRKjLK-9yjiSCmD_rnJ3m_L4dDNaj7nFZNtnsTkpzVNBBKUznZv_HF_qLwDdU8pRRQVXSeRuizv0jRkaGcrtOz8_kjS8wpLDcthxOUd8j5sIls_m0Wwle4HJUXv3D13_XUwvj8I146VUO91rM3sAavawuUHcyuzDoshXIDWk13B1Yz-wa8egRVuAkpZZ58HTBKQUdBMmZo7DJPqLvUMCt49pNSdu_Z48BEtQXfe1--fTqM6o4LkeN5NkFnMkbZbRIlOeeFdQJFZaqkVC4kzoTCq0IqFALGClMIq4JAHeiyLC9yY62xfBuWy1EZ3gArkPFDbF2GAjQNLlPCS-sL4dPcq9jHbejMaY0a210TDpXmhCcnkqQNcUN97WqwcuqZ8UMvYJaJghopqImCGif8MP_lZobU8b_BO82R6pppK53g1kRXSpG3gc0_I7tRDMWUYXRbaSmQNl0ZZ2342JzyYoZ_rvf2SaPfw8vB554-Oeofd2C1yVeIuzuwPBnfhnew4u4mV9V4d3rbHwCZD_u5
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=OpenMP+compiler+for+distributed+memory+architectures&rft.jtitle=Science+China.+Information+sciences&rft.au=Wang%2C+Jue&rft.au=Hu%2C+ChangJun&rft.au=Zhang%2C+JiLin&rft.au=Li%2C+JianJiang&rft.date=2010-05-01&rft.pub=Springer+Nature+B.V&rft.issn=1674-733X&rft.eissn=1869-1919&rft.volume=53&rft.issue=5&rft.spage=932&rft.epage=944&rft_id=info:doi/10.1007%2Fs11432-010-0074-0
thumbnail_s http://cvtisr.summon.serialssolutions.com/2.0.0/image/custom?url=http%3A%2F%2Fimage.cqvip.com%2Fvip1000%2Fqk%2F84009A%2F84009A.jpg