MPI + MPI: a new hybrid approach to parallel programming with MPI plus shared memory

Hybrid parallel programming with the message passing interface (MPI) for internode communication in conjunction with a shared-memory programming model to manage intranode parallelism has become a dominant approach to scalable parallel programming. While this model provides a great deal of flexibilit...

Full description

Saved in:
Bibliographic Details
Published in:Computing Vol. 95; no. 12; pp. 1121 - 1136
Main Authors: Hoefler, Torsten, Dinan, James, Buntinas, Darius, Balaji, Pavan, Barrett, Brian, Brightwell, Ron, Gropp, William, Kale, Vivek, Thakur, Rajeev
Format: Journal Article
Language:English
Published: Vienna Springer Vienna 01.12.2013
Springer Nature B.V
Subjects:
ISSN:0010-485X, 1436-5057
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Hybrid parallel programming with the message passing interface (MPI) for internode communication in conjunction with a shared-memory programming model to manage intranode parallelism has become a dominant approach to scalable parallel programming. While this model provides a great deal of flexibility and performance potential, it saddles programmers with the complexity of utilizing two parallel programming systems in the same application. We introduce an MPI-integrated shared-memory programming model that is incorporated into MPI through a small extension to the one-sided communication interface. We discuss the integration of this interface with the MPI 3.0 one-sided semantics and describe solutions for providing portable and efficient data sharing, atomic operations, and memory consistency. We describe an implementation of the new interface in the MPICH2 and Open MPI implementations and demonstrate an average performance improvement of 40 % to the communication component of a five-point stencil solver.
AbstractList Hybrid parallel programming with the message passing interface (MPI) for internode communication in conjunction with a shared-memory programming model to manage intranode parallelism has become a dominant approach to scalable parallel programming. While this model provides a great deal of flexibility and performance potential, it saddles programmers with the complexity of utilizing two parallel programming systems in the same application. We introduce an MPI-integrated shared-memory programming model that is incorporated into MPI through a small extension to the one-sided communication interface. We discuss the integration of this interface with the MPI 3.0 one-sided semantics and describe solutions for providing portable and efficient data sharing, atomic operations, and memory consistency. We describe an implementation of the new interface in the MPICH2 and Open MPI implementations and demonstrate an average performance improvement of 40 % to the communication component of a five-point stencil solver.
Hybrid parallel programming with the message passing interface (MPI) for internode communication in conjunction with a shared-memory programming model to manage intranode parallelism has become a dominant approach to scalable parallel programming. While this model provides a great deal of flexibility and performance potential, it saddles programmers with the complexity of utilizing two parallel programming systems in the same application. We introduce an MPI-integrated shared-memory programming model that is incorporated into MPI through a small extension to the one-sided communication interface. We discuss the integration of this interface with the MPI 3.0 one-sided semantics and describe solutions for providing portable and efficient data sharing, atomic operations, and memory consistency. We describe an implementation of the new interface in the MPICH2 and Open MPI implementations and demonstrate an average performance improvement of 40 % to the communication component of a five-point stencil solver.[PUBLICATION ABSTRACT]
Author Balaji, Pavan
Gropp, William
Dinan, James
Kale, Vivek
Barrett, Brian
Buntinas, Darius
Hoefler, Torsten
Thakur, Rajeev
Brightwell, Ron
Author_xml – sequence: 1
  givenname: Torsten
  surname: Hoefler
  fullname: Hoefler, Torsten
  email: htor@inf.ethz.ch
  organization: ETH Zurich
– sequence: 2
  givenname: James
  surname: Dinan
  fullname: Dinan, James
  organization: Argonne National Laboratory
– sequence: 3
  givenname: Darius
  surname: Buntinas
  fullname: Buntinas, Darius
  organization: Argonne National Laboratory
– sequence: 4
  givenname: Pavan
  surname: Balaji
  fullname: Balaji, Pavan
  organization: Argonne National Laboratory
– sequence: 5
  givenname: Brian
  surname: Barrett
  fullname: Barrett, Brian
  organization: Sandia National Laboratories
– sequence: 6
  givenname: Ron
  surname: Brightwell
  fullname: Brightwell, Ron
  organization: Sandia National Laboratories
– sequence: 7
  givenname: William
  surname: Gropp
  fullname: Gropp, William
  organization: University of Illinois at Urbana-Champaign
– sequence: 8
  givenname: Vivek
  surname: Kale
  fullname: Kale, Vivek
  organization: University of Illinois at Urbana-Champaign
– sequence: 9
  givenname: Rajeev
  surname: Thakur
  fullname: Thakur, Rajeev
  organization: Argonne National Laboratory
BookMark eNp9kEtLAzEUhYNUsK3-AHcBlxK9ecyk406Kj0JFFxXchUwm05kyL5Mppf_eDONCBN3cA5fz3Xs4MzRp2sYidEnhhgLIWw8QgyRAOQHOBGEnaEoFj0kEkZygKQAFIhbRxxmaeb8DAMYXyRRtXt5W-BqHeYc1buwBF8fUlRnWXedabQrct7jTTleVrXBYbZ2u67LZ4kPZFwOHu2rvsS-0sxmubd264zk6zXXl7cW3ztH748Nm-UzWr0-r5f2aGB4lPcmtzSCDNJHS6ChegEkTMJlkOkgSCw0solYwnmuTZnEqOOMCFjmXYE3GBJ-jq_FuyPW5t75Xu3bvmvBSUSESFkWMDi45uoxrvXc2V6bsdV-2Te90WSkKaqhQjRWqUKEaKlQskPQX2bmy1u74L8NGxgdvs7XuR6Y_oS8VKYOx
CitedBy_id crossref_primary_10_1051_epjn_2025035
crossref_primary_10_1145_3275443
crossref_primary_10_1002_cpe_4467
crossref_primary_10_1016_j_ultramic_2017_03_010
crossref_primary_10_1080_10106049_2019_1678679
crossref_primary_10_1088_1742_6596_2697_1_012035
crossref_primary_10_1016_j_future_2022_02_004
crossref_primary_10_1016_j_parco_2018_05_007
crossref_primary_10_1109_TAP_2021_3137455
crossref_primary_10_1016_j_cpc_2025_109743
crossref_primary_10_1016_j_parco_2015_07_004
crossref_primary_10_1016_j_foodchem_2021_130681
crossref_primary_10_1177_1094342014548772
crossref_primary_10_1109_TPDS_2021_3075157
crossref_primary_10_1007_s42241_019_0090_5
crossref_primary_10_1137_15M1026419
crossref_primary_10_1063_5_0094850
crossref_primary_10_1016_j_cam_2021_113706
crossref_primary_10_1145_3055600
crossref_primary_10_1016_j_applthermaleng_2025_128300
crossref_primary_10_1016_j_jpdc_2021_06_008
crossref_primary_10_1109_TPDS_2017_2703149
crossref_primary_10_1016_j_parco_2020_102669
crossref_primary_10_3390_mca26030052
Cites_doi 10.1007/978-3-642-33518-1_18
10.1145/1065010.1065042
10.1145/331532.331581
10.1109/MCSE.2010.122
10.1145/1379022.1375591
10.1109/MC.2006.180
10.1145/2076450.2076465
10.1137/0911008
10.1016/j.jcp.2009.01.017
10.1109/HiPC.2012.6507509
10.1016/S0167-8191(00)00072-7
10.1137/1.9780898718003
10.1109/PDP.2009.43
10.1002/1096-9128(200010)12:12<1121::AID-CPE531>3.0.CO;2-N
10.1145/1040305.1040336
10.1145/363911.363920
ContentType Journal Article
Copyright Springer-Verlag Wien 2013
Copyright_xml – notice: Springer-Verlag Wien 2013
DBID AAYXX
CITATION
3V.
7SC
7WY
7WZ
7XB
87Z
8AL
8AO
8FD
8FE
8FG
8FK
8FL
8G5
ABUWG
AFKRA
ARAPS
AZQEC
BENPR
BEZIV
BGLVJ
CCPQU
DWQXO
FRNLG
F~G
GNUQQ
GUQSH
HCIFZ
JQ2
K60
K6~
K7-
L.-
L.0
L7M
L~C
L~D
M0C
M0N
M2O
MBDVC
P5Z
P62
PHGZM
PHGZT
PKEHL
PQBIZ
PQBZA
PQEST
PQGLB
PQQKQ
PQUKI
Q9U
DOI 10.1007/s00607-013-0324-2
DatabaseName CrossRef
ProQuest Central (Corporate)
Computer and Information Systems Abstracts
ProQuest ABI/INFORM Collection
ABI/INFORM Global (PDF only)
ProQuest Central (purchase pre-March 2016)
ABI/INFORM Collection
Computing Database (Alumni Edition)
ProQuest Pharma Collection
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Central (Alumni) (purchase pre-March 2016)
ABI/INFORM Collection (Alumni Edition)
Research Library (Alumni Edition)
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
Advanced Technologies & Computer Science Collection
ProQuest Central Essentials
ProQuest Central
ProQuest Business Premium Collection
Technology Collection
ProQuest One Community College
ProQuest Central Korea
Business Premium Collection (Alumni)
ABI/INFORM Global (Corporate)
ProQuest Central Student
Research Library Prep
SciTech Premium Collection
ProQuest Computer Science Collection
ProQuest Business Collection (Alumni Edition)
ProQuest Business Collection
Computer Science Database
ABI/INFORM Professional Advanced
ABI/INFORM Professional Standard
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
ABI/INFORM Global
Computing Database
Research Library
Research Library (Corporate)
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Premium
ProQuest One Academic
ProQuest One Academic Middle East (New)
ProQuest One Business
ProQuest One Business (Alumni)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic (retired)
ProQuest One Academic UKI Edition
ProQuest Central Basic
DatabaseTitle CrossRef
ABI/INFORM Global (Corporate)
ProQuest Business Collection (Alumni Edition)
ProQuest One Business
Research Library Prep
Computer Science Database
ProQuest Central Student
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest One Academic Middle East (New)
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
Research Library (Alumni Edition)
ProQuest Pharma Collection
ABI/INFORM Complete
ProQuest Central
ABI/INFORM Professional Advanced
ProQuest One Applied & Life Sciences
ABI/INFORM Professional Standard
ProQuest Central Korea
ProQuest Research Library
ProQuest Central (New)
Advanced Technologies Database with Aerospace
ABI/INFORM Complete (Alumni Edition)
Advanced Technologies & Aerospace Collection
Business Premium Collection
ABI/INFORM Global
ProQuest Computing
ABI/INFORM Global (Alumni Edition)
ProQuest Central Basic
ProQuest Computing (Alumni Edition)
ProQuest One Academic Eastern Edition
ProQuest Technology Collection
ProQuest SciTech Collection
ProQuest Business Collection
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
ProQuest One Academic UKI Edition
ProQuest One Business (Alumni)
ProQuest One Academic
ProQuest One Academic (New)
ProQuest Central (Alumni)
Business Premium Collection (Alumni)
DatabaseTitleList
ABI/INFORM Global (Corporate)
Database_xml – sequence: 1
  dbid: BENPR
  name: ProQuest Central
  url: https://www.proquest.com/central
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Mathematics
Computer Science
EISSN 1436-5057
EndPage 1136
ExternalDocumentID 3120839461
10_1007_s00607_013_0324_2
Genre Feature
GroupedDBID -4Z
-59
-5G
-BR
-EM
-Y2
-~C
-~X
.4S
.86
.DC
.VR
06D
0R~
0VY
1N0
1SB
2.D
203
28-
29F
2J2
2JN
2JY
2KG
2KM
2LR
2P1
2VQ
2~H
30V
3V.
4.4
406
408
409
40D
40E
5GY
5QI
5VS
67Z
6NX
6TJ
78A
7WY
8AO
8FE
8FG
8FL
8G5
8TC
8UJ
8VB
95-
95.
95~
96X
AAAVM
AABHQ
AACDK
AAHNG
AAIAL
AAJBT
AAJKR
AANZL
AAOBN
AARHV
AARTL
AASML
AATNV
AATVU
AAUYE
AAWCG
AAYIU
AAYQN
AAYTO
AAYZH
ABAKF
ABBBX
ABBXA
ABDBF
ABDZT
ABECU
ABFTD
ABFTV
ABHLI
ABHQN
ABJNI
ABJOX
ABKCH
ABKTR
ABMNI
ABMQK
ABNWP
ABQBU
ABQSL
ABSXP
ABTEG
ABTHY
ABTKH
ABTMW
ABULA
ABUWG
ABWNU
ABXPI
ACAOD
ACBXY
ACDTI
ACGFS
ACHSB
ACHXU
ACKNC
ACMDZ
ACMLO
ACOKC
ACOMO
ACPIV
ACUHS
ACZOJ
ADHHG
ADHIR
ADIMF
ADINQ
ADKNI
ADKPE
ADRFC
ADTPH
ADURQ
ADYFF
ADZKW
AEBTG
AEFIE
AEFQL
AEGAL
AEGNC
AEJHL
AEJRE
AEKMD
AEMOZ
AEMSY
AENEX
AEOHA
AEPYU
AESKC
AETLH
AEVLU
AEXYK
AFBBN
AFEXP
AFFNX
AFGCZ
AFKRA
AFLOW
AFQWF
AFWTZ
AFZKB
AGAYW
AGDGC
AGGDS
AGJBK
AGMZJ
AGQEE
AGQMX
AGRTI
AGWIL
AGWZB
AGYKE
AHAVH
AHBYD
AHKAY
AHQJS
AHSBF
AHYZX
AIAKS
AIGIU
AIIXL
AILAN
AITGF
AJBLW
AJRNO
AJZVZ
AKVCP
ALMA_UNASSIGNED_HOLDINGS
ALWAN
AMKLP
AMXSW
AMYLF
AMYQR
AOCGG
ARAPS
ARCSS
ARMRJ
ASPBG
AVWKF
AXYYD
AYJHY
AZFZN
AZQEC
B-.
B0M
BA0
BBWZM
BDATZ
BENPR
BEZIV
BGLVJ
BGNMA
BKOMP
BPHCQ
BSONS
CAG
CCPQU
COF
CS3
CSCUP
DDRTE
DL5
DNIVK
DPUIP
DWQXO
EAD
EAP
EBA
EBLON
EBR
EBS
EBU
ECS
EDO
EIOEI
EJD
EMK
EPL
ESBYG
EST
ESX
FEDTE
FERAY
FFXSO
FIGPU
FINBP
FNLPD
FRNLG
FRRFC
FSGXE
FWDCC
GGCAI
GGRSB
GJIRD
GNUQQ
GNWQR
GQ6
GQ7
GQ8
GROUPED_ABI_INFORM_COMPLETE
GUQSH
GXS
H13
HCIFZ
HF~
HG5
HG6
HMJXF
HQYDN
HRMNR
HVGLF
HZ~
I09
IHE
IJ-
IKXTQ
ITG
ITH
ITM
IWAJR
IXC
IZIGR
IZQ
I~X
I~Z
J-C
J0Z
JBSCW
JCJTX
JZLTJ
K1G
K60
K6V
K6~
K7-
KDC
KOV
KOW
LAS
LLZTM
M0C
M0N
M2O
M4Y
MA-
MK~
ML~
N2Q
N9A
NB0
NDZJH
NPVJJ
NQJWS
NU0
O9-
O93
O9G
O9I
O9J
OAM
P19
P2P
P62
P9O
PF0
PQBIZ
PQBZA
PQQKQ
PROAC
PT4
PT5
Q2X
QOK
QOS
QWB
R4E
R89
R9I
RHV
RIG
RNI
RNS
ROL
RPX
RSV
RZK
S16
S1Z
S26
S27
S28
S3B
SAP
SCJ
SCLPG
SCO
SDH
SDM
SHX
SISQX
SJYHP
SNE
SNPRN
SNX
SOHCF
SOJ
SPISZ
SRMVM
SSLCW
STPWE
SZN
T13
T16
TH9
TN5
TSG
TSK
TSV
TUC
TUS
U2A
UG4
UOJIU
UTJUX
UZXMN
VC2
VFIZW
W23
W48
WK8
YLTOR
Z45
Z7R
Z7X
Z7Z
Z81
Z83
Z88
Z8M
Z8N
Z8R
Z8T
Z8U
Z8W
Z92
ZL0
ZMTXR
~8M
~EX
AAPKM
AAYXX
ABBRH
ABDBE
ABFSG
ABRTQ
ACSTC
ADHKG
AEZWR
AFDZB
AFFHD
AFHIU
AFKWF
AFOHR
AGQPQ
AHPBZ
AHWEU
AIXLP
AMVHM
ATHPR
AYFIA
CITATION
PHGZM
PHGZT
PQGLB
7SC
7XB
8AL
8FD
8FK
JQ2
L.-
L.0
L7M
L~C
L~D
MBDVC
PKEHL
PQEST
PQUKI
Q9U
ID FETCH-LOGICAL-c359t-feed0d0b977ca5680cb90cd72a90c964a0251e423facbd6b4323408f370ecd243
IEDL.DBID RSV
ISICitedReferencesCount 51
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000326885100002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0010-485X
IngestDate Wed Nov 26 13:41:46 EST 2025
Sat Nov 29 03:51:35 EST 2025
Tue Nov 18 20:45:27 EST 2025
Fri Feb 21 02:26:12 EST 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 12
Keywords Hybrid parallel programming
68N19 other progamming techniques (objects-oriented, sequential, concurrent, automatic, etc.)
MPI-3.0
Shared memory
Language English
License http://www.springer.com/tdm
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c359t-feed0d0b977ca5680cb90cd72a90c964a0251e423facbd6b4323408f370ecd243
Notes SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
OpenAccessLink http://hdl.handle.net/20.500.11850/75528
PQID 1449255214
PQPubID 48322
PageCount 16
ParticipantIDs proquest_journals_1449255214
crossref_citationtrail_10_1007_s00607_013_0324_2
crossref_primary_10_1007_s00607_013_0324_2
springer_journals_10_1007_s00607_013_0324_2
PublicationCentury 2000
PublicationDate 20131200
2013-12-00
20131201
PublicationDateYYYYMMDD 2013-12-01
PublicationDate_xml – month: 12
  year: 2013
  text: 20131200
PublicationDecade 2010
PublicationPlace Vienna
PublicationPlace_xml – name: Vienna
– name: Wien
PublicationSubtitle Archives for Scientific Computing
PublicationTitle Computing
PublicationTitleAbbrev Computing
PublicationYear 2013
Publisher Springer Vienna
Springer Nature B.V
Publisher_xml – name: Springer Vienna
– name: Springer Nature B.V
References Wolf MM, Heroux MA, Boman EG (2010) Factors impacting performance of multithreaded sparse triangular solve. Technical report SAND2010-0331 presented at VECPAR’10
EslerKPKimJCeperleyDMShulenburgerLAccelerating quantum Monte Carlo simulations of real materials on GPU clustersComput Sci Eng2012141405110.1109/MCSE.2010.122
BoehmHJAdveSVYou do not know jack about shared variables or memory modelsCommun. ACM2012552485410.1145/2076450.2076465
Saad Y (2003) Iterative methods for sparse linear systems. Society for Industrial and Applied Mathematics
SmithLBullMDevelopment of mixed mode MPI/OpenMP applicationsSci Program200192,38398
Bhargava P (1997) MPI-LITE: multithreading support for MPI. http://pcl.cs.ucla.edu/projects/sesame/mpi_lite/mpi_lite.html
Shen K, Tang H, Yang T (1999) Adaptive two-level thread management for fast MPI execution on shared memory machines. In: Proceedings of the ACM/IEEE conference on supercomputing
Sandia National Laboratories (2012) Mantevo project. http://www.mantevo.org
Heroux MA, Brightwell R, Wolf MM (2011) Bi-modal MPI and MPI+threads computing on scalable multicore systems. IJHPCA (Submitted)
Boehm HJ (2005) Threads cannot be implemented as a library. In: Proceedings of the 2005 ACM SIGPLAN conference on programming language design and implementation. PLDI ’05, New York, NY, USA, ACM pp 261–268
Rabenseifner R, Hager G, Jost G (2009) Hybrid MPI/OpenMP parallel programming on clusters of multi-core SMP nodes. In: Proceedings of the 17th Euromicro international conference on parallel, distributed and network-based processing
TaftJRAchieving 60 GFLOP/s on the production CFD code OVERFLOW-MLPParallel Comput200127452153610.1016/S0167-8191(00)00072-70972.68083
BoehmHJAdveSVFoundations of the C++ concurrency memory modelSIGPLAN Not2008436687810.1145/1379022.1375591
EslerKPKimJCeperleyDMPurwantoWWalterEJKrakauerHZhangSKentPRCHennigRGUmrigarCBajdichMKolorenJMitasLSrinivasanAQuantum monte carlo algorithms for electronic structure at the petascale; the endstation projectJ Phys20081251012057
AhoAVSethiRUllmanJDCompilers: principles, techniques, and tools1986BostonAddison-Wesley Longman Publishing Co. Inc.
SmithLKentPDevelopment and performance of a mixed OpenMP/MPI quantum Monte Carlo codeConcurr Pract Exp200012121121112910.1002/1096-9128(200010)12:12<1121::AID-CPE531>3.0.CO;2-N1008.68557
Niu Q, Dinan J, Tirukkovalur S, Mitas L, Wagner L, Sadayappan P (2012) A global address space approach to automated data management for parallel quantum Monte Carlo applications. In: Proceedings 19th international conference on high performance computing. HiPC’12
Esler KP Einspline libaray. Online: http://einspline.svn.sourceforge.net
Hoefler T, Dinan J, Buntinas D, Balaji P, Barrett B, Brightwell R, Gropp W, Kale V, Thakur R (2012) Leveraging MPIs one-sided communication interface for shared-memory programming. In: Träff J, Benkner S, Dongarra J (eds) Recent advances in the message passing interface. vol 7490, pp 132–141
TangHShenKYangTProgram transformation and runtime support for threaded MPI execution on shared memory machinesACM Trans Program Lang Syst20002267370010.1145/363911.363920
MPI Forum (2012) MPI: a message-passing interface standard. version 3.0
Manson J, Pugh W, Adve SV (2005) The Java memory model. In: Proceedings of the 32nd ACM SIGPLAN-SIGACT symposium on principles of programming languages. POPL ’05, New York, ACM pp 378–391
LeeEAThe problem with threadsComputer2006395334210.1109/MC.2006.180
Pérachec M, Carribault P, Jourdren H (2009) MPC-MPI: an MPI implementation reducing the overall memory consumption. In: Proceedings of EuroPVM/MPI 2009, Springer, pp 94–103
Shirley D (2000) Enhancing MPI applications through selective use of shared memory on SMPs. In: Proceedings of the 1st SIAM conference on CSE
SaltzJHAggregation methods for solving sparse triangular systems on multiprocessorsSIAM J Sci Stat Comput1990111123144103223110.1137/09110080692.65009
Demaine E (1997) A threads-only MPI implementation for the development of parallel programs. In: Proceedings of the 11th international symposium on HPC systems. pp 153–163
WagnerLKBajdichMMitasLQwalk: a quantum monte carlo program for electronic structureJ Comput Phys200922893390340410.1016/j.jcp.2009.01.017
Los Alamos National Laboratory (2001) Unified parallel software users’ guide and reference manual. http://public.lanl.gov/ups/Doc_Directory/UserGuide/UserGuide.pdf
AV Aho (324_CR15) 1986
324_CR19
JR Taft (324_CR12) 2001; 27
KP Esler (324_CR29) 2012; 14
LK Wagner (324_CR25) 2009; 228
324_CR21
324_CR23
EA Lee (324_CR18) 2006; 39
324_CR26
324_CR27
324_CR9
JH Saltz (324_CR22) 1990; 11
H Tang (324_CR7) 2000; 22
324_CR20
L Smith (324_CR2) 2001; 9
324_CR1
324_CR4
HJ Boehm (324_CR17) 2008; 43
324_CR3
324_CR6
324_CR5
324_CR8
HJ Boehm (324_CR14) 2012; 55
KP Esler (324_CR24) 2008; 125
324_CR11
324_CR10
324_CR13
L Smith (324_CR28) 2000; 12
324_CR16
References_xml – reference: Manson J, Pugh W, Adve SV (2005) The Java memory model. In: Proceedings of the 32nd ACM SIGPLAN-SIGACT symposium on principles of programming languages. POPL ’05, New York, ACM pp 378–391
– reference: Hoefler T, Dinan J, Buntinas D, Balaji P, Barrett B, Brightwell R, Gropp W, Kale V, Thakur R (2012) Leveraging MPIs one-sided communication interface for shared-memory programming. In: Träff J, Benkner S, Dongarra J (eds) Recent advances in the message passing interface. vol 7490, pp 132–141
– reference: Bhargava P (1997) MPI-LITE: multithreading support for MPI. http://pcl.cs.ucla.edu/projects/sesame/mpi_lite/mpi_lite.html
– reference: SmithLKentPDevelopment and performance of a mixed OpenMP/MPI quantum Monte Carlo codeConcurr Pract Exp200012121121112910.1002/1096-9128(200010)12:12<1121::AID-CPE531>3.0.CO;2-N1008.68557
– reference: AhoAVSethiRUllmanJDCompilers: principles, techniques, and tools1986BostonAddison-Wesley Longman Publishing Co. Inc.
– reference: SaltzJHAggregation methods for solving sparse triangular systems on multiprocessorsSIAM J Sci Stat Comput1990111123144103223110.1137/09110080692.65009
– reference: Demaine E (1997) A threads-only MPI implementation for the development of parallel programs. In: Proceedings of the 11th international symposium on HPC systems. pp 153–163
– reference: Rabenseifner R, Hager G, Jost G (2009) Hybrid MPI/OpenMP parallel programming on clusters of multi-core SMP nodes. In: Proceedings of the 17th Euromicro international conference on parallel, distributed and network-based processing
– reference: BoehmHJAdveSVYou do not know jack about shared variables or memory modelsCommun. ACM2012552485410.1145/2076450.2076465
– reference: TaftJRAchieving 60 GFLOP/s on the production CFD code OVERFLOW-MLPParallel Comput200127452153610.1016/S0167-8191(00)00072-70972.68083
– reference: Saad Y (2003) Iterative methods for sparse linear systems. Society for Industrial and Applied Mathematics
– reference: Shirley D (2000) Enhancing MPI applications through selective use of shared memory on SMPs. In: Proceedings of the 1st SIAM conference on CSE
– reference: Niu Q, Dinan J, Tirukkovalur S, Mitas L, Wagner L, Sadayappan P (2012) A global address space approach to automated data management for parallel quantum Monte Carlo applications. In: Proceedings 19th international conference on high performance computing. HiPC’12
– reference: MPI Forum (2012) MPI: a message-passing interface standard. version 3.0
– reference: Shen K, Tang H, Yang T (1999) Adaptive two-level thread management for fast MPI execution on shared memory machines. In: Proceedings of the ACM/IEEE conference on supercomputing
– reference: EslerKPKimJCeperleyDMPurwantoWWalterEJKrakauerHZhangSKentPRCHennigRGUmrigarCBajdichMKolorenJMitasLSrinivasanAQuantum monte carlo algorithms for electronic structure at the petascale; the endstation projectJ Phys20081251012057
– reference: Heroux MA, Brightwell R, Wolf MM (2011) Bi-modal MPI and MPI+threads computing on scalable multicore systems. IJHPCA (Submitted)
– reference: Esler KP Einspline libaray. Online: http://einspline.svn.sourceforge.net/
– reference: SmithLBullMDevelopment of mixed mode MPI/OpenMP applicationsSci Program200192,38398
– reference: Pérachec M, Carribault P, Jourdren H (2009) MPC-MPI: an MPI implementation reducing the overall memory consumption. In: Proceedings of EuroPVM/MPI 2009, Springer, pp 94–103
– reference: Wolf MM, Heroux MA, Boman EG (2010) Factors impacting performance of multithreaded sparse triangular solve. Technical report SAND2010-0331 presented at VECPAR’10
– reference: Los Alamos National Laboratory (2001) Unified parallel software users’ guide and reference manual. http://public.lanl.gov/ups/Doc_Directory/UserGuide/UserGuide.pdf
– reference: Sandia National Laboratories (2012) Mantevo project. http://www.mantevo.org
– reference: LeeEAThe problem with threadsComputer2006395334210.1109/MC.2006.180
– reference: TangHShenKYangTProgram transformation and runtime support for threaded MPI execution on shared memory machinesACM Trans Program Lang Syst20002267370010.1145/363911.363920
– reference: EslerKPKimJCeperleyDMShulenburgerLAccelerating quantum Monte Carlo simulations of real materials on GPU clustersComput Sci Eng2012141405110.1109/MCSE.2010.122
– reference: Boehm HJ (2005) Threads cannot be implemented as a library. In: Proceedings of the 2005 ACM SIGPLAN conference on programming language design and implementation. PLDI ’05, New York, NY, USA, ACM pp 261–268
– reference: BoehmHJAdveSVFoundations of the C++ concurrency memory modelSIGPLAN Not2008436687810.1145/1379022.1375591
– reference: WagnerLKBajdichMMitasLQwalk: a quantum monte carlo program for electronic structureJ Comput Phys200922893390340410.1016/j.jcp.2009.01.017
– ident: 324_CR11
  doi: 10.1007/978-3-642-33518-1_18
– ident: 324_CR13
  doi: 10.1145/1065010.1065042
– ident: 324_CR6
  doi: 10.1145/331532.331581
– volume: 14
  start-page: 40
  issue: 1
  year: 2012
  ident: 324_CR29
  publication-title: Comput Sci Eng
  doi: 10.1109/MCSE.2010.122
– volume: 43
  start-page: 68
  issue: 6
  year: 2008
  ident: 324_CR17
  publication-title: SIGPLAN Not
  doi: 10.1145/1379022.1375591
– volume: 39
  start-page: 33
  issue: 5
  year: 2006
  ident: 324_CR18
  publication-title: Computer
  doi: 10.1109/MC.2006.180
– ident: 324_CR19
– volume: 55
  start-page: 48
  issue: 2
  year: 2012
  ident: 324_CR14
  publication-title: Commun. ACM
  doi: 10.1145/2076450.2076465
– volume: 11
  start-page: 123
  issue: 1
  year: 1990
  ident: 324_CR22
  publication-title: SIAM J Sci Stat Comput
  doi: 10.1137/0911008
– volume: 228
  start-page: 3390
  issue: 9
  year: 2009
  ident: 324_CR25
  publication-title: J Comput Phys
  doi: 10.1016/j.jcp.2009.01.017
– ident: 324_CR9
– ident: 324_CR27
  doi: 10.1109/HiPC.2012.6507509
– volume: 9
  start-page: 83
  issue: 2,3
  year: 2001
  ident: 324_CR2
  publication-title: Sci Program
– volume: 27
  start-page: 521
  issue: 4
  year: 2001
  ident: 324_CR12
  publication-title: Parallel Comput
  doi: 10.1016/S0167-8191(00)00072-7
– ident: 324_CR5
– ident: 324_CR23
– ident: 324_CR1
– ident: 324_CR21
  doi: 10.1137/1.9780898718003
– ident: 324_CR26
– volume: 125
  start-page: 012057
  issue: 1
  year: 2008
  ident: 324_CR24
  publication-title: J Phys
– volume-title: Compilers: principles, techniques, and tools
  year: 1986
  ident: 324_CR15
– ident: 324_CR3
  doi: 10.1109/PDP.2009.43
– ident: 324_CR8
– volume: 12
  start-page: 1121
  issue: 12
  year: 2000
  ident: 324_CR28
  publication-title: Concurr Pract Exp
  doi: 10.1002/1096-9128(200010)12:12<1121::AID-CPE531>3.0.CO;2-N
– ident: 324_CR10
– ident: 324_CR4
– ident: 324_CR20
– ident: 324_CR16
  doi: 10.1145/1040305.1040336
– volume: 22
  start-page: 673
  year: 2000
  ident: 324_CR7
  publication-title: ACM Trans Program Lang Syst
  doi: 10.1145/363911.363920
SSID ssj0002389
Score 2.300325
Snippet Hybrid parallel programming with the message passing interface (MPI) for internode communication in conjunction with a shared-memory programming model to...
SourceID proquest
crossref
springer
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 1121
SubjectTerms Artificial Intelligence
Communication
Computer Appl. in Administrative Data Processing
Computer Communication Networks
Computer Science
Information Systems Applications (incl.Internet)
Interfaces
Laboratories
Parallel processing
Semantics
Software Engineering
Studies
SummonAdditionalLinks – databaseName: ABI/INFORM Global
  dbid: M0C
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3PT4MwGG10etCD06lxOk0PnlyItT-g9WLM4qKHLTvMZDcCLWQm7IeDmey_t-0KUxO9eIEDlACv_b5H-_geANdpjLgkmHuB1DBQRlKPM03kFJVGdM44YsqaTQT9Ph-NxMBNuOVOVlnGRBuo1UyaOfJbTfyFpr_4jj7M3z3jGmVWV52FxjbYMczGSPp6qFNFYp2O1vRXxxrK2ahc1US2iKhvRZfEQ5pTePh7XtqQzR_rozbtdOv_veFDcOAIJ3xc95AjsJVMG6BemjlAN7YbYL9XFXDNj8GwN3iBbai39zCCmnrD8cr82wXLGuSwmEFTNjzLkgw6kddEPwE0E7umHZxnyxzmYyNwhxOj512dgNfu07Dz7DkDBk8SJgov1QkUKRRrjigj5nMkY4GkCnCkd8KnkflASTQhSyMZKz-mBBOKeEoClEiFKTkFtelsmpwBqJBALFEkZYLTmChOCBXWuz2OE4FxE6Dy9YfSVSc3JhlZWNVVtoiFGrHQIBbqJjdVk_m6NMdfJ7dKlEI3SvNwA1ETtEucvxz-7WLnf1_sAuxh07Gs6KUFasVimVyCXflRvOWLK9tFPwGYaugr
  priority: 102
  providerName: ProQuest
Title MPI + MPI: a new hybrid approach to parallel programming with MPI plus shared memory
URI https://link.springer.com/article/10.1007/s00607-013-0324-2
https://www.proquest.com/docview/1449255214
Volume 95
WOSCitedRecordID wos000326885100002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVPQU
  databaseName: ABI/INFORM Global
  customDbUrl:
  eissn: 1436-5057
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0002389
  issn: 0010-485X
  databaseCode: M0C
  dateStart: 20020201
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/abiglobal
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Advanced Technologies & Aerospace Database
  customDbUrl:
  eissn: 1436-5057
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0002389
  issn: 0010-485X
  databaseCode: P5Z
  dateStart: 20020201
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/hightechjournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Computer Science Database
  customDbUrl:
  eissn: 1436-5057
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0002389
  issn: 0010-485X
  databaseCode: K7-
  dateStart: 20020201
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/compscijour
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest ABI/INFORM Collection
  customDbUrl:
  eissn: 1436-5057
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0002389
  issn: 0010-485X
  databaseCode: 7WY
  dateStart: 20020201
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/abicomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl:
  eissn: 1436-5057
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0002389
  issn: 0010-485X
  databaseCode: BENPR
  dateStart: 20020201
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Research Library
  customDbUrl:
  eissn: 1436-5057
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0002389
  issn: 0010-485X
  databaseCode: M2O
  dateStart: 20020201
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/pqrl
  providerName: ProQuest
– providerCode: PRVAVX
  databaseName: SpringerLINK Contemporary 1997-Present
  customDbUrl:
  eissn: 1436-5057
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0002389
  issn: 0010-485X
  databaseCode: RSV
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22
  providerName: Springer Nature
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3dS8MwED90-qAPzk-cHyMPPimF0CRr6puOiSKbY06dvpQ2aZnQzbFuwv57L11bP1BBX1JKkxDukrtfmsvvAI6igErFbGk5CtXABYssKRDIaa5M0LmQVOg02YTTaslez21n97iTPNo9P5JMLXVx2c1Qh5gwSWZRRAEW2t0l9HbS5Gvo3N4X5hd90BzzooHhUvTyo8zvuvjsjN4R5pdD0dTXXJT_Ncp1WMugJTmbz4UNWAiHm1DO0zaQbBVvwmqzoGpNtqDbbF-RE4LlKfEJgmzSn5lbXCRnGyeTF2IIwuM4jEkWzjXAIRHzC9e0I6N4mpCkb0LZycBE7s624e6i0a1fWlmqBUsx4U6sCF0l1TRANKh8UZNUBS5V2rF9fLg17putSIjQK_JVoGsBZzbjVEbMoaHSNmc7UBq-DMNdIJq6VISaRcKVPGBaMsbdNEt7EISubVeA5jL3VMZDbtJhxF7BoJzK0EMZekaGHjY5LpqM5iQcv1U-yBXpZesxwQ2OIWFEqMIrcJIr7sPnnzrb-1PtfVixjebTaJcDKE3G0_AQltXr5DkZV2HReXiswtJ5o9Xu4Nu1Y2HZpHVT2jdYtsVTNZ3Mb6Hi49s
linkProvider Springer Nature
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1NbxMxEB2VFgk4UChFFFrwAS5UVl1_ZG0khFA_1ChN1EOQclt2ba-KtE1CNwXlT_EbGTvrlCLRWw9cdg9rW1r7eebZfp4BeFuVTFvBNc0sDoNUoqJaIZFz0gbRudJMuZhsIhsM9GhkzlbgV7oLE2SVySZGQ-0mNuyR7yHxN0h_-b78NP1OQ9aocLqaUmgsYNHz85-4ZGs-dg9xfN9xfnw0PDihbVYBaoUyM1qhV2COlUh8bKE6mtnSMOsyXuDLdGQRWLdHllEVtnSdUgouJNOVyJi3jkuB7d6DNSl0FuZVL6NLy4_ub0G30bZJrUbpFJXFoKWdKPIUlCGHofymH7wmt3-dx0Y3d7z-v3XQE3jcEmryeTEDnsKKH2_AekpWQVrbtQGP-ssAtc0zGPbPumSX4PMDKQguLcj5PNxdIynGOplNSAiLXte-Jq2I7QJ7jISN61CPTOurhjTnQcBPLoJeeb4JX-7kT5_D6ngy9i-AOGaY8k5UymhZCqeFkCbmpi9LbzjfApaGO7dt9PWQBKTOl3GjI0JyREgeEJJjlffLKtNF6JHbCm8nVOStFWrya0hswW7C1R-f_9XYy9sbewMPTob90_y0O-i9goc8gDoKfLZhdXZ55Xfgvv0x-9Zcvo7Tg8DXu4bbbz8PRas
linkToPdf http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1Nb9QwEB2VglA5UCigFgrMAS5UUY0_NjYSQoiyYlW62kORVlxCYjsqUrq7NFvQ_jV-HWNvvAUkeuuBS3KIbSn288yz_TwD8KyumLaC6yy3NAxSiTrTioickzaIzpVmysVkE_lwqMdjM1qDn-kuTJBVJpsYDbWb2rBHvk_E3xD95S_lft3JIkYH_Tezb1nIIBVOWlM6jSVEDv3iBy3f2teDAxrr55z33x-_-5B1GQYyK5SZZzV5COZYRSTIlqqnma0Msy7nJb1MT5aBgXtiHHVpK9erpOBCMl2LnHnruBTU7jW4ntMaM8gJR-rzyguQK1xSb7JzUqtxOlFlMYBpLwo-RcaIz2T8T594QXT_OpuNLq-_-T931h243RFtfLucGXdhzU-2YDMlscDOpm3BraNV4Nr2HhwfjQa4h_R8hSXSkgNPFuFOG6bY6zifYgiX3jS-wU7cdkq9h2FDO9TDWXPeYnsShP14GnTMi_vw6Ur-9AGsT6YTvw3omGHKO1Ero2UlnBZCmpizvqq84XwHWBr6wnZR2UNykKZYxZOOaCkILUVAS0FVXqyqzJYhSS4rvJsQUnTWqS0u4LEDewljv33-V2MPL2_sKdwklBUfB8PDR7DBA76j7mcX1udn5_4x3LDf51_bsydxpiB8uWq0_QJRVE7P
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=MPI+%2B+MPI%3A+a+new+hybrid+approach+to+parallel+programming+with+MPI+plus+shared+memory&rft.jtitle=Computing&rft.au=Hoefler%2C+Torsten&rft.au=Dinan%2C+James&rft.au=Buntinas%2C+Darius&rft.au=Balaji%2C+Pavan&rft.date=2013-12-01&rft.pub=Springer+Vienna&rft.issn=0010-485X&rft.eissn=1436-5057&rft.volume=95&rft.issue=12&rft.spage=1121&rft.epage=1136&rft_id=info:doi/10.1007%2Fs00607-013-0324-2&rft.externalDocID=10_1007_s00607_013_0324_2
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0010-485X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0010-485X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0010-485X&client=summon