An efficient parallel algorithm for O( N2) direct summation method and its variations on distributed-memory parallel machines

We present a novel, highly efficient algorithm to parallelize O( N 2) direct summation method for N-body problems with individual timesteps on distributed-memory parallel machines such as Beowulf clusters. Previously known algorithms, in which all processors have complete copies of the N-body system...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:New astronomy Ročník 7; číslo 7; s. 373 - 384
Hlavný autor: Makino, Junichiro
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Elsevier B.V 01.10.2002
Predmet:
ISSN:1384-1076, 1384-1092
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract We present a novel, highly efficient algorithm to parallelize O( N 2) direct summation method for N-body problems with individual timesteps on distributed-memory parallel machines such as Beowulf clusters. Previously known algorithms, in which all processors have complete copies of the N-body system, has the serious problem that the communication–computation ratio increases as we increase the number of processors, since the communication cost is independent of the number of processors. In the new algorithm, p processors are organized as a p × p two-dimensional array. Each processor has N/ p particles, but the data are distributed in such a way that complete system is presented if we look at any row or column consisting of p processors. In this algorithm, the communication cost scales as N/ p , while the calculation cost scales as N 2/ p. Thus, we can use a much larger number of processors without losing efficiency compared to what was practical with previously known algorithms.
AbstractList We present a novel, highly efficient algorithm to parallelize O( N 2) direct summation method for N-body problems with individual timesteps on distributed-memory parallel machines such as Beowulf clusters. Previously known algorithms, in which all processors have complete copies of the N-body system, has the serious problem that the communication–computation ratio increases as we increase the number of processors, since the communication cost is independent of the number of processors. In the new algorithm, p processors are organized as a p × p two-dimensional array. Each processor has N/ p particles, but the data are distributed in such a way that complete system is presented if we look at any row or column consisting of p processors. In this algorithm, the communication cost scales as N/ p , while the calculation cost scales as N 2/ p. Thus, we can use a much larger number of processors without losing efficiency compared to what was practical with previously known algorithms.
Author Makino, Junichiro
Author_xml – sequence: 1
  givenname: Junichiro
  surname: Makino
  fullname: Makino, Junichiro
  email: makino@astron.s.u-tokyo.ac.jp
  organization: Department of Astronomy, School of Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan
BookMark eNqFkE1LAzEQhoNUsK3-BCHH9rA62aS7XTxIKX5BsQf1HJLNrI3sR0nSQg_-d7fbouClpxnm5XlhngHp1U2NhFwzuGHAkts3xqciYpAmI4jHAEzwSJyR_vGcxb3fPU0uyMD7LwBIWTbtk-9ZTbEobG6xDnStnCpLLKkqPxtnw6qiRePockRf4zE11mEeqN9UlQq2qWmFYdUYqmpDbfB0q5ztAk_b0FgfnNWbgCaqsGrc7q--UvnK1ugvyXmhSo9XxzkkH48P7_PnaLF8epnPFlEe8yxERaFBp9ooI_Qk5hhznQrGBPDEaGzvgEwYzlFhCyjIwMRaFdNEZErzFPiQTA69uWu8d1jItbOVcjvJQO4dys6h3AuSEMvOoRQtd_ePy23oXgxO2fIkfX-gsX1ta9FJv9ec40GkNI090fADP7qQHA
CitedBy_id crossref_primary_10_1051_0004_6361_202141789
crossref_primary_10_1093_mnras_stv817
crossref_primary_10_1017_S1743921307011209
crossref_primary_10_1111_j_1365_2966_2009_14880_x
crossref_primary_10_1088_1674_4527_9_8_002
crossref_primary_10_1016_j_newast_2007_11_004
crossref_primary_10_1046_j_1365_8711_2003_06717_x
crossref_primary_10_1016_j_parco_2007_01_001
crossref_primary_10_1093_ptep_pts029
crossref_primary_10_1007_s00450_009_0081_9
crossref_primary_10_1007_s41115_023_00018_w
crossref_primary_10_1111_j_1365_2966_2006_10693_x
crossref_primary_10_1016_S0021_9991_02_00067_0
crossref_primary_10_1088_1674_4527_16_1_011
crossref_primary_10_1016_j_newast_2007_12_005
crossref_primary_10_1093_mnras_stab3639
Cites_doi 10.1038/345033a0
10.1109/71.663861
10.1016/0021-9991(73)90160-5
10.1086/191306
10.1093/mnras/126.3.223
10.1086/316455
ContentType Journal Article
Copyright 2002 Elsevier Science B.V.
Copyright_xml – notice: 2002 Elsevier Science B.V.
DBID AAYXX
CITATION
DOI 10.1016/S1384-1076(02)00143-4
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Astronomy & Astrophysics
EISSN 1384-1092
EndPage 384
ExternalDocumentID 10_1016_S1384_1076_02_00143_4
S1384107602001434
GroupedDBID --K
--M
-DZ
.~1
0R~
123
1B1
1~.
1~5
29N
4.4
457
4G.
5VS
7-5
71M
8P~
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXUO
ABFNM
ABJNI
ABMAC
ABNEU
ABXDB
ABYKQ
ACDAQ
ACFVG
ACGFS
ACNNM
ACRLP
ADBBV
ADEZE
ADMUD
AEBSH
AEKER
AENEX
AFFNX
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AIEXJ
AIKHN
AITUG
AIVDX
AJBFU
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
ASPBG
AVWKF
AXJTR
AZFZN
BKOJK
BLXMC
CS3
DU5
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-Q
GBLVA
HVGLF
HZ~
IHE
J1W
KOM
M41
MO0
N9A
O-L
O9-
OAUVE
OGIMB
OZT
P-8
P-9
P2P
PC.
Q38
R2-
RIG
ROL
RPZ
SDF
SDG
SDP
SES
SEW
SPC
SPCBC
SPD
SSQ
SSZ
T5K
ZMT
~G-
9DU
AATTM
AAXKI
AAYWO
AAYXX
ABWVN
ACLOT
ACRPL
ACVFH
ADCNI
ADNMO
AEIPS
AEUPX
AFJKZ
AFPUW
AGQPQ
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
CITATION
EFKBS
~HD
ID FETCH-LOGICAL-c239t-ffb0b7bdad4b523e23b74114036dbebda0e14d33eaec23a090d2baf8649ab3703
ISICitedReferencesCount 28
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000178496100001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1384-1076
IngestDate Tue Nov 18 20:40:40 EST 2025
Sat Nov 29 02:39:26 EST 2025
Fri Feb 23 02:28:03 EST 2024
IsPeerReviewed true
IsScholarly true
Issue 7
Keywords 98.10.+z
02.60.Cb
95.10.Ce
Celestial mechanics, stellar dynamics
Methods: numerical
Language English
License https://www.elsevier.com/tdm/userlicense/1.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c239t-ffb0b7bdad4b523e23b74114036dbebda0e14d33eaec23a090d2baf8649ab3703
PageCount 12
ParticipantIDs crossref_primary_10_1016_S1384_1076_02_00143_4
crossref_citationtrail_10_1016_S1384_1076_02_00143_4
elsevier_sciencedirect_doi_10_1016_S1384_1076_02_00143_4
PublicationCentury 2000
PublicationDate 2002-10-01
PublicationDateYYYYMMDD 2002-10-01
PublicationDate_xml – month: 10
  year: 2002
  text: 2002-10-01
  day: 01
PublicationDecade 2000
PublicationTitle New astronomy
PublicationYear 2002
Publisher Elsevier B.V
Publisher_xml – name: Elsevier B.V
References Aarseth (BIB2) 1999; 111
Makino (BIB10) 1991; 43
Sugimoto, Chikada, Makino, Ito, Ebisuzaki, Umemura (BIB13) 1990; 345
Aarseth (BIB1) 1963; 126
Makino, Hut (BIB8) 1988; 68
McMillan (BIB11) 1986
Ahmad, Cohen (BIB3) 1973; 12
Makino, Taiji (BIB9) 1988
Fox, Williams, Messina (BIB6) 1994
Lippert, Seyfried, Bode, Schilling (BIB7) 1998; 9
Baumgardt, H., Spurzem, R., 2002. MNRAS, submitted.
Dorband, E.N., Hemsendorf, M., Merritt, D., 2002. Preprint (astro-ph/0112092).
Sterling, Salmon, Becker, Savarese (BIB12) 1999
Aarseth (10.1016/S1384-1076(02)00143-4_BIB2) 1999; 111
10.1016/S1384-1076(02)00143-4_BIB5
Lippert (10.1016/S1384-1076(02)00143-4_BIB7) 1998; 9
10.1016/S1384-1076(02)00143-4_BIB4
Fox (10.1016/S1384-1076(02)00143-4_BIB6) 1994
Makino (10.1016/S1384-1076(02)00143-4_BIB9) 1988
Makino (10.1016/S1384-1076(02)00143-4_BIB8) 1988; 68
Makino (10.1016/S1384-1076(02)00143-4_BIB10) 1991; 43
Sugimoto (10.1016/S1384-1076(02)00143-4_BIB13) 1990; 345
Sterling (10.1016/S1384-1076(02)00143-4_BIB12) 1999
McMillan (10.1016/S1384-1076(02)00143-4_BIB11) 1986
Aarseth (10.1016/S1384-1076(02)00143-4_BIB1) 1963; 126
Ahmad (10.1016/S1384-1076(02)00143-4_BIB3) 1973; 12
References_xml – volume: 68
  start-page: 833
  year: 1988
  ident: BIB8
  publication-title: ApJS
– volume: 9
  start-page: 97
  year: 1998
  ident: BIB7
  publication-title: IEEE Trans. Parallel Distrib. Syst.
– year: 1999
  ident: BIB12
  publication-title: How to Build a Beowulf: A Guide to Implementation and Application of PC Clusters
– reference: Baumgardt, H., Spurzem, R., 2002. MNRAS, submitted.
– volume: 43
  start-page: 859
  year: 1991
  ident: BIB10
  publication-title: PASJ
– volume: 345
  start-page: 33
  year: 1990
  ident: BIB13
  publication-title: Nature
– year: 1988
  ident: BIB9
  publication-title: Scientific Simulations with Special-Purpose Computers—The GRAPE Systems
– year: 1994
  ident: BIB6
  publication-title: Parallel Computing Works
– volume: 126
  start-page: 223
  year: 1963
  ident: BIB1
  publication-title: MNRAS
– volume: 111
  start-page: 1333
  year: 1999
  ident: BIB2
  publication-title: PASP
– volume: 12
  start-page: 389
  year: 1973
  ident: BIB3
  publication-title: J. Comput. Phys.
– reference: Dorband, E.N., Hemsendorf, M., Merritt, D., 2002. Preprint (astro-ph/0112092).
– start-page: 156
  year: 1986
  end-page: 161
  ident: BIB11
  article-title: The vectorization of small-n integrators
  publication-title: The Use of Supercomputers in Stellar Dynamics
– volume: 345
  start-page: 33
  year: 1990
  ident: 10.1016/S1384-1076(02)00143-4_BIB13
  publication-title: Nature
  doi: 10.1038/345033a0
– volume: 9
  start-page: 97
  year: 1998
  ident: 10.1016/S1384-1076(02)00143-4_BIB7
  publication-title: IEEE Trans. Parallel Distrib. Syst.
  doi: 10.1109/71.663861
– volume: 12
  start-page: 389
  year: 1973
  ident: 10.1016/S1384-1076(02)00143-4_BIB3
  publication-title: J. Comput. Phys.
  doi: 10.1016/0021-9991(73)90160-5
– volume: 68
  start-page: 833
  year: 1988
  ident: 10.1016/S1384-1076(02)00143-4_BIB8
  publication-title: ApJS
  doi: 10.1086/191306
– year: 1988
  ident: 10.1016/S1384-1076(02)00143-4_BIB9
– volume: 126
  start-page: 223
  year: 1963
  ident: 10.1016/S1384-1076(02)00143-4_BIB1
  publication-title: MNRAS
  doi: 10.1093/mnras/126.3.223
– year: 1999
  ident: 10.1016/S1384-1076(02)00143-4_BIB12
– volume: 43
  start-page: 859
  year: 1991
  ident: 10.1016/S1384-1076(02)00143-4_BIB10
  publication-title: PASJ
– year: 1994
  ident: 10.1016/S1384-1076(02)00143-4_BIB6
– ident: 10.1016/S1384-1076(02)00143-4_BIB5
– ident: 10.1016/S1384-1076(02)00143-4_BIB4
– volume: 111
  start-page: 1333
  year: 1999
  ident: 10.1016/S1384-1076(02)00143-4_BIB2
  publication-title: PASP
  doi: 10.1086/316455
– start-page: 156
  year: 1986
  ident: 10.1016/S1384-1076(02)00143-4_BIB11
  article-title: The vectorization of small-n integrators
SSID ssj0007198
Score 1.7457573
Snippet We present a novel, highly efficient algorithm to parallelize O( N 2) direct summation method for N-body problems with individual timesteps on...
SourceID crossref
elsevier
SourceType Enrichment Source
Index Database
Publisher
StartPage 373
SubjectTerms Celestial mechanics, stellar dynamics
Methods: numerical
Title An efficient parallel algorithm for O( N2) direct summation method and its variations on distributed-memory parallel machines
URI https://dx.doi.org/10.1016/S1384-1076(02)00143-4
Volume 7
WOSCitedRecordID wos000178496100001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1384-1092
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0007198
  issn: 1384-1076
  databaseCode: AIEXJ
  dateStart: 19960701
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3db9MwELeg44GXiU9tjCE_ILRpCiR2msSP0TQESHRIDKlvkZ04rKhNp7abNok_njt_pWNosAf6EEWOfI58v57Pzt39CHmNFcylkHXEM11HYCVlVIBJiIQGf7nAAmfD1pBN5KNRMR6LLy50aGnoBPKuKy4vxdl_VTW0gbIxdfYO6g5CoQHuQelwBbXD9Z8UX3YYpDExiY4HWNl7OtUYhvx9vpisTmcmrvAYOXgYngjYJe3A5LAZKFhK6fBN4QL20j5arsPPOZYhSzfRDGN0r_oRZiYs04Uk_uiLQsklnravZU58RgIs-8EHU1NOJ4v5tbMHFqLYvLnkRQptlsHF29N8DTb5mm3klrPELbPcMsPdsOD2MOFrkAzzgTViBe7lfELQtarZv61mIcYwhK-hqApFVTGrjJgqvU82WA57qQHZKD8ejT-FxTtPDH1yGL5P-nrXv9NezPbd-_zZnVlzUU4ekU23t6ClxcRjck93T8hW6eefvqHm3h5mLZ-Sn2VHA1SoVyQNUKEAFXq8N2L71MKEBphQCxMKMKEAE9rDhMLDmzDppXuYPCPf3h-dHH6IHBtHVDMuVlHbqljlqpFNqoaMa8YVeKNY7jFrlIb2WCdpw7mWGjrIWMQNU7ItslRIxWFheU4G3bzTW4QmSQ2O_hDcIphRkSj45VpmOpGsrVWdbZPUT2pVu1L1yJgyrW5V6jZ5G7qd2Votf-tQeI1VzuG001kBGm_v-uKuY-2Qh_0_6CUZrBbnepc8qC9Wk-XilYPhLyWMnvI
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=An+efficient+parallel+algorithm+for+O%28N2%29+direct+summation+method+and+its+variations+on+distributed-memory+parallel+machines&rft.jtitle=New+astronomy&rft.au=Makino%2C+Junichiro&rft.date=2002-10-01&rft.issn=1384-1076&rft.volume=7&rft.issue=7&rft.spage=373&rft.epage=384&rft_id=info:doi/10.1016%2FS1384-1076%2802%2900143-4&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_S1384_1076_02_00143_4
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1384-1076&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1384-1076&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1384-1076&client=summon