Minimizing Communication in All-Pairs Shortest Paths

We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling the APSP problem to high concurrencies requires both minimizing inter-processor communication as well as maximizing temporal data locality. The 2.5D APSP algorithm, which is based on the divide-and-con...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:2013 IEEE 27th International Symposium on Parallel and Distributed Processing S. 548 - 559
Hauptverfasser: Solomonik, Edgar, Buluç, Aydın, Demmel, James
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: IEEE 01.05.2013
Schlagworte:
ISBN:146736066X, 9781467360661
ISSN:1530-2075
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling the APSP problem to high concurrencies requires both minimizing inter-processor communication as well as maximizing temporal data locality. The 2.5D APSP algorithm, which is based on the divide-and-conquer paradigm, satisfies both of these requirements: it can utilize any extra available memory to perform asymptotically less communication, and it is rich in semiring matrix multiplications, which have high temporal locality. We start by introducing a block-cyclic 2D (minimal memory) APSP algorithm. With a careful choice of block-size, this algorithm achieves known communication lower-bounds for latency and bandwidth. We extend this 2D block-cyclic algorithm to a 2.5D algorithm, which can use c extra copies of data to reduce the bandwidth cost by a factor of c 1/2 , compared to its 2D counterpart. However, the 2.5D algorithm increases the latency cost by c 1/2 . We provide a tighter lower bound on latency, which dictates that the latency overhead is necessary to reduce bandwidth along the critical path of execution. Our implementation achieves impressive performance and scaling to 24,576 cores of a Cray XE6 supercomputer by utilizing well-tuned intra-node kernels within the distributed memory algorithm.
AbstractList We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling the APSP problem to high concurrencies requires both minimizing inter-processor communication as well as maximizing temporal data locality. The 2.5D APSP algorithm, which is based on the divide-and-conquer paradigm, satisfies both of these requirements: it can utilize any extra available memory to perform asymptotically less communication, and it is rich in semiring matrix multiplications, which have high temporal locality. We start by introducing a block-cyclic 2D (minimal memory) APSP algorithm. With a careful choice of block-size, this algorithm achieves known communication lower-bounds for latency and bandwidth. We extend this 2D block-cyclic algorithm to a 2.5D algorithm, which can use c extra copies of data to reduce the bandwidth cost by a factor of c 1/2 , compared to its 2D counterpart. However, the 2.5D algorithm increases the latency cost by c 1/2 . We provide a tighter lower bound on latency, which dictates that the latency overhead is necessary to reduce bandwidth along the critical path of execution. Our implementation achieves impressive performance and scaling to 24,576 cores of a Cray XE6 supercomputer by utilizing well-tuned intra-node kernels within the distributed memory algorithm.
Author Solomonik, Edgar
Buluç, Aydın
Demmel, James
Author_xml – sequence: 1
  givenname: Edgar
  surname: Solomonik
  fullname: Solomonik, Edgar
  email: solomon@eecs.berkeley.edu
  organization: Dept. of EECS, Univ. of California, Berkeley, Berkeley, CA, USA
– sequence: 2
  givenname: Aydın
  surname: Buluç
  fullname: Buluç, Aydın
  email: abuluc@lbl.gov
  organization: Comput. Res. Div., Lawrence Berkeley Nat. Lab., Berkeley, CA, USA
– sequence: 3
  givenname: James
  surname: Demmel
  fullname: Demmel, James
  email: demmel@eecs.berkeley.edu
  organization: Dept. of EECS, Univ. of California, Berkeley, Berkeley, CA, USA
BookMark eNotjrtOwzAUQI0oEm3JysLiH0i5N37FYxVelYqIVJDYKtvY1ChxUBwG-HoqwXR0lqOzILM0JE_IJcIKEfT1pr1pd6sKkB0dT0ihVQ1KasG1wuqULJBLxSRI-TojcxQMygqUOCdFzh8AgMAEcjUn_DGm2MefmN5pM_T9V4rOTHFINCa67rqyNXHMdHcYxsnnibZmOuQLchZMl33xzyV5ubt9bh7K7dP9pllvy4hKTKU3UPHKcfTc1NYIFMclLby0zFv02joeVDCgJbJacKsZBiecC07btyBrtiRXf93ovd9_jrE34_deCqlrjuwXMbJJlQ
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/IPDPS.2013.111
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9780769549712
0769549713
EndPage 559
ExternalDocumentID 6569841
Genre orig-research
GroupedDBID 29O
6IE
6IF
6IH
6IK
6IL
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
OCL
RIE
RIL
ID FETCH-LOGICAL-i175t-ea0242c41e4a8ba51573695e6b3eb1e9bc4f7fa09613854b931fc5ccfc9bdf683
IEDL.DBID RIE
ISBN 146736066X
9781467360661
ISICitedReferencesCount 25
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000332828000047&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1530-2075
IngestDate Wed Aug 27 04:15:52 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i175t-ea0242c41e4a8ba51573695e6b3eb1e9bc4f7fa09613854b931fc5ccfc9bdf683
PageCount 12
ParticipantIDs ieee_primary_6569841
PublicationCentury 2000
PublicationDate 2013-May
PublicationDateYYYYMMDD 2013-05-01
PublicationDate_xml – month: 05
  year: 2013
  text: 2013-May
PublicationDecade 2010
PublicationTitle 2013 IEEE 27th International Symposium on Parallel and Distributed Processing
PublicationTitleAbbrev ipdps
PublicationYear 2013
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0001035147
ssj0020349
Score 1.6891625
Snippet We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling the APSP problem to high concurrencies requires both...
SourceID ieee
SourceType Publisher
StartPage 548
SubjectTerms 2.5D algorithms
Algorithm design and analysis
all-pairs shortest paths
Bandwidth
communication-avoiding algorithms
Heuristic algorithms
Matrix decomposition
Particle separators
Partitioning algorithms
Title Minimizing Communication in All-Pairs Shortest Paths
URI https://ieeexplore.ieee.org/document/6569841
WOSCitedRecordID wos000332828000047&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELZKxcBUoEW8lYER06R-xSMCKhioIhWkblXsnEWkkqI2ZeDX40vSVpVY2OLIg504vovvexBy49cF6BQExVhKubGMaukk1RaYioQyoavNJtRoFE8mOmmR2w0XBgAq8Bnc4WVVy8_mdoVHZX2fe-gYWep7Ssmaq7U9T8GSGN_-bKHuSq2VGvqVoERF6kIQE1rnrbWemnbUqDlGoe6_JI_JGCFfDHeTHc-VKuQMO_8b7CHpbbl7QbKJSkekBcUx6azNG4LmW-4S_poX-Wf-4zsFOyyRIC-C-9mMJljpCcYfiMZdlkHiU8Vlj7wPn94enmljoUBznxeUFFKMwZZHwNPYpD558TPWAqRhfpMGbSx3yqXo-8JiwY1mkbPCWme1yZyM2QlpF_MCTkkAytnUZHHmGHAhBkbaTMtYQSgh9V3PSBcfwvSrVsmYNvM___v2BTkYVMYSCB28JO1ysYIrsm-_y3y5uK5e7S94-J3c
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3NT8IwFG8ImugJFYzf9uDRyUbbbT0alUAEsgRMuJG1e41LcBgYHvzr7duGhMSLt3Xpod26vre-3wchd3ZdgIxBOBhLHa40c6RvfEdqYIEnAuWa0mwiGI3C6VRGNXL_y4UBgAJ8Bg94WdTyk4Ve41FZ2-YeMkSW-h46Z3klW2t7ooJFMb793ULllVIt1bVrIRAFrQthTGiet1F7qtpepefoubLdj56jMYK-GO4nO64rRdDpNv433CPS2rL3aPQbl45JDbIT0tjYN9Dqa24SPkyz9CP9tp3oDk-Ephl9nM-dCGs9dPyOeNxVTiObLK5a5K37MnnqOZWJgpPazCB3IMYorLkHPA5VbNMXO2MpwFfMbtMgleYmMDE6v7BQcCWZZ7TQ2mipEuOH7JTUs0UGZ4RCYHSskjAxDLgQHeXrRPphAK4Pse16Tpr4EGafpU7GrJr_xd-3b8lBbzIczAb90eslOewUNhMIJLwi9Xy5hmuyr7_ydLW8KV7zD-XWoSM
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2013+IEEE+27th+International+Symposium+on+Parallel+and+Distributed+Processing&rft.atitle=Minimizing+Communication+in+All-Pairs+Shortest+Paths&rft.au=Solomonik%2C+Edgar&rft.au=Bulu%C3%A7%2C+Ayd%C4%B1n&rft.au=Demmel%2C+James&rft.date=2013-05-01&rft.pub=IEEE&rft.isbn=9781467360661&rft.issn=1530-2075&rft.spage=548&rft.epage=559&rft_id=info:doi/10.1109%2FIPDPS.2013.111&rft.externalDocID=6569841
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1530-2075&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1530-2075&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1530-2075&client=summon