Minimizing Communication in All-Pairs Shortest Paths
We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling the APSP problem to high concurrencies requires both minimizing inter-processor communication as well as maximizing temporal data locality. The 2.5D APSP algorithm, which is based on the divide-and-con...
Gespeichert in:
| Veröffentlicht in: | 2013 IEEE 27th International Symposium on Parallel and Distributed Processing S. 548 - 559 |
|---|---|
| Hauptverfasser: | , , |
| Format: | Tagungsbericht |
| Sprache: | Englisch |
| Veröffentlicht: |
IEEE
01.05.2013
|
| Schlagworte: | |
| ISBN: | 146736066X, 9781467360661 |
| ISSN: | 1530-2075 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Abstract | We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling the APSP problem to high concurrencies requires both minimizing inter-processor communication as well as maximizing temporal data locality. The 2.5D APSP algorithm, which is based on the divide-and-conquer paradigm, satisfies both of these requirements: it can utilize any extra available memory to perform asymptotically less communication, and it is rich in semiring matrix multiplications, which have high temporal locality. We start by introducing a block-cyclic 2D (minimal memory) APSP algorithm. With a careful choice of block-size, this algorithm achieves known communication lower-bounds for latency and bandwidth. We extend this 2D block-cyclic algorithm to a 2.5D algorithm, which can use c extra copies of data to reduce the bandwidth cost by a factor of c 1/2 , compared to its 2D counterpart. However, the 2.5D algorithm increases the latency cost by c 1/2 . We provide a tighter lower bound on latency, which dictates that the latency overhead is necessary to reduce bandwidth along the critical path of execution. Our implementation achieves impressive performance and scaling to 24,576 cores of a Cray XE6 supercomputer by utilizing well-tuned intra-node kernels within the distributed memory algorithm. |
|---|---|
| AbstractList | We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling the APSP problem to high concurrencies requires both minimizing inter-processor communication as well as maximizing temporal data locality. The 2.5D APSP algorithm, which is based on the divide-and-conquer paradigm, satisfies both of these requirements: it can utilize any extra available memory to perform asymptotically less communication, and it is rich in semiring matrix multiplications, which have high temporal locality. We start by introducing a block-cyclic 2D (minimal memory) APSP algorithm. With a careful choice of block-size, this algorithm achieves known communication lower-bounds for latency and bandwidth. We extend this 2D block-cyclic algorithm to a 2.5D algorithm, which can use c extra copies of data to reduce the bandwidth cost by a factor of c 1/2 , compared to its 2D counterpart. However, the 2.5D algorithm increases the latency cost by c 1/2 . We provide a tighter lower bound on latency, which dictates that the latency overhead is necessary to reduce bandwidth along the critical path of execution. Our implementation achieves impressive performance and scaling to 24,576 cores of a Cray XE6 supercomputer by utilizing well-tuned intra-node kernels within the distributed memory algorithm. |
| Author | Solomonik, Edgar Buluç, Aydın Demmel, James |
| Author_xml | – sequence: 1 givenname: Edgar surname: Solomonik fullname: Solomonik, Edgar email: solomon@eecs.berkeley.edu organization: Dept. of EECS, Univ. of California, Berkeley, Berkeley, CA, USA – sequence: 2 givenname: Aydın surname: Buluç fullname: Buluç, Aydın email: abuluc@lbl.gov organization: Comput. Res. Div., Lawrence Berkeley Nat. Lab., Berkeley, CA, USA – sequence: 3 givenname: James surname: Demmel fullname: Demmel, James email: demmel@eecs.berkeley.edu organization: Dept. of EECS, Univ. of California, Berkeley, Berkeley, CA, USA |
| BookMark | eNotjrtOwzAUQI0oEm3JysLiH0i5N37FYxVelYqIVJDYKtvY1ChxUBwG-HoqwXR0lqOzILM0JE_IJcIKEfT1pr1pd6sKkB0dT0ihVQ1KasG1wuqULJBLxSRI-TojcxQMygqUOCdFzh8AgMAEcjUn_DGm2MefmN5pM_T9V4rOTHFINCa67rqyNXHMdHcYxsnnibZmOuQLchZMl33xzyV5ubt9bh7K7dP9pllvy4hKTKU3UPHKcfTc1NYIFMclLby0zFv02joeVDCgJbJacKsZBiecC07btyBrtiRXf93ovd9_jrE34_deCqlrjuwXMbJJlQ |
| CODEN | IEEPAD |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/IPDPS.2013.111 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISBN | 9780769549712 0769549713 |
| EndPage | 559 |
| ExternalDocumentID | 6569841 |
| Genre | orig-research |
| GroupedDBID | 29O 6IE 6IF 6IH 6IK 6IL 6IN AAJGR AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI OCL RIE RIL |
| ID | FETCH-LOGICAL-i175t-ea0242c41e4a8ba51573695e6b3eb1e9bc4f7fa09613854b931fc5ccfc9bdf683 |
| IEDL.DBID | RIE |
| ISBN | 146736066X 9781467360661 |
| ISICitedReferencesCount | 25 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000332828000047&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1530-2075 |
| IngestDate | Wed Aug 27 04:15:52 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i175t-ea0242c41e4a8ba51573695e6b3eb1e9bc4f7fa09613854b931fc5ccfc9bdf683 |
| PageCount | 12 |
| ParticipantIDs | ieee_primary_6569841 |
| PublicationCentury | 2000 |
| PublicationDate | 2013-May |
| PublicationDateYYYYMMDD | 2013-05-01 |
| PublicationDate_xml | – month: 05 year: 2013 text: 2013-May |
| PublicationDecade | 2010 |
| PublicationTitle | 2013 IEEE 27th International Symposium on Parallel and Distributed Processing |
| PublicationTitleAbbrev | ipdps |
| PublicationYear | 2013 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0001035147 ssj0020349 |
| Score | 1.6891625 |
| Snippet | We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling the APSP problem to high concurrencies requires both... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 548 |
| SubjectTerms | 2.5D algorithms Algorithm design and analysis all-pairs shortest paths Bandwidth communication-avoiding algorithms Heuristic algorithms Matrix decomposition Particle separators Partitioning algorithms |
| Title | Minimizing Communication in All-Pairs Shortest Paths |
| URI | https://ieeexplore.ieee.org/document/6569841 |
| WOSCitedRecordID | wos000332828000047&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELZKxcBUoEW8lYER06R-xSMCKhioIhWkblXsnEWkkqI2ZeDX40vSVpVY2OLIg504vovvexBy49cF6BQExVhKubGMaukk1RaYioQyoavNJtRoFE8mOmmR2w0XBgAq8Bnc4WVVy8_mdoVHZX2fe-gYWep7Ssmaq7U9T8GSGN_-bKHuSq2VGvqVoERF6kIQE1rnrbWemnbUqDlGoe6_JI_JGCFfDHeTHc-VKuQMO_8b7CHpbbl7QbKJSkekBcUx6azNG4LmW-4S_poX-Wf-4zsFOyyRIC-C-9mMJljpCcYfiMZdlkHiU8Vlj7wPn94enmljoUBznxeUFFKMwZZHwNPYpD558TPWAqRhfpMGbSx3yqXo-8JiwY1mkbPCWme1yZyM2QlpF_MCTkkAytnUZHHmGHAhBkbaTMtYQSgh9V3PSBcfwvSrVsmYNvM___v2BTkYVMYSCB28JO1ysYIrsm-_y3y5uK5e7S94-J3c |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3NT8IwFG8ImugJFYzf9uDRyUbbbT0alUAEsgRMuJG1e41LcBgYHvzr7duGhMSLt3Xpod26vre-3wchd3ZdgIxBOBhLHa40c6RvfEdqYIEnAuWa0mwiGI3C6VRGNXL_y4UBgAJ8Bg94WdTyk4Ve41FZ2-YeMkSW-h46Z3klW2t7ooJFMb793ULllVIt1bVrIRAFrQthTGiet1F7qtpepefoubLdj56jMYK-GO4nO64rRdDpNv433CPS2rL3aPQbl45JDbIT0tjYN9Dqa24SPkyz9CP9tp3oDk-Ephl9nM-dCGs9dPyOeNxVTiObLK5a5K37MnnqOZWJgpPazCB3IMYorLkHPA5VbNMXO2MpwFfMbtMgleYmMDE6v7BQcCWZZ7TQ2mipEuOH7JTUs0UGZ4RCYHSskjAxDLgQHeXrRPphAK4Pse16Tpr4EGafpU7GrJr_xd-3b8lBbzIczAb90eslOewUNhMIJLwi9Xy5hmuyr7_ydLW8KV7zD-XWoSM |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2013+IEEE+27th+International+Symposium+on+Parallel+and+Distributed+Processing&rft.atitle=Minimizing+Communication+in+All-Pairs+Shortest+Paths&rft.au=Solomonik%2C+Edgar&rft.au=Bulu%C3%A7%2C+Ayd%C4%B1n&rft.au=Demmel%2C+James&rft.date=2013-05-01&rft.pub=IEEE&rft.isbn=9781467360661&rft.issn=1530-2075&rft.spage=548&rft.epage=559&rft_id=info:doi/10.1109%2FIPDPS.2013.111&rft.externalDocID=6569841 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1530-2075&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1530-2075&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1530-2075&client=summon |

