Embracing Irregular Parallelism in HPC with YGM

YGM is a general-purpose asynchronous distributed computing library for C++/MPI, designed to handle the irregular data access patterns and small messages of graph algorithms and data science applications. It uses data serialization to give an easily usable active message interface and message aggreg...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:International Conference for High Performance Computing, Networking, Storage and Analysis (Online) s. 1 - 13
Hlavní autoři: Steil, Trevor, Reza, Tahsin, Priest, Benjamin W., Pearce, Roger
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: ACM 11.11.2023
Témata:
ISSN:2167-4337
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract YGM is a general-purpose asynchronous distributed computing library for C++/MPI, designed to handle the irregular data access patterns and small messages of graph algorithms and data science applications. It uses data serialization to give an easily usable active message interface and message aggregation to maximize application throughput. Our design philosophy makes a tradeoff that increases network bandwidth utilization at the cost of added latency. We provide a suite of benchmarks showcasing YGM's performance. Compared to similar distributed active message benchmark implementations that do not provide message buffering, we are able to achieve over 10x throughput on thousands of cores at a latency cost that can be as small as 2x or as large as 100x, depending on the machine being used. For applications that can be written to be latency-tolerant, this represents a significant potential performance improvement through using YGM.
AbstractList YGM is a general-purpose asynchronous distributed computing library for C++/MPI, designed to handle the irregular data access patterns and small messages of graph algorithms and data science applications. It uses data serialization to give an easily usable active message interface and message aggregation to maximize application throughput. Our design philosophy makes a tradeoff that increases network bandwidth utilization at the cost of added latency. We provide a suite of benchmarks showcasing YGM's performance. Compared to similar distributed active message benchmark implementations that do not provide message buffering, we are able to achieve over 10x throughput on thousands of cores at a latency cost that can be as small as 2x or as large as 100x, depending on the machine being used. For applications that can be written to be latency-tolerant, this represents a significant potential performance improvement through using YGM.
Author Reza, Tahsin
Pearce, Roger
Steil, Trevor
Priest, Benjamin W.
Author_xml – sequence: 1
  givenname: Trevor
  surname: Steil
  fullname: Steil, Trevor
  email: steil1@llnl.gov
  organization: Center for Applied Scientific Computing (CASC),Lawrence Livermore National Laboratory (LLNL),Livermore,CA,USA
– sequence: 2
  givenname: Tahsin
  surname: Reza
  fullname: Reza, Tahsin
  email: tahsin.reza@uwaterloo.ca
  organization: University of Waterloo,Department of Electrical and Computer Engineering,Waterloo,Ontario,Canada
– sequence: 3
  givenname: Benjamin W.
  surname: Priest
  fullname: Priest, Benjamin W.
  email: priest2@llnl.gov
  organization: Center for Applied Scientific Computing (CASC),Lawrence Livermore National Laboratory (LLNL),Livermore,CA,USA
– sequence: 4
  givenname: Roger
  surname: Pearce
  fullname: Pearce, Roger
  email: rpearce@llnl.gov
  organization: Center for Applied Scientific Computing (CASC),Lawrence Livermore National Laboratory (LLNL),Livermore,CA,USA
BookMark eNotjD1PwzAUAA0qEqVkZmHwH0j7XvzijxFFpa1URIcyMFUviV2MkoCcIsS_pxLohtMtdyMmw8fghbhDmCNSuVClRWNprjQYBHUhMmecJQADCK64FNMCtclJKXMtsnF8BwBVAKGFqVgs-zpxE4ej3KTkj18dJ7njxF3nuzj2Mg5yvavkdzy9ydfV0624CtyNPvv3TLw8LvfVOt8-rzbVwzZnZc0pN4QlOF2jYmggNAGtBschtN4VLXmrtGuZnaO6DKam2rdETXHmXIFaNRP3f9_ovT98pthz-jkgkC3BavULvL9E3w
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1145/3581784.3607103
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE/IET Electronic Library (IEL) (UW System Shared)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9798400701092
EISSN 2167-4337
EndPage 13
ExternalDocumentID 10485086
Genre orig-research
GrantInformation_xml – fundername: U.S. Department of Energy
  grantid: DE-AC52-07NA27344,LLNL-CONF-853600
  funderid: 10.13039/100000015
GroupedDBID 6IE
6IF
6IH
6IK
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
OCL
RIE
RIL
ID FETCH-LOGICAL-a387t-7415096b13a0c0fcf18609affde92d4e8369daa994b5f7b4bed44c2c2cf7bf4d3
IEDL.DBID RIE
ISICitedReferencesCount 3
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001461755900038&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Wed Aug 27 02:09:35 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a387t-7415096b13a0c0fcf18609affde92d4e8369daa994b5f7b4bed44c2c2cf7bf4d3
OpenAccessLink https://dl.acm.org/doi/pdf/10.1145/3581784.3607103
PageCount 13
ParticipantIDs ieee_primary_10485086
PublicationCentury 2000
PublicationDate 2023-Nov.-11
PublicationDateYYYYMMDD 2023-11-11
PublicationDate_xml – month: 11
  year: 2023
  text: 2023-Nov.-11
  day: 11
PublicationDecade 2020
PublicationTitle International Conference for High Performance Computing, Networking, Storage and Analysis (Online)
PublicationTitleAbbrev SC
PublicationYear 2023
Publisher ACM
Publisher_xml – name: ACM
SSID ssj0003204180
ssib053141430
Score 1.8941408
Snippet YGM is a general-purpose asynchronous distributed computing library for C++/MPI, designed to handle the irregular data access patterns and small messages of...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms asynchronous communication
Benchmark testing
Costs
distributed computing
Programming
Random access memory
Scalability
Social networking (online)
Throughput
Title Embracing Irregular Parallelism in HPC with YGM
URI https://ieeexplore.ieee.org/document/10485086
WOSCitedRecordID wos001461755900038&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV05T8MwFLZoxcBUjiJueWBNa8evsT1XlDJQZQCpTJWPF6kSTVHa8vux0wRYGJAX2_Lg6_n5-r6PkHsunSgMV4m1XCfghE2sl5BAYUTqUHrBXC02IWczNZ_rvAGr11gYRKw_n-EgRuu3fL92u3hVFiwcVNhQZB3SkTLbg7XayRPmEvCWSjwuwyJlwBVr6Hw4jIaR6ksqGIjIqRZVsn7pqdTuZNL7Z0WOSf8HmEfzb5dzQg6wPCW9VpmBNoZ6RoYPq3AKdqEMfaqqWm6-ormponDK-3KzosuSTvMxjbew9O3xuU9eJw8v42nSSCMkRii5TeI-IBw-LBeGOVa4gquMaVMUHnXqAZXItDdGa7CjQlqw6AFcGkJIFeDFOemW6xIvCJVWI3OAjnMDMGImRRxZp6y0xgmPl6QfO2DxsWe_WLRtv_oj_5ocRUn2iNfj_IZ0t9UOb8mh-9wuN9VdPWZf0HaWyw
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1JTwIxGG0UTfSEC8bdHrwOtNNvaOdMUIhA5oAJnkjXhEQGM4C_33aYUS8eTC9t00O3r1-39x5Cj5Rr5iQVkVI0jUAzFSnDIQInWawtN4zoUmyCTyZiNkuzCqxeYmGsteXnM9sO0fIt36z0NlyVeQsH4TcU3X10kADEZAfXqqePn01AazLxsBCzmAAVpCL0oZB0AtkXF9BmgVUt6GT9UlQpHcpT859VOUGtH2gezr6dzinas_kZatbaDLgy1XPU6S_9OVj7MnhYFKXgfIEzWQTplPfFeokXOR5kPRzuYfHb87iFXp_6094gqsQRIskE30RhJ-CPH4oySTRx2lHRJal0ztg0NmAF66ZGyjQFlTiuQFkDoGMffMqBYReoka9ye4kwV6klGqymVAIkRMbWJkoLxZXUzNgr1AodMP_Y8V_M67Zf_5H_gI4G0_FoPhpOXm7QcRBoD-g9Sm9RY1Ns7R061J-bxbq4L8fvC9ySmhI
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis+%28Online%29&rft.atitle=Embracing+Irregular+Parallelism+in+HPC+with+YGM&rft.au=Steil%2C+Trevor&rft.au=Reza%2C+Tahsin&rft.au=Priest%2C+Benjamin+W.&rft.au=Pearce%2C+Roger&rft.date=2023-11-11&rft.pub=ACM&rft.eissn=2167-4337&rft.spage=1&rft.epage=13&rft_id=info:doi/10.1145%2F3581784.3607103&rft.externalDocID=10485086