BAASH: Lightweight, Efficient, and Reliable Blockchain-As-A-Service for HPC Systems

Distributed resiliency becomes paramount to alleviate the growing costs of data movement and I/Os while preserving the data accuracy in HPC systems. This paper proposes to adopt blockchain-like decentralized protocols to achieve such distributed resiliency. The key challenge for such an adoption lie...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:SC21: International Conference for High Performance Computing, Networking, Storage and Analysis s. 01 - 15
Hlavní autoři: Mamun, Abdullah Al, Yan, Feng, Zhao, Dongfang
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: ACM 14.11.2021
Témata:
ISSN:2167-4337
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract Distributed resiliency becomes paramount to alleviate the growing costs of data movement and I/Os while preserving the data accuracy in HPC systems. This paper proposes to adopt blockchain-like decentralized protocols to achieve such distributed resiliency. The key challenge for such an adoption lies in the mismatch between blockchain's targeting systems (e.g., shared-nothing, loosely-coupled, TCP/IP stack) and HPC's unique design on storage subsystems, resource allocation, and programming models. We present BAASH, Blockchain-As-A-Service for HPC, deployable in a plug-n-play fashion. BAASH bridges the HPC-blockchain gap with two key components: (i) Lightweight consensus protocols for the HPC's shared-storage architecture, (ii) A new fault-tolerant mechanism compensating for the MPI to guarantee the distributed resiliency. We have implemented a prototype system and evaluated it with more than two million transactions on a 500-core HPC cluster. Results show that the prototype of the proposed techniques significantly outperforms vanilla blockchain systems and exhibits strong reliability with MPI.
AbstractList Distributed resiliency becomes paramount to alleviate the growing costs of data movement and I/Os while preserving the data accuracy in HPC systems. This paper proposes to adopt blockchain-like decentralized protocols to achieve such distributed resiliency. The key challenge for such an adoption lies in the mismatch between blockchain's targeting systems (e.g., shared-nothing, loosely-coupled, TCP/IP stack) and HPC's unique design on storage subsystems, resource allocation, and programming models. We present BAASH, Blockchain-As-A-Service for HPC, deployable in a plug-n-play fashion. BAASH bridges the HPC-blockchain gap with two key components: (i) Lightweight consensus protocols for the HPC's shared-storage architecture, (ii) A new fault-tolerant mechanism compensating for the MPI to guarantee the distributed resiliency. We have implemented a prototype system and evaluated it with more than two million transactions on a 500-core HPC cluster. Results show that the prototype of the proposed techniques significantly outperforms vanilla blockchain systems and exhibits strong reliability with MPI.
Author Yan, Feng
Zhao, Dongfang
Mamun, Abdullah Al
Author_xml – sequence: 1
  givenname: Abdullah Al
  surname: Mamun
  fullname: Mamun, Abdullah Al
  email: aalmamun@nevada.unr.edu
  organization: University of Nevada, Reno,Reno,NV,USA
– sequence: 2
  givenname: Feng
  surname: Yan
  fullname: Yan, Feng
  email: fyan@unr.edu
  organization: University of Nevada, Reno,Reno,NV,USA
– sequence: 3
  givenname: Dongfang
  surname: Zhao
  fullname: Zhao, Dongfang
  email: dzhao@unr.edu
  organization: University of Nevada, Reno,Reno,NV,USA
BookMark eNotj01LxDAYhKMouK49e_CSH2DWvPnYJN66ZbVCQbF6XtL0rRvtdqUtyv57K3qZGRgYnjknJ92-Q0IugS8AlL6RSlsLZiGVWYLWRyRxxk4Fl1YpAcdkJmBpmJLSnJFkGN4558IakILPSLlK0zK_pUV8247f-KvXdN00MUTspui7mj5jG33VIl21-_ARtj52LB1Yykrsv2JA2ux7mj9ltDwMI-6GC3La-HbA5N_n5PVu_ZLlrHi8f8jSgnmhzMg8r3RjFFobpKlBc5QVtxM291UIFhxXzk-sDjRaIyoF9fQjoHONEr7yck6u_nYjIm4--7jz_WHjHHBQQv4AoqxPSw
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1145/3458817.3476155
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9781450384421
1450384420
EISSN 2167-4337
EndPage 15
ExternalDocumentID 9910142
Genre orig-research
GrantInformation_xml – fundername: National Science Foundation
  grantid: CCF-1756013
  funderid: 10.13039/100000001
GroupedDBID 6IE
6IF
6IH
6IK
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
OCL
RIE
RIL
ID FETCH-LOGICAL-a247t-a0b5f74e88c37d150e3b085030abcc819049a000915e872b41d167ce99f42aba3
IEDL.DBID RIE
ISICitedReferencesCount 1
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000946520100102&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Wed Aug 27 02:18:35 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a247t-a0b5f74e88c37d150e3b085030abcc819049a000915e872b41d167ce99f42aba3
PageCount 15
ParticipantIDs ieee_primary_9910142
PublicationCentury 2000
PublicationDate 2021-Nov.-14
PublicationDateYYYYMMDD 2021-11-14
PublicationDate_xml – month: 11
  year: 2021
  text: 2021-Nov.-14
  day: 14
PublicationDecade 2020
PublicationTitle SC21: International Conference for High Performance Computing, Networking, Storage and Analysis
PublicationTitleAbbrev SC
PublicationYear 2021
Publisher ACM
Publisher_xml – name: ACM
SSID ssj0002871320
ssj0003204180
Score 1.7836652
Snippet Distributed resiliency becomes paramount to alleviate the growing costs of data movement and I/Os while preserving the data accuracy in HPC systems. This paper...
SourceID ieee
SourceType Publisher
StartPage 01
SubjectTerms Blockchain
Costs
Fault tolerance
Fault tolerant systems
High performance computing
HPC
MPI
Programming
Prototypes
reproducibility
resilience
TCPIP
Title BAASH: Lightweight, Efficient, and Reliable Blockchain-As-A-Service for HPC Systems
URI https://ieeexplore.ieee.org/document/9910142
WOSCitedRecordID wos000946520100102&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlZ3PT8IwFMcbJB48oYJR_JEePFLY1m5tvQ0C4UAICZpwI29dG4nJMPzQf9--MTEmXrx1OzRLl_V99vq-30fIo9MyciFwphRIJqTVDGLsmioTk3iAVTmUPrMTOZ2qxULPaqRz1MJYa8viM9vFYXmWn6_NHlNlPc8ynuj9hnvipzpotY75FCR_XqEPXvuxCFVQufmEIu5xFGWGssuFxMO4X-1UymgyavzvOc5J60eWR2fHgHNBara4JI3vvgy0-kybZN5P0_n4iU7wx_uzzH126LD0ivDzdigUOcVSZFRN0b6PZm_mFVYFS7csZdXmQT3M0vFsQCtL8xZ5GQ2fB2NWNU9gEAm5YxBksZPCKmW4zD32WZ6hPR0PIDMGOUBoQMIKY6tklIkwDxNprNZORJABvyL1Yl3Ya0IjUBqcMplLLAKgDpxyuYtiEDlYoW5IE9do-X7wx1hWy9P--_YtOYuwLgRL6cQdqe82e3tPTs3HbrXdPJQv9QsfVJ86
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlZ3PS8MwFMfDmIKepm7ib3PwuGz9kTaJt25sVKxjsAm7jTRNcAid7If---bVOBG8eEt7KCUleZ--vO_3IXRnBAuML0PCuWSEMi2IjKBrKotVbAGWF7Lymc3YaMRnMzGuofZOC6O1rorPdAeG1Vl-sVRbSJV1LctYorcb7h50znJqrV1GBdg_dPAD13ZMfe45Px-fRt0QZJk-64SUwXHcr4YqVTwZNv73Jkeo9SPMw-NdyDlGNV2eoMZ3ZwbsFmoTTXpJMknvcQa_3h9V9rONB5VbhH1uG8uywFCMDLop3LPx7FW9yEVJkjVJiNs-sMVZnI772Jmat9DzcDDtp8S1TyAyoGxDpJdHhlHNuQpZYcFPhzkY1IWezJUCEqBCAmP5keYsyKlf-DFTWghDA5nL8BTVy2WpzxAOJBfScJWbWAMCCs9wU5ggkrSQmvJz1IQ5mr99OWTM3fRc_H37Fh2k06dsnj2MHi_RYQBVIlBYR69QfbPa6mu0r943i_XqpvrAn54gooM
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC21%3A+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=BAASH%3A+Lightweight%2C+Efficient%2C+and+Reliable+Blockchain-As-A-Service+for+HPC+Systems&rft.au=Mamun%2C+Abdullah+Al&rft.au=Yan%2C+Feng&rft.au=Zhao%2C+Dongfang&rft.date=2021-11-14&rft.pub=ACM&rft.eissn=2167-4337&rft.spage=01&rft.epage=15&rft_id=info:doi/10.1145%2F3458817.3476155&rft.externalDocID=9910142