Accelerating XOR-Based Erasure Coding using Program Optimization Techniques

Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a significant performance gap between the state-of-the-art XOR-based EC approach (~4.9 GB/s coding through-put) and Intel's high-performance E...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:SC21: International Conference for High Performance Computing, Networking, Storage and Analysis S. 1 - 15
1. Verfasser: Uezato, Yuya
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: ACM 14.11.2021
Schlagworte:
ISSN:2167-4337
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a significant performance gap between the state-of-the-art XOR-based EC approach (~4.9 GB/s coding through-put) and Intel's high-performance EC library based on another approach (~6.7 GB/s). We propose a novel approach based on our observation that XOR-based EC virtually generates programs of a Domain Specific Language for XORing byte arrays. We formalize such programs as straight-line programs (SLPs) of compiler construction and optimize SLPs using various program optimization techniques. Our optimization flow is three-fold: 1) reducing the number of XORs using grammar compression algorithms; 2) reducing memory accesses using deforestation, a functional program optimization method; and 3) reducing cache misses using the (red-blue) pebble game of program analysis. We provide an experimentallibrary, which outperforms Intel's library with an \sim 8.92 GB/s throughput.
AbstractList Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a significant performance gap between the state-of-the-art XOR-based EC approach (~4.9 GB/s coding through-put) and Intel's high-performance EC library based on another approach (~6.7 GB/s). We propose a novel approach based on our observation that XOR-based EC virtually generates programs of a Domain Specific Language for XORing byte arrays. We formalize such programs as straight-line programs (SLPs) of compiler construction and optimize SLPs using various program optimization techniques. Our optimization flow is three-fold: 1) reducing the number of XORs using grammar compression algorithms; 2) reducing memory accesses using deforestation, a functional program optimization method; and 3) reducing cache misses using the (red-blue) pebble game of program analysis. We provide an experimentallibrary, which outperforms Intel's library with an \sim 8.92 GB/s throughput.
Author Uezato, Yuya
Author_xml – sequence: 1
  givenname: Yuya
  surname: Uezato
  fullname: Uezato, Yuya
  email: yuuya_uezato@dwango.co.jp
  organization: Dwango, Co., Ltd.,Japan
BookMark eNotj8tOwzAURA0CiVKyZsEmP5Bi-_q5LFF5iEpBqEjsKse5KUFNUuxkAV9PKtjMLI7OSHNJzrq-Q0KuGV0wJuQtCGkM0wsQWnEqTkhitZkABSMEZ6dkxpnSmQDQFySJ8ZNSyo1mwOmMPC-9xz0GNzTdLn0vXrM7F7FKV8HFMWCa99URjPGYL6HfBdemxWFo2uZncvou3aD_6JqvEeMVOa_dPmLy33Pydr_a5I_Zunh4ypfrzHGhh8xLwbgEWQO3FrXk3CHltSqdV1hahVBqxqcn4MB4rY0XzFdGWQWltw5gTm7-dhtE3B5C07rwvbWWUSoBfgEiXE5w
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1145/3458817.3476204
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE/IET Electronic Library
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9781450384421
1450384420
EISSN 2167-4337
EndPage 15
ExternalDocumentID 9910053
Genre orig-research
GroupedDBID 6IE
6IF
6IH
6IK
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
OCL
RIE
RIL
ID FETCH-LOGICAL-a247t-c5412535f3299e7522ae02f6bac6eb96e3b7122043a38c778c41cd86963bc9a33
IEDL.DBID RIE
ISICitedReferencesCount 13
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000946520100014&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Wed Aug 27 02:18:42 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a247t-c5412535f3299e7522ae02f6bac6eb96e3b7122043a38c778c41cd86963bc9a33
PageCount 15
ParticipantIDs ieee_primary_9910053
PublicationCentury 2000
PublicationDate 2021-Nov.-14
PublicationDateYYYYMMDD 2021-11-14
PublicationDate_xml – month: 11
  year: 2021
  text: 2021-Nov.-14
  day: 14
PublicationDecade 2020
PublicationTitle SC21: International Conference for High Performance Computing, Networking, Storage and Analysis
PublicationTitleAbbrev SC
PublicationYear 2021
Publisher ACM
Publisher_xml – name: ACM
SSID ssj0002871320
ssj0003204180
Score 1.8683007
Snippet Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Encoding
Games
High performance computing
Optimization methods
Program processors
Redundancy
Throughput
Title Accelerating XOR-Based Erasure Coding using Program Optimization Techniques
URI https://ieeexplore.ieee.org/document/9910053
WOSCitedRecordID wos000946520100014&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PT8IwFH4B4sETKhh_pwePFtjatdtRCcTEBIjBhBtpu5Z4cBh--Pf73hgYEy_euqVbmnbd973X970HcJ9leWKtsFxINFFkMAlHUp9z1XN5CGj5KBvKYhN6NEpns2xSg4eDFsZ7Xwaf-Q41y7P8fOm25CrrIpehj6YOda3VTqt18KcQ8xcV9aFrbMso7VXZfCKZdAWJMiPdwVFREvZf5VRKNBk2_zeOE2j_yPLY5AA4p1DzxRk093UZWLVNW_Dy6ByiCa1tsWCz8St_QqzK2WBV-gNZf0nPMwp5X9ALKUCLjfHf8VGJMtl0n9l13Ya34WDaf-ZV0QRuYqk33CUSOYtIgkCg8RrplfG9OChrnPI2U15YHcWkiDUidVqnTkYuTxVuROsyI8Q5NIpl4S-AYWfkWyZHE89L64KJI--TOCCEpXSeegktmpv55y4vxryalqu_b1_DcUzxIBRCJ2-gsVlt_S0cua_N-3p1Vy7mN6ronjo
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LT8JAEJ4gmugJFYxv9-DRQtvdvo5KIBgQiMGEG9nd7hIPtoaHv9-ZUjAmXrxtm22z2e32-2Z2vhmA-yRJA6W4crhAE0VYGThI6lMndHVqLVo-obJFsYloOIyn02RcgYedFsYYUwSfmSY1i7P8NNdrcpW1kMvQR7MH-4EQvrtRa-08KsT9eUl-6BrbwovdMp-PJ4IWJ1mmFzVxXJSG_VdBlQJPurX_jeQYGj_CPDbeQc4JVEx2CrVtZQZWbtQ69B-1Rjyh1c3mbDp6dZ4QrVLWWRQeQdbO6XlGQe9zeiGFaLER_j0-Slkmm2xzuy4b8NbtTNo9pyyb4EhfRCtHBwJZCw8sR6gxERIsaVzfhkrq0KgkNFxFnk-aWMljHUWxFp5O4xC3otKJ5PwMqlmemXNg2BkZl0zRyDNCaSt9z5jAtwhiMZ2oXkCd5mb2ucmMMSun5fLv23dw2Ju8DGaD52H_Co58ig6hgDpxDdXVYm1u4EB_rd6Xi9tiYb8BGHKhgQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC21%3A+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=Accelerating+XOR-Based+Erasure+Coding+using+Program+Optimization+Techniques&rft.au=Uezato%2C+Yuya&rft.date=2021-11-14&rft.pub=ACM&rft.eissn=2167-4337&rft.spage=1&rft.epage=15&rft_id=info:doi/10.1145%2F3458817.3476204&rft.externalDocID=9910053