Accelerating XOR-Based Erasure Coding using Program Optimization Techniques

Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a significant performance gap between the state-of-the-art XOR-based EC approach (~4.9 GB/s coding through-put) and Intel's high-performance E...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:SC21: International Conference for High Performance Computing, Networking, Storage and Analysis S. 1 - 15
1. Verfasser: Uezato, Yuya
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: ACM 14.11.2021
Schlagworte:
ISSN:2167-4337
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a significant performance gap between the state-of-the-art XOR-based EC approach (~4.9 GB/s coding through-put) and Intel's high-performance EC library based on another approach (~6.7 GB/s). We propose a novel approach based on our observation that XOR-based EC virtually generates programs of a Domain Specific Language for XORing byte arrays. We formalize such programs as straight-line programs (SLPs) of compiler construction and optimize SLPs using various program optimization techniques. Our optimization flow is three-fold: 1) reducing the number of XORs using grammar compression algorithms; 2) reducing memory accesses using deforestation, a functional program optimization method; and 3) reducing cache misses using the (red-blue) pebble game of program analysis. We provide an experimentallibrary, which outperforms Intel's library with an \sim 8.92 GB/s throughput.
AbstractList Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a significant performance gap between the state-of-the-art XOR-based EC approach (~4.9 GB/s coding through-put) and Intel's high-performance EC library based on another approach (~6.7 GB/s). We propose a novel approach based on our observation that XOR-based EC virtually generates programs of a Domain Specific Language for XORing byte arrays. We formalize such programs as straight-line programs (SLPs) of compiler construction and optimize SLPs using various program optimization techniques. Our optimization flow is three-fold: 1) reducing the number of XORs using grammar compression algorithms; 2) reducing memory accesses using deforestation, a functional program optimization method; and 3) reducing cache misses using the (red-blue) pebble game of program analysis. We provide an experimentallibrary, which outperforms Intel's library with an \sim 8.92 GB/s throughput.
Author Uezato, Yuya
Author_xml – sequence: 1
  givenname: Yuya
  surname: Uezato
  fullname: Uezato, Yuya
  email: yuuya_uezato@dwango.co.jp
  organization: Dwango, Co., Ltd.,Japan
BookMark eNotj8tOwzAURA0CiVKyZsEmP5Bi-_q5LFF5iEpBqEjsKse5KUFNUuxkAV9PKtjMLI7OSHNJzrq-Q0KuGV0wJuQtCGkM0wsQWnEqTkhitZkABSMEZ6dkxpnSmQDQFySJ8ZNSyo1mwOmMPC-9xz0GNzTdLn0vXrM7F7FKV8HFMWCa99URjPGYL6HfBdemxWFo2uZncvou3aD_6JqvEeMVOa_dPmLy33Pydr_a5I_Zunh4ypfrzHGhh8xLwbgEWQO3FrXk3CHltSqdV1hahVBqxqcn4MB4rY0XzFdGWQWltw5gTm7-dhtE3B5C07rwvbWWUSoBfgEiXE5w
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1145/3458817.3476204
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9781450384421
1450384420
EISSN 2167-4337
EndPage 15
ExternalDocumentID 9910053
Genre orig-research
GroupedDBID 6IE
6IF
6IH
6IK
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
OCL
RIE
RIL
ID FETCH-LOGICAL-a247t-c5412535f3299e7522ae02f6bac6eb96e3b7122043a38c778c41cd86963bc9a33
IEDL.DBID RIE
ISICitedReferencesCount 13
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000946520100014&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Wed Aug 27 02:18:42 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a247t-c5412535f3299e7522ae02f6bac6eb96e3b7122043a38c778c41cd86963bc9a33
PageCount 15
ParticipantIDs ieee_primary_9910053
PublicationCentury 2000
PublicationDate 2021-Nov.-14
PublicationDateYYYYMMDD 2021-11-14
PublicationDate_xml – month: 11
  year: 2021
  text: 2021-Nov.-14
  day: 14
PublicationDecade 2020
PublicationTitle SC21: International Conference for High Performance Computing, Networking, Storage and Analysis
PublicationTitleAbbrev SC
PublicationYear 2021
Publisher ACM
Publisher_xml – name: ACM
SSID ssj0002871320
ssj0003204180
Score 1.8683007
Snippet Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Encoding
Games
High performance computing
Optimization methods
Program processors
Redundancy
Throughput
Title Accelerating XOR-Based Erasure Coding using Program Optimization Techniques
URI https://ieeexplore.ieee.org/document/9910053
WOSCitedRecordID wos000946520100014&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LTwIxEJ4A8eAJFYzv9ODRwnZbtu1RCcREA8Rgwo1su4V4cCE8_P3OLAvGxIu36abdbDptv2-n8wC4z-i2JjYpj1DiKmjPjXWCC53hiRxs5H1RteRVDwZmMrGjCjwcYmFCCIXzWWiRWNzlZwu_JVNZG7kMLZoqVLVOdrFaB3sKMX9ZUh9qo6yEicpsPkJ12pKCMoVuSaUpCfuvcioFmvTr__uOE2j-hOWx0QFwTqES8jOo7-sysHKbNuDl0XtEE9JtPmeT4Rt_QqzKWG9V2ANZd0HjGbm8z-mF5KDFhnh2fJZBmWy8z-y6bsJ7vzfuPvOyaAJPY6U33HcUchbZmUkEmqCRXqUhimeJS30SnE2CdBr1EymZSuO1Nl4Jn5kEN6LzNpXyHGr5Ig8XwPBvLI1xsHAWiZ91RlnhEPES7GVjnV1Cg-ZmutzlxZiW03L19-NrOI7JH4Rc6NQN1DarbbiFI_-1-Viv7gplfgMQuJ1Y
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3NT8IwFH9BNNETKhi_7cGjhXXt1vaoBIIBgRhMuJG1K8SDw_Dh3-_rGBgTL95el3ZZ-tr-fnt9HwD3qb-tCVVCA5SocNJSpQ2jTKZ4IjsdWJtXLenJfl-Nx3pYgoddLIxzLnc-c3Uv5nf56dyuvamsgVzGL5o92I-ECINNtNbOouK5Py_Ij2-jLJgKinw-TEQN7sMymaxzIX0a9l8FVXI8aVf-9yXHUPsJzCPDHeScQMllp1DZVmYgxUatQvfRWsQTr91sRsaDV_qEaJWS1iK3CJLm3I8n3ul95l_oXbTIAE-PjyIsk4y2uV2XNXhrt0bNDi3KJtAkFHJFbSSQtfBoyhFqnESClbggnMYmsbEzOnbcSNRQIHjClZVSWcFsqmLcisbqhPMzKGfzzJ0Dwf-xJMTBzGikftoooZlBzIuxlw5legFVPzeTz01mjEkxLZd_P76Dw87opTfpPfe7V3AUeu8Q71AnrqG8WqzdDRzYr9X7cnGbK_Ybc1ugnw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC21%3A+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=Accelerating+XOR-Based+Erasure+Coding+using+Program+Optimization+Techniques&rft.au=Uezato%2C+Yuya&rft.date=2021-11-14&rft.pub=ACM&rft.eissn=2167-4337&rft.spage=1&rft.epage=15&rft_id=info:doi/10.1145%2F3458817.3476204&rft.externalDocID=9910053