Accelerating XOR-Based Erasure Coding using Program Optimization Techniques

Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a significant performance gap between the state-of-the-art XOR-based EC approach (~4.9 GB/s coding through-put) and Intel's high-performance E...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:SC21: International Conference for High Performance Computing, Networking, Storage and Analysis s. 1 - 15
Hlavní autor: Uezato, Yuya
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: ACM 14.11.2021
Témata:
ISSN:2167-4337
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a significant performance gap between the state-of-the-art XOR-based EC approach (~4.9 GB/s coding through-put) and Intel's high-performance EC library based on another approach (~6.7 GB/s). We propose a novel approach based on our observation that XOR-based EC virtually generates programs of a Domain Specific Language for XORing byte arrays. We formalize such programs as straight-line programs (SLPs) of compiler construction and optimize SLPs using various program optimization techniques. Our optimization flow is three-fold: 1) reducing the number of XORs using grammar compression algorithms; 2) reducing memory accesses using deforestation, a functional program optimization method; and 3) reducing cache misses using the (red-blue) pebble game of program analysis. We provide an experimentallibrary, which outperforms Intel's library with an \sim 8.92 GB/s throughput.
AbstractList Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a significant performance gap between the state-of-the-art XOR-based EC approach (~4.9 GB/s coding through-put) and Intel's high-performance EC library based on another approach (~6.7 GB/s). We propose a novel approach based on our observation that XOR-based EC virtually generates programs of a Domain Specific Language for XORing byte arrays. We formalize such programs as straight-line programs (SLPs) of compiler construction and optimize SLPs using various program optimization techniques. Our optimization flow is three-fold: 1) reducing the number of XORs using grammar compression algorithms; 2) reducing memory accesses using deforestation, a functional program optimization method; and 3) reducing cache misses using the (red-blue) pebble game of program analysis. We provide an experimentallibrary, which outperforms Intel's library with an \sim 8.92 GB/s throughput.
Author Uezato, Yuya
Author_xml – sequence: 1
  givenname: Yuya
  surname: Uezato
  fullname: Uezato, Yuya
  email: yuuya_uezato@dwango.co.jp
  organization: Dwango, Co., Ltd.,Japan
BookMark eNotj8tOwzAURA0CiVKyZsEmP5Bi-_q5LFF5iEpBqEjsKse5KUFNUuxkAV9PKtjMLI7OSHNJzrq-Q0KuGV0wJuQtCGkM0wsQWnEqTkhitZkABSMEZ6dkxpnSmQDQFySJ8ZNSyo1mwOmMPC-9xz0GNzTdLn0vXrM7F7FKV8HFMWCa99URjPGYL6HfBdemxWFo2uZncvou3aD_6JqvEeMVOa_dPmLy33Pydr_a5I_Zunh4ypfrzHGhh8xLwbgEWQO3FrXk3CHltSqdV1hahVBqxqcn4MB4rY0XzFdGWQWltw5gTm7-dhtE3B5C07rwvbWWUSoBfgEiXE5w
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1145/3458817.3476204
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9781450384421
1450384420
EISSN 2167-4337
EndPage 15
ExternalDocumentID 9910053
Genre orig-research
GroupedDBID 6IE
6IF
6IH
6IK
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
OCL
RIE
RIL
ID FETCH-LOGICAL-a247t-c5412535f3299e7522ae02f6bac6eb96e3b7122043a38c778c41cd86963bc9a33
IEDL.DBID RIE
ISICitedReferencesCount 13
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000946520100014&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Wed Aug 27 02:18:42 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a247t-c5412535f3299e7522ae02f6bac6eb96e3b7122043a38c778c41cd86963bc9a33
PageCount 15
ParticipantIDs ieee_primary_9910053
PublicationCentury 2000
PublicationDate 2021-Nov.-14
PublicationDateYYYYMMDD 2021-11-14
PublicationDate_xml – month: 11
  year: 2021
  text: 2021-Nov.-14
  day: 14
PublicationDecade 2020
PublicationTitle SC21: International Conference for High Performance Computing, Networking, Storage and Analysis
PublicationTitleAbbrev SC
PublicationYear 2021
Publisher ACM
Publisher_xml – name: ACM
SSID ssj0002871320
ssj0003204180
Score 1.8683007
Snippet Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Encoding
Games
High performance computing
Optimization methods
Program processors
Redundancy
Throughput
Title Accelerating XOR-Based Erasure Coding using Program Optimization Techniques
URI https://ieeexplore.ieee.org/document/9910053
WOSCitedRecordID wos000946520100014&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PT8IwFH4B4sETKhh_pwePFta1W7ujEoiJCRCDCTfStQ9jomAG-PfbNwbGxIu3rlmXtWv3vr6-73sAtwYJiGPEvTcRV6kXPMPMcjoyy2LvnUNTJpvQw6GZTrNxDe72XBhELIPPsEPF8izfL92GXGXdgGVo0tShrnW65Wrt_SmE_GUFfeg6lJUwUaXmI1TSlUTKFLojlSYR9l_pVEprMmj-7z2OoP1Dy2PjvcE5hhouTqC5y8vAqmXagqf70Pd3UksOt7Hp6Jk_BFvlWb8o_YGst6T2jELeX-mBFKDFRuHf8VGRMtlkp-y6asPLoD_pPfIqaQK3sdJr7hIVMItM5jIYGtQBXlmM4nmaW5dinqUocy1iYsRaaZzWxinhvEnDQsxdZqU8hcZiucAzYMLbJPGhbh45JcNGkFKlO1SZpY2QkOfQorGZfW51MWbVsFz8XX0JhzHFg1AInbqCxrrY4DUcuK_126q4KT_mNwABnZc
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PT8IwFH5BNNETKhh_24NHC9vard1RCQQDAjGYcCOlfRgTBcMP_377xsCYePHWNeuydu3e19f3fQ_gViMBcQy4czrgMnEhTzE1nI7M0sg5a1FnySZUt6uHw7RfgLstFwYRs-AzrFIxO8t3M7siV1nNYxmaNDuwG0sZBWu21tajQthf5OCHrn1ZhjrI9XxCGdcE0TJDVRVSkQz7r4QqmT1plv73JodQ-SHmsf7W5BxBAafHUNpkZmD5Qi1D-973_p30kv1tbNh75g_eWjnWmGceQVafUXtGQe-v9EAK0WI9__f4yGmZbLDRdl1U4KXZGNRbPE-bwE0k1ZLbWHrUIuKJ8KYGlQdYBoNokoyNTXCcJijGKoyIE2uEtkppK0PrdOKX4timRogTKE5nUzwFFjoTx87XTQIrhd8KUrJ0izI1tBUKxRmUaWxGn2tljFE-LOd_V9_Afmvw1Bl1HrvtCziIKDqEAurkJRSX8xVewZ79Wr4t5tfZh_0GZV6g3g
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC21%3A+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=Accelerating+XOR-Based+Erasure+Coding+using+Program+Optimization+Techniques&rft.au=Uezato%2C+Yuya&rft.date=2021-11-14&rft.pub=ACM&rft.eissn=2167-4337&rft.spage=1&rft.epage=15&rft_id=info:doi/10.1145%2F3458817.3476204&rft.externalDocID=9910053