A Comparative Study of Consistent Snapshot Algorithms for Main-Memory Database Systems

Gespeichert in:
Bibliographische Detailangaben
Titel: A Comparative Study of Consistent Snapshot Algorithms for Main-Memory Database Systems
Autoren: Liang Li, Guoren Wang, Gang Wu, Ye Yuan, Lei Chen, Xiang Lian
Quelle: IEEE Transactions on Knowledge and Data Engineering. 33:316-330
Publication Status: Preprint
Verlagsinformationen: Institute of Electrical and Electronics Engineers (IEEE), 2021.
Publikationsjahr: 2021
Schlagwörter: Checkpoints, FOS: Computer and information sciences, Computer Science - Databases, HTAP, Snapshot algorithms, 0202 electrical engineering, electronic engineering, information engineering, Databases (cs.DB), 02 engineering and technology, In-memory database systems
Beschreibung: In-memory databases (IMDBs) are gaining increasing popularity in big data applications, where clients commit updates intensively. Specifically, it is necessary for IMDBs to have efficient snapshot performance to support certain special applications (e.g., consistent checkpoint, HTAP). Formally, the in-memory consistent snapshot problem refers to taking an in-memory consistent time-in-point snapshot with the constraints that 1) clients can read the latest data items and 2) any data item in the snapshot should not be overwritten. Various snapshot algorithms have been proposed in academia to trade off throughput and latency, but industrial IMDBs such as Redis adhere to the simple fork algorithm. To understand this phenomenon, we conduct comprehensive performance evaluations on mainstream snapshot algorithms. Surprisingly, we observe that the simple fork algorithm indeed outperforms the state-of-the-arts in update-intensive workload scenarios. On this basis, we identify the drawbacks of existing research and propose two lightweight improvements. Extensive evaluations on synthetic data and Redis show that our lightweight improvements yield better performance than fork, the current industrial standard, and the representative snapshot algorithms from academia. Finally, we have opensourced the implementation of all the above snapshot algorithms so that practitioners are able to benchmark the performance of each algorithm and select proper methods for different application scenarios.
Publikationsart: Article
ISSN: 2326-3865
1041-4347
DOI: 10.1109/tkde.2019.2930987
DOI: 10.48550/arxiv.1810.04915
Zugangs-URL: http://arxiv.org/pdf/1810.04915
http://arxiv.org/abs/1810.04915
https://arxiv.org/abs/1810.04915
https://ieeexplore.ieee.org/document/8772140/
https://arxiv.org/pdf/1810.04915.pdf
https://dblp.uni-trier.de/db/journals/tkde/tkde33.html#LiWWYCL21
https://doi.org/10.1109/TKDE.2019.2930987
Rights: IEEE Copyright
arXiv Non-Exclusive Distribution
Dokumentencode: edsair.doi.dedup.....75b8a7c455d7a9c13aa4c89605c57a2c
Datenbank: OpenAIRE
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://explore.openaire.eu/search/publication?articleId=doi_dedup___%3A%3A75b8a7c455d7a9c13aa4c89605c57a2c
    Name: EDS - OpenAIRE (s4221598)
    Category: fullText
    Text: View record at OpenAIRE
  – Url: https://resolver.ebscohost.com/openurl?sid=EBSCO:edsair&genre=article&issn=23263865&ISBN=&volume=33&issue=&date=20210201&spage=316&pages=316-330&title=IEEE Transactions on Knowledge and Data Engineering&atitle=A%20Comparative%20Study%20of%20Consistent%20Snapshot%20Algorithms%20for%20Main-Memory%20Database%20Systems&aulast=Liang%20Li&id=DOI:10.1109/tkde.2019.2930987
    Name: Full Text Finder
    Category: fullText
    Text: Full Text Finder
    Icon: https://imageserver.ebscohost.com/branding/images/FTF.gif
    MouseOverText: Full Text Finder
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Li%20L
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edsair
DbLabel: OpenAIRE
An: edsair.doi.dedup.....75b8a7c455d7a9c13aa4c89605c57a2c
RelevancyScore: 904
AccessLevel: 3
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 904.014892578125
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: A Comparative Study of Consistent Snapshot Algorithms for Main-Memory Database Systems
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Liang+Li%22">Liang Li</searchLink><br /><searchLink fieldCode="AR" term="%22Guoren+Wang%22">Guoren Wang</searchLink><br /><searchLink fieldCode="AR" term="%22Gang+Wu%22">Gang Wu</searchLink><br /><searchLink fieldCode="AR" term="%22Ye+Yuan%22">Ye Yuan</searchLink><br /><searchLink fieldCode="AR" term="%22Lei+Chen%22">Lei Chen</searchLink><br /><searchLink fieldCode="AR" term="%22Xiang+Lian%22">Xiang Lian</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <i>IEEE Transactions on Knowledge and Data Engineering</i>. 33:316-330
– Name: Publisher
  Label: Publication Status
  Group: PubInfo
  Data: Preprint
– Name: Publisher
  Label: Publisher Information
  Group: PubInfo
  Data: Institute of Electrical and Electronics Engineers (IEEE), 2021.
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2021
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Checkpoints%22">Checkpoints</searchLink><br /><searchLink fieldCode="DE" term="%22FOS%3A+Computer+and+information+sciences%22">FOS: Computer and information sciences</searchLink><br /><searchLink fieldCode="DE" term="%22Computer+Science+-+Databases%22">Computer Science - Databases</searchLink><br /><searchLink fieldCode="DE" term="%22HTAP%22">HTAP</searchLink><br /><searchLink fieldCode="DE" term="%22Snapshot+algorithms%22">Snapshot algorithms</searchLink><br /><searchLink fieldCode="DE" term="%220202+electrical+engineering%2C+electronic+engineering%2C+information+engineering%22">0202 electrical engineering, electronic engineering, information engineering</searchLink><br /><searchLink fieldCode="DE" term="%22Databases+%28cs%2EDB%29%22">Databases (cs.DB)</searchLink><br /><searchLink fieldCode="DE" term="%2202+engineering+and+technology%22">02 engineering and technology</searchLink><br /><searchLink fieldCode="DE" term="%22In-memory+database+systems%22">In-memory database systems</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: In-memory databases (IMDBs) are gaining increasing popularity in big data applications, where clients commit updates intensively. Specifically, it is necessary for IMDBs to have efficient snapshot performance to support certain special applications (e.g., consistent checkpoint, HTAP). Formally, the in-memory consistent snapshot problem refers to taking an in-memory consistent time-in-point snapshot with the constraints that 1) clients can read the latest data items and 2) any data item in the snapshot should not be overwritten. Various snapshot algorithms have been proposed in academia to trade off throughput and latency, but industrial IMDBs such as Redis adhere to the simple fork algorithm. To understand this phenomenon, we conduct comprehensive performance evaluations on mainstream snapshot algorithms. Surprisingly, we observe that the simple fork algorithm indeed outperforms the state-of-the-arts in update-intensive workload scenarios. On this basis, we identify the drawbacks of existing research and propose two lightweight improvements. Extensive evaluations on synthetic data and Redis show that our lightweight improvements yield better performance than fork, the current industrial standard, and the representative snapshot algorithms from academia. Finally, we have opensourced the implementation of all the above snapshot algorithms so that practitioners are able to benchmark the performance of each algorithm and select proper methods for different application scenarios.
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: Article
– Name: ISSN
  Label: ISSN
  Group: ISSN
  Data: 2326-3865<br />1041-4347
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.1109/tkde.2019.2930987
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.48550/arxiv.1810.04915
– Name: URL
  Label: Access URL
  Group: URL
  Data: <link linkTarget="URL" linkTerm="http://arxiv.org/pdf/1810.04915" linkWindow="_blank">http://arxiv.org/pdf/1810.04915</link><br /><link linkTarget="URL" linkTerm="http://arxiv.org/abs/1810.04915" linkWindow="_blank">http://arxiv.org/abs/1810.04915</link><br /><link linkTarget="URL" linkTerm="https://arxiv.org/abs/1810.04915" linkWindow="_blank">https://arxiv.org/abs/1810.04915</link><br /><link linkTarget="URL" linkTerm="https://ieeexplore.ieee.org/document/8772140/" linkWindow="_blank">https://ieeexplore.ieee.org/document/8772140/</link><br /><link linkTarget="URL" linkTerm="https://arxiv.org/pdf/1810.04915.pdf" linkWindow="_blank">https://arxiv.org/pdf/1810.04915.pdf</link><br /><link linkTarget="URL" linkTerm="https://dblp.uni-trier.de/db/journals/tkde/tkde33.html#LiWWYCL21" linkWindow="_blank">https://dblp.uni-trier.de/db/journals/tkde/tkde33.html#LiWWYCL21</link><br /><link linkTarget="URL" linkTerm="https://doi.org/10.1109/TKDE.2019.2930987" linkWindow="_blank">https://doi.org/10.1109/TKDE.2019.2930987</link>
– Name: Copyright
  Label: Rights
  Group: Cpyrght
  Data: IEEE Copyright<br />arXiv Non-Exclusive Distribution
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsair.doi.dedup.....75b8a7c455d7a9c13aa4c89605c57a2c
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsair&AN=edsair.doi.dedup.....75b8a7c455d7a9c13aa4c89605c57a2c
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1109/tkde.2019.2930987
    Languages:
      – Text: Undetermined
    PhysicalDescription:
      Pagination:
        PageCount: 15
        StartPage: 316
    Subjects:
      – SubjectFull: Checkpoints
        Type: general
      – SubjectFull: FOS: Computer and information sciences
        Type: general
      – SubjectFull: Computer Science - Databases
        Type: general
      – SubjectFull: HTAP
        Type: general
      – SubjectFull: Snapshot algorithms
        Type: general
      – SubjectFull: 0202 electrical engineering, electronic engineering, information engineering
        Type: general
      – SubjectFull: Databases (cs.DB)
        Type: general
      – SubjectFull: 02 engineering and technology
        Type: general
      – SubjectFull: In-memory database systems
        Type: general
    Titles:
      – TitleFull: A Comparative Study of Consistent Snapshot Algorithms for Main-Memory Database Systems
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Liang Li
      – PersonEntity:
          Name:
            NameFull: Guoren Wang
      – PersonEntity:
          Name:
            NameFull: Gang Wu
      – PersonEntity:
          Name:
            NameFull: Ye Yuan
      – PersonEntity:
          Name:
            NameFull: Lei Chen
      – PersonEntity:
          Name:
            NameFull: Xiang Lian
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 02
              Type: published
              Y: 2021
          Identifiers:
            – Type: issn-print
              Value: 23263865
            – Type: issn-print
              Value: 10414347
            – Type: issn-locals
              Value: edsair
            – Type: issn-locals
              Value: edsairFT
          Numbering:
            – Type: volume
              Value: 33
          Titles:
            – TitleFull: IEEE Transactions on Knowledge and Data Engineering
              Type: main
ResultId 1