Robust external hash aggregation in the solid state age
Gespeichert in:
| Titel: | Robust external hash aggregation in the solid state age |
|---|---|
| Autoren: | Kuiper, L.N. (Laurens), Boncz, P.A. (Peter), Mühleisen, H.F. (Hannes) |
| Publikationsjahr: | 2024 |
| Bestand: | CWI's Institutional Repository (Centrum voor Wiskunde en Informatica) |
| Schlagwörter: | Relational databases, Database query processing, Aggregation |
| Beschreibung: | Analytical database systems offer high-performance in-memory aggregation. If there are many unique groups, temporary query intermediates may not fit RAM, requiring the use of external storage. However, switching from an in-memory to an external algorithm can degrade performance sharply. We revisit external hash aggregation on modern hardware, aiming instead for robust performance that avoids a 'performance cliff' when memory runs out. To achieve this, we introduce two techniques for handling temporary query intermediates. First, we propose unifying the memory management of temporary and persistent data. Second, we propose using a page layout that can be spilled to disk despite being optimized for main memory performance. These two techniques allow operator implementations to process larger-than-memory query intermediates with only minor modifications. We integrate these into DuckDB's parallel hash aggregation. Experimental results show that our implementation gracefully degrades performance as query intermediates exceed the available memory limit, while main memory performance is competitive with other analytical database systems. |
| Publikationsart: | conference object |
| Sprache: | English |
| Relation: | https://ir.cwi.nl/pub/34360 |
| DOI: | 10.1109/ICDE60146.2024.00288 |
| Verfügbarkeit: | https://ir.cwi.nl/pub/34360 https://doi.org/10.1109/ICDE60146.2024.00288 |
| Dokumentencode: | edsbas.AFB0FD60 |
| Datenbank: | BASE |
| FullText | Text: Availability: 0 CustomLinks: – Url: https://ir.cwi.nl/pub/34360# Name: EDS - BASE (s4221598) Category: fullText Text: View record from BASE – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Kuiper%20LN Name: ISI Category: fullText Text: Nájsť tento článok vo Web of Science Icon: https://imagesrvr.epnet.com/ls/20docs.gif MouseOverText: Nájsť tento článok vo Web of Science |
|---|---|
| Header | DbId: edsbas DbLabel: BASE An: edsbas.AFB0FD60 RelevancyScore: 901 AccessLevel: 3 PubType: Conference PubTypeId: conference PreciseRelevancyScore: 901.007141113281 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Robust external hash aggregation in the solid state age – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Kuiper%2C+L%2EN%2E+%28Laurens%29%22">Kuiper, L.N. (Laurens)</searchLink><br /><searchLink fieldCode="AR" term="%22Boncz%2C+P%2EA%2E+%28Peter%29%22">Boncz, P.A. (Peter)</searchLink><br /><searchLink fieldCode="AR" term="%22Mühleisen%2C+H%2EF%2E+%28Hannes%29%22">Mühleisen, H.F. (Hannes)</searchLink> – Name: DatePubCY Label: Publication Year Group: Date Data: 2024 – Name: Subset Label: Collection Group: HoldingsInfo Data: CWI's Institutional Repository (Centrum voor Wiskunde en Informatica) – Name: Subject Label: Subject Terms Group: Su Data: <searchLink fieldCode="DE" term="%22Relational+databases%22">Relational databases</searchLink><br /><searchLink fieldCode="DE" term="%22Database+query+processing%22">Database query processing</searchLink><br /><searchLink fieldCode="DE" term="%22Aggregation%22">Aggregation</searchLink> – Name: Abstract Label: Description Group: Ab Data: Analytical database systems offer high-performance in-memory aggregation. If there are many unique groups, temporary query intermediates may not fit RAM, requiring the use of external storage. However, switching from an in-memory to an external algorithm can degrade performance sharply. We revisit external hash aggregation on modern hardware, aiming instead for robust performance that avoids a 'performance cliff' when memory runs out. To achieve this, we introduce two techniques for handling temporary query intermediates. First, we propose unifying the memory management of temporary and persistent data. Second, we propose using a page layout that can be spilled to disk despite being optimized for main memory performance. These two techniques allow operator implementations to process larger-than-memory query intermediates with only minor modifications. We integrate these into DuckDB's parallel hash aggregation. Experimental results show that our implementation gracefully degrades performance as query intermediates exceed the available memory limit, while main memory performance is competitive with other analytical database systems. – Name: TypeDocument Label: Document Type Group: TypDoc Data: conference object – Name: Language Label: Language Group: Lang Data: English – Name: NoteTitleSource Label: Relation Group: SrcInfo Data: https://ir.cwi.nl/pub/34360 – Name: DOI Label: DOI Group: ID Data: 10.1109/ICDE60146.2024.00288 – Name: URL Label: Availability Group: URL Data: https://ir.cwi.nl/pub/34360<br />https://doi.org/10.1109/ICDE60146.2024.00288 – Name: AN Label: Accession Number Group: ID Data: edsbas.AFB0FD60 |
| PLink | https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsbas&AN=edsbas.AFB0FD60 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1109/ICDE60146.2024.00288 Languages: – Text: English Subjects: – SubjectFull: Relational databases Type: general – SubjectFull: Database query processing Type: general – SubjectFull: Aggregation Type: general Titles: – TitleFull: Robust external hash aggregation in the solid state age Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Kuiper, L.N. (Laurens) – PersonEntity: Name: NameFull: Boncz, P.A. (Peter) – PersonEntity: Name: NameFull: Mühleisen, H.F. (Hannes) IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 01 Type: published Y: 2024 Identifiers: – Type: issn-locals Value: edsbas |
| ResultId | 1 |
Nájsť tento článok vo Web of Science