Extracting deltas from column oriented NoSQL databases for different incremental applications and diverse data targets

This paper describes the Change Data Capture (CDC) problems in the context of column-oriented NoSQL databases (CoNoSQLDBs). CDC is a term mostly used by ETL tools and data warehousing environments (DW) to depict a data processing of extracting data changes made at the data sources. Based on analyzin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Data & knowledge engineering Jg. 93; S. 42 - 59
Hauptverfasser: Hu, Yong, Dessloch, Stefan
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Elsevier B.V 01.09.2014
Schlagworte:
ISSN:0169-023X, 1872-6933
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper describes the Change Data Capture (CDC) problems in the context of column-oriented NoSQL databases (CoNoSQLDBs). CDC is a term mostly used by ETL tools and data warehousing environments (DW) to depict a data processing of extracting data changes made at the data sources. Based on analyzing the impacts and constraints caused by the core features of CoNoSQLDBs, we propose a logical change data (delta) model and the corresponding delta representations which could work with different incremental applications and diverse data targets. Moreover, we present five feasible CDC approaches, i.e. Timestamp-based approach, Audit-column approach, Log-based approach, Trigger-based approach and Snapshot differential approach and indicate the performance winners under different circumstances.
ISSN:0169-023X
1872-6933
DOI:10.1016/j.datak.2014.07.002