A survey of data partitioning and sampling methods to support big data analysis

Computer clusters with the shared-nothing architecture are the major computing platforms for big data processing and analysis. In cluster computing, data partitioning and sampling are two fundamental strategies to speed up the computation of big data and increase scalability. In this paper, we prese...

Full description

Saved in:
Bibliographic Details
Published in:Big Data Mining and Analytics Vol. 3; no. 2; pp. 85 - 101
Main Authors: Mahmud, Mohammad Sultan, Huang, Joshua Zhexue, Salloum, Salman, Emara, Tamer Z., Sadatdiynov, Kuanishbay
Format: Journal Article
Language:English
Published: Beijing Tsinghua University Press 01.06.2020
Subjects:
ISSN:2096-0654, 2097-406X
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first