A survey of data partitioning and sampling methods to support big data analysis
Computer clusters with the shared-nothing architecture are the major computing platforms for big data processing and analysis. In cluster computing, data partitioning and sampling are two fundamental strategies to speed up the computation of big data and increase scalability. In this paper, we prese...
Saved in:
| Published in: | Big Data Mining and Analytics Vol. 3; no. 2; pp. 85 - 101 |
|---|---|
| Main Authors: | , , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Beijing
Tsinghua University Press
01.06.2020
|
| Subjects: | |
| ISSN: | 2096-0654, 2097-406X |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!