Handling data skew in join algorithms using MapReduce

•We introduce a skew handling algorithm, called multi-dimensional range partitioning.•The proposed algorithm is more efficient than traditional MapReduce-based join algorithms.•The proposed algorithm is scalable regardless of the size of input data. One of the major obstacles hindering effective joi...

Full description

Saved in:
Bibliographic Details
Published in:Expert systems with applications Vol. 51; pp. 286 - 299
Main Authors: Myung, Jaeseok, Shim, Junho, Yeon, Jongheum, Lee, Sang-goo
Format: Journal Article
Language:English
Published: Elsevier Ltd 01.06.2016
Subjects:
ISSN:0957-4174, 1873-6793
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first