Local and Global Optimization of MapReduce Program Model

MapReduce, which was introduced by Google, provides two functional interfaces, Map and Reduce, for a user to write the user-specific code to process the large amount of data. It has been widely deployed in cloud computing systems. The parallel tasks, data partition, and data transit are automaticall...

Full description

Saved in:
Bibliographic Details
Published in:2011 IEEE World Congress on Services pp. 257 - 264
Main Authors: Congchong Liu, Shujia Zhou
Format: Conference Proceeding
Language:English
Published: IEEE 01.07.2011
Subjects:
ISBN:1457708795, 9781457708794
ISSN:2378-3818
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:MapReduce, which was introduced by Google, provides two functional interfaces, Map and Reduce, for a user to write the user-specific code to process the large amount of data. It has been widely deployed in cloud computing systems. The parallel tasks, data partition, and data transit are automatically managed by its runtime system. This paper proposes a solution to optimize the MapReduce program model and demonstrate it with X10. We develop an adaptive load distribution scheme to balance the load on each node and consequently reduce across-node communication cost occurring in the Reduce function. In addition, we exploit shared-memory in each node to further reduce the communication cost with multi-core programming.
ISBN:1457708795
9781457708794
ISSN:2378-3818
DOI:10.1109/SERVICES.2011.64