Using Performance Measurements to Improve MapReduce Algorithms

The Hadoop MapReduce software environment is used for parallel processing of distributively stored data. Data mining algorithms of increasing sophistication are being implemented in MapReduce, bringing new challenges for performance measurement and tuning. We focus on analyzing a job after completio...

Full description

Saved in:
Bibliographic Details
Published in:Procedia computer science Vol. 9; pp. 1920 - 1929
Main Authors: Plantenga, Todd D., Choe, Yung Ryn, Yoshimura, Ann
Format: Journal Article
Language:English
Published: Elsevier B.V 2012
Subjects:
ISSN:1877-0509, 1877-0509
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first