[ACM Press the 21st international symposium - Delft, The Netherlands (2012.06.18-2012.06.22)] Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing - HPDC '12 - Coupling scheduler for MapReduce/Hadoop
โ Scribed by Tan, Jian; Meng, Xiaoqiao; Zhang, Li
- Book ID
- 120704024
- Publisher
- ACM Press
- Year
- 2012
- Weight
- 383 KB
- Category
- Article
- ISBN
- 1450308058
No coin nor oath required. For personal study only.
โฆ Synopsis
Current schedulers of MapReduce/Hadoop are quite successful in providing good performance. However improving spaces still exist: map and reduce tasks are not jointly optimized for scheduling, albeit there is a strong dependence between them. This can cause job starvation and bad data locality. We design a resource-aware scheduler for Hadoop, which couples the progresses of mappers and reducers, and jointly optimize the placements for both of them. This mitigates the starvation problem and improves the overall data locality. Our experiments demonstrate improvements to job response times by up to an order of magnitude.
๐ SIMILAR VOLUMES
Accelerators and heterogeneous architectures in general, and GPUs in particular, have recently emerged as major players in high performance computing. For many classes of applications, MapReduce has emerged as the framework for easing parallel programming and improving programmer productivity. There