𝔖 Bobbio Scriptorium
✦   LIBER   ✦

[ACM Press the third joint WOSP/SIPEW international conference - Boston, Massachusetts, USA (2012.04.22-2012.04.25)] Proceedings of the third joint WOSP/SIPEW international conference on Performance Engineering - ICPE '12 - Apache hadoop performance-tuning methodologies and best practices

✍ Scribed by Joshi, Shrinivas B.


Book ID
121432685
Publisher
ACM Press
Year
2012
Weight
374 KB
Category
Article
ISBN
1450312020

No coin nor oath required. For personal study only.

✦ Synopsis


Apache Hadoop is a Java based distributed computing framework built for applications implemented using MapReduce programming model. In recent years, Hadoop technology has experienced an unprecedented growth in its adoption. From single-node clusters to clusters with well over thousands of nodes, Hadoop technology is being used to perform myriad of functions -search optimizations, data mining, click stream analytics, machine learning to name a few. Although setting up Hadoop clusters and building applications for Hadoop is a well understood area, tuning Hadoop clusters for optimal performance is still a black art. In this demo paper, we will attempt to provide the audience with a holistic approach of Hadoop performance tuning methodologies and best practices. We discuss hardware as well as software tuning techniques including BIOS, OS, JVM and Hadoop configuration parameters tuning.


πŸ“œ SIMILAR VOLUMES


[ACM Press the third joint WOSP/SIPEW in
✍ Kounev, Samuel; Spinner, Simon; Meier, Philipp πŸ“‚ Article πŸ“… 2012 πŸ› ACM Press βš– 729 KB

Queueing Petri nets are a powerful formalism that can be exploited for modeling distributed systems and evaluating their performance and scalability. By combining the modeling power and expressiveness of queueing networks and stochastic Petri nets, queueing Petri nets provide a number of advantages.