๐”– Scriptorium
โœฆ   LIBER   โœฆ

๐Ÿ“

Optimizing Hadoop for MapReduce

โœ Scribed by Khaled Tannir


Publisher
Packt Publishing
Year
2014
Tongue
English
Leaves
120
Category
Library

โฌ‡  Acquire This Volume

No coin nor oath required. For personal study only.

โœฆ Synopsis


Learn how to configure your Hadoop cluster to run optimal MapReduce jobs

Overview

  • Optimize your MapReduce job performance
  • Identify your Hadoop cluster's weaknesses
  • Tune your MapReduce configuration

In Detail

MapReduce is the distribution system that the Hadoop MapReduce engine uses to distribute work around a cluster by working parallel on smaller data sets. It is useful in a wide range of applications, including distributed pattern-based searching, distributed sorting, web link-graph reversal, term-vector per host, web access log stats, inverted index construction, document clustering, machine learning, and statistical machine translation.

This book introduces you to advanced MapReduce concepts and teaches you everything from identifying the factors that affect MapReduce job performance to tuning the MapReduce configuration. Based on real-world experience, this book will help you to fully utilize your cluster's node resources to run MapReduce jobs optimally.

This book details the Hadoop MapReduce job performance optimization process. Through a number of clear and practical steps, it will help you to fully utilize your cluster's node resources.

Starting with how MapReduce works and the factors that affect MapReduce performance, you will be given an overview of Hadoop metrics and several performance monitoring tools. Further on, you will explore performance counters that help you identify resource bottlenecks, check cluster health, and size your Hadoop cluster. You will also learn about optimizing map and reduce tasks by using Combiners and compression.

The book ends with best practices and recommendations on how to use your Hadoop cluster optimally.

What you will learn from this book

  • Learn about the factors that affect MapReduce performance
  • Utilize the Hadoop MapReduce performance counters to identify resource bottlenecks
  • Size your Hadoop cluster's nodes
  • Set the number of mappers and reducers correctly
  • Optimize mapper and reducer task throughput and code size using compression and Combiners
  • Understand the various tuning properties and best practices to optimize clusters

Approach

This book is an example-based tutorial that deals with optimizing MapReduce job performance.

Who this book is written for

If you are a Hadoop administrator, developer, MapReduce user, or beginner, this book is the best choice available if you wish to optimize your clusters and applications. Having prior knowledge of creating MapReduce applications is not necessary, but will help you better understand the concepts and snippets of MapReduce class template code.


๐Ÿ“œ SIMILAR VOLUMES


Optimizing Hadoop for MapReduce
โœ Khaled Tannir ๐Ÿ“‚ Library ๐Ÿ“… 2014 ๐Ÿ› Packt Publishing ๐ŸŒ English

MapReduce is the distribution system that the Hadoop MapReduce engine uses to distribute work around a cluster by working parallel on smaller data sets. It is useful in a wide range of applications, including distributed pattern-based searching, distributed sorting, web link-graph reversal, term-vec

Optimizing Hadoop for MapReduce
โœ Khaled Tannir ๐Ÿ“‚ Library ๐Ÿ“… 2014 ๐Ÿ› Packt Publishing ๐ŸŒ English

<p>Learn how to configure your Hadoop cluster to run optimal MapReduce jobs</p> <p><b>Overview</b></p> <ul> <li>Optimize your MapReduce job performance</li> <li>Identify your Hadoop cluster's weaknesses</li> <li>Tune your MapReduce configuration</li> </ul> <p><b>In Detail</b></p> <p>MapReduce is the

Optimizing Hadoop for MapReduce : learn
โœ Tannir, Khaled ๐Ÿ“‚ Library ๐Ÿ“… 2014 ๐Ÿ› Packt Publishing - ebooks Account ๐ŸŒ English

<b>Learn how to configure your Hadoop cluster to run optimal MapReduce jobs</b><h2>About This Book</h2><ul> <li>Optimize your MapReduce job performance</li> <li>Identify your Hadoop cluster's weaknesses</li> <li>Tune your MapReduce configuration</li> </ul><h2>Who This Book Is For</h2><p>If you are a

Hadoop MapReduce Cookbook: Recipes for a
โœ Srinath Perera, Thilina Gunarathne ๐Ÿ“‚ Library ๐Ÿ“… 2013 ๐Ÿ› Packt Publishing ๐ŸŒ English

Learn to process large and complex data sets, starting simply, then diving in deep. Solve complex big data problems such as classifications, finding relationships, online marketing and recommendations. More than 50 Hadoop MapReduce recipes, presented in a simple and straightforward manner, with step

Hadoop MapReduce Cookbook
โœ Srinath Perera, Thilina Gunarathne ๐Ÿ“‚ Library ๐Ÿ“… 2013 ๐Ÿ› Packt Publishing ๐ŸŒ English

Recipes for analyzing large and complex datasets with Hadoop MapReduce Overview Learn to process large and complex data sets, starting simply, then diving in deep Solve complex big data problems such as classifications, finding relationships, online marketing and recommendations. More than 50 Hadoo

Hadoop MapReduce Cookbook
โœ Srinath Perera, Thilina Gunarathne ๐Ÿ“‚ Library ๐Ÿ“… 2013 ๐Ÿ› Packt Publishing ๐ŸŒ English

Recipes for analyzing large and complex datasets with Hadoop MapReduce Overview Learn to process large and complex data sets, starting simply, then diving in deep Solve complex big data problems such as classifications, finding relationships, online marketing and recommendations. More than 50 Hadoo