Big Data for Chimps: A Guide to Massive-Scale Data Processing in Practice
✍ Scribed by Philip (flip) Kromer, Russell Jurney
- Publisher
- O'Reilly Media
- Year
- 2015
- Tongue
- English
- Leaves
- 220
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
✦ Synopsis
Finding patterns in massive event streams can be difficult, but learning how to find them doesn’t have to be. This unique hands-on guide shows you how to solve this and many other problems in large-scale data processing with simple, fun, and elegant tools that leverage Apache Hadoop. You’ll gain a practical, actionable view of big data by working with real data and real problems.
Perfect for beginners, this book’s approach will also appeal to experienced practitioners who want to brush up on their skills. Part I explains how Hadoop and MapReduce work, while Part II covers many analytic patterns you can use to process any data. As you work through several exercises, you’ll also learn how to use Apache Pig to process data.
- Learn the necessary mechanics of working with Hadoop, including how data and computation move around the cluster
- Dive into map/reduce mechanics and build your first map/reduce job in Python
- Understand how to run chains of map/reduce jobs in the form of Pig scripts
- Use a real-world dataset—baseball performance statistics—throughout the book
- Work with examples of several analytic patterns, and learn when and where you might use them
✦ Subjects
Информатика и вычислительная техника;Искусственный интеллект;Интеллектуальный анализ данных;
📜 SIMILAR VOLUMES
<p>Finding patterns in massive event streams can be difficult, but learning how to find them doesn't have to be. This unique hands-on guide shows you how to solve this and many other problems in large-scale data processing with simple, fun, and elegant tools that leverage Apache Hadoop. You'll gain
<P>With this book, managers and decision makers are given the tools to make more informed decisions about big data purchasing initiatives. <STRONG>Big Data Analytics: A Practical Guide for Managers</STRONG> not only supplies descriptions of common tools, but also surveys the various products and ven
<P>With this book, managers and decision makers are given the tools to make more informed decisions about big data purchasing initiatives. <STRONG>Big Data Analytics: A Practical Guide for Managers</STRONG> not only supplies descriptions of common tools, but also surveys the various products and ven
<p><em>Big Data Analytics with Spark</em> is a step-by-step guide for learning Spark, which is an open-source fast and general-purpose cluster computing framework for large-scale data analysis. You will learn how to use Spark for different types of big data analytics projects, including batch, inter