Get a jump start on using Azure HDInsight and Hadoop Ecosystem components. As most Hadoop and Big Data projects are written in either Java, Scala, or Python, this book minimizes the effort to learn another language and is written from the perspective of a .NET developer. Hadoop components are covere
Processing Big Data with Azure HDInsight: Building Real-World Big Data Systems on Azure HDInsight Using the Hadoop Ecosystem
β Scribed by Vinit Yadav (auth.)
- Publisher
- Apress
- Year
- 2017
- Tongue
- English
- Leaves
- 221
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Get a jump start on using Azure HDInsight and Hadoop Ecosystem components. As most Hadoop and Big Data projects are written in either Java, Scala, or Python, this book minimizes the effort to learn another language and is written from the perspective of a .NET developer. Hadoop components are covered, including Hive, Pig, HBase, Storm, and Spark on Azure HDInsight, and code samples are written in .NET only.
Processing Big Data with Azure HDInsight covers the fundamentals of big data, how businesses are using it to their advantage, and how Azure HDInsight fits into the big data world. This book introduces Hadoop and big data concepts and then dives into creating different solutions with HDInsight and the Hadoop Ecosystem. It covers concepts with real-world scenarios and code examples, making sure you get hands-on experience. The best way to utilize this book is to practice while reading. After reading this book you will be familiar with Azure HDInsight and how it can be utilized to build big data solutions, including batch processing, stream analytics, interactive processing, and storing and retrieving data in an efficient manner.
What You'll Learn
- Understand the fundamentals of HDInsight and Hadoop
- Work with HDInsight cluster
- Query with Apache Hive and Apache Pig
- Store and retrieve data with Apache HBase
- Stream data processing using Apache Storm
- Work with Apache Spark
Software developers, technical architects, data scientists/analyts, and Hadoop administrators who want to develop on Microsoftβs managed Hadoop offering, HDInsight
β¦ Table of Contents
Front Matter....Pages i-xix
Big Data, Hadoop, and HDInsight....Pages 1-11
Provisioning an HDInsight Cluster....Pages 13-43
Working with Data in HDInsight....Pages 45-70
Querying Data with Hive....Pages 71-110
Using Pig with HDInsight....Pages 111-122
Working with HBase....Pages 123-142
Real-Time Analytics with Storm....Pages 143-172
Exploring Data with Spark....Pages 173-202
Back Matter....Pages 203-207
β¦ Subjects
Data Storage Representation;Database Management;Data Mining and Knowledge Discovery
π SIMILAR VOLUMES
Chapter 1: Overview: Building Data Analytic Systems with Hadoop -- Chapter 2: A Scala and Python Refresher -- Chapter 3: Standard Toolkits for Hadoop and Analytics -- Chapter 4: Relational, noSQL, and Graph Databases -- Chapter 5: Data Pipelines and How to Construct Them -- Chapter 6: Advanced Searc
Learn advanced analytical techniques and leverage existing tool kits to make your analytic applications more powerful, precise, and efficient. This book provides the right combination of architecture, design, and implementation information to create analytical systems that go beyond the basics of cl
<p><p>Learn advanced analytical techniques and leverage existing toolkits to make your analytic applications more powerful, precise, and efficient. This book provides the right combination of architecture, design, and implementation information to create analytical systems which go beyond the basics
<p><b>Explore big data concepts, platforms, analytics, and their applications using the power of Hadoop 3</b></p><h4>Key Features</h4><ul><li>Learn Hadoop 3 to build effective big data analytics solutions on-premise and on cloud</li><li>Integrate Hadoop with other big data tools such as R, Python, A
Explore big data concepts, platforms, analytics, and their applications using the power of Hadoop 3 Key Features Learn Hadoop 3 to build effective big data analytics solutions on-premise and on cloud Integrate Hadoop with other big data tools such as R, Python, Apache Spark, and Apache Flink E