<p><span>Successfully navigating the data-driven economy presupposes a certain understanding of the technologies and methods to gain insights from Big Data. This book aims to help data science practitioners to successfully manage the transition to Big Data. <br>Building on familiar content from appl
Data Lake Analytics on Microsoft Azure: A Practitioner's Guide to Big Data Engineering
β Scribed by Harsh Chawla, Pankaj Khattar
- Publisher
- Apress
- Year
- 2020
- Tongue
- English
- Leaves
- 231
- Edition
- 1st ed.
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Get a 360-degree view of how the journey of data analytics solutions has evolved from monolithic data stores and enterprise data warehouses to data lakes and modern data warehouses. You will
This book includes comprehensive coverage of how:
- To architect data lake analytics solutions by choosing suitable technologies available on Microsoft Azure
- The advent of microservices applications covering ecommerce or modern solutions built on IoT and how real-time streaming data has completely disrupted this ecosystem
- These data analytics solutions have been transformed from solely understanding the trends from historical data to building predictions by infusing machine learning technologies into the solutions
Data platform professionals who have been working on relational data stores, non-relational data stores, and big data technologies will find the content in this book useful. The book also can help you start your journey into the data engineer world as it provides an overview of advanced data analytics and touches on data science concepts and various artificial intelligence and machine learning technologies available on Microsoft Azure.
What Will You Learn
- Concepts of data lake analytics, the modern data warehouse, and advanced data analytics
- Architecture patterns of the modern data warehouse and advanced data analytics solutions
- Phasesβsuch as Data Ingestion, Store, Prep and Train, and Model and Serveβof data analytics solutions and technology choices available on Azure under each phase
- In-depth coverage of real-time and batch mode data analytics solutions architecture
- Various managed services available on Azure such as Synapse analytics, event hubs, Stream analytics, CosmosDB, and managed Hadoop services such as Databricks and HDInsight
Data platform professionals, database architects, engineers, and solution architects
β¦ Table of Contents
Front Matter ....Pages i-xvii
Data Lake Analytics Concepts (Harsh Chawla, Pankaj Khattar)....Pages 1-10
Building Blocks of Data Analytics (Harsh Chawla, Pankaj Khattar)....Pages 11-25
Data Analytics on Public Cloud (Harsh Chawla, Pankaj Khattar)....Pages 27-41
Data Ingestion (Harsh Chawla, Pankaj Khattar)....Pages 43-85
Data Storage (Harsh Chawla, Pankaj Khattar)....Pages 87-98
Data Preparation and Training Part I (Harsh Chawla, Pankaj Khattar)....Pages 99-142
Data Preparation and Training Part II (Harsh Chawla, Pankaj Khattar)....Pages 143-180
Model and Serve (Harsh Chawla, Pankaj Khattar)....Pages 181-209
Summary (Harsh Chawla, Pankaj Khattar)....Pages 211-213
Back Matter ....Pages 215-222
β¦ Subjects
Computer Science; Microsoft and .NET; Big Data
π SIMILAR VOLUMES
Successfully navigating the data-driven economy presupposes a certain understanding of the technologies and methods to gain insights from Big Data. This book aims to help data science practitioners to successfully manage the transition to Big Data. Building on familiar content from applied econometr
<p><span>Successfully navigating the data-driven economy presupposes a certain understanding of the technologies and methods to gain insights from Big Data. This book aims to help data science practitioners to successfully manage the transition to Big Data. <br>Building on familiar content from appl
<p><em>Big Data Analytics with Spark</em> is a step-by-step guide for learning Spark, which is an open-source fast and general-purpose cluster computing framework for large-scale data analysis. You will learn how to use Spark for different types of big data analytics projects, including batch, inter
This book is a step-by-step guide for learning how to use Spark for different types of big-data analytics projects, including batch, interactive, graph, and stream data analysis as well as machine learning. It covers Spark core and its add-on libraries, including Spark SQL, Spark Streaming, GraphX,
<p><span>This new volume addresses the growing interest in and use of big data analytics in many industries and in many research fields around the globe; it is a comprehensive resource on the core concepts of big data analytics and the tools, techniques, and methodologies. The book gives the why and