Use this practical guide to successfully handle the challenges encountered when designing an enterprise data lake and learn industry best practices to resolve issues. When designing an enterprise data lake you often hit a roadblock when you must leave the comfort of the relational world and learn th
Practical Enterprise Data Lake Insights
โ Scribed by Saurabh Gupta, Venkata Giri
- Publisher
- Apress
- Year
- 2018
- Tongue
- English
- Leaves
- 335
- Edition
- 1st ed.
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
Use this practical guide to successfully handle the challenges encountered when designing an enterprise data lake and learn industry best practices to resolve issues.
When designing an enterprise data lake you often hit a roadblock when you must leave the comfort of the relational world and learn the nuances of handling non-relational data. Starting from sourcing data into the Hadoop ecosystem, you will go through stages that can bring up tough questions such as data processing, data querying, and security. Concepts such as change data capture and data streaming are covered. The book takes an end-to-end solution approach in a data lake environment that includes data security, high availability, data processing, data streaming, and more.
Each chapter includes application of a concept, code snippets, and use case demonstrations to provide you with a practical approach. You will learn the concept, scope, application, and starting point.
What You'll Learn
- Get to know data lake architecture and design principles
- Implement data capture and streaming strategies
- Implement data processing strategies in Hadoop
- Understand the data lake security framework and availability model
Big data architects and solution architects
โฆ Table of Contents
Front Matter ....Pages i-xviii
Introduction to Enterprise Data Lakes (Saurabh Gupta, Venkata Giri)....Pages 1-31
Data lake ingestion strategies (Saurabh Gupta, Venkata Giri)....Pages 33-85
Capture Streaming Data with Change-Data-Capture (Saurabh Gupta, Venkata Giri)....Pages 87-123
Data Processing Strategies in Data Lakes (Saurabh Gupta, Venkata Giri)....Pages 125-199
Data Archiving Strategies in Data Lakes (Saurabh Gupta, Venkata Giri)....Pages 201-223
Data Security in Data Lakes (Saurabh Gupta, Venkata Giri)....Pages 225-259
Ensure High Availability of Data Lake (Saurabh Gupta, Venkata Giri)....Pages 261-295
Managing Data Lake Operations (Saurabh Gupta, Venkata Giri)....Pages 297-315
Back Matter ....Pages 317-327
โฆ Subjects
Computer Science; Big Data; Computer Applications; Big Data/Analytics
๐ SIMILAR VOLUMES
The data lake is a daring new approach for harnessing the power of big data technology and providing convenient self-service capabilities. But is it right for your company? This book is based on discussions with practitioners and executives from more than a hundred organizations, ranging from data-d