Data lakes and warehouses have become increasingly fragile, costly, and difficult to maintain as data gets bigger and moves faster. Data meshes can help your organization decentralize data, giving ownership back to the engineers who produced it. This book provides a concise yet comprehensive overvie
Streaming Data Mesh (8th Early Release)
โ Scribed by Hubert Dulay and Stephen Mooney
- Publisher
- O'Reilly Media, Inc.
- Year
- 2023
- Tongue
- English
- Leaves
- 257
- Edition
- 8
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
Data lakes and warehouses have become increasingly fragile, costly, and difficult to maintain as data gets bigger and moves faster. Data meshes can help your organization decentralize data, giving ownership back to the engineers who produced it. This book provides a concise yet comprehensive overview of data mesh patterns for streaming and real-time data services.
Authors Hubert Dulay and Stephen Mooney examine the vast differences between streaming and batch data meshes. Data engineers, architects, data product owners, and those in DevOps and MLOps roles will learn steps for implementing a streaming data mesh, from defining a data domain to building a good data product. Through the course of the book, youโll create a complete self-service data platform and devise a data governance system that enables your mesh to work seamlessly.
With this book, you will
Design a streaming data mesh using Kafka
Learn how to identify a domain
Build your first data product using self-service tools
Apply data governance to the data products you create
Learn the differences between synchronous and asynchronous data services
Implement self-services that support decentralized data
๐ SIMILAR VOLUMES
Between major privacy regulations like the GDPR and CCPA and expensive and notorious data breaches, there has never been so much pressure for data scientists to ensure data privacy. Unfortunately, integrating privacy into your data science workflow is still complicated. This essential guide will giv
Quickly detect, troubleshoot, and prevent a wide range of data issues through data observability, a set of best practices that enable data teams to gain greater visibility of data and its usage. If you're a data engineer, data architect, or Machine Learning engineer, or if the quality of your work d
Your training data has as much to do with the success of your data project as the algorithms themselves--most failures in Deep Learning systems relate to training data. But while training data is the foundation for successful Machine Learning, there are few comprehensive resources to help you ace th
<p><span>Data lakes and warehouses have become increasingly fragile, costly, and difficult to maintain as data gets bigger and moves faster. Data meshes can help your organization decentralize data, giving ownership back to the engineers who produced it. This book provides a concise yet comprehensiv
<p><span>Data lakes and warehouses have become increasingly fragile, costly, and difficult to maintain as data gets bigger and moves faster. Data meshes can help your organization decentralize data, giving ownership back to the engineers who produced it. This book provides a concise yet comprehensiv