Many enterprises are investing in a next-generation data lake, hoping to democratize data at scale to provide business insights and ultimately make automated intelligent decisions. In this practical book, author Zhamak Dehghani reveals that, despite the time, money, and effort poured into them, data
Practical DataOps: Delivering Agile Data Science At Scale
โ Scribed by Harvinder Atwal
- Publisher
- Apress
- Year
- 2020
- Tongue
- English
- Leaves
- 289
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
Gain a practical introduction to DataOps, a new discipline for delivering data science at scale inspired by practices at companies such as Facebook, Uber, LinkedIn, Twitter, and eBay. Organizations need more than the latest AI algorithms, hottest tools, and best people to turn data into insight-driven action and useful analytical data products. Processes and thinking employed to manage and use data in the 20th century are a bottleneck for working effectively with the variety of data and advanced analytical use cases that organizations have today. This book provides the approach and methods to ensure continuous rapid use of data to create analytical data products and steer decision making. Practical DataOps shows you how to optimize the data supply chain from diverse raw data sources to the final data product, whether the goal is a machine learning model or other data-orientated output. The book provides an approach to eliminate wasted effort and improve collaboration between data producers, data consumers, and the rest of the organization through the adoption of lean thinking and agile software development principles.
This book helps you to improve the speed and accuracy of analytical application development through data management and DevOps practices that securely expand data access, and rapidly increase the number of reproducible data products through automation, testing, and integration. The book also shows how to collect feedback and monitor performance to manage and continuously improve your processes and output.
โฆ Table of Contents
Front Matter ....Pages i-xxviii
Front Matter ....Pages 1-1
The Problem with Data Science (Harvinder Atwal)....Pages 3-26
Data Strategy (Harvinder Atwal)....Pages 27-53
Front Matter ....Pages 55-55
Lean Thinking (Harvinder Atwal)....Pages 57-83
Agile Collaboration (Harvinder Atwal)....Pages 85-112
Build Feedback and Measurement (Harvinder Atwal)....Pages 113-137
Front Matter ....Pages 139-139
Building Trust (Harvinder Atwal)....Pages 141-159
DevOps for DataOps (Harvinder Atwal)....Pages 161-189
Organizing for DataOps (Harvinder Atwal)....Pages 191-211
Front Matter ....Pages 213-213
DataOps Technology (Harvinder Atwal)....Pages 215-247
The DataOps Factory (Harvinder Atwal)....Pages 249-266
Back Matter ....Pages 267-275
โฆ Subjects
Database Management
๐ SIMILAR VOLUMES
<div><p>Many enterprises are investing in a next-generation data lake, hoping to democratize data at scale to provide business insights and ultimately make automated intelligent decisions. In this practical book, author Zhamak Dehghani reveals that, despite the time, money, and effort poured into th
<p><span>Many enterprises are investing in a next-generation data lake, hoping to democratize data at scale to provide business insights and ultimately make automated intelligent decisions. In this practical book, author Zhamak Dehghani reveals that, despite the time, money, and effort poured into t
We're at an inflection point in data, where our data management solutions no longer match the complexity of organizations, the proliferation of data sources, and the scope of our aspirations to get value from data with AI and analytics. In this practical book, author Zhamak Dehghani introduces data
Mining big data requires a deep investment in people and time. How can you be sure youre building the right models? With this hands-on book, youll learn a flexible toolset and methodology for building effective analytics applications with Hadoop.<br>Using lightweight tools such as Python, Apache Pig