"Extract, transform, load" (ETL) is at the center of every application of data, from business intelligence to AI. Recent shifts in the data landscapeβincluding the emergence of lakehouse architectures and the rising importance of high-scale real-time dataβmean that today's data practitioners must ap
Understanding ETL: Data Pipelines for Modern Data Architectures (Early Release)
β Scribed by Matt Palmer
- Publisher
- O'Reilly Media, Inc.
- Year
- 2024
- Tongue
- English
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Extract, transform, load (ETL) is at the center of every application of data, from business intelligence to AI. This technical guide offers data engineers, engineering managers, and architects an overview of the modern ETL process, along with the challenges you're likely to face and the strategic patterns that will help you overcome them. You will be equipped to make informed decisions when implementing ETL and choose the technology stack that will help you succeed.
π SIMILAR VOLUMES
Extract, transform, load (ETL) is at the center of every application of data, from business intelligence to AI. This technical guide offers data engineers, engineering managers, and architects an overview of the modern ETL process, along with the challenges you're likely to face and the strategic pa
If you haven't modernized your data cleaning and reporting processes in Microsoft Excel, you're missing out on big productivity gains. And if you're looking to conduct rigorous data analysis, more can be done in Excel than you think. This practical book serves as an introduction to the modern Excel
Data management is an emerging and disruptive subject. Datafication is everywhere. This transformation is happening all around us: in smartphones, TV devices, ereaders, industrial machines, self-driving cars, robots, and so on. Itβs changing our lives at an accelerating speed. As data management
Data science happens in code. The ability to write reproducible, robust, scaleable code is key to a data science project's successβand is absolutely essential for those working with production code. This practical book bridges the gap between data science and software engineering, clearly explaining
Build scalable and reliable data ecosystems using Data Mesh, Databricks Spark, and Kafka Key Features Develop modern data skills used in emerging technologies Learn pragmatic design methodologies such as Data Mesh and data lakehouses Gain a deeper understanding of data governance Purchase of