<p>Streaming data is a big deal in big data these days. As more and more businesses seek to tame the massive unbounded data sets that pervade our world, streaming systems have finally reached a level of maturity sufficient for mainstream adoption. With this practical guide, data engineers, data scie
Streaming Systems: The What, Where, When, and How of Large-Scale Data Processing
β Scribed by Tyler Akidau, Slava Chernyak, Reuven Lax
- Publisher
- O'Reilly Media
- Year
- 2018
- Tongue
- English
- Leaves
- 351
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Streaming data is a big deal in big data these days. As more and more businesses seek to tame the massive unbounded data sets that pervade our world, streaming systems have finally reached a level of maturity sufficient for mainstream adoption. With this practical guide, data engineers, data scientists, and developers will learn how to work with streaming data in a conceptual and platform-agnostic way.
Expanded from Tyler Akidauβs popular blog posts "Streaming 101" and "Streaming 102", this book takes you from an introductory level to a nuanced understanding of the what, where, when, and how of processing real-time data streams. Youβll also dive deep into watermarks and exactly-once processing with co-authors Slava Chernyak and Reuven Lax.
Youβll explore:
- How streaming and batch data processing patterns compare
- The core principles and concepts behind robust out-of-order data processing
- How watermarks track progress and completeness in infinite datasets
- How exactly-once data processing techniques ensure correctness
- How the concepts of streams and tables form the foundations of both batch and streaming data processing
- The practical motivations behind a powerful persistent state mechanism, driven by a real-world example
- How time-varying relations provide a link between stream processing and the world of SQL and relational algebra
π SIMILAR VOLUMES
Annotation<span class='showMoreLessContentElement' style='display: none;'><p>Streaming data is a big deal in big data these days, and for good reason. Businesses crave ever more timely data, and streaming is a good way to achieve lower latency. Plus, streaming is a much easier way to tame the massiv
<span><p>Data has cemented itself as a building block of daily life. However, surrounding oneself with great quantities of information heightens risks to ones personal privacy. Additionally, the presence of massive amounts of information prompts researchers into how best to handle and disseminate it
<p>This, the 11th issue of Transactions on Large-Scale Data- and Knowledge-Centered Systems, contains five selected papers focusing on Advanced Data Stream Management and Processing of Continuous Queries. The contributions cover different methods for avoiding unauthorized access to streaming data, m
Large Scale and Big data: Processing and Management provides readers with a central source of reference on the data management techniques currently available for large-scale data processing. Presenting chapters written by leading researchers, academics, and practitioners, it addresses the fundamenta
''This book provides a central source of reference on the various data management techniques of large scale data processing and its technology application. This book presents chapters written by leading researchers, academics, and practitioners in the field, all of which have been reviewed by indepe