𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Stream Processing: Hands\hyp{}on with Apache Flink

✍ Scribed by Giannis Polyzos


Publisher
Independently published
Year
2023
Tongue
English
Leaves
234
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Get onboard this journey into the land of streams. This is a complete hands-on book about Apache Flink, that follows real-life use cases and will help you learn how to create scalable end-to-end stream processing pipelines.

This is a complete hands-on book about Apache Flink.

The book follows real-life use cases and you will learn how to create end-to-end stream processing pipelines.

We will be using Redpanda and Apache Kafka - along with other technologies - so an understanding of Apache Kafka and Redpanda concepts like topics/partitions and producers/consumers is nice to have.

The book is designed to teach you the theory and the practicals as fast as possible. The reader should be able to get from zero to production-ready applications fast with enough practice on the concepts introduced in the book, along with having enough knowledge to debug and troubleshoot when things go wrong.

Hope you will enjoy it and use it as a guide in your journey in the land of streams.

✦ Table of Contents


Stream Processing: Hands\hyp{}on with Apache Flink
Stream Processing: Hands\hyp{}on with Apache Flink
Introduction
In the land of streams
The Streaming Layer: Redpanda
Flink’s Runtime
Summary
Streams and Tables
Streaming SQL Semantics
Flink SQL Logical Components
Running SQL Queries
Operators
The TableEnvironment
Summary
Watermarks & Windows
The Notion of Time
Time Windows
What is a Watermark?
How do watermarks work?
Watermark Generation
Watermark Propagation
Idle Sources
Summary
Streaming Joins
Introduction
Regular Joins
Interval Joins
Temporal Joins
Lookup Joins
Summary
User Defined Functions
Scalar Functions
Table Functions
Aggregate & Table Aggregate Functions
External Service Lookup UDF
Summary
The Datastream API
Sources
Datastream Operators
Merging Multiple Streams
Event Buffering & Enrichment
Handling Late Arriving Data
Summary
Fault Tolerance
Why the need for checkpoints?
Failure in Practise
Flink’s Checkpointing Algorithm
Aligned and Unaligned Checkpoints
Checkpoints vs. Savepoints
Summary
State Backends
State Backends
Using RocksDB
Inspecting RocksDB
Tuning and Troubleshooting
Summary
Monitoring and Troubleshooting
Metrics System
Prometheus and Grafana Setup
Setting up Flink Dashboards
Troubleshooting tips
Summary


πŸ“œ SIMILAR VOLUMES


Stream Processing with Apache Flink
✍ Hueske, Fabian;Kalavri, Vasiliki πŸ“‚ Library πŸ“… 2019 πŸ› O'Reilly Media Inc 🌐 English

With Early Release ebooks, you get books in their earliest form--the author's raw and unedited content as he or she writes--so you can take advantage of these technologies long before the official release of these titles. You'll also receive updates when significant changes are made, new chapters ar

Stream Processing with Apache Flink: Fun
✍ Fabian Hueske, Vasiliki Kalavri πŸ“‚ Library πŸ“… 2019 πŸ› O’Reilly Media 🌐 English

Get started with Apache Flink, the open source framework that powers some of the world’s largest stream processing applications. With this practical book, you’ll explore the fundamental concepts of parallel stream processing and discover how this technology differs from traditional batch data proces

Stream Processing with Apache Flink: Fun
✍ Fabian Hueske, Vasiliki Kalavri πŸ“‚ Library πŸ“… 2019 πŸ› O'Reilly Media 🌐 English

<div><p>Get started with Apache Flink, the open source framework that powers some of the world’s largest stream processing applications. With this practical book, you’ll explore the fundamental concepts of parallel stream processing and discover how this technology differs from traditional batch dat

Stream Processing with Apache Flink: Fun
✍ Fabian Hueske, Vasiliki Kalavri πŸ“‚ Library πŸ“… 2019 πŸ› O'Reilly Media 🌐 English

Get started with Apache Flink, the open source framework that powers some of the world’s largest stream processing applications. With this practical book, you’ll explore the fundamental concepts of parallel stream processing and discover how this technology differs from traditional batch data proces

Stream Processing with Apache Spark: Mas
✍ Gerard Maas, Francois Garillot πŸ“‚ Library πŸ“… 2019 πŸ› O’Reilly Media 🌐 English

Before you can build analytics tools to gain quick insights, you first need to know how to process data in real time. With this practical guide, developers familiar with Apache Spark will learn how to put this in-memory framework to use for streaming data. You’ll discover how Spark enables you to wr