𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Stream Processing with Apache Spark: Mastering Structured Streaming and Spark Streaming

✍ Scribed by Gerard Maas, Francois Garillot


Publisher
O’Reilly Media
Year
2019
Tongue
English
Leaves
452
Edition
1
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Before you can build analytics tools to gain quick insights, you first need to know how to process data in real time. With this practical guide, developers familiar with Apache Spark will learn how to put this in-memory framework to use for streaming data. You’ll discover how Spark enables you to write streaming jobs in almost the same way you write batch jobs.

Authors Gerard Maas and FranΓ§ois Garillot help you explore the theoretical underpinnings of Apache Spark. This comprehensive guide features two sections that compare and contrast the streaming APIs Spark now supports: the original Spark Streaming library and the newer Structured Streaming API.

β€’ Learn fundamental stream processing concepts and examine different streaming architectures
β€’ Explore Structured Streaming through practical examples; learn different aspects of stream processing in detail
β€’ Create and operate streaming jobs and applications with Spark Streaming; integrate Spark Streaming with other Spark APIs
β€’ Learn advanced Spark Streaming techniques, including approximation algorithms and machine learning algorithms
β€’ Compare Apache Spark to other stream processing projects, including Apache Storm, Apache Flink, and Apache Kafka Streams

✦ Subjects


Cloud Computing; Machine Learning; Analytics; Apache Spark; Apache Storm; Monitoring; Stream Processing; Apache Kafka; Batch Processing; Clusters; Apache Flink; Spark SQL; Resilient Distributed Datasets; Performance Tuning; Spark Streaming; Lambda Architecture; Distributed Processing; Kappa Architecture; Structured Streaming


πŸ“œ SIMILAR VOLUMES


Stream Processing with Apache Spark: Mas
✍ Gerard Maas, Francois Garillot πŸ“‚ Library πŸ“… 2019 πŸ› O'Reilly Media 🌐 English

Before you can build analytics tools to gain quick insights, you first need to know how to process data in real time. With this practical guide, developers familiar with Apache Spark will learn how to put this in-memory framework to use for streaming data. You’ll discover how Spark enables you to wr

Beginning Apache Spark 3: With DataFrame
✍ Hien Luu πŸ“‚ Library πŸ“… 2021 πŸ› Apress 🌐 English

<p>Take a journey toward discovering, learning, and using Apache Spark 3.0. In this book, you will gain expertise on the powerful and efficient distributed data processing engine inside of Apache Spark; its user-friendly, comprehensive, and flexible programming model for processing data in batch and

Beginning Apache Spark 3: With DataFrame
✍ Hien Luu πŸ“‚ Library πŸ“… 2021 πŸ› Apress 🌐 English

<p>Take a journey toward discovering, learning, and using Apache Spark 3.0. In this book, you will gain expertise on the powerful and efficient distributed data processing engine inside of Apache Spark; its user-friendly, comprehensive, and flexible programming model for processing data in batch and

Apache Spark 2: Data Processing and Real
✍ Romeo Kienzler, Md. Rezaul Karim, Sridhar Alla, Siamak Amirghodsi, Meenakshi Raj πŸ“‚ Library πŸ“… 2018 πŸ› Packt Publishing 🌐 English

<p><b>Build efficient data flow and machine learning programs with this flexible, multi-functional open-source cluster-computing framework</b></p> <h4>Key Features</h4> <ul><li>Master the art of real-time big data processing and machine learning </li> <li>Explore a wide range of use-cases to analyze

Beginning Apache Spark 2: With Resilient
✍ Hien Luu πŸ“‚ Library πŸ“… 2018 πŸ› Apress 🌐 English

Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work wi

Beginning Apache Spark 2: With Resilient
✍ Hien Luu πŸ“‚ Library πŸ“… 2018 πŸ› Apress 🌐 English

Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work wi