Before you can build analytics tools to gain quick insights, you first need to know how to process data in real time. With this practical guide, developers familiar with Apache Spark will learn how to put this in-memory framework to use for streaming data. Youβll discover how Spark enables you to wr
Stream Processing with Apache Spark: Mastering Structured Streaming and Spark Streaming
β Scribed by Gerard Maas, Francois Garillot
- Publisher
- OβReilly Media
- Year
- 2019
- Tongue
- English
- Leaves
- 452
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Before you can build analytics tools to gain quick insights, you first need to know how to process data in real time. With this practical guide, developers familiar with Apache Spark will learn how to put this in-memory framework to use for streaming data. Youβll discover how Spark enables you to write streaming jobs in almost the same way you write batch jobs.
Authors Gerard Maas and FranΓ§ois Garillot help you explore the theoretical underpinnings of Apache Spark. This comprehensive guide features two sections that compare and contrast the streaming APIs Spark now supports: the original Spark Streaming library and the newer Structured Streaming API.
β’ Learn fundamental stream processing concepts and examine different streaming architectures
β’ Explore Structured Streaming through practical examples; learn different aspects of stream processing in detail
β’ Create and operate streaming jobs and applications with Spark Streaming; integrate Spark Streaming with other Spark APIs
β’ Learn advanced Spark Streaming techniques, including approximation algorithms and machine learning algorithms
β’ Compare Apache Spark to other stream processing projects, including Apache Storm, Apache Flink, and Apache Kafka Streams
β¦ Subjects
Cloud Computing; Machine Learning; Analytics; Apache Spark; Apache Storm; Monitoring; Stream Processing; Apache Kafka; Batch Processing; Clusters; Apache Flink; Spark SQL; Resilient Distributed Datasets; Performance Tuning; Spark Streaming; Lambda Architecture; Distributed Processing; Kappa Architecture; Structured Streaming
π SIMILAR VOLUMES
<p>Take a journey toward discovering, learning, and using Apache Spark 3.0. In this book, you will gain expertise on the powerful and efficient distributed data processing engine inside of Apache Spark; its user-friendly, comprehensive, and flexible programming model for processing data in batch and
<p>Take a journey toward discovering, learning, and using Apache Spark 3.0. In this book, you will gain expertise on the powerful and efficient distributed data processing engine inside of Apache Spark; its user-friendly, comprehensive, and flexible programming model for processing data in batch and
<p><b>Build efficient data flow and machine learning programs with this flexible, multi-functional open-source cluster-computing framework</b></p> <h4>Key Features</h4> <ul><li>Master the art of real-time big data processing and machine learning </li> <li>Explore a wide range of use-cases to analyze
Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work wi
Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work wi