𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Mastering Machine Learning with Spark 2.x

✍ Scribed by Alex Tellez, Max Pumperla, Michal Malohlava


Publisher
Packt Publishing
Year
2017
Tongue
English
Leaves
419
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Unlock the complexities of machine learning algorithms in Spark to generate useful data insights through this data analysis tutorial

About This Book

  • Process and analyze big data in a distributed and scalable way
  • Write sophisticated Spark pipelines that incorporate elaborate extraction
  • Build and use regression models to predict flight delays

Who This Book Is For

Are you a developer with a background in machine learning and statistics who is feeling limited by the current slow and β€œsmall data” machine learning tools? Then this is the book for you! In this book, you will create scalable machine learning applications to power a modern data-driven business using Spark. We assume that you already know the machine learning concepts and algorithms and have Spark up and running (whether on a cluster or locally) and have a basic knowledge of the various libraries contained in Spark.

What You Will Learn

  • Use Spark streams to cluster tweets online
  • Run the PageRank algorithm to compute user influence
  • Perform complex manipulation of DataFrames using Spark
  • Define Spark pipelines to compose individual data transformations
  • Utilize generated models for off-line/on-line prediction
  • Transfer the learning from an ensemble to a simpler Neural Network
  • Understand basic graph properties and important graph operations
  • Use GraphFrames, an extension of DataFrames to graphs, to study graphs using an elegant query language
  • Use K-means algorithm to cluster movie reviews dataset

In Detail

The purpose of machine

✦ Subjects


Data Mining;Databases & Big Data;Computers & Technology;Data Processing;Databases & Big Data;Computers & Technology


πŸ“œ SIMILAR VOLUMES


Mastering Machine Learning with Spark 2.
✍ Alex Tellez, Max Pumperla, Michal Malohlava πŸ“‚ Library πŸ“… 2017 πŸ› Packt Publishing 🌐 English

Unlock the complexities of machine learning algorithms in Spark to generate useful data insights through this data analysis tutorial About This Book β€’ Process and analyze big data in a distributed and scalable way β€’ Write sophisticated Spark pipelines that incorporate elaborate extraction β€’ Bu

Apache Spark 2.x Machine Learning Cookbo
✍ Siamak Amirghodsi, Meenakshi Rajendran, Broderick Hall, Shuen Mei πŸ“‚ Library πŸ“… 2017 πŸ› Packt Publishing 🌐 English

<p><b>Simplify machine learning model implementations with Spark</b></p><h2>About This Book</h2><ul><li>Solve the day-to-day problems of data science with Spark</li><li>This unique cookbook consists of exciting and intuitive numerical recipes</li><li>Optimize your work by acquiring, cleaning, analyz

Machine Learning with Spark
✍ Nick Pentreath, Manpreet Singh Ghotra, Rajdeep Dua πŸ“‚ Library πŸ“… 2017 πŸ› Packt Publishing, Limited 🌐 English

Create scalable machine learning applications to power a modern data-driven business using Spark 2.xAbout This Book* Get to the grips with the latest version of Apache Spark* Utilize Spark's machine learning library to implement predictive analytics* Leverage Spark's powerful tools to load, analyze,

Machine Learning with Spark
✍ Nick Pentreath πŸ“‚ Library πŸ“… 2014 πŸ› Packt Publishing 🌐 English

<p><b>Create scalable machine learning applications to power a modern data-driven business using Spark</b></p> <h2>About This Book</h2><ul><li>A practical tutorial with real-world use cases allowing you to develop your own machine learning systems with Spark</li><li>Combine various techniques and mo