𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Scala Programming for Big Data Analytics: Get Started With Big Data Analytics Using Apache Spark

✍ Scribed by Irfan Elahi


Publisher
Apress
Year
2019
Tongue
English
Leaves
315
Edition
1st ed.
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Gain the key language concepts and programming techniques of Scala in the context of big data analytics and Apache Spark. The book begins by introducing you to Scala and establishes a firm contextual understanding of why you should learn this language, how it stands in comparison to Java, and how Scala is related to Apache Spark for big data analytics. Next, you’ll set up the Scala environment ready for examining your first Scala programs. This is followed by sections on Scala fundamentals including mutable/immutable variables, the type hierarchy system, control flow expressions and code blocks.
The author discusses functions at length and highlights a number of associated concepts such as functional programming and anonymous functions. The book then delves deeper into Scala’s powerful collections system because many of Apache Spark’s APIs bear a strong resemblance to Scala collections.
Along the way you’ll see the development life cycle of a Scala program. This involves compiling and building programs using the industry-standard Scala Build Tool (SBT). You’ll cover guidelines related to dependency management using SBT as this is critical for building large Apache Spark applications. Scala Programming for Big Data Analytics concludes by demonstrating how you can make use of the concepts to write programs that run on the Apache Spark framework. These programs will provide distributed and parallel computing, which is critical for big data analytics.
What You Will Learn

  • See the fundamentals of Scala as a general-purpose programming language
  • Understand functional programming and object-oriented programming constructs in Scala
  • Use Scala collections and functions
  • Develop, package and run Apache Spark applications for big data analytics
Who This Book Is For
Data scientists, data analysts and data engineers who intend to use Apache Spark for large-scale analytics.

✦ Table of Contents


Front Matter ....Pages i-xxi
Scala Language (Irfan Elahi)....Pages 1-13
Installing Scala (Irfan Elahi)....Pages 15-26
Using the Scala Shell (Irfan Elahi)....Pages 27-44
Variables (Irfan Elahi)....Pages 45-55
Data Types (Irfan Elahi)....Pages 57-79
Conditional Statements (Irfan Elahi)....Pages 81-93
Code Blocks (Irfan Elahi)....Pages 95-101
Functions (Irfan Elahi)....Pages 103-122
Collections (Irfan Elahi)....Pages 123-164
Loops (Irfan Elahi)....Pages 165-173
Classes and Packages (Irfan Elahi)....Pages 175-201
Exception Handling (Irfan Elahi)....Pages 203-221
Building and Packaging (Irfan Elahi)....Pages 223-260
Hello Apache Spark (Irfan Elahi)....Pages 261-299
Back Matter ....Pages 301-306

✦ Subjects


Business and Management; Big Data/Analytics; Programming Languages, Compilers, Interpreters; Open Source


πŸ“œ SIMILAR VOLUMES


Big Data Analytics with Spark: A Practit
✍ Mohammed Guller πŸ“‚ Library πŸ“… 2015 πŸ› Apress 🌐 English

<p><em>Big Data Analytics with Spark</em> is a step-by-step guide for learning Spark, which is an open-source fast and general-purpose cluster computing framework for large-scale data analysis. You will learn how to use Spark for different types of big data analytics projects, including batch, inter

Big Data Analytics with Spark: A Practit
✍ Mohammed Guller πŸ“‚ Library πŸ“… 2016 πŸ› Apress 🌐 English

This book is a step-by-step guide for learning how to use Spark for different types of big-data analytics projects, including batch, interactive, graph, and stream data analysis as well as machine learning. It covers Spark core and its add-on libraries, including Spark SQL, Spark Streaming, GraphX,

Getting started with Greenplum for big d
✍ Gollapudi, Sunila πŸ“‚ Library πŸ“… 2013 πŸ› Packt Publishing 🌐 English

Standard tutorial-based approach."Getting Started with Greenplum for Big Data" Analytics is great for data scientists and data analysts with a basic knowledge of Data Warehousing and Business Intelligence platforms who are new to Big Data and who are looking to get a good grounding in how to use the

Getting Started with Big Data Query usin
✍ Agus Kurniawan πŸ“‚ Library πŸ“… 2021 πŸ› Lulu Press, Inc 🌐 English

This book is designed for anyone who learns how to get started with Apache Impala. The book covers SQL queries and data manipulation for Apache Impala. The following is a list of highlight topics: * Introduction to Apache Impala * Working with Apache Impala Shell * SQL Querying with Apache Hue and A

Getting Started with Big Data Query usin
✍ Agus Kurniawan πŸ“‚ Library πŸ“… 2021 πŸ› Lulu Press, Inc 🌐 English

This book is designed for anyone who learns how to get started with Apache Impala. The book covers SQL queries and data manipulation for Apache Impala. The following is a list of highlight topics: * Introduction to Apache Impala * Working with Apache Impala Shell * SQL Querying with Apache Hue and A

Apache Spark 2: Data Processing and Real
✍ Romeo Kienzler, Md. Rezaul Karim, Sridhar Alla, Siamak Amirghodsi, Meenakshi Raj πŸ“‚ Library πŸ“… 2018 πŸ› Packt Publishing 🌐 English

<p><b>Build efficient data flow and machine learning programs with this flexible, multi-functional open-source cluster-computing framework</b></p> <h4>Key Features</h4> <ul><li>Master the art of real-time big data processing and machine learning </li> <li>Explore a wide range of use-cases to analyze