<p><em>Big Data Analytics with Spark</em> is a step-by-step guide for learning Spark, which is an open-source fast and general-purpose cluster computing framework for large-scale data analysis. You will learn how to use Spark for different types of big data analytics projects, including batch, inter
Scala Programming for Big Data Analytics: Get Started With Big Data Analytics Using Apache Spark
β Scribed by Irfan Elahi
- Publisher
- Apress
- Year
- 2019
- Tongue
- English
- Leaves
- 315
- Edition
- 1st ed.
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Gain the key language concepts and programming techniques of Scala in the context of big data analytics and Apache Spark. The book begins by introducing you to Scala and establishes a firm contextual understanding of why you should learn this language, how it stands in comparison to Java, and how Scala is related to Apache Spark for big data analytics. Next, youβll set up the Scala environment ready for examining your first Scala programs. This is followed by sections on Scala fundamentals including mutable/immutable variables, the type hierarchy system, control flow expressions and code blocks.
The author discusses functions at length and highlights a number of associated concepts such as functional programming and anonymous functions. The book then delves deeper into Scalaβs powerful collections system because many of Apache Sparkβs APIs bear a strong resemblance to Scala collections.
Along the way youβll see the development life cycle of a Scala program. This involves compiling and building programs using the industry-standard Scala Build Tool (SBT). Youβll cover guidelines related to dependency management using SBT as this is critical for building large Apache Spark applications. Scala Programming for Big Data Analytics concludes by demonstrating how you can make use of the concepts to write programs that run on the Apache Spark framework. These programs will provide distributed and parallel computing, which is critical for big data analytics.
What You Will Learn
- See the fundamentals of Scala as a general-purpose programming language
- Understand functional programming and object-oriented programming constructs in Scala
- Use Scala collections and functions
- Develop, package and run Apache Spark applications for big data analytics
Data scientists, data analysts and data engineers who intend to use Apache Spark for large-scale analytics.
β¦ Table of Contents
Front Matter ....Pages i-xxi
Scala Language (Irfan Elahi)....Pages 1-13
Installing Scala (Irfan Elahi)....Pages 15-26
Using the Scala Shell (Irfan Elahi)....Pages 27-44
Variables (Irfan Elahi)....Pages 45-55
Data Types (Irfan Elahi)....Pages 57-79
Conditional Statements (Irfan Elahi)....Pages 81-93
Code Blocks (Irfan Elahi)....Pages 95-101
Functions (Irfan Elahi)....Pages 103-122
Collections (Irfan Elahi)....Pages 123-164
Loops (Irfan Elahi)....Pages 165-173
Classes and Packages (Irfan Elahi)....Pages 175-201
Exception Handling (Irfan Elahi)....Pages 203-221
Building and Packaging (Irfan Elahi)....Pages 223-260
Hello Apache Spark (Irfan Elahi)....Pages 261-299
Back Matter ....Pages 301-306
β¦ Subjects
Business and Management; Big Data/Analytics; Programming Languages, Compilers, Interpreters; Open Source
π SIMILAR VOLUMES
This book is a step-by-step guide for learning how to use Spark for different types of big-data analytics projects, including batch, interactive, graph, and stream data analysis as well as machine learning. It covers Spark core and its add-on libraries, including Spark SQL, Spark Streaming, GraphX,
Standard tutorial-based approach."Getting Started with Greenplum for Big Data" Analytics is great for data scientists and data analysts with a basic knowledge of Data Warehousing and Business Intelligence platforms who are new to Big Data and who are looking to get a good grounding in how to use the
This book is designed for anyone who learns how to get started with Apache Impala. The book covers SQL queries and data manipulation for Apache Impala. The following is a list of highlight topics: * Introduction to Apache Impala * Working with Apache Impala Shell * SQL Querying with Apache Hue and A
This book is designed for anyone who learns how to get started with Apache Impala. The book covers SQL queries and data manipulation for Apache Impala. The following is a list of highlight topics: * Introduction to Apache Impala * Working with Apache Impala Shell * SQL Querying with Apache Hue and A
<p><b>Build efficient data flow and machine learning programs with this flexible, multi-functional open-source cluster-computing framework</b></p> <h4>Key Features</h4> <ul><li>Master the art of real-time big data processing and machine learning </li> <li>Explore a wide range of use-cases to analyze