Data analytics with Hadoop: an introduction for data scientists
β Scribed by Bengfort, Benjamin;Kim, Jenny
- Publisher
- O'Reilly Media
- Year
- 2016
- Tongue
- English
- Edition
- First edition
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
The age of the data product -- An operating system for big data -- A framework for Python and Hadoop streaming -- In-memory computing with Spark -- Distributed analysis and patterns -- Data mining and warehousing -- Data ingestion -- Analytics with higher-level APIs -- Machine learning -- Summary : doing distributed data science.
β¦ Table of Contents
The age of the data product --
An operating system for big data --
A framework for Python and Hadoop streaming --
In-memory computing with Spark --
Distributed analysis and patterns --
Data mining and warehousing --
Data ingestion --
Analytics with higher-level APIs --
Machine learning --
Summary : doing distributed data science.
β¦ Subjects
Big data;Cluster analysis--Data processing;Electronic data processing--Distributed processing;Apache Hadoop;Cluster analysis -- Data processing;Electronic data processing -- Distributed processing
π SIMILAR VOLUMES
Ready to use statistical and machine-learning techniques across large data sets? This practical guide shows you why the Hadoop ecosystem is perfect for the job. Instead of deployment, operations, or software development usually associated with distributed computing, you'll focus on particular analys
<p>Ready to use statistical and machine-learning techniques across large data sets? This practical guide shows you why the Hadoop ecosystem is perfect for the job. Instead of deployment, operations, or software development usually associated with distributed computing, youβll focus on particular ana
xxiii, 304 pages : 25 cm
Data Science for Infectious Disease Data Analytics: An Introduction with R provides an overview of modern data science tools and methods that have been developed specifically to analyze infectious disease data. With a quick start guide to epidemiological data visualization and analysis in R, this bo
<span><p>This book providesΒ an undergraduate introduction to analysing data for data science, computer science, and quantitative social science students.Β It uniquely combines a hands-on approach to data analysis β supported by numerous real data examples and reusable [R] code β with a rigorous treat