๐”– Scriptorium
โœฆ   LIBER   โœฆ

๐Ÿ“

Practical Data Science with Hadoop and Spark: Designing and Building Effective Analytics at Scale (Addison-Wesley Data & Analytics)

โœ Scribed by Ofer Mendelevitch, Casey Stella, Douglas Eadline


Publisher
Addison-Wesley Professional
Tongue
English
Category
Library

โฌ‡  Acquire This Volume

No coin nor oath required. For personal study only.

โœฆ Synopsis


The Complete Guide to Data Science with Hadoopโ€”For Technical Professionals, Businesspeople, and Students

ย 

Demand is soaring for professionals who can solve real data science problems with Hadoop and Spark. Practical Data Science with Hadoopยฎ and Spark is your complete guide to doing just that. Drawing on immense experience with Hadoop and big data, three leading experts bring together everything you need: high-level concepts, deep-dive techniques, real-world use cases, practical applications, and hands-on tutorials.

ย 

The authors introduce the essentials of data science and the modern Hadoop ecosystem, explaining how Hadoop and Spark have evolved into an effective platform for solving data science problems at scale. In addition to comprehensive application coverage, the authors also provide useful guidance on the important steps of data ingestion, data munging, and visualization.

ย 

Once the groundwork is in place, the authors focus on specific applications, including machine learning, predictive modeling for sentiment analysis, clustering for document analysis, anomaly detection, and natural language processing (NLP).

ย 

This guide provides a strong technical foundation for those who want to do practical data science, and also presents business-driven guidance on how to apply Hadoop and Spark to optimize ROI of data science initiatives.

ย 

Learn

  • What data science is, how it has evolved, and how to plan a data science career
  • How data volume, variety, and velocity shape data science use cases
  • Hadoop and its ecosystem, including HDFS, MapReduce, YARN, and Spark
  • Data importation with Hive and Spark
  • Data quality, preprocessing, preparation, and modeling
  • Visualization: surfacing insights from huge data sets
  • Machine learning: classification, regression, clustering, and anomaly detection
  • Algorithms and Hadoop tools for predictive modeling
  • Cluster analysis and similarity functions
  • Large-scale anomaly detection
  • NLP: applying data science to human language

๐Ÿ“œ SIMILAR VOLUMES


Practical Data Science with Hadoop and S
โœ Ofer Mendelevitch, Casey Stella, Douglas Eadline ๐Ÿ“‚ Library ๐Ÿ“… 2017 ๐Ÿ› Addison-Wesley ๐ŸŒ English

<span><!--[if gte mso 9]><xml> </xml><![endif]--> <p><b>The Complete Guide to Data Science with Hadoopโ€•For Technical Professionals, Businesspeople, and Students</b></p> <p>ย </p> <p>Demand is soaring for professionals who can solve real data science problems with Hadoop and Spark. <i><b>Practical Dat

Agile data science: building data analyt
โœ Russell Jurney ๐Ÿ“‚ Library ๐Ÿ“… 2013 ๐Ÿ› O'Reilly Media ๐ŸŒ English

Mining big data requires a deep investment in people and time. How can you be sure youโ€™re building the right models? With this hands-on book, youโ€™ll learn a flexible toolset and methodology for building effective analytics applications with Hadoop. Using lightweight tools such as Python, Apache Pig

Agile data science: building data analyt
โœ Russell Jurney ๐Ÿ“‚ Library ๐Ÿ“… 2013 ๐Ÿ› O'Reilly Media ๐ŸŒ English

Mining big data requires a deep investment in people and time. How can you be sure youโ€™re building the right models? With this hands-on book, youโ€™ll learn a flexible toolset and methodology for building effective analytics applications with Hadoop.<br><br>Using lightweight tools such as Python, Apac

Agile Data Science: Building Data Analyt
โœ Russell Jurney ๐Ÿ“‚ Library ๐Ÿ“… 2013 ๐Ÿ› O'Reilly Media ๐ŸŒ English

Mining big data requires a deep investment in people and time. How can you be sure you're building the right models? With this hands-on book, you'll learn a flexible toolset and methodology for building effective analytics applications with Hadoop. Using lightweight tools such as Python, Apache Pig

Pandas for Everyone: Python Data Analysi
โœ Daniel Chen ๐Ÿ“‚ Library ๐Ÿ“… 2023 ๐Ÿ› Addison-Wesley Professional ๐ŸŒ English

<p><span>Manage and Automate Data Analysis with Pandas in Python</span></p><p><span>ย </span></p><p><span>Today, analysts must manage data characterized by extraordinary variety, velocity, and volume. Using the open source Pandas library, you can use Python to rapidly automate and perform virtually a

Pandas for Everyone: Python Data Analysi
โœ Daniel Y. Chen ๐Ÿ“‚ Library ๐Ÿ“… 2022 ๐Ÿ› Addison-Wesley Professional ๐ŸŒ English

<p><span>Manage and Automate Data Analysis with Pandas in Python</span></p><p><span>Today, analysts must manage data characterized by extraordinary variety, velocity, and volume. Using the open source Pandas library, you can use Python to rapidly automate and perform virtually any data analysis task