𝔖 Scriptorium
✦   LIBER   ✦

📁

Learning Apache Mahout: Acquire practical skills in Big Data Analytics and explore data science with Apache Mahout

✍ Scribed by Chandramani Tiwary


Publisher
Packt Publishing
Year
2015
Tongue
English
Leaves
250
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


In the past few years the generation of data and our capability to store and process it has grown exponentially. There is a need for scalable analytics frameworks and people with the right skills to get the information needed from this Big Data. Apache Mahout is one of the first and most prominent Big Data machine learning platforms. It implements machine learning algorithms on top of distributed processing platforms such as Hadoop and Spark. Starting with the basics of Mahout and machine learning, you will explore prominent algorithms and their implementation in Mahout development. You will learn about Mahout building blocks, addressing feature extraction, reduction and the curse of dimensionality, delving into classification use cases with the random forest and Naïve Bayes classifier and item and user-based recommendation. You will then work with clustering Mahout using the K-means algorithm and implement Mahout without MapReduce. Finish with a flourish by exploring end-to-end use cases on customer analytics and test analytics to get a real-life practical know-how of analytics projects.

✦ Subjects


Информатика и вычислительная техника;Искусственный интеллект;Интеллектуальный анализ данных;


📜 SIMILAR VOLUMES


Learning Apache Mahout Classification: B
✍ Ashish Gupta 📂 Library 📅 2015 🏛 Packt Publishing 🌐 English

This book is a practical guide that explains the classification algorithms provided in Apache Mahout with the help of actual examples. Starting with the introduction of classification and model evaluation techniques, we will explore Apache Mahout and learn why it is a good choice for classification.

Apache Mahout Essentials: Implement top-
✍ Jayani Withanawasam 📂 Library 📅 2015 🏛 Packt Publishing 🌐 English

Apache Mahout is a scalable machine learning library with algorithms for clustering, classification, and recommendations. It empowers users to analyze patterns in large, diverse, and complex datasets faster and more scalably. This book is an all-inclusive guide to analyzing large and complex datase

Apache Spark 2: Data Processing and Real
✍ Romeo Kienzler, Md. Rezaul Karim, Sridhar Alla, Siamak Amirghodsi, Meenakshi Raj 📂 Library 📅 2018 🏛 Packt Publishing 🌐 English

<p><b>Build efficient data flow and machine learning programs with this flexible, multi-functional open-source cluster-computing framework</b></p> <h4>Key Features</h4> <ul><li>Master the art of real-time big data processing and machine learning </li> <li>Explore a wide range of use-cases to analyze

Scala Programming for Big Data Analytics
✍ Irfan Elahi 📂 Library 📅 2019 🏛 Apress 🌐 English

<p>Gain the key language concepts and programming techniques of Scala in the context of big data analytics and Apache Spark. The book begins by introducing you to Scala and establishes a firm contextual understanding of why you should learn this language, how it stands in comparison to Java, and how