𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Parallel Python with Dask: Perform distributed computing, concurrent programming and manage large dataset

✍ Scribed by Tim Peters


Publisher
Gitforgits
Year
2023
Tongue
English
Leaves
290
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Unlock the Power of Parallel Python with Dask: A Perfect Learning Guide for Aspiring Data Scientists

Dask has revolutionized parallel computing for Python, empowering data scientists to accelerate their workflows. This comprehensive guide unravels the intricacies of Dask to help you harness its capabilities for machine learning and data analysis.

Across 10 chapters, you'll master Dask's fundamentals, architecture, and integration with Python's scientific computing ecosystem.Step-by-step tutorials demonstrate parallel mapping, task scheduling, and leveraging Dask arrays for NumPy workloads.You'll discover how Dask seamlessly scales Pandas, Scikit-Learn, PyTorch, and other libraries for large datasets.

Dedicated chapters explore scaling regression, classification, hyperparameter tuning, feature engineering, and more with clear examples. You'll also learn to tap into the power of GPUs with Dask, RAPIDS, and Google JAX for orders of magnitude speedups.

This book places special emphasis on practical use cases related to scalability and distributed computing. You'll learn Dask patterns for cluster computing, managing resources efficiently, and robust data pipelines. The advanced chapters on DaskML and deep learning showcase how to build scalable models with PyTorch and TensorFlow.

With this book, you'll gain practical skills to

Accelerate Python workloads with parallel mapping and task scheduling

Speed up NumPy, Pandas, Scikit-Learn, PyTorch, and other libraries

Build scalable machine learning pipelines for large datasets

Leverage GPUs efficiently via Dask, RAPIDS and JAX

Manage Dask clusters and workflows for distributed computing

Streamline deep learning models with DaskML and DL frameworks

Packed with hands-on examples and expert insights, this book provides the complete toolkit to harness Dask's capabilities. It will empower Python programmers, data scientists, and machine learning engineers to achieve faster workflows and operationalize parallel computing.

✦ Table of Contents


Parallel Python with Dask
Chapter 1: Introduction to Dask
Chapter 2: Dask Fundamentals
Chapter 3: Batch Data Parallel Processing with Dask
Chapter 4: Distributed Systems and Dask
Chapter 5: Advanced Dask: APIs and Building Blocks
Chapter 6: Integrated Libraries: Dask with Pandas
Chapter 7: Integrated Libraries: Dask with Scikit-learn
Chapter 8: Integrated Libraries: Dask and PyTorch
Chapter 9: Dask with GPUs
Chapter 10: Scaling Machine Learning Projects with Dask
Thank You
Index


πŸ“œ SIMILAR VOLUMES


Mastering Large Datasets with Python: Pa
✍ John T. Wolohan πŸ“‚ Library πŸ“… 2020 πŸ› Manning 🌐 English

Modern data science solutions need to be clean, easy to read, and scalable. In Mastering Large Datasets with Python, author J.T. Wolohan teaches you how to take a small project and scale it up using a functionally influenced approach to Python coding. You’ll explore methods and built-in Python tools

Parallel and High Performance Programmin
✍ Fabio Nelli πŸ“‚ Library 🌐 English

<p><span>Unleash the capabilities of Python and its libraries for solving high performance computational problems.</span></p><p><span>Key Features </span></p><p><span>● Explores parallel programming concepts and techniques for high-performance computing.</span></p><p><span>● Covers parallel algorith

Concurrent, parallel and distributed com
✍ Adele Kuzmiakova πŸ“‚ Library πŸ“… 2023 πŸ› Arcler Press 🌐 English

The book "Concurrent, Parallel, and Distributed Computing" offers an excellent overview of the various areas of the computing field. There is a lot of overlap between the words "concurrent computing," "parallel computing," and "distributed computing," and there is no obvious differentiation between

Python Parallel Programming Cookbook: Ov
✍ Giancarlo Zaccone πŸ“‚ Library πŸ“… 2019 πŸ› Packt Publishing 🌐 English

Implement effective programming techniques in Python to build scalable software that saves time and memory Key Features β€’ Design distributed computing systems and massive computational tasks coherently β€’ Learn practical recipes with concise explanations that address development pain points enco

Topics in Parallel and Distributed Compu
✍ Sushil K. Prasad, Anshul Gupta, Arnold Rosenberg, Alan Sussman, Charles Weems πŸ“‚ Library πŸ“… 2018 πŸ› Springer International Publishing 🌐 English

<p><p></p><p>This book introduces beginning undergraduate students of computing and computational disciplines to modern parallel and distributed programming languages and environments, including map-reduce, general-purpose graphics processing units (GPUs), and graphical user interfaces (GUI) for mob

Hands-On GPU Programming with Python and
✍ Dr. Brian Tuomanen πŸ“‚ Library πŸ“… 2018 πŸ› Packt Publishing 🌐 English

Hands-On GPU Programming with Python and CUDA hits the ground running: you'll start by learning how to apply Amdahl's Law, use a code profiler to identify bottlenecks in your Python code, and set up an appropriate GPU programming environment. You'll then see how to β€œquery” the GPU's features and cop