𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Scaling Python with Dask (Sixth Early Release)

✍ Scribed by Holden Karau and Mika Kimmins


Publisher
O'Reilly Media, Inc.
Year
2023
Tongue
English
Leaves
26
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Dask is a free and open source library for parallel computing in Python that helps you scale your data science and machine learning workflows. With this quick but thorough resource, data scientists and Python programmers will learn how Dask provides APIs that make it easy to parallelize PyData libraries like NumPy, pandas, and scikit-learn.

Author Holden Karau shows you how you can use Dask computations in local systems and then scale to the cloud for heavier workloads. This practical book explains why Dask is popular among industry experts and academics and used by organizations that include Walmart, Capital One, Harvard Medical School, and NASA.

With this book, you'll learn about
What is Dask is, where you can use it, and how it compares to other tools
Batch data parallel processing
Key distributed system concepts for Dask users
Higher-level APIs and building blocks
Integrated libraries, such as scikit-learn, pandas, and PyTorch


πŸ“œ SIMILAR VOLUMES


Scaling Python with Dask
✍ Holden Karau; Mika Kimmins πŸ“‚ Library πŸ“… 2023 πŸ› O'Reilly Media, Inc. 🌐 English

Dask is a free and open source library for parallel computing in Python that helps you scale your data science and machine learning workflows. With this quick but thorough resource, data scientists and Python programmers will learn how Dask provides APIs that make it easy to parallelize PyData libra

Dask: The Definitive Guide - Scalable Py
✍ Matthew Rocklin, Matthew Powers, Richard Pelgrim πŸ“‚ Library πŸ“… 2022 πŸ› O’Reilly Media, Inc. 🌐 English

The burgeoning volume and complexity of data make scalability and reliability increasingly challenging issues. But while modern systems contain multicore CPUs and GPUs that have the potential for parallel computing, many Python tools weren't designed to leverage this parallelism. Using Dask to paral

Dask: The Definitive Guide - Scalable Py
✍ Matthew Rocklin, Matthew Powers, Richard Pelgrim πŸ“‚ Library πŸ“… 2022 πŸ› O’Reilly Media, Inc. 🌐 English

The burgeoning volume and complexity of data make scalability and reliability increasingly challenging issues. But while modern systems contain multicore CPUs and GPUs that have the potential for parallel computing, many Python tools weren't designed to leverage this parallelism. Using Dask to paral

Foundations for Analytics with Python (E
✍ Clinton W. Brownley πŸ“‚ Library πŸ“… 2016 πŸ› O'Reilly Media 🌐 English

Many of Excels 750 million users would like to do more with their data, such as repeating similar analyses over hundreds of files or combining the data in many files for analysis at one time. This practical guide shows ambitious non-programmers how to automate and scale data processing and analysis

Test-Driven Development with Python: Ear
✍ Percival H. πŸ“‚ Library 🌐 English

O’Reilly Media, 2013– 54 p. – ISBN: 1449367798, 9781449367794<div class="bb-sep"></div>With Early Release ebooks, you get books in their earliest form β€” the author's raw and unedited content as he or she writes β€” so you can take advantage of these technologies long before the official release of the