Dask is a free and open source library for parallel computing in Python that helps you scale your data science and machine learning workflows. With this quick but thorough resource, data scientists and Python programmers will learn how Dask provides APIs that make it easy to parallelize PyData libra
Scaling Python with Dask (Sixth Early Release)
β Scribed by Holden Karau and Mika Kimmins
- Publisher
- O'Reilly Media, Inc.
- Year
- 2023
- Tongue
- English
- Leaves
- 26
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Dask is a free and open source library for parallel computing in Python that helps you scale your data science and machine learning workflows. With this quick but thorough resource, data scientists and Python programmers will learn how Dask provides APIs that make it easy to parallelize PyData libraries like NumPy, pandas, and scikit-learn.
Author Holden Karau shows you how you can use Dask computations in local systems and then scale to the cloud for heavier workloads. This practical book explains why Dask is popular among industry experts and academics and used by organizations that include Walmart, Capital One, Harvard Medical School, and NASA.
With this book, you'll learn about
What is Dask is, where you can use it, and how it compares to other tools
Batch data parallel processing
Key distributed system concepts for Dask users
Higher-level APIs and building blocks
Integrated libraries, such as scikit-learn, pandas, and PyTorch
π SIMILAR VOLUMES
The burgeoning volume and complexity of data make scalability and reliability increasingly challenging issues. But while modern systems contain multicore CPUs and GPUs that have the potential for parallel computing, many Python tools weren't designed to leverage this parallelism. Using Dask to paral
The burgeoning volume and complexity of data make scalability and reliability increasingly challenging issues. But while modern systems contain multicore CPUs and GPUs that have the potential for parallel computing, many Python tools weren't designed to leverage this parallelism. Using Dask to paral
Many of Excels 750 million users would like to do more with their data, such as repeating similar analyses over hundreds of files or combining the data in many files for analysis at one time. This practical guide shows ambitious non-programmers how to automate and scale data processing and analysis
OβReilly Media, 2013β 54 p. β ISBN: 1449367798, 9781449367794<div class="bb-sep"></div>With Early Release ebooks, you get books in their earliest form β the author's raw and unedited content as he or she writes β so you can take advantage of these technologies long before the official release of the