Dask is a free and open source library for parallel computing in Python that helps you scale your data science and machine learning workflows. With this quick but thorough resource, data scientists and Python programmers will learn how Dask provides APIs that make it easy to parallelize PyData libra
Scaling Python with Dask
โ Scribed by Holden Karau; Mika Kimmins
- Publisher
- O'Reilly Media, Inc.
- Year
- 2023
- Tongue
- English
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
Dask is a free and open source library for parallel computing in Python that helps you scale your data science and machine learning workflows. With this quick but thorough resource, data scientists and Python programmers will learn how Dask provides APIs that make it easy to parallelize PyData libraries like NumPy, pandas, and scikit-learn.
Author Holden Karau shows you how you can use Dask computations in local systems and then scale to the cloud for heavier workloads. This practical book explains why Dask is popular among industry experts and academics and used by organizations that include Walmart, Capital One, Harvard Medical School, and NASA.
With this book, you'll learn about
What is Dask is, where you can use it, and how it compares to other tools
Batch data parallel processing
Key distributed system concepts for Dask users
Higher-level APIs and building blocks
Integrated libraries, such as scikit-learn, pandas, and PyTorch
How to use Dask with GPUs
๐ SIMILAR VOLUMES
<b>Summary</b><br /><br />Dask is a native parallel analytics tool designed to integrate seamlessly with the libraries you're already using, including Pandas, NumPy, and Scikit-Learn. With Dask you can crunch and work with huge datasets, using the tools you already have. And<i>Data Science with Pyth
<p><span>Modern systems contain multicore CPUs and GPUs that have the potential for parallel computing. But many scientific Python tools were not designed to leverage this parallelism. With this short but thorough resource, data scientists and Python programmers will learn how the Dask open source l
<p><span>Modern systems contain multi-core CPUs and GPUs that have the potential for parallel computing. But many scientific Python tools were not designed to leverage this parallelism. With this short but thorough resource, data scientists and Python programmers will learn how the Dask open source
<p><span>Modern systems contain multi-core CPUs and GPUs that have the potential for parallel computing. But many scientific Python tools were not designed to leverage this parallelism. With this short but thorough resource, data scientists and Python programmers will learn how the Dask open source
Modern systems contain multi-core CPUs and GPUs that have the potential for parallel computing. But many scientific Python tools were not designed to leverage this parallelism. With this short but thorough resource, data scientists and Python programmers will learn how the Dask open source library f