Fr. 73.00

Parallel Python with Dask - Perform distributed computing, concurrent programming and manage large dataset

English · Paperback / Softback

Shipping usually within 2 to 3 weeks (title will be printed to order)

Description

Read more










Unlock the Power of Parallel Python with Dask: A Perfect Learning Guide for Aspiring Data Scientists

Dask has revolutionized parallel computing for Python, empowering data scientists to accelerate their workflows. This comprehensive guide unravels the intricacies of Dask to help you harness its capabilities for machine learning and data analysis.

Across 10 chapters, you'll master Dask's fundamentals, architecture, and integration with Python's scientific computing ecosystem. Step-by-step tutorials demonstrate parallel mapping, task scheduling, and leveraging Dask arrays for NumPy workloads. You'll discover how Dask seamlessly scales Pandas, Scikit-Learn, PyTorch, and other libraries for large datasets.

Dedicated chapters explore scaling regression, classification, hyperparameter tuning, feature engineering, and more with clear examples. You'll also learn to tap into the power of GPUs with Dask, RAPIDS, and Google JAX for orders of magnitude speedups.

This book places special emphasis on practical use cases related to scalability and distributed computing. You'll learn Dask patterns for cluster computing, managing resources efficiently, and robust data pipelines. The advanced chapters on DaskML and deep learning showcase how to build scalable models with PyTorch and TensorFlow.

With this book, you'll gain practical skills to:Accelerate Python workloads with parallel mapping and task scheduling
Speed up NumPy, Pandas, Scikit-Learn, PyTorch, and other libraries
Build scalable machine learning pipelines for large datasets
Leverage GPUs efficiently via Dask, RAPIDS and JAX
Manage Dask clusters and workflows for distributed computing
Streamline deep learning models with DaskML and DL frameworks

Packed with hands-on examples and expert insights, this book provides the complete toolkit to harness Dask's capabilities. It will empower Python programmers, data scientists, and machine learning engineers to achieve faster workflows and operationalize parallel computing.

Table of ContentIntroduction to Dask
Dask Fundamentals
Batch Data Parallel Processing with Dask
Distributed Systems and Dask
Advanced Dask: APIs and Building Blocks
Dask with Pandas
Dask with Scikit-learn
Dask and PyTorch
Dask with GPUs
Scaling Machine Learning Projects with Dask

Product details

Authors Tim Peters
Publisher GitforGits
 
Languages English
Product format Paperback / Softback
Released 19.10.2023
 
EAN 9788119177653
ISBN 978-81-19177-65-3
No. of pages 174
Dimensions 191 mm x 235 mm x 10 mm
Weight 338 g
Subject Natural sciences, medicine, IT, technology > IT, data processing > Programming languages

Customer reviews

No reviews have been written for this item yet. Write the first review and be helpful to other users when they decide on a purchase.

Write a review

Thumbs up or thumbs down? Write your own review.

For messages to CeDe.ch please use the contact form.

The input fields marked * are obligatory

By submitting this form you agree to our data privacy statement.