Content, tutorials, and more on how to use Dask effectively.

Dask is a flexible open-source Python library for parallel computing. Dask scales Python code from multi-core local machines to large distributed clusters in the cloud. Dask provides a familiar user interface by mirroring the APIs of other libraries in the PyData ecosystem including Pandas, Scikit-learn, and NumPy. It also exposes low-level APIs that help programmers run custom algorithms in parallel.

Dask was created by Matthew Rocklin in 2014 and is used by retail, financial, and governmental organizations, as well as life science and geophysical institutes.