Scalable Scientific Computing using Dask

Scalable Scientific Computing using Dask

Pandas and NumPy are great tools to dive through data, do analysis and train machine learning models. They provide intuitive APIs and superb performance. Sadly they are both restricted to the main memory of a single machine and mostly also to a single CPU. Dask is a flexible tools for parallelizing NumPy and Pandas code on a single machine or a cluster.

D6fcc16462fbe93673342da3ff5d8121?s=128

Uwe L. Korn

October 24, 2018
Tweet