Lock in $30 Savings on PRO—Offer Ends Soon! ⏳
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Deploying Dask Distributed
Search
Jacob Tomlinson
May 19, 2021
Technology
0
290
Deploying Dask Distributed
Jacob Tomlinson
May 19, 2021
Tweet
Share
More Decks by Jacob Tomlinson
See All by Jacob Tomlinson
EffVer - Version your code by the effort required to upgrade
jacobtomlinson
0
26
Tech Exeter - Intro to Kubernetes 10 Year Update
jacobtomlinson
0
46
Who Builds the PyData Ecosystem?
jacobtomlinson
0
68
The Art of Wrangling Your GPU Python Environments
jacobtomlinson
0
64
Getting science done with accelerated Python computing platforms
jacobtomlinson
0
65
Dask on HPC in 2024 - Lightning Talk
jacobtomlinson
0
80
GPU Acceleration in the PyData community
jacobtomlinson
0
79
Dask on HPC in 2024
jacobtomlinson
0
58
GPU Acceleration in the PyData community
jacobtomlinson
0
57
Other Decks in Technology
See All in Technology
MLflowで始めるプロンプト管理、評価、最適化
databricksjapan
1
230
SSO方式とJumpアカウント方式の比較と設計方針
yuobayashi
7
680
AIプラットフォームにおけるMLflowの利用について
lycorptech_jp
PRO
1
150
re:Invent 2025 ふりかえり 生成AI版
takaakikakei
1
210
5分で知るMicrosoft Ignite
taiponrock
PRO
0
370
Database イノベーショントークを振り返る/reinvent-2025-database-innovation-talk-recap
emiki
0
180
第4回 「メタデータ通り」 リアル開催
datayokocho
0
130
エンジニアリングをやめたくないので問い続ける
estie
2
1.2k
re:Invent 2025 ~何をする者であり、どこへいくのか~
tetutetu214
0
220
Lookerで実現するセキュアな外部データ提供
zozotech
PRO
0
110
乗りこなせAI駆動開発の波
eltociear
1
1.1k
MapKitとオープンデータで実現する地図情報の拡張と可視化
zozotech
PRO
1
140
Featured
See All Featured
Agile that works and the tools we love
rasmusluckow
331
21k
What’s in a name? Adding method to the madness
productmarketing
PRO
24
3.8k
Context Engineering - Making Every Token Count
addyosmani
9
510
Navigating Team Friction
lara
191
16k
Optimising Largest Contentful Paint
csswizardry
37
3.5k
Scaling GitHub
holman
464
140k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
26
3.2k
Testing 201, or: Great Expectations
jmmastey
46
7.8k
Large-scale JavaScript Application Architecture
addyosmani
515
110k
Java REST API Framework Comparison - PWX 2021
mraible
34
9k
GraphQLの誤解/rethinking-graphql
sonatard
73
11k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.6k
Transcript
Deployment Workshop Deploying Dask Distributed Jacob Tomlinson
Dask Distributed A centrally managed, distributed, dynamic task scheduler
Dask Overview
None
Worker Worker Worker Scheduler Client Protocols TCP UCX Websocket Dask
components can communicate via a variety of different protocols.
Scheduler Starting a scheduler
Connecting a worker Worker Scheduler
Client Scheduler Worker Connecting a client
Client Scheduler Worker Submitting work
Dask Dashboard
JupyterLab Extension
Cluster Managers Utility classes to simplify cluster creation
Local Cluster Scheduler Worker Worker Worker Worker LocalCluster creates everything
for you. It will break down a large CPU into multiple workers withy multiple threads as this can be more performant.
Client Local Cluster Scheduler Worker Worker Worker Worker
Get logs
Scaling
How do I get more resource? Moving beyond a single
machine
SSH ... You could SSH to a bunch of machines
and start the Dask components manually.
SSHCluster Or you could use SSHCluster which will bootstrap a
cluster for you on a list of machines. All you need is passwordless SSH configured for each machine.
None
Deployment Workshop Thank you! @_jacobtomlinson