Distributed TensorFlow: Scaling Deep Learning L...

December 28, 2017

84

Distributed TensorFlow: Scaling Deep Learning Library

#tensorflow #scale #distributed

mactiendinh

December 28, 2017

Tweet

More Decks by mactiendinh

See All by mactiendinh

Chapter 5: Good design = Flexible softwave

0

14

Overview chapter 4 Head First Object Oriented Design and Analysis

0

41

Overview chapter 3 Head First Object Oriented Design and Analysis

0

70

Other Decks in Technology

See All in Technology

能登半島災害現場エンジニアクロストーク【JAWS FESTA 2025 in 金沢】

0

830

大規模サーバーレスAPIの堅牢性・信頼性設計〜AWSのベストプラクティスから始まる現実的制約との向き合い方〜

10

4.7k

Digitization部紹介資料

PRO

1

5.5k

これがLambdaレス時代のChatOpsだ！実例で学ぶAmazon Q Developerカスタムアクション活用法

PRO

6

1.1k

20251014_Pythonを実務で徹底的に使いこなした話

0

200

いまからでも遅くない！SSL/TLS証明書超入門（It's not too late to start! SSL/TLS Certificates: The Absolute Beginner's Guide）

0

240

Node.js 2025: What's new and what's next

0

340

リセラー企業のテクサポ担当が考える、生成 AI 時代のトラブルシュート 2025

1

340

プレーリーカードを活用しよう❗❗デジタル名刺交換からはじまるイベント会場交流のススメ

0

160

やる気のない自分との向き合い方/How to Deal with Your Unmotivated Self

0

510

Introduction to Sansan for Engineers / エンジニア向け会社紹介

PRO

5

43k

20201008_ファインディ_品質意識を育てる役目は人かAIか___2_.pdf

findy_eventslides

2

640

Featured

See All Featured

A better future with KSS

239

18k

Balancing Empowerment & Direction

4

690

Fantastic passwords and where to find them - at NoRuKo

52

3.4k

Context Engineering - Making Every Token Count

6

250

Understanding Cognitive Biases in Performance Measurement

31

2.7k

Into the Great Unknown - MozCon

40

2.1k

No one is an island. Learnings from fostering a developers community.

21

3.5k

Refactoring Trust on Your Teams (GOTO; Chicago 2020)

35

3.2k

Rebuilding a faster, lazier Slack

84

9.2k

Gamification - CAS2011

81

5.5k

Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure

48

9.7k

How STYLIGHT went responsive

100

5.8k

Transcript

Distributed TensorFlow Tien Dinh
None
None
None
None
TensorFlow: Expressing High-Level ML Computations Core in C++ • Very
• low overhead Different • front ends for specifying/driving the computation Python • and C++ today, easy to add more
Computation is a dataflow graph Graph of Nodes • ,
called Operations or ops Edges are N • -dimensional arrays: Tensors
Computation is a dataflow graph WITH STATE
Computation is a dataflow graph Distributed
Computation is a dataflow graph Assign Devices to Ops •
TensorFlow inserts Send/Recv Ops to transport tensors across devices • Recv ops pull data from Send ops
Computation is a dataflow graph Assign Devices to Ops TensorFlow
inserts Send/Recv Ops to transport tensors across devices • Recv • ops pull data from Send ops
Distrubuted Training with TensorFlow
Distrubuted Training with TensorFlow
Model Parallelism = split model, share data
Distrubuted Training
Distrubuted Training with TensorFlow
Data Parallelism
Data Parallelism
Data Parallelism
Data Parallelism
Data Parallelism
Data Parallelism
Distributed training mechanisms Graph structure and low-level graph primitives (queues)
allow us to play with synchronous vs. asynchronous update algorithms.
Thanks for your attention!