Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Rancherでkubeflow構築
Search
nakayamam
March 16, 2019
Technology
3
19k
Rancherでkubeflow構築
Rancher Meetup #07 in Osakaでの発表資料です
nakayamam
March 16, 2019
Tweet
Share
More Decks by nakayamam
See All by nakayamam
rancher/system-toolsを試してみた
nakayamam
0
380
Other Decks in Technology
See All in Technology
【新卒研修資料】LLM・生成AI研修 / Large Language Model・Generative AI
brainpadpr
23
17k
o11yで育てる、強い内製開発組織
_awache
3
120
KAGのLT会 #8 - 東京リージョンでGAしたAmazon Q in QuickSightを使って、報告用の資料を作ってみた
0air
0
200
成長自己責任時代のあるきかた/How to navigate the era of personal responsibility for growth
kwappa
3
270
Goにおける 生成AIによるコード生成の ベンチマーク評価入門
daisuketakeda
2
100
業務自動化プラットフォーム Google Agentspace に入門してみる #devio2025
maroon1st
0
190
PLaMo2シリーズのvLLM実装 / PFN LLM セミナー
pfn
PRO
2
970
LLMアプリケーション開発におけるセキュリティリスクと対策 / LLM Application Security
flatt_security
7
1.8k
Exadata Database Service on Dedicated Infrastructure(ExaDB-D) UI スクリーン・キャプチャ集
oracle4engineer
PRO
2
5.4k
Modern_Data_Stack最新動向クイズ_買収_AI_激動の2025年_.pdf
sagara
0
200
いま注目しているデータエンジニアリングの論点
ikkimiyazaki
0
590
Goに育てられ開発者向けセキュリティ事業を立ち上げた僕が今向き合う、AI × セキュリティの最前線 / Go Conference 2025
flatt_security
0
350
Featured
See All Featured
RailsConf 2023
tenderlove
30
1.2k
StorybookのUI Testing Handbookを読んだ
zakiyama
31
6.2k
Learning to Love Humans: Emotional Interface Design
aarron
274
40k
Unsuck your backbone
ammeep
671
58k
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
33
2.5k
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
35
3.2k
Into the Great Unknown - MozCon
thekraken
40
2.1k
Building Adaptive Systems
keathley
43
2.8k
Docker and Python
trallard
46
3.6k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
132
19k
Done Done
chrislema
185
16k
Transcript
RancherͰkubeflowߏங Rancher Meetup #07 in Osaka Masaki-Nakayama
ࣗݾհ • Masaki-Nakayama @nakayamam2 • KAGOYA JAPAN • Rancher Meetup,
CNJP Kansai
kubeflowʁ IUUQTXXXLVCFqPXPSHEPDTBCPVULVCFqPX
kubeflow? KubernetesͷͨΊͷػցֶशπʔϧΩοτ
kubeflow? ԼهͷํʹΦεεϝ by ެࣜ • TensorFlowϞσϧΛ͞·͟·ͳڥʢϩʔΧϧɺΦϯϓ ϨɺΫϥυͳͲʣͰτϨʔχϯά/ఏڙ͍ͨ͠ • TensorFlowτϨʔχϯάδϣϒΛཧ͢ΔͨΊʹJupyter ϊʔτϒοΫΛ͍͍ͨ
• TensorFlowΛଞͷϓϩηεͱΈ߹Θ͍ͤͨ
kubeflow? kubeflowͷμογϡϘʔυ
kubeflow? Լهͷͷ͕͋Β͔͡Ίೖ͍ͬͯ·͢ • JupyterHub : Jupyter NotebookʹϢʔβʔೝূՃͯ͠ෳਓͰ͑ΔΑ ͏ʹͨ͠ͷ • TFjob
Dashboard: k8sͰTensorFlowτϨʔχϯάδϣϒΛཧͰ͖Δ • Katib Dashboard: ϋΠύʔύϥϝʔλʔνϡʔχϯάͷπʔϧ https://www.slideshare.net/Oshima0x3fd/katib ͕ৄͦ͠͏
ߏங·ͰಓͷΓͦ͏ɾɾɾ
͋ΔRancherΧλϩάΛݟ͍ͯΔͱɾɾ
͋ʂ
None
͔ͯ͠͠Chainer͙͑͢Δɾɾɾʁ
ྲྀΕ 1. GPUΫϥελʔͷߏங on GKE 2. ΫϥελʔΛRancherΠϯϙʔτ 3. RancherΧλϩάͰkubeflowσϓϩΠ ※ͪͳΈʹGKEʹೖΕΔ͚ͩͳΒઐ༻ͷϫϯΫϦοΫσϓϩΠ͕༻ҙ͞ΕͯΔͷͰ
ͦͪΒΛ͏ํ͕ૣ͍͔ https://deploy.kubeflow.cloud/#/
GPUΫϥελʔͷߏங on GKE
GPUબΜͰ࡞ KZVQZUFSIVC͕Ϧιʔε Λ৯͏ͷͰεϖοΫ͕͋ Μ·Γ͍ͱࢮʹ·͢
GPUΫϥελʔ͕Ͱ͖·ͨ͠
ΫϥελʔΛ RancherΠϯϙʔτ
None
ࣗݾॺ໊ͳͷͰͪ͜Β Λ࣮ߦ DMVTUFSBENJOΛϢʔβʔ ʹCJOEJOH
ΠϯϙʔτͰ͖·ͨ͠
RancherΧλϩάͰ kubeflowσϓϩΠ
None
֤ػೳͷ0/0''͕Ͱ͖ΔͬΆ͍ʁ
֤ػೳͷ0/0''͕Ͱ͖ΔͬΆ͍ʁ
None
6*
None
Ϣʔβʔ࡞͢Δ
ΠϝʔδΛࢦఆͯ͠TQBXO
ϫʔΫεϖʔε͕࡞͞ΕΔ
ϫʔΫεϖʔε͕ग़དྷ্͕Δ
QZUIPOίʔυΛ͙࣮͢ߦͰ͖Δ
None
UFOTPSqPXͷδϣϒͷ࡞͕Ͱ͖Δ
> kubectl get crd NAME AGE backendconfigs.cloud.google.com 2h scalingpolicies.scalingpolicy.kope.io 2h
studyjobs.kubeflow.org 1h tfjobs.kubeflow.org 1h ͋Εɺchainer operator͕͍ͳ͍ɾɾɾ
None
{"log":"2019/03/14 17:57:17 info: manifest \"kubeflow/templates/ chainer-rbac.yaml\" is empty. Skipping. \n","stream":"stderr","time":"2019-03-14T17:57:17.702047111Z"}
{"log":"2019/03/14 17:57:17 info: manifest \"kubeflow/templates/ chainer-crd.yaml\" is empty. Skipping. \n","stream":"stderr","time":"2019-03-14T17:57:17.702898257Z"} {"log":"2019/03/14 17:57:17 info: manifest \"kubeflow/templates/ chainer-operator.yaml\" is empty. Skipping. \n","stream":"stderr","time":"2019-03-14T17:57:17.702904447Z"} RancherαʔόʔͷϩάΛݟΔͱ Ͳ͏manifestʹө͞Ε͍ͯͳ͍Α͏ͩ
issueग़͠ͱ͖·ͨ͠ʂ
Thanks!