Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
入門AlphaGo
Search
na-o-ys
April 22, 2016
Technology
3.8k
5
Share
入門AlphaGo
"Mastering the game of Go with deep neural networks and tree search" の概要
na-o-ys
April 22, 2016
More Decks by na-o-ys
See All by na-o-ys
IoTと監視
naoys
1
830
RubyとJIT
naoys
0
180
将棋盤を画像認識したかった
naoys
0
1.6k
Rust で乗り換え案内
naoys
0
650
疎行列と Jaccard 類似度の高速計算
naoys
1
670
有理数集合の濃度
naoys
2
160
YARVの最適化について調べた
naoys
0
160
転職会議サービスのAWS移行記録
naoys
0
91
Anonymous Recursion in C++
naoys
0
440
Other Decks in Technology
See All in Technology
はじめてのDatadog
kairim0
0
180
Datadog 認定試験の概要と対策
uechishingo
0
150
eBPF Can Do It! A 5-Minute Tour of 5 Real-World PHP Issues Solved with eBPF
egmc
0
330
AI-DLCを活用した高品質・安全なAI駆動開発実践 / AI Driven Development
yoshidashingo
0
210
GitHub Copilot のこれまでとこれから: From Copilot to Collaborative Agents
yuriemori
1
230
JICUG あなたのAI駆動開発パートナー IBM Bob を使ったアプリ開発
1ftseabass
PRO
0
120
テストコードのないプロジェクトにテストを根付かせる
tttol
0
220
コードレビューを制するチームがソフトウェアデリバリーのフローを制す / Beyond Code Review: Distributing Its Responsibilities Across the SDLC
mtx2s
1
300
イベントで大活躍する電子ペーパー名札 〜その3〜 / ビジュアルプログラミングIoTLT vol.23
you
PRO
0
160
Copilot CLI・IDE・Web・スマホで途切れない開発フローを目指して / One Copilot flow - CLI IDE Web Mobile
aeonpeople
1
1.1k
oracle-to-databricks-migration-with-llm-and-dbt
casek
0
350
Gradle×GitHub_ActionsでCI時間を約50%短縮 ジョブ分割の設計と落とし穴 / Cutting CI Time by ~50% with Gradle and GitHub Actions: Job-Splitting Design and Pitfalls
takatty
0
510
Featured
See All Featured
Optimising Largest Contentful Paint
csswizardry
37
3.7k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
133
19k
Measuring & Analyzing Core Web Vitals
bluesmoon
9
840
Discover your Explorer Soul
emna__ayadi
2
1.1k
How GitHub (no longer) Works
holman
316
150k
Balancing Empowerment & Direction
lara
6
1.1k
Unsuck your backbone
ammeep
672
58k
Test your architecture with Archunit
thirion
1
2.2k
What Being in a Rock Band Can Teach Us About Real World SEO
427marketing
0
240
Bash Introduction
62gerente
615
210k
Winning Ecommerce Organic Search in an AI Era - #searchnstuff2025
aleyda
1
2k
Amusing Abliteration
ianozsvald
1
180
Transcript
ೖAlphaGo 0x64ޠ ୈ07 “AI” @na_o_ys
͝ҙ • จʹॻ͔Ε͍ͯͳ͍ಠࣗௐࠪਪଌؚ͕· Ε·͢ • Ұߟͩͱࢥͬͯݟ͍ͯͩ͘͞
AlphaGoͱ • ॳΊͯϓϩع࢜ΛഁͬͨғޟAI
ୈҰ෦: AlphaGoʹࢸΔ·Ͱ
શใήʔϜ • ΦηϩɺνΣεɺকعɺғޟɺetc • ϥϯμϜੑ͕ແ͘ɺ࠷ળख͕ଘࡏ͢Δ • (ݪཧతʹ) ઌखඞউɾޙखඞউɾҾ͖͚
ήʔϜ • શ୳ࡧͰ࠷ળख͕ٻ·Δ • ܭࢉྔతʹෆՄೳ … ݱہ໘ 1खޙ 2खޙ
ධՁؔ • ൫໘Λ༩͑ΔͱείΞ (༧উͳͲ) Λฦؔ͢ • কعνΣεͳΒɺۨͷଛಘޮ͖ͷΛݩʹܭࢉ • ήʔϜͷ୳ࡧൣғ(ਂ͞)ΛݶఆͰ͖Δ ݱہ໘
1खޙ 2खޙ ධՁˠ 0.1 0.8 0.3 0.4
ධՁؔͷ༗ޮੑ • ύϥϝʔλͷબఆɾઃఆ͕ΩϞ • ख࡞ۀ: νΣεͰਓؒΛ͑ͨ • ػցֶश: কعͰਓؒΛ͑ͨ •
ғޟෳࡶੑͷͨΊʹ·ͱͳධՁؔΛ࡞Εͳ͔ͬ ͨ
ݪ࢝ϞϯςΧϧϩ๏ • ϥϯμϜʹऴہ·Ͱଧͭ (ϩʔϧΞτ) Λ܁Γฦͯ͠ɺউΛܭ ࢉ͢Δํ๏ ϩʔϧΞτΛ܁Γฦͯ͠ উΛܭࢉ উ 7/10
উ 3/10
ϞϯςΧϧϩ୳ࡧ (MCTS) • ݪ࢝ϞϯςΧϧϩ๏ΛධՁؔతʹ͏ • n खઌͰϩʔϧΞτ • ༿ͷউΛܭࢉ ※͞Βʹ༿ͷউʹԠͯ͡ಈతʹࢬמΓɾల։͠ɺ୳ࡧਫ਼Λ্͛Δ
ϙϦγʔؔ • f (ہ໘, ࣍ͷҰख) • ࣍ͷҰखͷࣗવ͞Λ͋ΒΘ֬͢ີؔ • عේσʔλ͔Βͷֶश͕༰қ •
ϩʔϧΞτ࣌ʹ͑Δ • ϥϯμϜʹଧͭͷͰͳ͘ɺ·ͱͳखΛଧͨͤΔ • ͨͩ͠ߴʹಈ࡞͢Δඞཁ͕͋Δ
MCTSͷڧ͞ • ϙϦγʔؔͷͳͲͰΞϚνϡΞߴஈʹඖఢ͢Δڧ͞· Ͱਐา • ϓϩʹٴͳ͍ • େہ؍ʹ༏ΕΔ • ʮڱ͘ਂ͍ಡΈʯ͕ऑ͍
• खΛ͘ಡΉͨΊ
AlphaGo͕ͬͨ͜ͱ • جຊMCTS • ༷ʑͳ • CNN(ΈࠐΈχϡʔϥϧωοτϫʔΫ) • ڧԽֶश •
ධՁؔ • ฒྻࢄΞϧΰϦζϜ • MCTS ʹͦΕΒΛΈࠐΜͩ
ୈೋ෦: AlphaGo
2ͭͷϙϦγʔؔͱ 1ͭͷධՁؔ ϩʔϧΞτϙϦγʔ ϩʔϧΞτʹ͏ ߴɾਫ਼ 4-ϙϦγʔ ୳ࡧॱংΛܾΊΔ ɾߴਫ਼ ධՁؔ ༿ͷධՁ(উ)Λܭࢉ
ϩʔϧΞτʹΑΔউͱ͠߹ΘͤΔ
ϩʔϧΞτϙϦγʔ • ϩʔϧΞτ(ϥϯμϜϓϨΠ)ʹ͏ϙϦγʔؔ • ߴੑɹʼɹਫ਼ • ਓؒͷعේ800ສہ໘͔Βֶश • ઢܗιϑτϚοΫεؔ •
2ϚΠΫϩඵ (ߴ) • عේͱͷࢦ͠खҰக: 24.2%
SLϙϦγʔ • ͷ୳ࡧॱংΛܾΊΔϙϦγʔؔ • ਫ਼ɹʼɹߴੑ • ਓؒͷعේ3000ສہ໘͔Βֶश • 13CNN(ΈࠐΈχϡʔϥϧωοτϫʔΫ) •
ը૾ೝࣝͰΑ͘ΘΕΔ • : 3ϛϦඵ • عේͱͷࢦ͠खҰக: 57%
ධՁؔ • 14CNN • SLϙϦγʔΛڧԽֶशͨ͠ͷ (RLϙϦγʔ) Λݩʹɺճؼͯ͠࡞Δ 4-ϙϦγʔ 3-ϙϦγʔ ධՁؔ
1. ڧԽֶश 2. ϥϯμϜعේੜ (3000ສہ໘) 3. ճؼ
ධՁؔͷଊ͑ํ • ϩʔϧΞτʹΑΔউܭࢉΛิ͏ͷ • ୯ମͰͦ͜·Ͱڧ͘ͳ͍ • ධՁؔͷಛ (ߟ) • ʮڱ͘ਂ͍ಡΈʯʹڧ͍
• ʮRLϙϦγʔ(ڧԽֶश݁Ռ)Λऴہ·ͰଧͨͤͨࡍͷউʯͱՁ • େہ؍͕ແ͍ • Ұຊಓ͔͠ಡ·ͳ͍ .$54ͷಛੑ େہ؍ʹ༏Εͯʮਂ͍ಡΈʯ͕ऑ͍ ͱ ͏·͘ิ͍͍͋ͬͯΔ
ڧ͞ (2015/10࣌)
ڧ͞ (2016/3 ࣌) R3500+ ͷΠɾηυϧʹউ
ࢀߟ • Mastering the game of Go with deep neural
networks and tree search (http://www.nature.com/nature/journal/v529/n7587/full/ nature16961.html) • Google AlphaGoͷΈΛཧղ͢Δ | IT Leaders (http://it.impressbm.co.jp/articles/-/13474)
ऴΘΓ