Lock in $30 Savings on PRO—Offer Ends Soon! ⏳
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
入門AlphaGo
Search
na-o-ys
April 22, 2016
Technology
5
3.8k
入門AlphaGo
"Mastering the game of Go with deep neural networks and tree search" の概要
na-o-ys
April 22, 2016
Tweet
Share
More Decks by na-o-ys
See All by na-o-ys
IoTと監視
naoys
1
800
RubyとJIT
naoys
0
170
将棋盤を画像認識したかった
naoys
0
1.6k
Rust で乗り換え案内
naoys
0
630
疎行列と Jaccard 類似度の高速計算
naoys
1
640
有理数集合の濃度
naoys
2
140
YARVの最適化について調べた
naoys
0
140
転職会議サービスのAWS移行記録
naoys
0
78
Anonymous Recursion in C++
naoys
0
430
Other Decks in Technology
See All in Technology
日本の AI 開発と世界の潮流 / GenAI Development in Japan
hariby
1
250
ExpoのインダストリーブースでみたAWSが見せる製造業の未来
hamadakoji
0
190
通勤手当申請チェックエージェント開発のリアル
whisaiyo
3
410
マイクロサービスへの5年間 ぶっちゃけ何をしてどうなったか
joker1007
18
7.5k
2025年 開発生産「可能」性向上報告 サイロ解消からチームが能動性を獲得するまで/ 20251216 Naoki Takahashi
shift_evolve
PRO
2
220
Bedrock AgentCore Evaluationsで学ぶLLM as a judge入門
shichijoyuhi
2
190
Oracle Database@Google Cloud:サービス概要のご紹介
oracle4engineer
PRO
1
760
Microsoft Agent Frameworkの可観測性
tomokusaba
1
100
Strands Agents × インタリーブ思考 で変わるAIエージェント設計 / Strands Agents x Interleaved Thinking AI Agents
takanorig
4
1.9k
LayerX QA Night#1
koyaman2
0
240
まだ間に合う! Agentic AI on AWSの現在地をやさしく一挙おさらい
minorun365
17
2.5k
『君の名は』と聞く君の名は。 / Your name, you who asks for mine.
nttcom
1
110
Featured
See All Featured
<Decoding/> the Language of Devs - We Love SEO 2024
nikkihalliwell
0
100
The Organizational Zoo: Understanding Human Behavior Agility Through Metaphoric Constructive Conversations (based on the works of Arthur Shelley, Ph.D)
kimpetersen
PRO
0
200
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
PRO
196
70k
Effective software design: The role of men in debugging patriarchy in IT @ Voxxed Days AMS
baasie
0
170
Leveraging LLMs for student feedback in introductory data science courses - posit::conf(2025)
minecr
0
88
Data-driven link building: lessons from a $708K investment (BrightonSEO talk)
szymonslowik
1
850
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
54k
SEOcharity - Dark patterns in SEO and UX: How to avoid them and build a more ethical web
sarafernandez
0
88
Making the Leap to Tech Lead
cromwellryan
135
9.7k
Statistics for Hackers
jakevdp
799
230k
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
21
1.3k
Un-Boring Meetings
codingconduct
0
160
Transcript
ೖAlphaGo 0x64ޠ ୈ07 “AI” @na_o_ys
͝ҙ • จʹॻ͔Ε͍ͯͳ͍ಠࣗௐࠪਪଌؚ͕· Ε·͢ • Ұߟͩͱࢥͬͯݟ͍ͯͩ͘͞
AlphaGoͱ • ॳΊͯϓϩع࢜ΛഁͬͨғޟAI
ୈҰ෦: AlphaGoʹࢸΔ·Ͱ
શใήʔϜ • ΦηϩɺνΣεɺকعɺғޟɺetc • ϥϯμϜੑ͕ແ͘ɺ࠷ળख͕ଘࡏ͢Δ • (ݪཧతʹ) ઌखඞউɾޙखඞউɾҾ͖͚
ήʔϜ • શ୳ࡧͰ࠷ળख͕ٻ·Δ • ܭࢉྔతʹෆՄೳ … ݱہ໘ 1खޙ 2खޙ
ධՁؔ • ൫໘Λ༩͑ΔͱείΞ (༧উͳͲ) Λฦؔ͢ • কعνΣεͳΒɺۨͷଛಘޮ͖ͷΛݩʹܭࢉ • ήʔϜͷ୳ࡧൣғ(ਂ͞)ΛݶఆͰ͖Δ ݱہ໘
1खޙ 2खޙ ධՁˠ 0.1 0.8 0.3 0.4
ධՁؔͷ༗ޮੑ • ύϥϝʔλͷબఆɾઃఆ͕ΩϞ • ख࡞ۀ: νΣεͰਓؒΛ͑ͨ • ػցֶश: কعͰਓؒΛ͑ͨ •
ғޟෳࡶੑͷͨΊʹ·ͱͳධՁؔΛ࡞Εͳ͔ͬ ͨ
ݪ࢝ϞϯςΧϧϩ๏ • ϥϯμϜʹऴہ·Ͱଧͭ (ϩʔϧΞτ) Λ܁Γฦͯ͠ɺউΛܭ ࢉ͢Δํ๏ ϩʔϧΞτΛ܁Γฦͯ͠ উΛܭࢉ উ 7/10
উ 3/10
ϞϯςΧϧϩ୳ࡧ (MCTS) • ݪ࢝ϞϯςΧϧϩ๏ΛධՁؔతʹ͏ • n खઌͰϩʔϧΞτ • ༿ͷউΛܭࢉ ※͞Βʹ༿ͷউʹԠͯ͡ಈతʹࢬמΓɾల։͠ɺ୳ࡧਫ਼Λ্͛Δ
ϙϦγʔؔ • f (ہ໘, ࣍ͷҰख) • ࣍ͷҰखͷࣗવ͞Λ͋ΒΘ֬͢ີؔ • عේσʔλ͔Βͷֶश͕༰қ •
ϩʔϧΞτ࣌ʹ͑Δ • ϥϯμϜʹଧͭͷͰͳ͘ɺ·ͱͳखΛଧͨͤΔ • ͨͩ͠ߴʹಈ࡞͢Δඞཁ͕͋Δ
MCTSͷڧ͞ • ϙϦγʔؔͷͳͲͰΞϚνϡΞߴஈʹඖఢ͢Δڧ͞· Ͱਐา • ϓϩʹٴͳ͍ • େہ؍ʹ༏ΕΔ • ʮڱ͘ਂ͍ಡΈʯ͕ऑ͍
• खΛ͘ಡΉͨΊ
AlphaGo͕ͬͨ͜ͱ • جຊMCTS • ༷ʑͳ • CNN(ΈࠐΈχϡʔϥϧωοτϫʔΫ) • ڧԽֶश •
ධՁؔ • ฒྻࢄΞϧΰϦζϜ • MCTS ʹͦΕΒΛΈࠐΜͩ
ୈೋ෦: AlphaGo
2ͭͷϙϦγʔؔͱ 1ͭͷධՁؔ ϩʔϧΞτϙϦγʔ ϩʔϧΞτʹ͏ ߴɾਫ਼ 4-ϙϦγʔ ୳ࡧॱংΛܾΊΔ ɾߴਫ਼ ධՁؔ ༿ͷධՁ(উ)Λܭࢉ
ϩʔϧΞτʹΑΔউͱ͠߹ΘͤΔ
ϩʔϧΞτϙϦγʔ • ϩʔϧΞτ(ϥϯμϜϓϨΠ)ʹ͏ϙϦγʔؔ • ߴੑɹʼɹਫ਼ • ਓؒͷعේ800ສہ໘͔Βֶश • ઢܗιϑτϚοΫεؔ •
2ϚΠΫϩඵ (ߴ) • عේͱͷࢦ͠खҰக: 24.2%
SLϙϦγʔ • ͷ୳ࡧॱংΛܾΊΔϙϦγʔؔ • ਫ਼ɹʼɹߴੑ • ਓؒͷعේ3000ສہ໘͔Βֶश • 13CNN(ΈࠐΈχϡʔϥϧωοτϫʔΫ) •
ը૾ೝࣝͰΑ͘ΘΕΔ • : 3ϛϦඵ • عේͱͷࢦ͠खҰக: 57%
ධՁؔ • 14CNN • SLϙϦγʔΛڧԽֶशͨ͠ͷ (RLϙϦγʔ) Λݩʹɺճؼͯ͠࡞Δ 4-ϙϦγʔ 3-ϙϦγʔ ධՁؔ
1. ڧԽֶश 2. ϥϯμϜعේੜ (3000ສہ໘) 3. ճؼ
ධՁؔͷଊ͑ํ • ϩʔϧΞτʹΑΔউܭࢉΛิ͏ͷ • ୯ମͰͦ͜·Ͱڧ͘ͳ͍ • ධՁؔͷಛ (ߟ) • ʮڱ͘ਂ͍ಡΈʯʹڧ͍
• ʮRLϙϦγʔ(ڧԽֶश݁Ռ)Λऴہ·ͰଧͨͤͨࡍͷউʯͱՁ • େہ؍͕ແ͍ • Ұຊಓ͔͠ಡ·ͳ͍ .$54ͷಛੑ େہ؍ʹ༏Εͯʮਂ͍ಡΈʯ͕ऑ͍ ͱ ͏·͘ิ͍͍͋ͬͯΔ
ڧ͞ (2015/10࣌)
ڧ͞ (2016/3 ࣌) R3500+ ͷΠɾηυϧʹউ
ࢀߟ • Mastering the game of Go with deep neural
networks and tree search (http://www.nature.com/nature/journal/v529/n7587/full/ nature16961.html) • Google AlphaGoͷΈΛཧղ͢Δ | IT Leaders (http://it.impressbm.co.jp/articles/-/13474)
ऴΘΓ