Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
投球を可視化する技術〜Analyzing Pitching Data With Python
Search
Shinichi Nakagawa
March 22, 2016
Research
1
1.1k
投球を可視化する技術〜Analyzing Pitching Data With Python
MLBの一球速報データを使った投球データの可視化をPython他でやってみました.
BPStudy #103 2016/3/22 発表資料
Shinichi Nakagawa
March 22, 2016
Tweet
Share
More Decks by Shinichi Nakagawa
See All by Shinichi Nakagawa
Terraform, GitHub Actions, Cloud Buildでデータ基盤をProvisioningする / Data Platform provisioning for Google Cloud and Terraform
shinyorke
2
2k
Cloud RunとCloud PubSubでサーバレスなデータ基盤2024 with Terraform / Cloud Run and PubSub with Terraform
shinyorke
8
2.1k
自らを強いエンジニアにするための3つの習慣 / I need to be myself, I can't be no one else
shinyorke
76
53k
阪神タイガース優勝のひみつ - Pythonでシュッと調べた件 / SABRmetrics for Python
shinyorke
1
970
Pythonとクラウドと野球の推し活. / Baseball Data Platform for Python and Google Cloud
shinyorke
2
2.3k
月額コーヒー3.34杯分のコストでオオタニサンの活躍を見守るデータ基盤のはなし / Pyhack Con
shinyorke
2
380
俺のDXを実現するためのサーバレスなデータ基盤開発と運用 / Serverless Data Platform and Baseball
shinyorke
5
11k
機械学習エンジニアが目指すキャリアパスとその実話 / My Journey to Become a ML Engineer
shinyorke
6
14k
一人でも小さく始められるGoogle Cloudで実現するほぼサーバレスなデータ基盤 / Serverless Dataplatform for Google Cloud
shinyorke
0
440
Other Decks in Research
See All in Research
第4回ナレッジグラフ勉強会:ISWC2023論文読み会
kg_wakate
1
200
Julia Tokyo #11 トーク: 「Juliaで歩く自動微分」
abap34
2
1.3k
Introduction of NII S. Koyama's Lab (AY2024)
skoyamalab
0
110
Trezor Safe 3 ファーストインプレッション
toshihr
0
190
メタ動画データセットによる動作認識の現状と可能性
yuyay
0
180
CSC590 Lecture 01
javiergs
PRO
0
130
ICLR2024 LLMエージェントの研究動向
masatoto
3
670
Generative AI - practice and theory
gpeyre
1
560
CASCON 2023 Most Influential Paper Award Talk
tsantalis
0
120
People Driven Transformation / 人が起点の、社会の変え方
dmattsun
0
150
「EBPMエコシステム」の可能性
daimoriwaki
0
200
説明可能AI:代表的手法と最近の動向
yuyay
1
590
Featured
See All Featured
GraphQLとの向き合い方2022年版
quramy
32
12k
From Idea to $5000 a Month in 5 Months
shpigford
377
45k
Clear Off the Table
cherdarchuk
84
310k
Web development in the modern age
philhawksworth
202
10k
Mobile First: as difficult as doing things right
swwweet
216
8.6k
Navigating Team Friction
lara
178
13k
Making Projects Easy
brettharned
108
5.5k
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
14
1.5k
Building Adaptive Systems
keathley
31
1.9k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
17
1.4k
What's in a price? How to price your products and services
michaelherold
237
11k
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
20
1.9k
Transcript
None
Who am I? • Shinichi Nakagawa(@shinyorke) • Pythonista/Agile Software Development/Baseball
Analyst • visasQ(ϏβεΫ) Python Engineer/Scrum Master • ւಓຊϋϜϑΝΠλʔζ/Oakland Athletics • ιχʔɾάϨΠ(OAK)ͷαΠϠϯάड &Ԭւ(ϋϜ)ͷελϝϯୣऔΛ৴͍ͯ͡·͢.
ࠓγʔζϯݟͲ͜Ζ ݟͲ͜Ζ ੈؒͷ෩ை தͷݟղ ༏উνʔϜ ɾιϑτόϯΫ ɾϠΫϧτ ɾϋϜ ɾڊਓPSౡ τϦϓϧεϦʔ
ɾ༄ా༔ذ ࿈ଓ ɾࢁాਓ ࿈ଓ ࢁాਓ ࿈ଓ ΪʔλࡾףͲ͏ͧ ΰʔϧσϯάϥϒ ɾ༄ా༔ذ $' ɾௗ୩ܟ 44 ɾೋਓڞऩ ɾγϣʔτ୭͕ʁ ۙ౻݈հ ϋϜ ɾׂຊ͍͚ΔͰʂ ɾࢦ໊ଧऀPSϥΠτ ۙ౻ ࢦcӈcัcࡾc༡ ˠॅॴෆఆʹͳΔ
Starting Member • ٿHack!2015ৼΓฦΓ • MLBҰٿใσʔλͱٿHack • MLBҰٿใσʔλΛPythonͰHackͯ͠ΈΔ ʙpitchpxͱJupyter +
pandas + matplotlibʙ • ར༻ྫʙؠ۾ٱࢤϊʔώοτϊʔϥϯ • ݁ͼʙࠓޙͷٿHack(PyCon JP 2016ʹ͚ͯ) • ʲΦϚέʳ2016ϓϩٿେ༧
ٿHack!1.0(PyCon JP 2015) • MLBͷࢼ߹͝ͱͷଧ੮σʔλΛHack! • ࢄาʢ࢛ٿʣͷʢΠονVSϘοτʣ • ϐονϟʔͷ݄ผউͪʢδϣϯɾϨελʔʣ •
ຖຖࢼ߹ͷσʔλΛऔಘ&ੳ • ΞμϜɾμϯʢଧऀʣ • ඃΞμϜɾμϯʢखʣ • ৄ͘͠εϥΠυΛޚཡ͍ͩ͘͞ or ʮٿ PythonʯͰάάΖ͏
ٿHack!ʙPythonΛ༻͍ͨσʔλੳͱՄࢹԽ PyCon JP 2015ൃදࢿྉ http://www.slideshare.net/shinyorke/hackpython-pyconjp
ٿHack!ʙPythonΛ༻͍ͨσʔλੳͱՄࢹԽ PyCon JP 2015ൃදࢿྉ http://www.slideshare.net/shinyorke/hackpython-pyconjp ͷωλ
ٿHack!ʙPythonΛ༻͍ͨσʔλੳͱՄࢹԽ PyCon JP 2015ൃදࢿྉ http://www.slideshare.net/shinyorke/hackpython-pyconjp ҰٿใΓ͍ͨϯΰ ˠͷςʔϚʂ
ٿHack!ͱҰٿใ • ࢼ߹ɾଧ੮ͷ݁Ռetc…είΞͰଌΕΔωλΓͬͨײ͋Δ • બखͷނোɾෆௐʢௐʣείΞͰଌΕͳ͍ˠΓ͍ͨ • खͳΒٿɾίϯτϩʔϧɾϘʔϧͷճసɺ खकඋൣғ()ɾεΠϯάεϐʔυͰଌΕΔͷͰʂʁ • Ұٿใͷσʔλ͕͋ΕͰ͖ͦ͏…͋ͬͨʂʂʂ
• ࢼ͠ʹͬͯΈΑ͏ʂʂʂˡࠓίί
MLB at BATʙMLBҰٿใ • MLB࣮گҰٿใαʔϏε • PCαΠτɾεϚϗΞϓϦɾApple TVͳͲ • MLB.TVͱ߹ΘͤͯܖͰ࣮گಈըݟΒΕΔ
• σʔλ͕ͱʹ͔͘ॆ࣮
Analyzing Baseball Data with R • MLBͷΦʔϓϯσʔλʮRetrosheetʯ, MLB at BATใσʔλΛ༻͍ͨσʔλੳɾՄࢹ
Խʹ͍ͭͯॻ͔Ε͍ͯΔॻ੶ʢӳޠʣ • RݴޠΛͬͨੳͱՄࢹԽͷωλ͕ϝΠϯ • ʮpitchRxʯͱ͍͏ɺRݴޠͷϥΠϒϥϦΛ༻͍ͯ at BATσʔλΛऔಘ&ՄࢹԽ
“ʮpitchRxʯͱ͍͏ɺ RݴޠͷϥΠϒϥϦΛ༻͍ͯ at BATσʔλΛऔಘ&ՄࢹԽ”
ʁʁʁʮPythonͰΓ͍ͨΜ͡Όʂʯ ※RΛͲ͏͜͏ݴ͏ͱ͔ͦΜͳҙਤ(ry
pitchpx - Getting MLB dataset • MLB at BATͷҰٿใσʔλΛऔಘ&εΫϨΠϐϯάͯ͠ CSVσʔληοτʹམͱ͢PythonϥΠϒϥϦ.
• pitchRx(R)ͳͲΛࢀߟʹࢲ͕։ൃ͠·ͨ͠. • ίϚϯυϥΠϯπʔϧͰ͢. • Python 3.3.xҎ্ઐ༻ˡڧ͍ͩ͜ΘΓ • PyPIͰެ։͍ͯ͠·͢ʂʂʂʢ୭Ͱ͑Δʣ
͍ํ $ # Python 3.3Ҏ্(ਪPython 3.4Ҏ্)͕ಈ͘ڥͰͬͯͶ $ pip install pitchpx
$ # ྫɿ2015/8/1-8/12·Ͱͷࢼ߹݁ՌΛऔಘ͢Δ $ pitchpx -s 20150801 -e 20150812 -o .
ʲྫʳؠ۾ϊʔώοτϊʔϥϯ • ϚϦφʔζ-ΦϦΦʔϧζͷࢼ߹(2015/8/12)ʹͯɺ ϊʔώοτϊʔϥϯΛܾΊͨؠ۾ٱࢤखͷٿΛੳ • ٿɺϘʔϧͷճసɺετϥΠΫκʔϯɺetc… • pitchpxͰऔಘͨ͠σʔλΛpandasͱ matplotlib(&seaborn)Ͱલॲཧ&ՄࢹԽ •
ڥJupyter notebook(Python 3.5.1)
σϞ (লུ)
ৄ͘͠QiitaͰʂʂʂ ؠ۾ٱࢤ(SEA)ͷφΠεϐονϯάΛPythonͰՄࢹԽ http://qiita.com/shinyorke/items/2c2e2c3976fc2d1ed051
݁ͼʙ2016ͷٿHack! • ͦΒʢࠓٿσʔλͷՄࢹԽ͔ͩΒʣ ͦ͏ʢͭ͗कඋσʔλͷՄࢹԽʹʣ Αɹʢܾ·͍ͬͯΔ͡Όͳ͍͔ʣ • PyCon JP 2016(9/21,22)ɺ ʮAnalyzing
Baseball Data With Pythonʯ ͱ͔ͦΜͳλΠτϧͰͬͱ໘ന͍͕Ͱ͖Δϋζ. • ຊެ։ͨ͠ωλੋඇ༡ΜͰΈͯʂ ˠػցֶशͷࡐͱ͔ʹΠέΔΜ͡Όͳ͍ʁ
ʮҰٿใσʔλͷϥΠηϯεʁେৎͳͷʁʯ ※Ұ൪͋Γͦ͏ͳ࣭
ɿ(ݸਓར༻ఔͳΒ)OK ʲެࣜʳ http://gd2.mlb.com/components/copyright.txt ʲ༁&ղઆʳ http://qiita.com/shinyorke/items/566f1b7e7687492a0c7f
ήʔϜηοτʂʂʂ ͝ਗ਼ௌ͋Γ͕ͱ͏͍͟͝·ͨ͠. Shinichi Nakagawa(Twitter/Facebook/hatena:@shinyorke)