Why horse racing data?
• Over 30 years official data.
• Clean data. No need for scraping.
• Some chances of making money?
Slide 16
Slide 16 text
Let’s take a look at running speed
Slide 17
Slide 17 text
Speed
• Faster horse wins the race
• Distance and Time
• Regression analysis (linear model)
Slide 18
Slide 18 text
No content
Slide 19
Slide 19 text
• Horses run about 60km/h.
• I want to compare the speed of
horse A runs 1km and horse B runs
2km
Slide 20
Slide 20 text
Is the relationship linear?
Slide 21
Slide 21 text
Hypothesis
• They must get tired if run long distance.
• Regression analysis with quadratic model.
Slide 22
Slide 22 text
%JTUBODF
5JNF
Slide 23
Slide 23 text
quadratic coefficient is negative
convex upwards
Slide 24
Slide 24 text
Check other years
Slide 25
Slide 25 text
No content
Slide 26
Slide 26 text
No content
Slide 27
Slide 27 text
Expert advice
• Every racecourse has different shape, straight
line length and corner radius, etc.
• It is not right to compare the data of various
racecourses together.
Slide 28
Slide 28 text
Analysis by racecourse
First off from Tokyo Racecourse
*O/PSUI"NFSJDB
Public betting favorites win approximately 33 percent of all
races and finish second 53 percent of the time. Second choices
win approximately 21 percent of all races and finish second 42
percent of the time. So the top two choices win 54 percent of
the races and finish second 74 percent of the time. You might
even want to consider the fact that third choices win
approximately 14 percent of all races run over the course of a
year.
http://www.predictem.com/horse/profit.php
Slide 45
Slide 45 text
No content
Slide 46
Slide 46 text
Strategy
• We humans sometime put too much emphasis
on certain factors and ignore others.
• That is where data analysis can make a
difference.
Slide 47
Slide 47 text
Strategy X
Slide 48
Slide 48 text
Accumulated
Payback
Sequence of tickets bought from Jan.1 - Dec. 31
Slide 49
Slide 49 text
)JTUPSJDBM3FDPSE
1BZCBDL
Slide 50
Slide 50 text
No content
Slide 51
Slide 51 text
No content
Slide 52
Slide 52 text
:FBS
1SPpU :FO
Slide 53
Slide 53 text
Lessons learned
•Find under evaluated horses.
•Win rate of strategy X is 8.8%.
Slide 54
Slide 54 text
Next Goal
• Use machine learning to tune parameters of
strategy X