anonaka
September 03, 2017
950

# Introduction to the data analysis using python

PyCon APAC 2017 presentation

## anonaka

September 03, 2017

## Transcript

3. ### Who am I ? • XOXZO Evangelist • A Flying

Python Programmer
4. ### About XOXZO • Provide SMS, Telephony API • No office

• Everybody works remotely

analysis

8. ### Tools • Python • numpy • pandas • matplotlib •

Jupyter notebook

10. ### Why horse racing data? • Over 30 years official data.

• Clean data. No need for scraping. • Some chances of making money?

12. ### Speed • Faster horse wins the race • Distance and

Time • Regression analysis (linear model)
13. ### • Horses run about 60km/h. • I want to compare

the speed of horse A runs 1km and horse B runs 2km

15. ### Hypothesis • They must get tired if run long distance.

• Regression analysis with quadratic model.

19. ### Expert advice • Every racecourse has different shape, straight line

length and corner radius, etc. • It is not right to compare the data of various racecourses together.

8 5

27. ### It is almost meaningless to compare the speed if the

racecourse/distance is different
28. ### Lessons learned •Fatigue is not signiﬁcant factor in horse racing.

•Knowing the target domain is very important.

33. ### Win Fav Win Rate 1 32.69 2 18.85 3 13.22

4 9.41 5 7.08 6 5.45 7 3.93 8 2.86 9 2.12 10 1.45
34. ### *O/PSUI"NFSJDB Public betting favorites win approximately 33 percent of all

races and ﬁnish second 53 percent of the time. Second choices win approximately 21 percent of all races and ﬁnish second 42 percent of the time. So the top two choices win 54 percent of the races and ﬁnish second 74 percent of the time. You might even want to consider the fact that third choices win approximately 14 percent of all races run over the course of a year. http://www.predictem.com/horse/proﬁt.php
35. ### Strategy • We humans sometime put too much emphasis on

certain factors and ignore others. • That is where data analysis can make a difference.

31

39. ###          

   :FBS                 1SPpU :FO

X is 8.8%.

strategy X