Upgrade to Pro — share decks privately, control downloads, hide ads and more …

wk04_d01

Sponsored · Your Podcast. Everywhere. Effortlessly. Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.

 wk04_d01

Still needs work, this is an early draft.

Avatar for Seiji Armstrong

Seiji Armstrong

June 22, 2015
Tweet

More Decks by Seiji Armstrong

Other Decks in Science

Transcript

  1. Basketball Performance Predictor - Real-time dashboard for coaches  Coach

    has to rely on intuition to make real-time decisions.  Can quantify performance by looking at data.  Unique play-by-play data.
  2. Data and Algorithms 1. Download individual files from archive in

    csv 2. Scrape a public basketball reference website (beautiful soup) 3. Free account on premium Basketball API site (queries limited to 2000). 1 game.csv 1 game.csv 1 game.csv • Read in > 4000 files • Munge, produce clean pandas DataFrames • Store tables in MySQL database • Run analytics and produce different SQL tables • Web app: Make SQL queries, run analysis, update display. 1 game.csv 1 game.csv Event-based data: Every single play is logged
  3. Data and Algorithms Performance might appear to be stochastic. However,

    there are patterns in basketball games. Exploit these patterns. When a player has been performing well, or performing poorly, it affects their next few plays. Feedback loop for positive performance: • Positive gain: Momentum, increased confidence, team-mates will pas • Negative gain: opponents play better defence
  4. Data Story Decaying exponential, but with unique features. Features to

    exploit in week 4 Count[n] = count of all streaks of length n. Count[>n] = count of all streaks greater than n Performance[n+1] = Count[>n]/(Count[n]+Count[>n]