Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Ben_Leathers_Lightning_Talk_PyData_07_20_2016

 Ben_Leathers_Lightning_Talk_PyData_07_20_2016

Cupid Charities funds research for Neurofibromatosis (NF) by putting on fundraising events. Their main event, Cupid's Undie Run (www.CupidsUndieRun.com), is a 1-ish mile fun run in your underwear that starts and ends at a bar. These events take place in 36 cities around the US on the Saturday after Valentine's Day. Runners raise money to earn incentives like t-shirts, underwear, hats, etc. Since winter conditions can cause massive delays in shipping we are forced to ship the incentives to the runs almost a month in advance. The problem, however, is that the majority of the fundraising happens after the incentives have shipped, so predicting how many of each incentive to ship to each location is quite a challenge. It's important that every earned incentive is delivered, but we want to avoid purchasing too much extra.
I used historical data and machine learning to help me improve this prediction.

Ben Leathers

July 20, 2016
Tweet

Other Decks in Technology

Transcript

  1. Finding Predictors •  Total Raised •  Number of fundraisers • 

    Number of runners at previous incentive level •  Number of Men •  Number women •  Number of NF affected •  Number of teams •  Etc. … none of them were good predictors
  2. TOOLS import pandas as pd import numpy as np import

    csv import matplotlib.pyplot as plt from sklearn.ensemble import RandomForestRegressor from sklearn import cross_validation from sklearn.learning_curve import learning_curve from sklearn.metrics import mean_squared_error from sklearn.metrics import make_scorer