Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Safe Dining

ahmad0510
February 15, 2016

Safe Dining

Finds safe places to eat in your neighborhood

ahmad0510

February 15, 2016
Tweet

More Decks by ahmad0510

Other Decks in Technology

Transcript

  1. Personal Story What if I could find which restaurants are

    safe and which are not? Mugged @Parking lot ! L Hungry for Pizza! Closest pizza location
  2. I want to eat Pizza at 3+ rated restaurant within

    2 miles of my location p1 p2 p3 p4 p5 p1 p2 p3 p4 p1 p2 p1 p2 p3 p4 p5 p6 p7 p8 p1 Crime Location Crime Probability
  3. I want to eat Pizza at 2+ rated restaurant within

    3 miles of my location p1 p2 p3 p4 p5 p1 p2 p3 p4 p1 p2 p1 p2 p3 p4 p5 p6 p7 p8 p1 Crime Location Crime Probability 1. pRest 1. pRest 1. pRest 1. pRest pRest Crime Probability at restaurant
  4. I want to eat Pizza at 2+ rated restaurant within

    3 miles of my location p1 p2 p3 p4 p5 p1 p2 p3 p4 p1 p2 p1 p2 p3 p4 p5 p6 p7 p8 p1 Crime Location Crime Probability 1. pRest 1. pRest 1. pRest 1. pRest pRest Crime Probability at restaurant pRest pRest pRest pRest pRest pRest pRest
  5. I want to eat Pizza at 2+ rated restaurant within

    3 miles of my location p1 p2 p3 p4 p5 p1 p2 p3 p4 p1 p2 p1 p2 p3 p4 p5 p6 p7 p8 p1 Crime Location Crime Probability 1. pRest 1. pRest 1. pRest 1. pRest pRest Crime Probability at restaurant pRest pRest pRest pRest pRest pRest pRest 2. Relative Safety Index 2. Relative Safety Index 2. Relative Safety Index 2. Relative Safety Index
  6. Workflow Crime dataset Yelp dataset Preprocessing Defining classification problem Feature

    Engineering Choosing a model Validation Python Pandas Regular expr. Multiclass 24 classes (hour of day) Standardization PCA Logistic Regression scikit-learn 10-fold cross validation Log loss score APD: 2009-2015 Yelp search API
  7. ™ Multiclass Classification –  Predict the hour at which crime happens

    at given location –  Features: Location (Lat., Long.) , Time (dd, wk, mm, yyyy) –  Labels: 24 classes (hour of day) –  Logistic Regression –  Cross entropy loss measure = 2.95 Algorithm
  8. •  Ahmad Haider •  PhD in “Measurement of energy landscapes

    of biological interactions using boltzmann sampling” •  Georgia Tech •  Love hiking and reading fiction/non-fiction About Me