Upgrade to Pro — share decks privately, control downloads, hide ads and more …

SafeDining2

ahmad0510
February 12, 2016

 SafeDining2

Recommender for safe restaurants

ahmad0510

February 12, 2016
Tweet

More Decks by ahmad0510

Other Decks in Technology

Transcript

  1. Personal Story What if I could find which restaurants are

    safe and which are not? Mugged @Parking lot ! L Hungry for Pizza! Closest pizza location
  2. I want to eat Pizza at 3+ rated restaurant within

    2 miles of my location p1 p2 p3 p4 p5 p1 p2 p3 p4 p1 p2 p1 p2 p3 p4 p5 p6 p7 p8 p1 Crime Location Crime Probability
  3. I want to eat Pizza at 2+ rated restaurant within

    3 miles of my location p1 p2 p3 p4 p5 p1 p2 p3 p4 p1 p2 p1 p2 p3 p4 p5 p6 p7 p8 p1 Crime Location Crime Probability 1. pRest 1. pRest 1. pRest 1. pRest pRest Crime Probability at restaurant
  4. I want to eat Pizza at 2+ rated restaurant within

    3 miles of my location p1 p2 p3 p4 p5 p1 p2 p3 p4 p1 p2 p1 p2 p3 p4 p5 p6 p7 p8 p1 Crime Location Crime Probability 1. pRest 1. pRest 1. pRest 1. pRest pRest Crime Probability at restaurant pRest pRest pRest pRest pRest pRest pRest
  5. I want to eat Pizza at 2+ rated restaurant within

    3 miles of my location p1 p2 p3 p4 p5 p1 p2 p3 p4 p1 p2 p1 p2 p3 p4 p5 p6 p7 p8 p1 Crime Location Crime Probability 1. pRest 1. pRest 1. pRest 1. pRest pRest Crime Probability at restaurant pRest pRest pRest pRest pRest pRest pRest 2. Relative Safety Index 2. Relative Safety Index 2. Relative Safety Index 2. Relative Safety Index
  6. Workflow Crime dataset Yelp dataset Preprocessing Defining classification problem Feature

    Engineering Choosing a model Validation Python Pandas Regular expr. Multiclass 24 classes (hour of day) Standardization PCA Logistic Regression scikit-learn 10-fold cross validation Log loss score 2009-2015 Yelp search API
  7. ™  Multiclass Classification –  Predict the hour at which crime

    happens at given location –  Features: Location (lat., long.), Address, Day, Week, Month, Year –  Labels: 24 classes (hour of day) –  Logistic Regression –  Cross entropy loss measure = 2.95 Algorithm
  8. •  Ahmad Haider •  PhD in “Measurement of energy landscapes

    of biological interactions using boltzmann sampling” •  Georgia Tech •  Love hiking and reading fiction/non-fiction About Me