Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
SafeDining2
Search
ahmad0510
February 12, 2016
Technology
0
32
SafeDining2
Recommender for safe restaurants
ahmad0510
February 12, 2016
Tweet
Share
More Decks by ahmad0510
See All by ahmad0510
Safe Dining Talk
ahmad0510
0
49
Safe Dining Talk
ahmad0510
0
33
Safe Dining
ahmad0510
1
32
Safe Dining
ahmad0510
0
40
Safe Dining
ahmad0510
0
44
Safe Dining
ahmad0510
0
43
Safe Dining
ahmad0510
1
39
SafeDining
ahmad0510
0
48
Safedining3
ahmad0510
0
41
Other Decks in Technology
See All in Technology
初めてのDatabricks Apps開発
taka_aki
1
260
「REALITY」3Dアバターシステムの7年分の拡張の歴史について
gree_tech
PRO
0
140
Databricks AI/BI Genie の「値ディクショナリー」をAmazonの奥地(S3)まで見に行く
kameitomohiro
1
400
事業開発におけるDify活用事例
kentarofujii
5
1.4k
AI時代におけるデータの重要性 ~データマネジメントの第一歩~
ryoichi_ota
0
710
もう外には出ない。より快適なフルリモート環境を目指して
mottyzzz
13
9.5k
「タコピーの原罪」から学ぶ間違った”支援” / the bad support of Takopii
piyonakajima
0
130
ソースを読むプロセスの例
sat
PRO
15
9.9k
ViteとTypeScriptのProject Referencesで 大規模モノレポのUIカタログのリリースサイクルを高速化する
shuta13
3
170
Bill One 開発エンジニア 紹介資料
sansan33
PRO
4
14k
SQLAlchemy の select(User).where(User.id =="123") を理解してみる/sqlalchemy deep dive
3l4l5
3
310
Introduction to Sansan for Engineers / エンジニア向け会社紹介
sansan33
PRO
5
43k
Featured
See All Featured
Rails Girls Zürich Keynote
gr2m
95
14k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
359
30k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.5k
Java REST API Framework Comparison - PWX 2021
mraible
34
8.9k
Building Better People: How to give real-time feedback that sticks.
wjessup
369
20k
Building a Modern Day E-commerce SEO Strategy
aleyda
44
7.8k
Gamification - CAS2011
davidbonilla
81
5.5k
Learning to Love Humans: Emotional Interface Design
aarron
274
41k
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
PRO
190
55k
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
657
61k
Typedesign – Prime Four
hannesfritz
42
2.8k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
285
14k
Transcript
Ahmad Haider Insight Data Science 2016
Personal Story What if I could find which restaurants are
safe and which are not? Mugged @Parking lot ! L Hungry for Pizza! Closest pizza location
Makes Real-Time Recommendations for Safety Ratings Restaurants in a Neighborhood
SafeDining
I want to eat Pizza at 2+ rated restaurant within
3 miles of my location
I want to eat Pizza at 2+ rated restaurant within
3 miles of my location
I want to eat Pizza at 2+ rated restaurant within
3 miles of my location
I want to eat Pizza at 3+ rated restaurant within
2 miles of my location p1 p2 p3 p4 p5 p1 p2 p3 p4 p1 p2 p1 p2 p3 p4 p5 p6 p7 p8 p1 Crime Location Crime Probability
I want to eat Pizza at 2+ rated restaurant within
3 miles of my location p1 p2 p3 p4 p5 p1 p2 p3 p4 p1 p2 p1 p2 p3 p4 p5 p6 p7 p8 p1 Crime Location Crime Probability 1. pRest 1. pRest 1. pRest 1. pRest pRest Crime Probability at restaurant
I want to eat Pizza at 2+ rated restaurant within
3 miles of my location p1 p2 p3 p4 p5 p1 p2 p3 p4 p1 p2 p1 p2 p3 p4 p5 p6 p7 p8 p1 Crime Location Crime Probability 1. pRest 1. pRest 1. pRest 1. pRest pRest Crime Probability at restaurant pRest pRest pRest pRest pRest pRest pRest
I want to eat Pizza at 2+ rated restaurant within
3 miles of my location p1 p2 p3 p4 p5 p1 p2 p3 p4 p1 p2 p1 p2 p3 p4 p5 p6 p7 p8 p1 Crime Location Crime Probability 1. pRest 1. pRest 1. pRest 1. pRest pRest Crime Probability at restaurant pRest pRest pRest pRest pRest pRest pRest 2. Relative Safety Index 2. Relative Safety Index 2. Relative Safety Index 2. Relative Safety Index
Workflow Crime dataset Yelp dataset Preprocessing Defining classification problem Feature
Engineering Choosing a model Validation Python Pandas Regular expr. Multiclass 24 classes (hour of day) Standardization PCA Logistic Regression scikit-learn 10-fold cross validation Log loss score 2009-2015 Yelp search API
Multiclass Classification Predict the hour at which crime
happens at given location Features: Location (lat., long.), Address, Day, Week, Month, Year Labels: 24 classes (hour of day) Logistic Regression Cross entropy loss measure = 2.95 Algorithm
Theft Residential Burglary Robbery Assault Source: mylocalcrime.com Validation My analysis
Independent source
• Ahmad Haider • PhD in “Measurement of energy landscapes
of biological interactions using boltzmann sampling” • Georgia Tech • Love hiking and reading fiction/non-fiction About Me
None