Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Speaker Deck
PRO
Sign in
Sign up
for free
Ivory - Data Modelling
Ambiata
October 20, 2014
Technology
0
290
Ivory - Data Modelling
Ambiata
October 20, 2014
Tweet
Share
More Decks by Ambiata
See All by Ambiata
ambiata
3
610
ambiata
1
500
ambiata
0
580
ambiata
1
1k
Other Decks in Technology
See All in Technology
kawaguti
2
410
azara
1
820
miyake
1
390
hanacchi
0
140
manuelmeyer
0
130
gamella
3
1.4k
kaedemalu
0
310
yshr1200
0
170
andysumi
0
160
grapecity_dev
0
170
gobeyond20xx
0
150
myhomenwlab
1
230
Featured
See All Featured
yeseniaperezcruz
302
31k
brianwarren
83
4.7k
stephaniewalter
260
11k
paulrobertlloyd
72
1.4k
marcelosomers
220
15k
pedronauck
652
110k
mongodb
23
3.9k
philhawksworth
192
8.8k
bkeepers
321
53k
jponch
103
5k
cassininazir
347
20k
dotmariusz
94
5.5k
Transcript
IVORY DATA MODELLING http://github.com/ambiata/ivory © Ambiata 2014
WHAT WE START WITH © Ambiata 2014
© Ambiata 2014
WHAT WE NEED © Ambiata 2014
Feature vectors © Ambiata 2014 0.00 3 3001 1.00 634.83
16 4670 0.6875 15.12 2 - 0.50 33.56 2 - 1.00 98.34 12 3303 0.8333 523.81 23 2046 0.4782 1086.05 17 - 1.00 224.81 9 - 0.2222 78.21 2 2134 0.50 126.48 4 - 0.0 1 3 1 1 4 1 2 1 1 1 M - F M F - F F M - gender balance purchases zipcode prop_online num_accs 89340218 feature instance 48149407 18452274 07499337 62948721 93754723 00272446 13374497 31989993 46474236
Ivory Repository Ingest facts Extract features © Ambiata 2014
© Ambiata 2014 Fact ETL Source data Entity resolution +
attribution Factset Ivory Repository Ingest facts Extract features
WHAT’S A FACT? © Ambiata 2014
WHAT’S A FEATURE? © Ambiata 2014
FACT • Atomic piece of information attributed to an entity
• 2 types: states and events • Captured as close to the “source” as possible © Ambiata 2014
• State facts • Demographics, e.g.: gender, DOB, zipcode, etc
• Account statuses • Subscription states • Snapshots, e.g. account balance at end of month • Segments © Ambiata 2014
• Event facts • Purchases • Page views • Phone
calls • Queries © Ambiata 2014
FEATURE • Attribute that describes one aspect of an entity
• Derived from facts • Simplest feature is “latest value before ‘date’” © Ambiata 2014
• Latest • Days since latest, days since earliest •
Count, sum • Mean, quantile, proportion • Gradient, state changes © Ambiata 2014