Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Ivory - Data Modelling
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
Ambiata
October 20, 2014
Technology
520
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Ivory - Data Modelling
Ambiata
October 20, 2014
More Decks by Ambiata
See All by Ambiata
Improving feature engineering in the lab and production with Ivory
ambiata
3
680
Ivory - A Data Store for Data Science
ambiata
1
740
Ivory - Concepts
ambiata
0
920
Ivory - An Introduction
ambiata
1
1.3k
Other Decks in Technology
See All in Technology
スキルと MCP ツール、責務をどう分けるか? AI が迷わないインターフェース設計の戦略
cdataj
1
970
機械学習を「社会実装」するということ 2026年夏版 / Social Implementation of Machine Learning June 2026 Version
moepy_stats
4
1.6k
なぜ Platform Engineering の土台に Kubernetes を選ぶのか
r4ynode
2
590
タクシーアプリ『GO』の実践的データ活用
mot_techtalk
3
190
AIはどのように 組織のアジリティを変えるのか?
junki
0
400
AIのReact習熟度を測る
uhyo
2
190
Dario Amodi『Policy on the AI Exponential』を理解する
nagatsu
0
230
中期計画、2回作ってみた ~業務委託と正社員、両方の視点から~
demaecan
1
680
自宅LLMの話
jacopen
1
400
Bucharest Tech Week 2026 - Reinventing testing practices in the AI era
edeandrea
PRO
1
140
エンジニアリング戦略の作り方 / Crafting Engineering Strategy
iwashi86
20
6.6k
Chainlitで作るお手軽チャットUI
ynt0485
0
200
Featured
See All Featured
Context Engineering - Making Every Token Count
addyosmani
9
960
HDC tutorial
michielstock
2
700
GitHub's CSS Performance
jonrohan
1033
470k
Taking LLMs out of the black box: A practical guide to human-in-the-loop distillation
inesmontani
PRO
3
2.3k
Fantastic passwords and where to find them - at NoRuKo
philnash
52
3.7k
A Soul's Torment
seathinner
6
2.9k
Optimizing for Happiness
mojombo
378
71k
Navigating the moral maze — ethical principles for Al-driven product design
skipperchong
2
390
What's in a price? How to price your products and services
michaelherold
247
13k
Efficient Content Optimization with Google Search Console & Apps Script
katarinadahlin
PRO
1
610
Chasing Engaging Ingredients in Design
codingconduct
0
220
First, design no harm
axbom
PRO
2
1.2k
Transcript
IVORY DATA MODELLING http://github.com/ambiata/ivory © Ambiata 2014
WHAT WE START WITH © Ambiata 2014
© Ambiata 2014
WHAT WE NEED © Ambiata 2014
Feature vectors © Ambiata 2014 0.00 3 3001 1.00 634.83
16 4670 0.6875 15.12 2 - 0.50 33.56 2 - 1.00 98.34 12 3303 0.8333 523.81 23 2046 0.4782 1086.05 17 - 1.00 224.81 9 - 0.2222 78.21 2 2134 0.50 126.48 4 - 0.0 1 3 1 1 4 1 2 1 1 1 M - F M F - F F M - gender balance purchases zipcode prop_online num_accs 89340218 feature instance 48149407 18452274 07499337 62948721 93754723 00272446 13374497 31989993 46474236
Ivory Repository Ingest facts Extract features © Ambiata 2014
© Ambiata 2014 Fact ETL Source data Entity resolution +
attribution Factset Ivory Repository Ingest facts Extract features
WHAT’S A FACT? © Ambiata 2014
WHAT’S A FEATURE? © Ambiata 2014
FACT • Atomic piece of information attributed to an entity
• 2 types: states and events • Captured as close to the “source” as possible © Ambiata 2014
• State facts • Demographics, e.g.: gender, DOB, zipcode, etc
• Account statuses • Subscription states • Snapshots, e.g. account balance at end of month • Segments © Ambiata 2014
• Event facts • Purchases • Page views • Phone
calls • Queries © Ambiata 2014
FEATURE • Attribute that describes one aspect of an entity
• Derived from facts • Simplest feature is “latest value before ‘date’” © Ambiata 2014
• Latest • Days since latest, days since earliest •
Count, sum • Mean, quantile, proportion • Gradient, state changes © Ambiata 2014