Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Fast In-memory Analytics for Retail Data with C...
Search
ernestoarbitrio
April 09, 2017
Technology
0
69
Fast In-memory Analytics for Retail Data with Columnar Databases
ernestoarbitrio
April 09, 2017
Tweet
Share
More Decks by ernestoarbitrio
See All by ernestoarbitrio
Enable effective Observability with Python
pamaron
0
130
PyConZA 2022 Best practices for good(ish) and clean(ish) code
pamaron
0
100
PyCon Italia 2022 Best practices for good(ish) and clean(ish) code
pamaron
0
240
Bokeh: Using python for interactive data visualization
pamaron
1
160
Keystroke Behavioural Analysis For Fraud Detection: Deep Learning as-a-service Infrastructure
pamaron
0
59
Indexing and search tons of data with ElasticSearch and Django
pamaron
0
400
Interactive plot with django and highchart JS (without JS)
pamaron
0
350
Other Decks in Technology
See All in Technology
AWSでAgentic AIを開発するための前提知識の整理
nasuvitz
2
170
AWS Control Tower に学ぶ! IAM Identity Center 権限設計の第一歩 / IAM Identity Center with Control Tower
y___u
0
170
プロダクトのコードから見るGoによるデザインパターンの実践 #go_night_talk
bengo4com
1
2.6k
ガバメントクラウド(AWS)へのデータ移行戦略の立て方【虎の巻】 / 20251011 Mitsutosi Matsuo
shift_evolve
PRO
2
200
カンファレンスに託児サポートがあるということ / Having Childcare Support at Conferences
nobu09
1
590
アイテムレビュー機能導入からの学びと改善
zozotech
PRO
0
170
WEBサービスを成り立たせるAWSサービス
takano0131
1
170
物体検出モデルでシイタケの収穫時期を自動判定してみた。 #devio2025
lamaglama39
0
160
業務効率化をさらに加速させる、ノーコードツールとStep Functionsのハイブリッド化
smt7174
2
140
プレーリーカードを活用しよう❗❗デジタル名刺交換からはじまるイベント会場交流のススメ
tsukaman
0
170
リセラー企業のテクサポ担当が考える、生成 AI 時代のトラブルシュート 2025
kazzpapa3
1
350
ニッポンの人に知ってもらいたいGISスポット
sakaik
0
150
Featured
See All Featured
Being A Developer After 40
akosma
91
590k
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
33
2.5k
Optimising Largest Contentful Paint
csswizardry
37
3.5k
GitHub's CSS Performance
jonrohan
1032
470k
GraphQLとの向き合い方2022年版
quramy
49
14k
A better future with KSS
kneath
239
18k
The Invisible Side of Design
smashingmag
302
51k
Testing 201, or: Great Expectations
jmmastey
45
7.7k
Practical Orchestrator
shlominoach
190
11k
Imperfection Machines: The Place of Print at Facebook
scottboms
269
13k
KATA
mclloyd
32
15k
The Illustrated Children's Guide to Kubernetes
chrisshort
49
51k
Transcript
Fast In-memory Analytics for Retail Data with Columnar Databases Ernesto
Arbitrio - Valerio Maggio arbitrio |
[email protected]
Florence April 6, 2017
Retail Data • Overview of data we have • granularity
• refresh/update rate • Quantity and storage required (space) • services developed around these data
“Materialized Views” • Description of what they are (non-technical) •
Some examples of Analytics we do on this data
The Problem! ~1 TByte Data We need OLAP Performance: 75M
rows -> 5hours
The Solution! Use a Column-oriented Database (i.e. Just swap Rows
with Columns) Chuck Norris Test Passed!
None
Query
None
Thank you get in touch @__pamaron__ @leriomaggio