Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Fast In-memory Analytics for Retail Data with C...
Search
ernestoarbitrio
April 09, 2017
Technology
0
65
Fast In-memory Analytics for Retail Data with Columnar Databases
ernestoarbitrio
April 09, 2017
Tweet
Share
More Decks by ernestoarbitrio
See All by ernestoarbitrio
Enable effective Observability with Python
pamaron
0
110
PyConZA 2022 Best practices for good(ish) and clean(ish) code
pamaron
0
89
PyCon Italia 2022 Best practices for good(ish) and clean(ish) code
pamaron
0
220
Bokeh: Using python for interactive data visualization
pamaron
1
150
Keystroke Behavioural Analysis For Fraud Detection: Deep Learning as-a-service Infrastructure
pamaron
0
51
Indexing and search tons of data with ElasticSearch and Django
pamaron
0
360
Interactive plot with django and highchart JS (without JS)
pamaron
0
320
Other Decks in Technology
See All in Technology
プロダクト観点で考えるデータ基盤の育成戦略 / Growth Strategy of Data Analytics Platforms from a Product Perspective
yamamotoyuta
0
420
技術負債の「予兆検知」と「状況異変」のススメ / Technology Dept
i35_267
1
340
バクラクの組織とアーキテクチャ(要約)2025/01版
shkomine
13
3.3k
EDRからERM: PFN-SIRTが関わるセキュリティとリスクへの取り組み
pfn
PRO
0
140
talk_about_wasmwasi
junkishigaki
0
100
Women in Agile
kawaguti
PRO
3
190
DeepSeek on AWS
hariby
1
200
パブリッククラウドのプロダクトマネジメントとアーキテクト
tagomoris
4
960
Kubernetes x k6 で負荷試験基盤を開発して 負荷試験を民主化した話 / Kubernetes x k6
sansan_randd
1
610
Bounded Context: Problem or Solution?
ewolff
1
200
これからSREになる人と、これからもSREをやっていく人へ
masayoshi
5
3.8k
Creative Pair
kawaguti
PRO
1
150
Featured
See All Featured
How to Ace a Technical Interview
jacobian
276
23k
BBQ
matthewcrist
86
9.4k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
98
18k
Understanding Cognitive Biases in Performance Measurement
bluesmoon
27
1.5k
The MySQL Ecosystem @ GitHub 2015
samlambert
250
12k
Imperfection Machines: The Place of Print at Facebook
scottboms
267
13k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
232
17k
ReactJS: Keep Simple. Everything can be a component!
pedronauck
666
120k
Rebuilding a faster, lazier Slack
samanthasiow
79
8.8k
Into the Great Unknown - MozCon
thekraken
34
1.6k
Music & Morning Musume
bryan
46
6.3k
Statistics for Hackers
jakevdp
797
220k
Transcript
Fast In-memory Analytics for Retail Data with Columnar Databases Ernesto
Arbitrio - Valerio Maggio arbitrio |
[email protected]
Florence April 6, 2017
Retail Data • Overview of data we have • granularity
• refresh/update rate • Quantity and storage required (space) • services developed around these data
“Materialized Views” • Description of what they are (non-technical) •
Some examples of Analytics we do on this data
The Problem! ~1 TByte Data We need OLAP Performance: 75M
rows -> 5hours
The Solution! Use a Column-oriented Database (i.e. Just swap Rows
with Columns) Chuck Norris Test Passed!
None
Query
None
Thank you get in touch @__pamaron__ @leriomaggio