Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Fast In-memory Analytics for Retail Data with C...
Search
ernestoarbitrio
April 09, 2017
Technology
0
65
Fast In-memory Analytics for Retail Data with Columnar Databases
ernestoarbitrio
April 09, 2017
Tweet
Share
More Decks by ernestoarbitrio
See All by ernestoarbitrio
Enable effective Observability with Python
pamaron
0
100
PyConZA 2022 Best practices for good(ish) and clean(ish) code
pamaron
0
83
PyCon Italia 2022 Best practices for good(ish) and clean(ish) code
pamaron
0
210
Bokeh: Using python for interactive data visualization
pamaron
1
150
Keystroke Behavioural Analysis For Fraud Detection: Deep Learning as-a-service Infrastructure
pamaron
0
47
Indexing and search tons of data with ElasticSearch and Django
pamaron
0
340
Interactive plot with django and highchart JS (without JS)
pamaron
0
310
Other Decks in Technology
See All in Technology
Making your applications cross-environment - OSCG 2024 NA
salaboy
0
180
Exadata Database Service on Dedicated Infrastructure(ExaDB-D) UI スクリーン・キャプチャ集
oracle4engineer
PRO
2
3.2k
dev 補講: プロダクトセキュリティ / Product security overview
wa6sn
1
2.3k
Platform Engineering for Software Developers and Architects
syntasso
1
510
rootlessコンテナのすゝめ - 研究室サーバーでもできる安全なコンテナ管理
kitsuya0828
3
380
ドメイン名の終活について - JPAAWG 7th -
mikit
33
20k
インフラとバックエンドとフロントエンドをくまなく調べて遅いアプリを早くした件
tubone24
1
430
誰も全体を知らない ~ ロールの垣根を超えて引き上げる開発生産性 / Boosting Development Productivity Across Roles
kakehashi
1
220
Terraform Stacks入門 #HashiTalks
msato
0
350
障害対応指揮の意思決定と情報共有における価値観 / Waroom Meetup #2
arthur1
5
470
EventHub Startup CTO of the year 2024 ピッチ資料
eventhub
0
110
スクラムチームを立ち上げる〜チーム開発で得られたもの・得られなかったもの〜
ohnoeight
2
350
Featured
See All Featured
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
356
29k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
232
17k
Raft: Consensus for Rubyists
vanstee
136
6.6k
Principles of Awesome APIs and How to Build Them.
keavy
126
17k
Build The Right Thing And Hit Your Dates
maggiecrowley
33
2.4k
Intergalactic Javascript Robots from Outer Space
tanoku
269
27k
Testing 201, or: Great Expectations
jmmastey
38
7.1k
Product Roadmaps are Hard
iamctodd
PRO
49
11k
The Straight Up "How To Draw Better" Workshop
denniskardys
232
140k
Unsuck your backbone
ammeep
668
57k
Let's Do A Bunch of Simple Stuff to Make Websites Faster
chriscoyier
506
140k
Docker and Python
trallard
40
3.1k
Transcript
Fast In-memory Analytics for Retail Data with Columnar Databases Ernesto
Arbitrio - Valerio Maggio arbitrio |
[email protected]
Florence April 6, 2017
Retail Data • Overview of data we have • granularity
• refresh/update rate • Quantity and storage required (space) • services developed around these data
“Materialized Views” • Description of what they are (non-technical) •
Some examples of Analytics we do on this data
The Problem! ~1 TByte Data We need OLAP Performance: 75M
rows -> 5hours
The Solution! Use a Column-oriented Database (i.e. Just swap Rows
with Columns) Chuck Norris Test Passed!
None
Query
None
Thank you get in touch @__pamaron__ @leriomaggio