Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Fast In-memory Analytics for Retail Data with C...
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
ernestoarbitrio
April 09, 2017
Technology
73
0
Share
Fast In-memory Analytics for Retail Data with Columnar Databases
ernestoarbitrio
April 09, 2017
More Decks by ernestoarbitrio
See All by ernestoarbitrio
Enable effective Observability with Python
pamaron
0
140
PyConZA 2022 Best practices for good(ish) and clean(ish) code
pamaron
0
120
PyCon Italia 2022 Best practices for good(ish) and clean(ish) code
pamaron
0
270
Bokeh: Using python for interactive data visualization
pamaron
1
180
Keystroke Behavioural Analysis For Fraud Detection: Deep Learning as-a-service Infrastructure
pamaron
0
65
Indexing and search tons of data with ElasticSearch and Django
pamaron
0
420
Interactive plot with django and highchart JS (without JS)
pamaron
0
380
Other Decks in Technology
See All in Technology
2026-05-14 要件定義からソース管理まで!IBM Bob基礎ハンズオン
yutanonaka
0
160
なぜ、私がCommunity Builderに?〜活動期間1か月半でも選出されたワケ〜
yama3133
0
130
Terragrunt x Snowflake + dbt で作るマルチテナントなデータ基盤構築プラットフォーム
gak_t12
0
170
Vision Banana: Image Generators are Generalist Vision Learners
kzykmyzw
0
390
Claude Code / Codex / Kiro に AWS 権限を 渡すとき、何を設計すべきか
k_adachi_01
5
1.5k
Claude Code で使える DuckDB Skills を試してみた / DuckDB Skills and Claude Code
masahirokawahara
1
360
マンション備え付けのネットワークとLTE回線を組み合わせた ネットワークの安定化の考案
harutiro
1
130
"スキルファースト"で作る、AIの自走環境
subroh0508
0
510
サンプリングは「作る」のか「使う」のか? 分散トレースのコストと運用を両立する実践的戦略 / Why you need the tail sampling and why you don't want it
ymotongpoo
4
180
20260516_SecJAWS_Days
takuyay0ne
2
430
Oracle Base Database Service 技術詳細
oracle4engineer
PRO
15
100k
ServiceによるKubernetes通信制御ーClusterIPを例に
miku01
1
170
Featured
See All Featured
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
49
9.9k
Public Speaking Without Barfing On Your Shoes - THAT 2023
reverentgeek
1
390
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
35
3.5k
Designing Experiences People Love
moore
143
24k
Groundhog Day: Seeking Process in Gaming for Health
codingconduct
0
180
Design of three-dimensional binary manipulators for pick-and-place task avoiding obstacles (IECON2024)
konakalab
0
420
Building Adaptive Systems
keathley
44
3k
Building Experiences: Design Systems, User Experience, and Full Site Editing
marktimemedia
0
500
Ten Tips & Tricks for a 🌱 transition
stuffmc
0
110
Ethics towards AI in product and experience design
skipperchong
2
270
Data-driven link building: lessons from a $708K investment (BrightonSEO talk)
szymonslowik
1
1.1k
What’s in a name? Adding method to the madness
productmarketing
PRO
24
4k
Transcript
Fast In-memory Analytics for Retail Data with Columnar Databases Ernesto
Arbitrio - Valerio Maggio arbitrio |
[email protected]
Florence April 6, 2017
Retail Data • Overview of data we have • granularity
• refresh/update rate • Quantity and storage required (space) • services developed around these data
“Materialized Views” • Description of what they are (non-technical) •
Some examples of Analytics we do on this data
The Problem! ~1 TByte Data We need OLAP Performance: 75M
rows -> 5hours
The Solution! Use a Column-oriented Database (i.e. Just swap Rows
with Columns) Chuck Norris Test Passed!
None
Query
None
Thank you get in touch @__pamaron__ @leriomaggio