Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Agile Lab - Massive real time analytics with Spark
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
Mattia Bertorello
September 22, 2015
Technology
160
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Agile Lab - Massive real time analytics with Spark
Mattia Bertorello
September 22, 2015
More Decks by Mattia Bertorello
See All by Mattia Bertorello
Big Data: a brief introduction using Hadoop
mattiabertorello
1
190
Other Decks in Technology
See All in Technology
Chart.js が簡単に使えるようになっていたので OGP 画像生成に使った話
kamekyame
0
170
チームで実践する AI-DLC 思考の軌跡を残すチェックポイント設計
belongadmin
0
3.1k
Ruby::Boxでできること、Refinementsでできること
joker1007
3
400
Claude Code×Terraform IaC テンプレート駆動開発
itouhi
1
450
Snowflakeと仲良くなる第一歩
coco_se
3
270
LLMにもCAP定理があるという話
harukasakihara
0
250
Platform Engineering as a Product: Criteria for Improvement and Multi-Tenant Design
kumorn5s
0
530
Reliability in the Age of AI: Engineering for AI Velocity
rrreeeyyy
0
110
個人最適 から 全体最適 へ AI情報共有会・AIギルド・AI-DLC で進める カンリーの組織展開
rfdnxbro
0
2k
AIを「創る」と「使う」の循環 — HRテックが実践するリアルなAI組織実装
taketo957
0
1.8k
なぜ Platform Engineering の土台に Kubernetes を選ぶのか
r4ynode
0
230
スキルと MCP ツール、責務をどう分けるか? AI が迷わないインターフェース設計の戦略
cdataj
0
210
Featured
See All Featured
How Software Deployment tools have changed in the past 20 years
geshan
0
34k
Facilitating Awesome Meetings
lara
57
6.9k
How to Think Like a Performance Engineer
csswizardry
28
2.6k
Practical Orchestrator
shlominoach
191
11k
Digital Ethics as a Driver of Design Innovation
axbom
PRO
1
310
Optimizing for Happiness
mojombo
378
71k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
52
6k
The Illustrated Guide to Node.js - THAT Conference 2024
reverentgeek
1
380
Context Engineering - Making Every Token Count
addyosmani
9
950
Building AI with AI
inesmontani
PRO
1
1.1k
Claude Code のすすめ
schroneko
67
230k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
122
22k
Transcript
Mattia Bertorello
Who is Agile Lab? GO BIG (data) or GO HOME
Summary •Why streaming matters •Why prediction? •Streaming architecture •Spark streaming
•Demo time
Why streaming matters ⡥2015 Agile Lab S.r.l.
Why streaming matters Data Big Data Business Reaction Batch Analysis
Typical BigData Workflow
Why streaming matters Data Real Time Processing FASTER REACTIONS !
MORE PROFITS Business Reaction Streaming BigData Workflow
Why streaming matters • Fleet Management • Insurance • Recommendation
• Etc...
Why prediction? ⡥2015 Agile Lab S.r.l.
Why prediction? • Rule based categorization and clustering is obsolete
• Pattern discovery • Adaptation to fast changing data • Smart thinking: no dummies • Prediction is more valuable
Streaming architecture ⡥2015 Agile Lab S.r.l.
Streaming architecture Ingestion Layer Processing Layer Serving Layer
Spark Streaming
Spark Notebook Use Apache Spark straight from the Browser
Demo time... ⡥2015 Agile Lab S.r.l.
Card transaction analysis PAN CIFRATO | AMOUNT | DESCRIPTION |
TIMESTAMP Classificazione delle transazioni online/offline PAN CIFRATO | AMOUNT | DESCRIPTION | TIMESTAMP | ISONLINE fraud detection algorithm SQL aggregation Generazione di allarmi in tempo reale
We are hiring... ⡥2015 Agile Lab S.r.l.
[email protected]