Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Albert Bifet
August 25, 2012
Research
1
280
Distributed Systems
Albert Bifet
August 25, 2012
Tweet
Share
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
260
Regression
abifet
0
260
Evaluation
abifet
1
230
Stream Algorithmics
abifet
1
390
Introduction to Data Stream Mining
abifet
1
240
Clustering
abifet
2
310
Ensemble Methods
abifet
0
260
Classification
abifet
0
340
Concept Drift
abifet
0
410
Other Decks in Research
See All in Research
財務諸表監査のための逐次検定
masakat0
0
160
Hiding What from Whom? A Critical Review of the History of Programming languages for Music
tomoyanonymous
0
210
PhD Defense 2025: Visual Understanding of Human Hands in Interactions
tkhkaeio
1
270
一人称視点映像解析の最先端(MIRU2025 チュートリアル)
takumayagi
6
4k
「どう育てるか」より「どう働きたいか」〜スクラムマスターの最初の一歩〜
hirakawa51
0
980
AI in Enterprises - Java and Open Source to the Rescue
ivargrimstad
0
840
長期・短期メモリを活用したエージェントの個別最適化
isidaitc
0
220
引力・斥力を制御可能なランダム部分集合の確率分布
wasyro
0
270
言語モデルの地図:確率分布と情報幾何による類似性の可視化
shimosan
8
2k
Time to Cash: The Full Stack Breakdown of Modern ATM Attacks
ratatata
0
160
論文紹介:Safety Alignment Should be Made More Than Just a Few Tokens Deep
kazutoshishinoda
0
110
音声感情認識技術の進展と展望
nagase
0
310
Featured
See All Featured
The Illustrated Children's Guide to Kubernetes
chrisshort
51
51k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
10
900
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.5k
Evolution of real-time – Irina Nazarova, EuRuKo, 2024
irinanazarova
9
1k
The Straight Up "How To Draw Better" Workshop
denniskardys
239
140k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
253
22k
Building a Modern Day E-commerce SEO Strategy
aleyda
44
7.9k
Building Adaptive Systems
keathley
44
2.8k
Measuring & Analyzing Core Web Vitals
bluesmoon
9
650
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
658
61k
A designer walks into a library…
pauljervisheath
209
24k
Automating Front-end Workflow
addyosmani
1371
200k
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology