Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Albert Bifet
August 25, 2012
Research
1
280
Distributed Systems
Albert Bifet
August 25, 2012
Tweet
Share
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
260
Regression
abifet
0
270
Evaluation
abifet
1
230
Stream Algorithmics
abifet
1
390
Introduction to Data Stream Mining
abifet
1
240
Clustering
abifet
2
310
Ensemble Methods
abifet
0
260
Classification
abifet
0
340
Concept Drift
abifet
0
410
Other Decks in Research
See All in Research
ip71_contraflow_reconfiguration
stkmsd
0
110
Adaptive Experimental Design for Efficient Average Treatment Effect Estimation and Treatment Choice
masakat0
0
130
Pythonでジオを使い倒そう! 〜それとFOSS4G Hiroshima 2026のご紹介を少し〜
wata909
0
1.1k
RHO-1: Not All Tokens Are What You Need
sansan_randd
1
200
【輪講資料】Moshi: a speech-text foundation model for real-time dialogue
hpprc
3
770
MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation
satai
4
380
世界の人気アプリ100個を分析して見えたペイウォール設計の心得
akihiro_kokubo
PRO
63
33k
CVPR2025論文紹介:Unboxed
murakawatakuya
0
180
2025/7/5 応用音響研究会招待講演@北海道大学
takuma_okamoto
1
230
言語モデルの地図:確率分布と情報幾何による類似性の可視化
shimosan
8
2k
Time to Cash: The Full Stack Breakdown of Modern ATM Attacks
ratatata
0
160
不確実性下における目的と手段の統合的探索に向けた連続腕バンディットの応用 / iot70_gp_rff_mab
monochromegane
2
210
Featured
See All Featured
Exploring the Power of Turbo Streams & Action Cable | RailsConf2023
kevinliebholz
36
6.1k
Why Our Code Smells
bkeepers
PRO
340
57k
Optimizing for Happiness
mojombo
379
70k
Product Roadmaps are Hard
iamctodd
PRO
55
11k
The Straight Up "How To Draw Better" Workshop
denniskardys
239
140k
The Invisible Side of Design
smashingmag
302
51k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.5k
The Language of Interfaces
destraynor
162
25k
Build The Right Thing And Hit Your Dates
maggiecrowley
38
2.9k
Leading Effective Engineering Teams in the AI Era
addyosmani
8
980
Mobile First: as difficult as doing things right
swwweet
225
10k
Chrome DevTools: State of the Union 2024 - Debugging React & Beyond
addyosmani
9
960
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology