Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Albert Bifet
August 25, 2012
Research
300
1
Share
Distributed Systems
Albert Bifet
August 25, 2012
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
290
Regression
abifet
0
300
Evaluation
abifet
1
260
Stream Algorithmics
abifet
1
420
Introduction to Data Stream Mining
abifet
1
260
Clustering
abifet
2
330
Ensemble Methods
abifet
0
310
Classification
abifet
0
380
Concept Drift
abifet
0
430
Other Decks in Research
See All in Research
AIエージェント時代のLLM-jpモデルのあるべき姿
k141303
0
400
多様なデータを許容し学習し続ける模倣学習 / Advanced Imitation Learning for VLA
prinlab
0
200
Unified Audio Source Separation (Defense Slides)
kohei_1979
1
600
LINEヤフー データサイエンス Meetup「三井物産コモディティ予測チャレンジ」の舞台裏-AlpacaTechパート
gamella
1
540
2026.01ウェビナー資料
elith
0
380
Scalable dynamic origin-destination demand estimation enhanced by high-resolution satellite imagery data
satai
2
220
Φ-Sat-2のAutoEncoderによる情報圧縮系論文
satai
4
730
LLM の Attention 機構まとめ — 数式・計算量・メモリ
puwaer
7
1.9k
「行ける・行けない表」による地域公共交通の性能評価
bansousha
0
150
コーディングエージェントとABNを再考
hf149
2
680
SoftMatcha 2: 1兆語規模コーパスの超高速かつ柔らかい検索
e869120_sub
6
3.4k
COFFEE-Japan PROJECT Impact Report(海ノ向こうコーヒー)
ontheslope
0
1.7k
Featured
See All Featured
SEO for Brand Visibility & Recognition
aleyda
0
4.6k
Tell your own story through comics
letsgokoyo
1
930
Mobile First: as difficult as doing things right
swwweet
225
10k
16th Malabo Montpellier Forum Presentation
akademiya2063
PRO
0
130
Ten Tips & Tricks for a 🌱 transition
stuffmc
0
120
Between Models and Reality
mayunak
4
310
brightonSEO & MeasureFest 2025 - Christian Goodrich - Winning strategies for Black Friday CRO & PPC
cargoodrich
3
710
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
11
930
Understanding Cognitive Biases in Performance Measurement
bluesmoon
32
2.9k
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
659
62k
Navigating Algorithm Shifts & AI Overviews - #SMXNext
aleyda
1
1.2k
Designing for Timeless Needs
cassininazir
1
230
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology