Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Albert Bifet
August 25, 2012
Research
1
270
Distributed Systems
Albert Bifet
August 25, 2012
Tweet
Share
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
250
Regression
abifet
0
260
Evaluation
abifet
1
230
Stream Algorithmics
abifet
1
380
Introduction to Data Stream Mining
abifet
1
240
Clustering
abifet
2
300
Ensemble Methods
abifet
0
250
Classification
abifet
0
330
Concept Drift
abifet
0
400
Other Decks in Research
See All in Research
数理最適化と機械学習の融合
mickey_kubo
15
9.1k
When Submarine Cables Go Dark: Examining the Web Services Resilience Amid Global Internet Disruptions
irvin
0
280
PhD Defense 2025: Visual Understanding of Human Hands in Interactions
tkhkaeio
1
140
Generative Models 2025
takahashihiroshi
23
13k
Submeter-level land cover mapping of Japan
satai
3
200
集合間Bregmanダイバージェンスと置換不変NNによるその学習
wasyro
0
130
SSII2025 [TS1] 光学・物理原理に基づく深層画像生成
ssii
PRO
4
4.1k
時系列データに対する解釈可能な 決定木クラスタリング
mickey_kubo
2
870
能動適応的実験計画
masakat0
2
760
問いを起点に、社会と共鳴する知を育む場へ
matsumoto_r
PRO
0
560
引力・斥力を制御可能なランダム部分集合の確率分布
wasyro
0
220
「エージェントって何?」から「実際の開発現場で役立つ考え方やベストプラクティス」まで
mickey_kubo
0
140
Featured
See All Featured
How STYLIGHT went responsive
nonsquared
100
5.7k
The MySQL Ecosystem @ GitHub 2015
samlambert
251
13k
How To Stay Up To Date on Web Technology
chriscoyier
790
250k
Optimizing for Happiness
mojombo
379
70k
Gamification - CAS2011
davidbonilla
81
5.4k
A Modern Web Designer's Workflow
chriscoyier
695
190k
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
30
9.6k
KATA
mclloyd
32
14k
The Cost Of JavaScript in 2023
addyosmani
53
8.8k
Fantastic passwords and where to find them - at NoRuKo
philnash
51
3.4k
The Pragmatic Product Professional
lauravandoore
36
6.8k
Art, The Web, and Tiny UX
lynnandtonic
301
21k
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology