Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Albert Bifet
August 25, 2012
Research
1
270
Distributed Systems
Albert Bifet
August 25, 2012
Tweet
Share
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
250
Regression
abifet
0
260
Evaluation
abifet
1
230
Stream Algorithmics
abifet
1
380
Introduction to Data Stream Mining
abifet
1
240
Clustering
abifet
2
300
Ensemble Methods
abifet
0
250
Classification
abifet
0
330
Concept Drift
abifet
0
400
Other Decks in Research
See All in Research
SSII2025 [TS1] 光学・物理原理に基づく深層画像生成
ssii
PRO
4
4.1k
在庫管理のための機械学習と最適化の融合
mickey_kubo
3
1.1k
集合間Bregmanダイバージェンスと置換不変NNによるその学習
wasyro
0
130
A scalable, annual aboveground biomass product for monitoring carbon impacts of ecosystem restoration projects
satai
3
150
数理最適化に基づく制御
mickey_kubo
6
710
Ad-DS Paper Circle #1
ykaneko1992
0
5.8k
SSII2025 [TS2] リモートセンシング画像処理の最前線
ssii
PRO
7
3k
診断前の病歴テキストを対象としたLLMによるエンティティリンキング精度検証
hagino3000
1
120
投資戦略202508
pw
0
150
Galileo: Learning Global & Local Features of Many Remote Sensing Modalities
satai
3
140
Self-supervised audiovisual representation learning for remote sensing data
satai
3
250
20250624_熊本経済同友会6月例会講演
trafficbrain
1
550
Featured
See All Featured
A Modern Web Designer's Workflow
chriscoyier
695
190k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
49
3k
VelocityConf: Rendering Performance Case Studies
addyosmani
332
24k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
31
2.5k
Reflections from 52 weeks, 52 projects
jeffersonlam
351
21k
Chrome DevTools: State of the Union 2024 - Debugging React & Beyond
addyosmani
7
810
Music & Morning Musume
bryan
46
6.7k
Fantastic passwords and where to find them - at NoRuKo
philnash
51
3.4k
The World Runs on Bad Software
bkeepers
PRO
70
11k
Fireside Chat
paigeccino
39
3.6k
Statistics for Hackers
jakevdp
799
220k
The MySQL Ecosystem @ GitHub 2015
samlambert
251
13k
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology