Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Albert Bifet
August 25, 2012
Research
1
270
Distributed Systems
Albert Bifet
August 25, 2012
Tweet
Share
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
250
Regression
abifet
0
260
Evaluation
abifet
1
220
Stream Algorithmics
abifet
1
380
Introduction to Data Stream Mining
abifet
1
240
Clustering
abifet
2
300
Ensemble Methods
abifet
0
250
Classification
abifet
0
330
Concept Drift
abifet
0
390
Other Decks in Research
See All in Research
cvpaper.challenge 10年の軌跡 / cvpaper.challenge a decade-long journey
gatheluck
1
260
2025年度 生成AIの使い方/接し方
hkefka385
1
740
「エージェントって何?」から「実際の開発現場で役立つ考え方やベストプラクティス」まで
mickey_kubo
0
130
心理言語学の視点から再考する言語モデルの学習過程
chemical_tree
2
510
とあるSREの博士「過程」 / A Certain SRE’s Ph.D. Journey
yuukit
7
3.6k
大規模な2値整数計画問題に対する 効率的な重み付き局所探索法
mickey_kubo
1
300
一人称視点映像解析の最先端(MIRU2025 チュートリアル)
takumayagi
5
2.2k
EOGS: Gaussian Splatting for Efficient Satellite Image Photogrammetry
satai
4
360
公立高校入試等に対する受入保留アルゴリズム(DA)導入の提言
shunyanoda
0
6.4k
2025年度人工知能学会全国大会チュートリアル講演「深層基盤モデルの数理」
taiji_suzuki
24
17k
LLM-as-a-Judge: 文章をLLMで評価する@教育機関DXシンポ
k141303
3
840
CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations
satai
3
230
Featured
See All Featured
ピンチをチャンスに:未来をつくるプロダクトロードマップ #pmconf2020
aki_iinuma
126
53k
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
34
3.1k
The Pragmatic Product Professional
lauravandoore
35
6.8k
Statistics for Hackers
jakevdp
799
220k
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.4k
Gamification - CAS2011
davidbonilla
81
5.4k
A Tale of Four Properties
chriscoyier
160
23k
The Art of Programming - Codeland 2020
erikaheidi
54
13k
Imperfection Machines: The Place of Print at Facebook
scottboms
267
13k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
31
2.4k
The Straight Up "How To Draw Better" Workshop
denniskardys
235
140k
Let's Do A Bunch of Simple Stuff to Make Websites Faster
chriscoyier
507
140k
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology