Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Albert Bifet
August 25, 2012
Research
1
290
Distributed Systems
Albert Bifet
August 25, 2012
Tweet
Share
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
270
Regression
abifet
0
280
Evaluation
abifet
1
240
Stream Algorithmics
abifet
1
400
Introduction to Data Stream Mining
abifet
1
250
Clustering
abifet
2
320
Ensemble Methods
abifet
0
290
Classification
abifet
0
350
Concept Drift
abifet
0
420
Other Decks in Research
See All in Research
教師あり学習と強化学習で作る 最強の数学特化LLM
analokmaus
2
820
データサイエンティストの業務変化
datascientistsociety
PRO
0
120
第二言語習得研究における 明示的・暗示的知識の再検討:この分類は何に役に立つか,何に役に立たないか
tam07pb915
0
690
Language Models Are Implicitly Continuous
eumesy
PRO
0
370
AI in Enterprises - Java and Open Source to the Rescue
ivargrimstad
0
1.1k
超高速データサイエンス
matsui_528
1
340
ドメイン知識がない領域での自然言語処理の始め方
hargon24
1
230
J-RAGBench: 日本語RAGにおける Generator評価ベンチマークの構築
koki_itai
0
1.1k
自動運転におけるデータ駆動型AIに対する安全性の考え方 / Safety Engineering for Data-Driven AI in Autonomous Driving Systems
ishikawafyu
0
110
GPUを利用したStein Particle Filterによる点群6自由度モンテカルロSLAM
takuminakao
0
780
Pythonでジオを使い倒そう! 〜それとFOSS4G Hiroshima 2026のご紹介を少し〜
wata909
0
1.2k
AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data
satai
3
670
Featured
See All Featured
Ruling the World: When Life Gets Gamed
codingconduct
0
120
Chasing Engaging Ingredients in Design
codingconduct
0
93
Optimizing for Happiness
mojombo
379
70k
State of Search Keynote: SEO is Dead Long Live SEO
ryanjones
0
80
Effective software design: The role of men in debugging patriarchy in IT @ Voxxed Days AMS
baasie
0
190
Un-Boring Meetings
codingconduct
0
170
Why Our Code Smells
bkeepers
PRO
340
58k
Writing Fast Ruby
sferik
630
62k
How to Get Subject Matter Experts Bought In and Actively Contributing to SEO & PR Initiatives.
livdayseo
0
39
Fantastic passwords and where to find them - at NoRuKo
philnash
52
3.5k
It's Worth the Effort
3n
187
29k
Become a Pro
speakerdeck
PRO
31
5.8k
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology