Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Albert Bifet
August 25, 2012
Research
1
290
Distributed Systems
Albert Bifet
August 25, 2012
Tweet
Share
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
280
Regression
abifet
0
290
Evaluation
abifet
1
250
Stream Algorithmics
abifet
1
410
Introduction to Data Stream Mining
abifet
1
250
Clustering
abifet
2
330
Ensemble Methods
abifet
0
300
Classification
abifet
0
360
Concept Drift
abifet
0
430
Other Decks in Research
See All in Research
[SITA2025 Workshop] 空中計算による高速・低遅延な分散回帰分析
k_sato
0
120
姫路市 -都市OSの「再実装」-
hopin
0
1.7k
A History of Approximate Nearest Neighbor Search from an Applications Perspective
matsui_528
1
190
一般道の交通量減少と速度低下についての全国分析と熊本市におけるケーススタディ(20251122 土木計画学研究発表会)
trafficbrain
0
180
ローテーション別のサイドアウト戦略 ~なぜあのローテは回らないのか?~
vball_panda
0
300
「なんとなく」の顧客理解から脱却する ──顧客の解像度を武器にするインサイトマネジメント
tajima_kaho
10
6.8k
ペットのかわいい瞬間を撮影する オートシャッターAIアプリへの スマートラベリングの適用
mssmkmr
0
370
Akamaiのキャッシュ効率を支えるAdaptSizeについての論文を読んでみた
bootjp
1
490
病院向け生成AIプロダクト開発の実践と課題
hagino3000
0
570
その推薦システムの評価指標、ユーザーの感覚とズレてるかも
kuri8ive
1
340
離散凸解析に基づく予測付き離散最適化手法 (IBIS '25)
taihei_oki
PRO
1
720
「車1割削減、渋滞半減、公共交通2倍」を 熊本から岡山へ@RACDA設立30周年記念都市交通フォーラム2026
trafficbrain
1
720
Featured
See All Featured
Un-Boring Meetings
codingconduct
0
220
The Cost Of JavaScript in 2023
addyosmani
55
9.8k
Making the Leap to Tech Lead
cromwellryan
135
9.8k
Organizational Design Perspectives: An Ontology of Organizational Design Elements
kimpetersen
PRO
1
630
Visualization
eitanlees
150
17k
How to Get Subject Matter Experts Bought In and Actively Contributing to SEO & PR Initiatives.
livdayseo
0
82
How STYLIGHT went responsive
nonsquared
100
6k
Game over? The fight for quality and originality in the time of robots
wayneb77
1
130
VelocityConf: Rendering Performance Case Studies
addyosmani
333
24k
What does AI have to do with Human Rights?
axbom
PRO
1
2k
<Decoding/> the Language of Devs - We Love SEO 2024
nikkihalliwell
1
150
Measuring Dark Social's Impact On Conversion and Attribution
stephenakadiri
1
150
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology