Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Albert Bifet
August 25, 2012
Research
290
1
Share
Distributed Systems
Albert Bifet
August 25, 2012
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
290
Regression
abifet
0
290
Evaluation
abifet
1
260
Stream Algorithmics
abifet
1
420
Introduction to Data Stream Mining
abifet
1
260
Clustering
abifet
2
330
Ensemble Methods
abifet
0
300
Classification
abifet
0
370
Concept Drift
abifet
0
430
Other Decks in Research
See All in Research
2026-01-30-MandSL-textbook-jp-cos-lod
yegusa
1
1.1k
衛星×エッジAI勉強会 衛星上におけるAI処理制約とそ取組について
satai
4
470
さくらインターネット研究所テックトーク2026春、研究開発Gr.25年度成果26年度方針
kikuzo
0
130
非試合日の野球場を楽しむためのARホームランボールキャッチ体験システムの開発 / EC79-miyazaki
yumulab
0
170
通時的な類似度行列に基づく単語の意味変化の分析
rudorudo11
0
270
An Open and Reproducible Deep Research Agent for Long-Form Question Answering
ikuyamada
0
440
2026年3月1日(日)福島「除染土」の公共利用をかんがえる
atsukomasano2026
0
580
R&Dチームを起ち上げる
shibuiwilliam
1
240
生成AI による論文執筆サポート・ワークショップ 論文執筆・推敲編 / Generative AI-Assisted Paper Writing Support Workshop: Drafting and Revision Edition
ks91
PRO
0
200
「AIとWhyを深堀る」をAIと深堀る
iflection
0
350
製造業主導型経済からサービス経済化における中間層形成メカニズムのパラダイムシフト
yamotty
0
570
はじまりの クエスチョンブック —余暇と豊かさにあふれた社会とは?
culturaltransition
PRO
0
400
Featured
See All Featured
Navigating Weather and Climate Data
rabernat
0
180
Writing Fast Ruby
sferik
630
63k
Documentation Writing (for coders)
carmenintech
77
5.3k
Digital Ethics as a Driver of Design Innovation
axbom
PRO
1
280
Ten Tips & Tricks for a 🌱 transition
stuffmc
0
110
Statistics for Hackers
jakevdp
799
230k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
Tips & Tricks on How to Get Your First Job In Tech
honzajavorek
1
500
Collaborative Software Design: How to facilitate domain modelling decisions
baasie
1
200
New Earth Scene 8
popppiees
3
2.2k
The Limits of Empathy - UXLibs8
cassininazir
1
320
Why You Should Never Use an ORM
jnunemaker
PRO
61
9.8k
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology