Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Albert Bifet
August 25, 2012
Research
290
1
Share
Distributed Systems
Albert Bifet
August 25, 2012
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
290
Regression
abifet
0
290
Evaluation
abifet
1
250
Stream Algorithmics
abifet
1
410
Introduction to Data Stream Mining
abifet
1
250
Clustering
abifet
2
330
Ensemble Methods
abifet
0
300
Classification
abifet
0
360
Concept Drift
abifet
0
430
Other Decks in Research
See All in Research
社内データ分析AIエージェントを できるだけ使いやすくする工夫
fufufukakaka
1
1k
さくらインターネット研究所テックトーク2026春、研究開発Gr.25年度成果26年度方針
kikuzo
0
120
明日から使える!研究効率化ツール入門
matsui_528
11
6.1k
老舗ものづくり企業でリサーチが変革を起こすまで - 三菱重工DXの実践
skydats
0
130
2026-01-30-MandSL-textbook-jp-cos-lod
yegusa
1
1k
LLM Compute Infrastructure Overview
karakurist
2
1.1k
製造業主導型経済からサービス経済化における中間層形成メカニズムのパラダイムシフト
yamotty
0
560
英語教育 “研究” のあり方:学術知とアウトリーチの緊張関係
terasawat
1
840
20年前に50代だった人たちの今
hysmrk
0
180
Sequences of Logits Reveal the Low Rank Structure of Language Models
sansantech
PRO
1
180
生成AI による論文執筆サポート・ワークショップ 論文執筆・推敲編 / Generative AI-Assisted Paper Writing Support Workshop: Drafting and Revision Edition
ks91
PRO
0
180
「車1割削減、渋滞半減、公共交通2倍」を 熊本から岡山へ@RACDA設立30周年記念都市交通フォーラム2026
trafficbrain
1
980
Featured
See All Featured
A Modern Web Designer's Workflow
chriscoyier
698
190k
Navigating Team Friction
lara
192
16k
世界の人気アプリ100個を分析して見えたペイウォール設計の心得
akihiro_kokubo
PRO
69
38k
Agile Leadership in an Agile Organization
kimpetersen
PRO
0
130
Crafting Experiences
bethany
1
110
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
52
5.9k
Exploring the relationship between traditional SERPs and Gen AI search
raygrieselhuber
PRO
2
3.8k
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
Imperfection Machines: The Place of Print at Facebook
scottboms
270
14k
How to Build an AI Search Optimization Roadmap - Criteria and Steps to Take #SEOIRL
aleyda
1
2k
Game over? The fight for quality and originality in the time of robots
wayneb77
1
160
The B2B funnel & how to create a winning content strategy
katarinadahlin
PRO
1
330
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology