Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Albert Bifet
August 25, 2012
Research
1
260
Distributed Systems
Albert Bifet
August 25, 2012
Tweet
Share
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
230
Regression
abifet
0
240
Evaluation
abifet
1
200
Stream Algorithmics
abifet
1
350
Introduction to Data Stream Mining
abifet
1
220
Clustering
abifet
2
270
Ensemble Methods
abifet
0
230
Classification
abifet
0
310
Concept Drift
abifet
0
360
Other Decks in Research
See All in Research
Satellite Sunroof: High-res Digital Surface Models and Roof Segmentation for Global Solar Mapping
satai
3
190
Batch Processing Algorithm for Elliptic Curve Operations and Its AVX-512 Implementation
herumi
0
140
資産間の相関関係を頑健に評価する指標を用いたファクターアローケーション戦略の構築
nomamist
0
170
Weekly AI Agents News! 1月号 アーカイブ
masatoto
1
240
Weekly AI Agents News! 2月号 アーカイブ
masatoto
1
130
【NLPコロキウム】Stepwise Alignment for Constrained Language Model Policy Optimization (NeurIPS 2024)
akifumi_wachi
3
570
A Segment Anything Model based weakly supervised learning method for crop mapping using Sentinel-2 time series images
satai
3
190
さくらインターネット研究所 アップデート2025年
matsumoto_r
PRO
0
480
Segment Any Change
satai
3
280
ことばの意味を計算するしくみ
verypluming
10
2.2k
Introduction of NII S. Koyama's Lab (AY2025)
skoyamalab
0
180
Neural Fieldの紹介
nnchiba
2
830
Featured
See All Featured
Testing 201, or: Great Expectations
jmmastey
42
7.4k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
PRO
17
1.1k
How to Think Like a Performance Engineer
csswizardry
22
1.5k
YesSQL, Process and Tooling at Scale
rocio
172
14k
The Cost Of JavaScript in 2023
addyosmani
48
7.6k
Designing for humans not robots
tammielis
251
25k
Making Projects Easy
brettharned
116
6.1k
Designing for Performance
lara
606
69k
Intergalactic Javascript Robots from Outer Space
tanoku
270
27k
Unsuck your backbone
ammeep
670
57k
Faster Mobile Websites
deanohume
306
31k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
135
33k
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology