Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Albert Bifet
August 25, 2012
Research
1
270
Distributed Systems
Albert Bifet
August 25, 2012
Tweet
Share
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
250
Regression
abifet
0
260
Evaluation
abifet
1
230
Stream Algorithmics
abifet
1
380
Introduction to Data Stream Mining
abifet
1
240
Clustering
abifet
2
300
Ensemble Methods
abifet
0
260
Classification
abifet
0
340
Concept Drift
abifet
0
400
Other Decks in Research
See All in Research
snlp2025_prevent_llm_spikes
takase
0
350
Stealing LUKS Keys via TPM and UUID Spoofing in 10 Minutes - BSides 2025
anykeyshik
0
130
GPUを利用したStein Particle Filterによる点群6自由度モンテカルロSLAM
takuminakao
0
350
Mechanistic Interpretability:解釈可能性研究の新たな潮流
koshiro_aoki
1
440
AWSで実現した大規模日本語VLM学習用データセット "MOMIJI" 構築パイプライン/buiding-momiji
studio_graph
2
640
Minimax and Bayes Optimal Best-arm Identification: Adaptive Experimental Design for Treatment Choice
masakat0
0
180
EarthSynth: Generating Informative Earth Observation with Diffusion Models
satai
3
360
世界の人気アプリ100個を分析して見えたペイウォール設計の心得
akihiro_kokubo
PRO
62
30k
Towards a More Efficient Reasoning LLM: AIMO2 Solution Summary and Introduction to Fast-Math Models
analokmaus
2
870
Hiding What from Whom? A Critical Review of the History of Programming languages for Music
tomoyanonymous
0
190
問いを起点に、社会と共鳴する知を育む場へ
matsumoto_r
PRO
0
640
Large Language Model Agent: A Survey on Methodology, Applications and Challenges
shunk031
17
10k
Featured
See All Featured
Site-Speed That Sticks
csswizardry
11
880
Faster Mobile Websites
deanohume
310
31k
Code Review Best Practice
trishagee
72
19k
Fireside Chat
paigeccino
40
3.7k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
45
2.5k
Optimizing for Happiness
mojombo
379
70k
Why Our Code Smells
bkeepers
PRO
339
57k
Visualization
eitanlees
148
16k
Building a Modern Day E-commerce SEO Strategy
aleyda
43
7.7k
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
4k
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
31
9.7k
How GitHub (no longer) Works
holman
315
140k
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology