Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Albert Bifet
August 25, 2012
Research
300
1
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Distributed Systems
Albert Bifet
August 25, 2012
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
290
Regression
abifet
0
300
Evaluation
abifet
1
270
Stream Algorithmics
abifet
1
430
Introduction to Data Stream Mining
abifet
1
270
Clustering
abifet
2
340
Ensemble Methods
abifet
0
310
Classification
abifet
0
380
Concept Drift
abifet
0
440
Other Decks in Research
See All in Research
[BlackHatAsia2026] Hidden Telemetry: Uncovering TraceLogging ETW Providers You're Not Using (Yet)
asuna_jp
1
530
Apache Gravitinoで実現する Icebergカタログ統合とアクセスの一元化
matsumooon
0
290
2026 東京科学大 情報通信系 研究室紹介 (大岡山)
icttitech
0
3.8k
敵対生成プロンプト同時探索による内省型プロンプト最適化
kinoue_smarthr
0
210
Data Visualization Tools in the Age of AI
flekschas
0
160
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
shunk031
4
1k
RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent
satai
2
300
2026年3月1日(日)福島「除染土」の公共利用をかんがえる
atsukomasano2026
0
640
AI Agentの精度改善に見るML開発との共通点 / commonalities in accuracy improvements in agentic era
shimacos
6
1.7k
Φ-Sat-2のAutoEncoderによる情報圧縮系論文
satai
4
780
AIで最適化を解けるか?
mickey_kubo
0
120
National high-resolution cropland classification of Japan with agricultural census information and multi-temporal multi-modality datasets
satai
3
290
Featured
See All Featured
Writing Fast Ruby
sferik
630
63k
The Mindset for Success: Future Career Progression
greggifford
PRO
0
360
Effective software design: The role of men in debugging patriarchy in IT @ Voxxed Days AMS
baasie
0
410
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
Statistics for Hackers
jakevdp
799
230k
The untapped power of vector embeddings
frankvandijk
2
1.8k
What’s in a name? Adding method to the madness
productmarketing
PRO
24
4.1k
Money Talks: Using Revenue to Get Sh*t Done
nikkihalliwell
0
250
Accessibility Awareness
sabderemane
1
140
How to Build an AI Search Optimization Roadmap - Criteria and Steps to Take #SEOIRL
aleyda
1
2.1k
The Director’s Chair: Orchestrating AI for Truly Effective Learning
tmiket
1
190
Marketing Yourself as an Engineer | Alaka | Gurzu
gurzu
0
240
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology