$30 off During Our Annual Pro Sale. View Details »
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Albert Bifet
August 25, 2012
Research
1
280
Distributed Systems
Albert Bifet
August 25, 2012
Tweet
Share
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
260
Regression
abifet
0
270
Evaluation
abifet
1
240
Stream Algorithmics
abifet
1
390
Introduction to Data Stream Mining
abifet
1
250
Clustering
abifet
2
310
Ensemble Methods
abifet
0
270
Classification
abifet
0
350
Concept Drift
abifet
0
410
Other Decks in Research
See All in Research
AI in Enterprises - Java and Open Source to the Rescue
ivargrimstad
0
1k
ウェブ・ソーシャルメディア論文読み会 第31回: The rising entropy of English in the attention economy. (Commun Psychology, 2024)
hkefka385
1
120
一人称視点映像解析の最先端(MIRU2025 チュートリアル)
takumayagi
6
4.3k
財務諸表監査のための逐次検定
masakat0
0
200
SREのためのテレメトリー技術の探究 / Telemetry for SRE
yuukit
12
2.2k
Satellites Reveal Mobility: A Commuting Origin-destination Flow Generator for Global Cities
satai
3
160
Open Gateway 5GC利用への期待と不安
stellarcraft
2
160
スキマバイトサービスにおける現場起点でのデザインアプローチ
yoshioshingyouji
0
270
MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation
satai
4
460
単施設でできる臨床研究の考え方
shuntaros
0
3.3k
When Learned Data Structures Meet Computer Vision
matsui_528
1
890
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images
satai
3
450
Featured
See All Featured
Fireside Chat
paigeccino
41
3.7k
The Cult of Friendly URLs
andyhume
79
6.7k
RailsConf 2023
tenderlove
30
1.3k
Product Roadmaps are Hard
iamctodd
PRO
55
12k
How to train your dragon (web standard)
notwaldorf
97
6.4k
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
34
2.5k
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
54k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
12
960
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
21
1.3k
Why You Should Never Use an ORM
jnunemaker
PRO
60
9.6k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
132
19k
Agile that works and the tools we love
rasmusluckow
331
21k
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology