Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Distributed Systems
Search
Albert Bifet
August 25, 2012
Research
1
200
Distributed Systems
Albert Bifet
August 25, 2012
Tweet
Share
More Decks by Albert Bifet
See All by Albert Bifet
Frequent Pattern Mining
abifet
1
170
Regression
abifet
0
180
Evaluation
abifet
1
140
Stream Algorithmics
abifet
1
260
Introduction to Data Stream Mining
abifet
1
180
Clustering
abifet
2
210
Ensemble Methods
abifet
0
140
Classification
abifet
0
250
Concept Drift
abifet
0
260
Other Decks in Research
See All in Research
AIを前提とした体験の実現に向けて/toward_ai_based_experiences
monochromegane
1
230
時系列解析と疫学
kingqwert
2
900
自己教師あり学習による事前学習(CVIMチュートリアル)
naok615
2
1.4k
Rの機械学習フレームワークの紹介〜tidymodelsを中心に〜 / machine_learning_with_r2024
s_uryu
0
210
[KDD2023論文読み会] BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model with Non-textual Features for CTR Prediction / KDD2023 LY Tech Reading
shunk031
0
430
200名の育児中男性の声 「僕たちは、キャリアとライフをトレードオフにしたくない」共働き3.0世代の男性が 本当に求める働き方とは【ワーキングペアレンツの転職意識調査2023|XTalent株式会社】
xtalent
0
460
マルチモーダルLLMの応用動向の論文調査
masatoto
7
2.7k
Refactoring Mining - The key to unlock software evolution
tsantalis
0
240
20240127_熊本から今いちど真面目に都市交通~めざせ「車1割削減、渋滞半減、公共交通2倍」~ 全国路面電車サミット2024宇都宮
trafficbrain
1
650
第14回対話システムシンポジウム EMNLP 2023 参加報告
atsumoto
0
140
Prompt Tuning から Fine Tuning への移行時期推定
icoxfog417
17
6.8k
第12回全日本コンピュータビジョン勉強会:画像の自己教師あり学習における大規模データセット
naok615
0
500
Featured
See All Featured
Why Our Code Smells
bkeepers
PRO
331
56k
For a Future-Friendly Web
brad_frost
171
8.9k
Producing Creativity
orderedlist
PRO
336
39k
The Illustrated Children's Guide to Kubernetes
chrisshort
29
46k
Music & Morning Musume
bryan
41
5.6k
Typedesign – Prime Four
hannesfritz
36
2.1k
The Cult of Friendly URLs
andyhume
74
5.7k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
18
1.7k
Raft: Consensus for Rubyists
vanstee
132
6.2k
Done Done
chrislema
178
15k
The MySQL Ecosystem @ GitHub 2015
samlambert
242
12k
The Mythical Team-Month
searls
215
42k
Transcript
Distributed Streaming Albert Bifet May 2012
COMP423A/COMP523A Data Stream Mining Outline 1. Introduction 2. Stream Algorithmics
3. Concept drift 4. Evaluation 5. Classification 6. Ensemble Methods 7. Regression 8. Clustering 9. Frequent Pattern Mining 10. Distributed Streaming
Data Streams Big Data & Real Time
Distributed Systems Hadoop, S4 and Storm
Hadoop Hadoop
Hadoop Hadoop architecture
Apache Mahout Mahout: open source framework
Pig Pig: Similar to SQL
Pig A = LOAD ’data’ USING PigStorage() AS (f1:int, f2:int,
f3:int); B = GROUP A BY f1; C = FOREACH B GENERATE COUNT ($0); DUMP C; Pig: Similar to SQL
Apache S4 Apache S4
Apache S4
Storm Storm from Twitter
Storm Stream, Spout, Bolt, Topology