Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Introduction to Data Stream Mining
Search
Albert Bifet
August 25, 2012
Research
1
190
Introduction to Data Stream Mining
Albert Bifet
August 25, 2012
Tweet
Share
More Decks by Albert Bifet
See All by Albert Bifet
Distributed Systems
abifet
1
220
Frequent Pattern Mining
abifet
1
190
Regression
abifet
0
200
Evaluation
abifet
1
150
Stream Algorithmics
abifet
1
290
Clustering
abifet
2
230
Ensemble Methods
abifet
0
160
Classification
abifet
0
280
Concept Drift
abifet
0
280
Other Decks in Research
See All in Research
JMED-LLM: 日本語医療LLM評価データセットの公開
fta98
1
350
SSII2024 [TS3] 画像認識におけるマルチモーダル基盤モデル ~基盤モデル、あなたのタスクに役立つかも?~
ssii
PRO
0
810
DroidKaigi CfP分析
yukihiromori
0
110
LayerXにおけるAI・機械学習技術の活用と展望 / layerx-ai-jsai2024
shimacos
2
2.5k
SSII2024 [OS1] 画像認識におけるモデル・データの共進化
ssii
PRO
0
380
"多様な推薦"はユーザーの目にどう映るか
kuri8ive
3
260
Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction(日本語版)
aiueola
0
120
機械学習を用いたポケモン対戦選出予測
fufufukakaka
1
560
SSII2024 [OS1] 研究紹介100連発(オープンニング)
ssii
PRO
0
420
大規模言語モデルを用いた その場での要約に基づく レビュー探索インタフェース
yamamotolab
0
240
デジタルツインによる ネイチャーポジティブへの挑戦
fullfull
0
210
機械学習と最適化の融合動的ロットサイズ決定問題を例として
mickey_kubo
2
360
Featured
See All Featured
Designing Experiences People Love
moore
136
23k
jQuery: Nuts, Bolts and Bling
dougneiner
61
7.4k
Unsuck your backbone
ammeep
666
57k
Ruby is Unlike a Banana
tanoku
96
10k
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
26
2.1k
Statistics for Hackers
jakevdp
792
220k
What's new in Ruby 2.0
geeforr
338
31k
Side Projects
sachag
451
42k
Designing on Purpose - Digital PM Summit 2013
jponch
113
6.6k
The MySQL Ecosystem @ GitHub 2015
samlambert
248
12k
Learning to Love Humans: Emotional Interface Design
aarron
269
39k
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
224
21k
Transcript
Introduction to Data Stream Mining Albert Bifet March 2012
Motivation Source: IDC’s Digital Universe Study (EMC), June 2011 Data
is growing
Motivation Memory unit Size Binary size kilobyte (kB/KB) 103 210
megabyte (MB) 106 220 gigabyte (GB) 109 230 terabyte (TB) 1012 240 petabyte (PB) 1015 250 exabyte (EB) 1018 260 zettabyte (ZB) 1021 270 yottabyte (YB) 1024 280 Data is growing
Motivation Source: IDC’s Digital Universe Study (EMC), June 2011 Data
is growing
Motivation Source: IDC’s Digital Universe Study (EMC), June 2011 Data
is growing
Motivation Source: IDC’s Digital Universe Study (EMC), June 2011 Data
is growing
Streaming Data Big Data & Real Time
Big Data McKinsey Global Institute (MGI) Report on Big Data,
2011. Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze.
Big Data McKinsey Global Institute (MGI) Report on Big Data,
2011. Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze.
Methodology Sampling and distributed systems
Methodology Paolo Boldi Big Data does not need big machines,
it needs big intelligence
Real time analytics We want to analyze what is happening
now.
Real time analytics We want to analyze what is happening
now.
Time and Memory Number 8 Wire Mentality Time and memory
are the resource dimensions of the process.
Time and Memory Time and memory are the resource dimensions
of the process.
Algorithms Classification, Regression, Clustering, Frequent Pattern Mining.
Applications sensor data: industry, cities telecomm data social networks: twitter,
facebook, yahoo marketing: sales business Data may come from: humans, sensors, or machines.
Data Streams Big Data & Real Time