Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Introduction to Data Stream Mining
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Albert Bifet
August 25, 2012
Research
270
1
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Introduction to Data Stream Mining
Albert Bifet
August 25, 2012
More Decks by Albert Bifet
See All by Albert Bifet
Distributed Systems
abifet
1
300
Frequent Pattern Mining
abifet
1
290
Regression
abifet
0
300
Evaluation
abifet
1
270
Stream Algorithmics
abifet
1
430
Clustering
abifet
2
340
Ensemble Methods
abifet
0
310
Classification
abifet
0
380
Concept Drift
abifet
0
440
Other Decks in Research
See All in Research
Fukui Shibiten 39 - AI Art
butchi
0
120
正規分布と最適化について
koide3
1
260
人間中心の意思決定支援AI
yukinobaba
PRO
6
2.8k
通時的な類似度行列に基づく単語の意味変化の分析
rudorudo11
0
320
AIで最適化を解けるか?
mickey_kubo
0
120
明日から使える!研究効率化ツール入門
matsui_528
13
7.3k
Ankylosing Spondylitis
ankh2054
0
170
The mathematics of transformers
gpeyre
0
330
重要だけど測れていないもの:高齢者ケアの見えない課題
theoriatec2024
0
350
LINEヤフー データサイエンス Meetup「三井物産コモディティ予測チャレンジ」の舞台裏-AlpacaTechパート
gamella
1
570
YOLO26_ Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
satai
3
810
2026年3月1日(日)福島「除染土」の公共利用をかんがえる
atsukomasano2026
0
640
Featured
See All Featured
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
34
2.8k
How to Think Like a Performance Engineer
csswizardry
28
2.7k
Producing Creativity
orderedlist
PRO
348
40k
How GitHub (no longer) Works
holman
316
150k
HU Berlin: Industrial-Strength Natural Language Processing with spaCy and Prodigy
inesmontani
PRO
0
410
Leveraging Curiosity to Care for An Aging Population
cassininazir
1
270
svc-hook: hooking system calls on ARM64 by binary rewriting
retrage
2
300
Leo the Paperboy
mayatellez
7
1.8k
Practical Orchestrator
shlominoach
191
11k
Mobile First: as difficult as doing things right
swwweet
225
10k
The Cost Of JavaScript in 2023
addyosmani
55
10k
How STYLIGHT went responsive
nonsquared
100
6.2k
Transcript
Introduction to Data Stream Mining Albert Bifet March 2012
Motivation Source: IDC’s Digital Universe Study (EMC), June 2011 Data
is growing
Motivation Memory unit Size Binary size kilobyte (kB/KB) 103 210
megabyte (MB) 106 220 gigabyte (GB) 109 230 terabyte (TB) 1012 240 petabyte (PB) 1015 250 exabyte (EB) 1018 260 zettabyte (ZB) 1021 270 yottabyte (YB) 1024 280 Data is growing
Motivation Source: IDC’s Digital Universe Study (EMC), June 2011 Data
is growing
Motivation Source: IDC’s Digital Universe Study (EMC), June 2011 Data
is growing
Motivation Source: IDC’s Digital Universe Study (EMC), June 2011 Data
is growing
Streaming Data Big Data & Real Time
Big Data McKinsey Global Institute (MGI) Report on Big Data,
2011. Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze.
Big Data McKinsey Global Institute (MGI) Report on Big Data,
2011. Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze.
Methodology Sampling and distributed systems
Methodology Paolo Boldi Big Data does not need big machines,
it needs big intelligence
Real time analytics We want to analyze what is happening
now.
Real time analytics We want to analyze what is happening
now.
Time and Memory Number 8 Wire Mentality Time and memory
are the resource dimensions of the process.
Time and Memory Time and memory are the resource dimensions
of the process.
Algorithms Classification, Regression, Clustering, Frequent Pattern Mining.
Applications sensor data: industry, cities telecomm data social networks: twitter,
facebook, yahoo marketing: sales business Data may come from: humans, sensors, or machines.
Data Streams Big Data & Real Time