Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Introduction to Data Stream Mining
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Albert Bifet
August 25, 2012
Research
270
1
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Introduction to Data Stream Mining
Albert Bifet
August 25, 2012
More Decks by Albert Bifet
See All by Albert Bifet
Distributed Systems
abifet
1
300
Frequent Pattern Mining
abifet
1
290
Regression
abifet
0
300
Evaluation
abifet
1
270
Stream Algorithmics
abifet
1
430
Clustering
abifet
2
340
Ensemble Methods
abifet
0
310
Classification
abifet
0
380
Concept Drift
abifet
0
440
Other Decks in Research
See All in Research
typst の使い方:言語学を研究する学生のために
gitomochang
0
460
AI Agentの精度改善に見るML開発との共通点 / commonalities in accuracy improvements in agentic era
shimacos
6
1.7k
AIエージェント時代のLLM-jpモデルのあるべき姿
k141303
0
470
人間中心の意思決定支援AI
yukinobaba
PRO
6
2.8k
Apache Gravitinoで実現する Icebergカタログ統合とアクセスの一元化
matsumooon
0
290
COFFEE-Japan PROJECT Impact Report(海ノ向こうコーヒー)
ontheslope
0
1.9k
(SIGQS17) Frasco-VS:フラグメントに基づく薬剤候補化合物選抜の量子アニーリングによる実現
keisukeyanagisawa
PRO
0
110
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
shunk031
4
1k
Language and AI
ayaniwa
0
120
FUSE-RSVLM: Feature Fusion Vision-Language Model for Remote Sensing
satai
3
860
長時間動画QAにおけるマルチエージェント推論 ・SVAgent: Storyline-Guided Long Video Understanding via Cross-Modal Multi-Agent Collaboration
murakawatakuya
1
130
進学校の生徒にはア行の苗字が多いのか
ozekinote
0
450
Featured
See All Featured
Redefining SEO in the New Era of Traffic Generation
szymonslowik
1
340
KATA
mclloyd
PRO
35
15k
The Hidden Cost of Media on the Web [PixelPalooza 2025]
tammyeverts
2
330
Leading Effective Engineering Teams in the AI Era
addyosmani
9
2.1k
Intergalactic Javascript Robots from Outer Space
tanoku
273
27k
The Art of Programming - Codeland 2020
erikaheidi
57
14k
RailsConf 2023
tenderlove
30
1.5k
Public Speaking Without Barfing On Your Shoes - THAT 2023
reverentgeek
1
420
Avoiding the “Bad Training, Faster” Trap in the Age of AI
tmiket
0
180
What the history of the web can teach us about the future of AI
inesmontani
PRO
1
610
[SF Ruby Conf 2025] Rails X
palkan
2
1.1k
The Pragmatic Product Professional
lauravandoore
37
7.3k
Transcript
Introduction to Data Stream Mining Albert Bifet March 2012
Motivation Source: IDC’s Digital Universe Study (EMC), June 2011 Data
is growing
Motivation Memory unit Size Binary size kilobyte (kB/KB) 103 210
megabyte (MB) 106 220 gigabyte (GB) 109 230 terabyte (TB) 1012 240 petabyte (PB) 1015 250 exabyte (EB) 1018 260 zettabyte (ZB) 1021 270 yottabyte (YB) 1024 280 Data is growing
Motivation Source: IDC’s Digital Universe Study (EMC), June 2011 Data
is growing
Motivation Source: IDC’s Digital Universe Study (EMC), June 2011 Data
is growing
Motivation Source: IDC’s Digital Universe Study (EMC), June 2011 Data
is growing
Streaming Data Big Data & Real Time
Big Data McKinsey Global Institute (MGI) Report on Big Data,
2011. Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze.
Big Data McKinsey Global Institute (MGI) Report on Big Data,
2011. Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze.
Methodology Sampling and distributed systems
Methodology Paolo Boldi Big Data does not need big machines,
it needs big intelligence
Real time analytics We want to analyze what is happening
now.
Real time analytics We want to analyze what is happening
now.
Time and Memory Number 8 Wire Mentality Time and memory
are the resource dimensions of the process.
Time and Memory Time and memory are the resource dimensions
of the process.
Algorithms Classification, Regression, Clustering, Frequent Pattern Mining.
Applications sensor data: industry, cities telecomm data social networks: twitter,
facebook, yahoo marketing: sales business Data may come from: humans, sensors, or machines.
Data Streams Big Data & Real Time