Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Speaker Deck
PRO
Sign in
Sign up for free
Introduction to Data Mining!
Kevin Hale
December 05, 2011
Technology
4
440
Introduction to Data Mining!
Kevin Hale
December 05, 2011
Tweet
Share
More Decks by Kevin Hale
See All by Kevin Hale
My Favorite UX Joke
roundedbygravity
3
630
How to Build a Billion Dollar Company
roundedbygravity
5
720
How to Run a Startup Like Genghis Khan
roundedbygravity
13
1.6k
The Best Way to Scale
roundedbygravity
5
1.2k
A Resourceful Korean
roundedbygravity
1
330
How I Got Everyone to Write the Documentation!
roundedbygravity
3
570
What's Love Got to Do with It? : Wooing Your Customers and Keeping the Flame Alive
roundedbygravity
7
1.1k
Words to Make You a Design Polyglot
roundedbygravity
5
1.8k
Support Driven Design
roundedbygravity
86
8.5k
Other Decks in Technology
See All in Technology
MoT TechTalk #12 タクシーアプリ『GO』大規模トラフィックを捌く分析データ基盤の全容に迫る!
mot_techtalk
1
380
Azure Arc Virtual MachineとAzure Arc Resource Bridge / VM provisioning through Azure portal on Azure Stack HCI (preview)
sashizaki
0
150
Security Hub のマルチアカウント 管理・運用をサーバレスでやってみる
ch6noota
0
840
OPENLOGI Company Profile
hr01
0
520
GeoLocationAnchor and MKTileOverlay
toyship
0
110
220628 「Google AppSheet」タスク管理アプリをライブ作成 吉積情報伊藤さん
comucal
PRO
0
220
Lessons Learned from Scaling Infrastructure as Code
joatmon08
0
800
インフラのCI/CDはGitHub Actionsに任せた
mihyon
0
110
モブに早く慣れたい人のためのガイド / A Guide to Getting Started Quickly with Mob Programming
cybozuinsideout
PRO
2
1.8k
ソフトウェアテスト自動化、一歩前へ
yoshikiito
5
750
Power AutomateでのAdaptive Cards
miyakemito
1
410
IoTLT88-NTKanazawa-laundry-dry
yukima0707
0
230
Featured
See All Featured
Scaling GitHub
holman
451
140k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
349
27k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
226
15k
Mobile First: as difficult as doing things right
swwweet
213
7.5k
Practical Orchestrator
shlominoach
178
8.6k
Intergalactic Javascript Robots from Outer Space
tanoku
261
25k
Code Reviewing Like a Champion
maltzj
506
37k
Designing for Performance
lara
597
63k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
237
19k
Art Directing for the Web. Five minutes with CSS Template Areas
malarkey
196
9.4k
Visualization
eitanlees
125
11k
Web development in the modern age
philhawksworth
197
9.3k
Transcript
Data Mining! An Introduction
None
Wufoo.com
None
None
None
None
What is data mining?
None
Collection? No! Extraction. Yup.
None
324 - 576 megapixels Stereo Audio 20-20,000hz 10,000 Chemical Compounds
5-6 Flavors Temperature / Pressure / Texture 2.5 Petabytes Eyes Ears Nose Mouth Skin Memory
None
None
The process of extracting patterns from large data sets.
What are some examples of large data sets?
Astronomy Biology Business Internet Government Religion
None
None
None
None
None
None
Online Surveys
Individuals, Developers, Designers, Non-Profits, Teachers, Students, Universities, Research, Real Estate,
Marketing, Healthcare, Banks, SMBs
None
What do they do with all that data?
None
Positive / Negative Likert Scale Ratings Multiple Choice Open Feedback
None
None
None
None
What are some potential problems with data collected by asking?
None
None
Data collection is just the first part.
Association Rule Learning Clustering Classification Regression Visualization
Statistics Artificial Intelligence Database Management
Bayes Theorem (1700s) Regression Analysis (1800s) Neural Networks (1940s) Genetic
Algorithms (1950s) Decision Tree Learning (1960s) Support Vector Machines (1990s)
None
None
Google Flu Trends
None
None
None
Hans Rosling
None
Recommendation Engines
None
None
Relationships!
None
None
None
Will my date have sex on the first date? Do
you like the taste of beer?
None
Assuming you were in the position to do so, would
you launch nuclear weapons under any circumstances? 82%
In a certain light, wouldn't nuclear war be exciting? 83%
None
None
The Social Graph
Privacy & Confidentiality Issues
None
None
So that’s data mining!
Thanks!