Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Bootstrapping Data Science at uSwitch
Search
Data Science London
July 03, 2012
Technology
2
160
Bootstrapping Data Science at uSwitch
Presentation by Paul Lam Data Scientist @Forwardtek presentation at Data Science London 21/03/12
Data Science London
July 03, 2012
Tweet
Share
More Decks by Data Science London
See All by Data Science London
Semi-Supervised Anomaly Detection
datasciencelondon
0
870
Hacking the Rail: Ingesting, analysing & visualising realtime streaming data
datasciencelondon
1
47k
Stateful Data-Parallel Processing
datasciencelondon
0
47k
Semantic web warmed up: Ontologies for the IoT
datasciencelondon
0
110
IoT data ingestion pipelines and Clojure transducers
datasciencelondon
0
250
TrendCalculus: A data science for trends
datasciencelondon
1
48k
Data Science in Mobile Health
datasciencelondon
1
8.3k
Large-scale Recommender Systems on Just a PC (with GraphChi)
datasciencelondon
1
17k
Taming Graph Dynamics at Scale
datasciencelondon
0
8.1k
Other Decks in Technology
See All in Technology
よく聞くけど使ったことないソフトウェアNo.1 KafkaとSnowflake
foursue
4
350
私が trocco を推す理由
__allllllllez__
1
210
APIファーストなプロダクトマネジメントの実践 〜SaaSus Platformでの例〜 / "Practicing API-First Product Management - An Example with SaaSus Platform
oztick139
0
100
アクセシビリティを考慮したUI/CSSフレームワーク・ライブラリ選定
yajihum
2
1k
EMとして2023年度に頑張ったこと / What we did well in FY2023 as a EM
pauli
1
160
長期間TiDBを使ってきた話 @ 私たちはなぜNewSQLを使うのかTiDB選定5社が語る選定理由と活用LT / Experiences with TiDB Over Time
chibiegg
2
880
複雑な構成要素を持つUIとの向き合い方 〜新・支出グラフでの実例〜 / B43 TECH TALK
nakamuuu
0
140
一生覚えておきたい「システム開発=コミュニケーション」〜初めての実務案件振り返りLT〜
maimyyym
0
120
エンジニアのキャリアをちょっと楽しくする3本の軸/Three Pillars to Make an Engineer's Career More Enjoyable
kwappa
0
2.6k
コンパウンドスタートアップのためのスケーラブルでセキュアなInfrastructure as Codeパイプラインを考える / Scalable and Secure Infrastructure as Code Pipeline for a Compound Startup
yuyatakeyama
4
4.7k
Compose Compiler Metricsを使った実践的なコードレビュー
tomorrowkey
1
220
レガシーをぶっ壊せ。AEONで始めるDevRelの話 / Qiita Night 2024-2-22
aeonpeople
3
1.3k
Featured
See All Featured
Building Your Own Lightsaber
phodgson
99
5.7k
Teambox: Starting and Learning
jrom
128
8.4k
Debugging Ruby Performance
tmm1
70
11k
Being A Developer After 40
akosma
57
580k
How to Ace a Technical Interview
jacobian
272
22k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
274
13k
It's Worth the Effort
3n
180
27k
Designing Dashboards & Data Visualisations in Web Apps
destraynor
226
51k
Designing with Data
zakiwarfel
96
4.8k
Pencils Down: Stop Designing & Start Developing
hursman
117
11k
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
9
8.3k
No one is an island. Learnings from fostering a developers community.
thoeni
16
2.1k
Transcript
%RRWVWUDSSLQJ 'DWD6FLHQFHDWX6ZLWFK 3DXO/DP 'DWD6FLHQWLVWX6ZLWFKFRP 'DWD6FLHQFH/RQGRQ0DUFK
:KDWWRGRZLWKDOOWKHVH'DWD"
Ɣ 6HFRQGELJJHVWSULFHFRPSDULVRQVLWHLQ8. Ɣ *%RIGDWDSHUPRQWK X6ZLWFKFRP
6R:KDW'RHVD'DWD6FLHQWLVWGR" 6KRUWYHUVLRQ ,GRSURJUDPPLQJDQGVWDWLVWLFV /RQJYHUVLRQ 7HOOPHZKDW\RXGRZLWK\RXUGDWDILUVW
)LUVWIHZZHHNV RQPHHWLQJV $VNLQJORWVRI TXHVWLRQV 8QGHUVWDQGLQJ KRZGDWDLVEHLQJXVHGRUQRWXVHG 6WDUWLQWHUURJDWLQJWKHGDWDEDVHV
,QIRUPDWLRQ)XQQHOV 'DWD %,'HYHORSHU $QDO\VW 3URGXFW0DQDJHU ,QIRUPDWLRQ 'HFLVLRQ
6FLHQWLILF$SSURDFKWR6WXG\'DWD Ɣ 'DWDQRWDVDE\SURGXFWRIEXVLQHVV Ɣ ([DPSOH%LRWHFKUHVHDUFK D &(2ZDQWVFXUHIRUFDQFHU E 5HVHDFKHUVUXQH[SHULPHQWV
F 5HILQHPRGHOEDVHGRQUHVXOWVUHSHDW E XQWLO; G 'HOLYHUUHVXOWV Ɣ &(2SRLQWVGLUHFWLRQEXWGRHVQRWGLFWDWH ZKDWUHVHDUFKHUVGRZLWKWKHGDWD
'DWD6LORV
4XHVWLRQ +RZZRXOG\RXPHUJH\RXUGDWDLQWRXVDEOH PHVKHV LQDQDJLOHRUJDQL]DWLRQZLWK GHOLEHUDWHGDWDVLORV" 'RQ W
5HSODFHZLWKDPRQROLWKLFGDWDZDUHKRXVH 3XVKGDWDLQWR+')6ZKLOHNHHSLQJDOO GHSDUWPHQWVDXWRQRPRXVZLWKWKHLUGDWD VLORVIRURSHUDWLRQ )XOWRQ,%093$QMXO%KDPEKUL0XVW%LJ'DWD$OWHUWKH(QWHUSULVH"5HDG:ULWH:HE0DUFK
6HFRQG0RQWK&DSWXULQJGDWD
&XUDWLQJGDWD (7/
$SDFKH+LYH64/RQ+DGRRS
3URJUDPPLQJDGKRFTXHULHV Ɣ &RPSRVLELOLW\ Ɣ ([WHQVLELOLW\ Ɣ 0DLQWDLQLELOLW\ Ɣ ,QWHUDFWLYLW\
([DPSOH +RZDUHZHGRLQJRQFURVVVHOOLQJ"
+HUH VDJUDSK &XVWRPHUIORZEHWZHHQFKDQQHOV KWWSFLUFRVFD UDQGRPGDWD
&DSWXUH!&XUDWH!0RGHO!&RPPXQLFDWH 0RVWRIWKHZRUN 0RVWRIWKHUHVXOW
6RPHGDWDSUREOHPVFRQVLGHUHG Ɣ 6WUXFWXULQJ ż SDUVLQJHYHQWVLQWRRUGHUHGYHFWRUV Ɣ &OHDQLQJ ż XSWRRIRXUZHEORJUHFRUGVDUHIURPERWVDQG FUDZOHUV
Ɣ 9DOLGDWLQJ ż ZKHQMRLQLQJWLPHEDVHGGDWDGRQRWDVVXPHWKDW WKHWLPHVWDPSVEHWZHHQVHUYHUVDUHV\QFKURQLVHG Ɣ 0LVVLQJGDWD ż RIUHFRUGVKDYHQRXVHU,'
PRQWKVODWHU :KDWKDYHZHHQDEOHG"
(YLGHQFHEDVHG'HFLVLRQPDNLQJ Ɣ UHJUHVVLRQDQDO\VLV ż ZKDWLVUHWXUQRQLQYHVWPHQWRI;" Ɣ H[SHFWHGYDOXHFDOFXODWLRQ ż ZKDWLVWKHYDOXHRIGRLQJ;"
Ɣ K\SRWKHVLVWHVWLQJ ż GLGGRLQJ;PDNHDGLIIHUHQFH" Ɣ RSWLPLVDWLRQ ż KRZWRGHOLYHUDEHWWHUH[SHULHQFHWRFXVWRPHU;"
,QIRUPDWLRQLV(DV\ 6ZHHW,GRQ WKDYHWRGRDQ\WKLQJ +HPDOX6ZLWFK'HYHORSHU
6XPPDU\ 2XUGDWDVFLHQFHSURFHVV +\SRWKHVL]H &DSWXUH &XUDWH
0RGHO &RPPXQLFDWH