Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Analytics for Developers
Search
Trent Hauck
May 05, 2013
Programming
1
420
Analytics for Developers
A talk I gave at Kansas City Developer Conference 2013.
Trent Hauck
May 05, 2013
Tweet
Share
Other Decks in Programming
See All in Programming
なぜ適用するか、移行して理解するClean Architecture 〜構造を超えて設計を継承する〜 / Why Apply, Migrate and Understand Clean Architecture - Inherit Design Beyond Structure
seike460
PRO
3
770
20250704_教育事業におけるアジャイルなデータ基盤構築
hanon52_
5
790
「テストは愚直&&網羅的に書くほどよい」という誤解 / Test Smarter, Not Harder
munetoshi
0
170
Agentic Coding: The Future of Software Development with Agents
mitsuhiko
0
100
PicoRuby on Rails
makicamel
2
130
PHPでWebSocketサーバーを実装しよう2025
kubotak
0
290
PipeCDのプラグイン化で目指すところ
warashi
1
280
システム成長を止めない!本番無停止テーブル移行の全貌
sakawe_ee
1
200
初学者でも今すぐできる、Claude Codeの生産性を10倍上げるTips
s4yuba
16
11k
RailsGirls IZUMO スポンサーLT
16bitidol
0
190
Railsアプリケーションと パフォーマンスチューニング ー 秒間5万リクエストの モバイルオーダーシステムを支える事例 ー Rubyセミナー 大阪
falcon8823
5
1.1k
AIエージェントはこう育てる - GitHub Copilot Agentとチームの共進化サイクル
koboriakira
0
590
Featured
See All Featured
Designing Experiences People Love
moore
142
24k
Visualization
eitanlees
146
16k
Automating Front-end Workflow
addyosmani
1370
200k
Making the Leap to Tech Lead
cromwellryan
134
9.4k
The Illustrated Children's Guide to Kubernetes
chrisshort
48
50k
Scaling GitHub
holman
460
140k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
281
13k
Building Flexible Design Systems
yeseniaperezcruz
328
39k
Designing for humans not robots
tammielis
253
25k
Docker and Python
trallard
44
3.5k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
48
2.9k
Understanding Cognitive Biases in Performance Measurement
bluesmoon
29
1.8k
Transcript
Analytics for Developers and Developing for Analytics
About Me 2006-2011: Educated Accounting & Finance 2011-Present: Reeducated Marketing
& Operations Twitter: @trent_hauck Work: @AlightAnalytics Other: Contribute (now and then) to Pandas & StatsModels
Two Parts Analytics (more) Development (less)
Why should you care?
“In God we trust; all others must bring data.”
To do analytics you need x
Where x is data collection...
Site Analytics Should be a 1st Class Citizen of Development
Collect More Than You Need Now
Now Some GA Code <script type="text/javascript"> var _gaq = _gaq
|| []; _gaq.push(['_setAccount', 'UA-31465642-1']); _gaq.push(['_setDomainName', 'trenthauck.com']); _gaq.push(['_setAllowLinker', true]); _gaq.push(['_trackPageview']); (function() { var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js'; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); })(); </script>
Next Steps Events _gaq.push(['_trackEvent', 'Cat', 'Act', ‘Label’]); Custom Variables _gaq.push(['_setCustomVar',
1, ‘key’, ‘value’, 1])
Where x is data analysis...
Differences in Data Small Data == Math Problem Big Data
== Engineering Problem
The Math Problem
Descriptive Stats (please compute these) Max, Min Quartiles Mean Variance
Mode
Web Stats are Easy A user converts or not... what
are the chances of that? p or q (=1-p) 3 users convert or not... what are the changes of that? p^3 or (p^2)q or p(q^2) or q^3
Hypothetical Worlds Trials = 100, Size = 100, p =
.1
Back to real world Stats: p-bar = .08, SE =
0.027
So then AB Testing 500 Trials A B p 0.1
0.2 SE 0.01 0.01 95% CI .1 +/- .02 .2 +/- 0.02
The Engineering Problem
Build Data Pipelines • Repeatable Flows of Data • Handles
Initial Analysis For You • Literate Programming
Programming For Data Analysis • Scripting good for Discovery •
Larger Jobs need Types • Mapping high dimensional space to lower dimensional space... then add
Where x is visualization....
Visualization Types •Distributions •Comparisons •Time Series •Other (Match Domain)
Distributions Single Variable: Histograms Multiple Variables: Scatter plot
Comparisons Categorical Variables
TimeSeries X Axis is Time
Match Domain with Analysis
Where x is storytelling...
Storytelling
3 Temporal Stages 1. What happened 2. What is happening
3. What will happen (Plus a tease)
Start With the Simple Stuff Friday Saturday Sunday 40º 42º
(Why do I live in KC)º
Build Up to Complex Idea
Thanks... Questions?