Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Analytics for Developers
Search
Trent Hauck
May 05, 2013
Programming
1
420
Analytics for Developers
A talk I gave at Kansas City Developer Conference 2013.
Trent Hauck
May 05, 2013
Tweet
Share
Other Decks in Programming
See All in Programming
Oxlintはいいぞ
yug1224
5
1.3k
今から始めるClaude Code超入門
448jp
7
8.5k
AI Agent Tool のためのバックエンドアーキテクチャを考える #encraft
izumin5210
6
1.8k
dchart: charts from deck markup
ajstarks
3
990
なるべく楽してバックエンドに型をつけたい!(楽とは言ってない)
hibiki_cube
0
140
プロダクトオーナーから見たSOC2 _SOC2ゆるミートアップ#2
kekekenta
0
200
HTTPプロトコル正しく理解していますか? 〜かわいい猫と共に学ぼう。ฅ^•ω•^ฅ ニャ〜
hekuchan
2
680
AI Schema Enrichment for your Oracle AI Database
thatjeffsmith
0
250
AIによる高速開発をどう制御するか? ガードレール設置で開発速度と品質を両立させたチームの事例
tonkotsuboy_com
7
2k
LLM Observabilityによる 対話型音声AIアプリケーションの安定運用
gekko0114
2
420
AIと一緒にレガシーに向き合ってみた
nyafunta9858
0
170
それ、本当に安全? ファイルアップロードで見落としがちなセキュリティリスクと対策
penpeen
7
2.4k
Featured
See All Featured
Understanding Cognitive Biases in Performance Measurement
bluesmoon
32
2.8k
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
Dealing with People You Can't Stand - Big Design 2015
cassininazir
367
27k
Exploring the Power of Turbo Streams & Action Cable | RailsConf2023
kevinliebholz
37
6.3k
Context Engineering - Making Every Token Count
addyosmani
9
650
How to Build an AI Search Optimization Roadmap - Criteria and Steps to Take #SEOIRL
aleyda
1
1.9k
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
35
3.3k
Self-Hosted WebAssembly Runtime for Runtime-Neutral Checkpoint/Restore in Edge–Cloud Continuum
chikuwait
0
320
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
52
5.8k
Build your cross-platform service in a week with App Engine
jlugia
234
18k
Fantastic passwords and where to find them - at NoRuKo
philnash
52
3.6k
The Straight Up "How To Draw Better" Workshop
denniskardys
239
140k
Transcript
Analytics for Developers and Developing for Analytics
About Me 2006-2011: Educated Accounting & Finance 2011-Present: Reeducated Marketing
& Operations Twitter: @trent_hauck Work: @AlightAnalytics Other: Contribute (now and then) to Pandas & StatsModels
Two Parts Analytics (more) Development (less)
Why should you care?
“In God we trust; all others must bring data.”
To do analytics you need x
Where x is data collection...
Site Analytics Should be a 1st Class Citizen of Development
Collect More Than You Need Now
Now Some GA Code <script type="text/javascript"> var _gaq = _gaq
|| []; _gaq.push(['_setAccount', 'UA-31465642-1']); _gaq.push(['_setDomainName', 'trenthauck.com']); _gaq.push(['_setAllowLinker', true]); _gaq.push(['_trackPageview']); (function() { var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js'; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); })(); </script>
Next Steps Events _gaq.push(['_trackEvent', 'Cat', 'Act', ‘Label’]); Custom Variables _gaq.push(['_setCustomVar',
1, ‘key’, ‘value’, 1])
Where x is data analysis...
Differences in Data Small Data == Math Problem Big Data
== Engineering Problem
The Math Problem
Descriptive Stats (please compute these) Max, Min Quartiles Mean Variance
Mode
Web Stats are Easy A user converts or not... what
are the chances of that? p or q (=1-p) 3 users convert or not... what are the changes of that? p^3 or (p^2)q or p(q^2) or q^3
Hypothetical Worlds Trials = 100, Size = 100, p =
.1
Back to real world Stats: p-bar = .08, SE =
0.027
So then AB Testing 500 Trials A B p 0.1
0.2 SE 0.01 0.01 95% CI .1 +/- .02 .2 +/- 0.02
The Engineering Problem
Build Data Pipelines • Repeatable Flows of Data • Handles
Initial Analysis For You • Literate Programming
Programming For Data Analysis • Scripting good for Discovery •
Larger Jobs need Types • Mapping high dimensional space to lower dimensional space... then add
Where x is visualization....
Visualization Types •Distributions •Comparisons •Time Series •Other (Match Domain)
Distributions Single Variable: Histograms Multiple Variables: Scatter plot
Comparisons Categorical Variables
TimeSeries X Axis is Time
Match Domain with Analysis
Where x is storytelling...
Storytelling
3 Temporal Stages 1. What happened 2. What is happening
3. What will happen (Plus a tease)
Start With the Simple Stuff Friday Saturday Sunday 40º 42º
(Why do I live in KC)º
Build Up to Complex Idea
Thanks... Questions?