Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Berlin 2013 - Session - Jeff Weinstein
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Monitorama
September 20, 2013
670
2
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Berlin 2013 - Session - Jeff Weinstein
Monitorama
September 20, 2013
More Decks by Monitorama
See All by Monitorama
Monitorama PDX 2017 - Ian Bennett
monitorama
1
630
PDX 2017 - Pedro Andrade
monitorama
0
830
PDX 2017 - Roy Rapoport
monitorama
4
990
PDX 2017 - Julia Evans
monitorama
0
520
Berlin 2013 - Session - Brad Lhotsky
monitorama
5
770
Berlin 2013 - Session - Alex Petrov
monitorama
6
730
Berlin 2013 - Session - Oliver Hankeln
monitorama
1
590
Berlin 2013 - Session - David Goodlad
monitorama
0
510
Berlin 2013 - Session - Radu Gheorghe
monitorama
0
670
Featured
See All Featured
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
47
8.2k
How To Stay Up To Date on Web Technology
chriscoyier
790
250k
Hiding What from Whom? A Critical Review of the History of Programming languages for Music
tomoyanonymous
2
850
The B2B funnel & how to create a winning content strategy
katarinadahlin
PRO
1
380
Chasing Engaging Ingredients in Design
codingconduct
0
220
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
The Impact of AI in SEO - AI Overviews June 2024 Edition
aleyda
5
1.1k
How To Speak Unicorn (iThemes Webinar)
marktimemedia
1
480
Utilizing Notion as your number one productivity tool
mfonobong
4
320
Heart Work Chapter 1 - Part 1
lfama
PRO
7
36k
Git: the NoSQL Database
bkeepers
PRO
432
67k
Leveraging Curiosity to Care for An Aging Population
cassininazir
1
270
Transcript
How monitoring can improve the rest of the company
Monitorama EU 2013 @jeff_weinstein
I real-time and batch data analytics
Monitoring can wildly improve the whole company by
sharing data and sharing techniques.
Monitoring Folks Developers Business Analysts
ExecuIves & Product Data ScienIsts Data
Apps & Services & Systems Users
Data Code & Config Monitoring
Some problems…
Data Processing Apps Systems Logs /
Events Metrics Graphs & Alerts Apps 3rd Party Reports & Queries ETL AnalyIc Systems Monitoring: Streaming BI: Batch
Data Needs Logs Metrics Logs Metrics
Streaming Batch Data Monitoring BI
Data Tools Stack Monitoring • Ad hoc
– sed, grep, awk – ES, LogStash, Splunk, … • Storage – Hosts, Ganglia, OTSDB – Central syslog server • VisualizaIon/ReporIng – Graphite, RRDTool, 3rd party – Homegrown • AlerIng/EscalaIon – Nagios, Sensu, PagerDuty, … Rest of company • Ad hoc – Excel, SQL, Hive – MapReduce, … • Storage – Lots o’ databases, Excel – Hadoop, RDBMS… • VisualizaIon/ReporIng – Excel, R, Tableau ... – Dinosaur apps, … • AlerIng/EscalaIon – nada
Metrics
Views Unintelligible generated views Too granular for long
term trends Lack of historical Intolerant to anomalies
Team and incenIves • What team? • Change
vs. reliability • Planning • Budget • Churn
Good or bad? • Specific Tools • Decentralized
• Focus • Ownership • Lost context • Siloed work • Data dark • Misunderstanding
Some fixes
End to End Data Pipeline ü Structured logs ü (Config)
ü Measure once ü AutomaIc metrics ü API ü Graph tools ü Glossary ü AnnotaIons and tags ü Pipeline
Structured events • JSON (or whatever) • (opIonal)
config • Tags per key – Type – Tag: latency, funnel,… – DescripIon – Storage
Auto: Graphs, Glossary, & Storage • Graphs and dashboards
• * templates • Views and stats • Glossary • Batch analyIcs • Long term storage
build learn communicate inspire
Developers • Logging toolkit • Data pipeline
• Pain points • Outage causes • Deployment pracIces • EscalaIon playbook • Measurement as TDD • Monitor staging env
Business Analysts • Structured logs • Config
for ETL • Metrics definiIons • Slices and visualizaIons • Data size and cardinality • Outages and delays • Flexibility • VisualizaIon and tools
Data ScienIsts • Access to (meta)data • Query
monitoring • StaIsIcs and models • New data streams • Context of data issues • What’s in the logs • Validate algorithms • Teach stats and models!
Product & ExecuIves • Curated dashboards • Graph/alert
tools • Learn the business • PrioriIze alerts by $ • Incident post mortems • Metrics granularity • Data driven decisions • Recognize and celebrate
Monitoring can become the data plahorm and improve all
teams with its techniques.
Icons from The Noun Project: Dmitry Baranovskiy, Benjamin Orlovski, Luis
Prado, MikaDo Nguyen, Yarden Gilboa, Javier Cabezas, Icons Pusher, Jeremy Bristol, Blake Thomas, RiIka Khasgiwale, Mayene de Leon, Yorlmar Campos, Sergey Shmid @jeff_weinstein Thanks! hiring ;)