Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Dashboard Design Using Information Architecture

Joy Bhattacherjee
March 16, 2024
99

Dashboard Design Using Information Architecture

Joy Bhattacherjee

March 16, 2024
Tweet

Transcript

  1. Dashboards - Why? Global Overview of Information Data Functions Controls

    Leverage State Stakeholders Insight Use alter
  2. Design SRE Thinking Empathy Priority Prototype Test Validate Iterate Empathy

    Map J2BD User Journey No Sprint Time for Engineers Skills Abilities KPIs Attitude Knowledge Environment Urgency Water-cooler chats Slack Banter 1:1s Outages
  3. Care BIZ TECH CEO, CFO PMs CTO Dir. Eng EMs

    Engineers Are things up? What’s the cost? When can we go live? Does it scale? Are we fast and secure? How’s our code quality Performance? Will it break? Do we have enough compute?
  4. Information Architecture & Hierarchy Are things up? What’s the cost?

    When can we go live? Does it scale? Are we fast and secure? Code Quality? Performance? Will it break? Enough compute? Grain & Precision ++ RED USE Latency Bug count CVEs / deploy Deploys/week OK 10K/mo Biz Critical ++
  5. Information Hierarchy Grain & Precision ++ RED USE Latency Bug

    count CVEs / deploy Deploys/week OK 10K/mo KPIs & SLOs SLIs Debug, MTTR, MTBF
  6. Only the Most Important Information OK Gateway 200.5K RPS Time

    Req Gateway Time Req Payments Time Req Auth DOWN Payments 10.8K 5xx OK Auth 50.5K RPS
  7. Data Ink Ratio Time (Hr) Req/s Gateway 5k X Gridlines

    X Icons X Colors X Labels Time (Hr) Req/ sec Gateway 00:09:00 00:10:00 00:11:00 00:12:00 00:13:00 00:14:00 00:15:00 00:16:00 1000 2000 3000 4000 5000 6000 7000 8000 7.2K
  8. Precision vs Cognition 1349745.667 1009345.667 Users (Q1) USA UK 901236.667

    EU 801236.667 ASIA 1.3M 1.0M Users (Q1) USA UK 0.9M EU 0.8M ASIA
  9. Right Visualization For Insights OK Gateway 200.5K RPS 200.5K 5xx

    Payments KPIs Status, Rounded Numbers 1.3M 1.0M Users (Q1) USA UK 0.9M EU 0.8M ASIA Comparisons Bar Graphs Composition Pie, Donut, Area charts Releases P95 latency Relationship Correlations Distributions
  10. F/Z Reading Patterns OK Gateway 200.5K RPS DOWN Payments 10.8K

    5xx OK Auth 50.5K RPS P95 Latency num(req) OK MySQL READS 100K WARN MySQL Slow Query 15 Uptime Gateway 99.99% Auth 99.99% payment 80.65% Cart 95.28% Review 99.28% OK MySQL WRITES 50K Scan Primary KPIs Skim Scan Secondary SLIs Scan Primary KPIs Skim Scan Secondary SLIs
  11. Emphasize Reading Patterns OK Gateway 200.5K RPS DOWN Payments 10.8K

    5xx OK Auth 50.5K RPS P95 Latency num(req) OK MySQL READS 100K WARN MySQL Slow Query 15 Uptime 99.99% Gateway OK MySQL WRITES 50K Color Size 80.65% Payment 95.28% Cart 99.28% Review Complexity We always look first for complex shapes Font Weight
  12. Contextual Grouping OK Gateway 200.5K RPS DOWN Payments 10.8K 5xx

    OK Auth 50.5K RPS P95 Latency num(req) OK MySQL READS 100K WARN MySQL Slow Query 15 Uptime 99.99% Gateway OK MySQL WRITES 50K 80.65% Payment 95.28% Cart 99.28% Review Database performance Application performance User Experience
  13. Dashboards Targeting Stakeholders BIZ TECH CEO PMs CTO Dir. Eng

    EMs Engineers Executive Summary - Tech + Biz Feature Performance Global Status - KPIs, SLOs SLI RED, USE
  14. Executive Summaries 1.8M Global Users +10% 4.5M Gateway Requests +20%

    50K Signups +15% 10K Live Users +12% 30K Payments +10% 10.5K $ Cost +10% 1K Payment Failures +90%
  15. Global Status 4.5M Gateway RPS +20% 10.8K 5xx Payments +30%

    100K Auth RPS +30% P95 Latency num(req) Uptime 99.99% Gateway 80.65% Payment 95.28% Cart 99.28% Review 100K Payment RPS +20% 400K Cart RPS +30% 50K Review RPS +20% 1.5K 4xx Auth +20% 0 5xx Review -35% 0 5xx Cart 0% 99.87% Auth 1M Reads 100K Writes 1K Slow MySQL Queries Auth Cart Payment Review
  16. Service Level Indicators 10.8K 5xx Payments +30% P95 Latency num(req)

    99.99% Gateway 100K RPS +20% 99.87% Auth 1M Reads GET /refund/:id GET /payment/:id POST /create POST /refund 80.65% Uptime -20% 1.5hrs Error Budget -45% 100K MySQL READS 15 MySQL Slow Queries 50K MySQL WRITES 5.8K 5xx SlyPay +30% 0.1K 5xx SharpPay +0% Top K Errors Unable to Connect to MySQL Endpoint - 1K Query timed out - 500 Unable to resolve DNS payment.slypay.com - 4K
  17. Debugging & RCA • RED Method ◦ Rate ◦ Errors

    ◦ Duration • USE Method ◦ Utilization ◦ Saturation ◦ Errors • Golden Signals ◦ Latency ◦ Traffic ◦ Errors ◦ Saturation • Create an inverted Pyramid of abstractions ◦ L7 Metrics ◦ L4 Metrics ◦ CPU, Memory, Disk, Network ◦ Logs, traces, events • Plot SLIs of dependent services & dependencies ◦ SLIs for 3rd party APIs ◦ SLIs for Cloud and K8s APIs ◦ SLIs for DBs, Caches and queues ▪ Separate Read / write paths ▪ Slow queries filtered for the service ◦ DNS calls
  18. Key Takeaways Shape Cognition Complexity Talk to Stakeholders Only Emphasize

    Important Data Use Information Architecture Color Size Separate Dashboards Drill-Down Contextual Relationships Storytelling using correlation