Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Datadog - IT Press Tour February 2018

Datadog - IT Press Tour February 2018

The IT Press Tour

March 01, 2018
Tweet

More Decks by The IT Press Tour

Other Decks in Technology

Transcript

  1. Real-time performance visualization What is Datadog A monitoring service for

    large – scale applications running on dynamic infrastructure Robust alerting Public Dashboarding and Collaboration Root Cause Correlation Historical Analysis Impossible d'afficher l'image. Votre ordinateur manque peut-être de mémoire pour ouvrir l'image ou l'image est endommagée. Redémarrez l'ordinateur, puis ouvrez à nouveau le fichier. Si le x rouge est toujours affiché, vous devrez peut-être supprimer l'image avant de la réinsérer.
  2. Quick Facts •  Founded in 2010 by Olivier Pomel and

    Alexis Le-Quoc •  Headquartered in NYC with offices in Boston and Paris •  Over 500 employees •  Over 6,000 enterprise customers •  $147.9 Million in venture capital funding to date •  100% growth in ARR from 6/1/2016 – 5/31/2017
  3. Infrastructure Architecture Development Stack Participants Monitoring Centralized Monolithic Waterfall Standardized,

    using stable, on-premise vendor software Infra (Governance) Dev (Participants)) NEXT GEN Distributed Microservice Agile Diverse, using quickly-evolving OSS and SaaS components Multiple Infra and Dev teams LEGACY
  4. Infrastructure-wide visibility Your Servers, Your Clouds, Yours Metrics, Your Apps,

    Your team. Together in one place. Compare and correlate metrics from multiple IT components Create custom KPIs and composite metrics Track events from the systems in your environment A B
  5. Scaling a unified monitoring platform through an AWS transition. Existing

    challenges •  AOL's systems were constantly failing due to increased load •  Ops and dev teams were missing key data contained in other team’s tools •  A patchwork of open source and custom-built monitoring tools made team collaboration challenging Compelling events •  AOL was transitioning to AWS company-wide •  The platform shift would also change processes and team structure •  Existing monitoring tools could not scale with the new dynamic platform and/or connect with data from AWS and other new support systems Solution •  Datadog provides unified infrastructure monitoring for AOL teams and their cloud environments, with data access for all teams that need it •  Product owners and corporate management receive high-level reporting and business metric visibility for all products monitored by Datadog Business impacts •  Engineers can correlate the monitoring data from upstream and downstream components to holistically troubleshoot issues in AOL's products •  The recurring time spent by AOL engineers from 45 different teams developing monitoring tools, are now applied to core AOL products
  6. Metrics Monitoring & Alerting -  Summarization of lots of data

    into Dashboards -  Long range storage & analysis -  Performance & efficient alert triggers
  7. APM Application Performance Profiling & Tracing -  Measuring lots of

    performance points at the source code level -  Tracing transactions into source code and across remote services (Service map representation) -  Architecture runtime monitoring
  8. Metrics Monitoring & Alerting APM Profiling & Tracing Logs Troubleshooting,

    Debugging, Support & Auditing Troubleshoot Understand Identify trends Long range metrics New alerts See Func. calls See Service calls See operation details See error details Analyze Txns Analyze Services interactions Perf. Statistics New alerts Tech. & Services Integrations working hand-in-hand brings: -  Easy set-ups -  Same infrastructure tags -  Better default dashboards Well controlled data acquisition that unlocks all the potential (eg. deep linking, machine learning, etc.)
  9. Logs Explorer Search bar: -Contains all filters -In sync with

    URL -Auto-complete Facets: -Filtering -Analytics Time range Filter "pills" Columns: -Log attributes -Infra. tags