<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="/feed.rss.xml" type="text/xsl" media="screen"?>
<rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Maxim Schepelin</title>
    <description></description>
    <link>https://speakerdeck.com/schepelin</link>
    <atom:link rel="self" type="application/rss+xml" href="https://speakerdeck.com/schepelin.rss"/>
    <lastBuildDate>2026-05-08 08:23:57 -0400</lastBuildDate>
    <item>
      <title>How to set SLOs, drive improvements, and make friends with business stakeholders</title>
      <description>Make reliability a shared priority, not just tech-speak. This session shows you how to frame SLOs in business terms, engage stakeholders, and use clear metrics to align technical and business priorities.</description>
      <media:content url="https://files.speakerdeck.com/presentations/e3a40b5c226a4c389d4b1e60ae7f26b4/preview_slide_0.jpg?39637744" type="image/jpeg" medium="image"/>
      <content:encoded>Make reliability a shared priority, not just tech-speak. This session shows you how to frame SLOs in business terms, engage stakeholders, and use clear metrics to align technical and business priorities.</content:encoded>
      <pubDate>Sun, 07 Jun 2026 00:00:00 -0400</pubDate>
      <link>https://speakerdeck.com/schepelin/how-to-set-slos-drive-improvements-and-make-friends-with-business-stakeholders</link>
      <guid>https://speakerdeck.com/schepelin/how-to-set-slos-drive-improvements-and-make-friends-with-business-stakeholders</guid>
    </item>
    <item>
      <title>The Formula for Faster Outage Recovery</title>
      <description>Production outages can be extremely costly. For a global business, just one minute of downtime can exceed an engineer’s annual compensation. So, how can engineering leaders strengthen incident response across their organizations?


After over a decade of building and operating systems used by millions of people, I’ve distilled an approach that helps engineering leaders strengthen their teams’ incident management practices, based on a simple formula:


Outage Duration = Time to Detect + Time to Acknowledge + Time to Repair.


To resolve outages quickly, we need to be efficient in all three stages. But shortening the time of each stage requires a coordinated mix of technical, process, and cultural changes. This is where engineering leaders can truly enable and empower their teams.


We’ll unpack each component and examine practical strategies—from tooling investments and observability practices to cultural habits and on-call readiness—that can dramatically shorten outage duration. By the end of the talk, you’ll learn how to set up your teams for success when they face an outage.

More engineering insight in my book: &lt;a href="https://leanpub.com/managers-compass"&gt;Engineering Manager’s Compass: Insights for building effective engineering organizations&lt;/a&gt;</description>
      <media:content url="https://files.speakerdeck.com/presentations/54b7df9d42cf4c04aec044fc648d0a3c/preview_slide_0.jpg?39340998" type="image/jpeg" medium="image"/>
      <content:encoded>Production outages can be extremely costly. For a global business, just one minute of downtime can exceed an engineer’s annual compensation. So, how can engineering leaders strengthen incident response across their organizations?


After over a decade of building and operating systems used by millions of people, I’ve distilled an approach that helps engineering leaders strengthen their teams’ incident management practices, based on a simple formula:


Outage Duration = Time to Detect + Time to Acknowledge + Time to Repair.


To resolve outages quickly, we need to be efficient in all three stages. But shortening the time of each stage requires a coordinated mix of technical, process, and cultural changes. This is where engineering leaders can truly enable and empower their teams.


We’ll unpack each component and examine practical strategies—from tooling investments and observability practices to cultural habits and on-call readiness—that can dramatically shorten outage duration. By the end of the talk, you’ll learn how to set up your teams for success when they face an outage.

More engineering insight in my book: &lt;a href="https://leanpub.com/managers-compass"&gt;Engineering Manager’s Compass: Insights for building effective engineering organizations&lt;/a&gt;</content:encoded>
      <pubDate>Fri, 08 May 2026 00:00:00 -0400</pubDate>
      <link>https://speakerdeck.com/schepelin/the-formula-for-faster-outage-recovery</link>
      <guid>https://speakerdeck.com/schepelin/the-formula-for-faster-outage-recovery</guid>
    </item>
  </channel>
</rss>
