Keep Calm and Carry On: Scaling Your Org With Microservices

Keep Calm and Carry On: Scaling Your Org With Microservices
Charity Majors, @mipsytipsy Bridget Kromhout, @bridgetkromhout

@mipsytipsy engineer, cofounder, CEO @bridgetkromhout #opslife, storyteller

What even is a microservice (No one knows)

What are microservices? • Independently deployable, small modular services •
Monorepo vs multiple repos • Decentralized governance • Small teams, up to maybe a dozen people (“two pizzas”) • Operating independently, interacting with other teams via APIs

Microservices are about changes.

Conway’s “Law”

Conway’s Law, post-Jobs

“Conway’s Law” is not a law (and the most important
word in Conway’s Law is “communication”)

Growing your microservices org • Interfaces and abstractions • Data
is just another service. • Ops is just another software engineering skill. • Implicit communication channels matter too. • Observability must be democratized.

• Team structure (Conway’s Law?) • Communication pathways • “Smarter
Edges”: For individual contributors • “Dumb Pipes”: for managers • Transitions are hard i can haz microservices?

YAS! Has microservices: just the good parts • Don’t get
religious. It’s not all or nothing. • What are your team’s strengths? What are their weaknesses? • Account for the operational cost

How many engineers do you have? How good are they
at operations? ** you need to be REALLY GOOD at operations to do microservices.

How many products/services do you really have? Use a big
fat service if it helps, plus some smaller ones Don’t microservice your shared libs, storage, or registry

Don’t reinvent too many wheels. new wheels have too many
unknown-unknowns (“choose boring technology”: still applies)

“Dear Twitter …”

“Software deploys … that take days to run, when they
run.” “I’m responsible for it, but I can’t log in to it.” Hard things are hard.

Interfaces and abstractions

Scaling considerations for services (also teams!!) • Scalability • Redundancy
and resiliency • Consensus knowledge of processes and arch • Load balancing, early warning alerts, graceful degradation • Communication problems to debug, black-box debugging with remote hands Interfaces

Your team is a service, your humans are nodes Interfaces

Interfaces Ownership is super key. Every service must be owned
by a human just like it must be served by a dedicated set of resources. we’ve all been on teams that spend more time circularly routing jobs around or blackholing them than actually fixing them

Management role #1: deﬁne the mission. Repeat the mission. Bore
everyone to death with the mission. Interfaces Management role #2: routing, load balancing, health checking

You can chaos monkey your people! You should!! \o/ Interfaces

Interfaces

(what the actual fuck? do it anyway.) Interfaces

Interfaces

Communication channels

Implicit communication channels matter just as much (more?) than explicit
channels. Comms

You can’t debug something you can’t name, describe, or understand.
Comms Understand your communication channels.

Pull requests, code reviews. Reporting structures. Work schedules. WFH vs
in-ofﬁce. 1x1s. On-call rotations, escalation paths. Promotions, interviews, recruiting, hiring pipelines, mentorship. Gossip. Happy hours. Comms Examples of communication ﬂows:

smart nodes, dumb pipes provision automatedly Managers’ job is primarily
facilitating nodes Comms

The more you map and understand these, the more power
you have to eﬀect change. Comms

On call questions • Who is on call? Is it
a necessary part of being an engineer? • How many rotations are there? • How often do people get woken up? *who* gets woken up? • How do you know? Who keeps track? • Are there diﬀerent rotations for stateful and stateless services, front-end and backend? • Is there an escalation path? Comms

Operations is just another software engineering skill

You can’t be an eﬀective SWE in a modern organization
without ops skills. Ops

Empowerment and responsibility go hand in hand … you can’t
ask someone to care about something and ﬁx it without also giving them the power Ops

Do SWEs have to be on call? shrug. it helps.
but it’s all about creating virtuous cause/effect loops Ops

Snowﬂakes are enormously costly. The larger your org gets, the
fewer snowﬂakes you are allowed to have. Ops

networking: common theme

Probe every software engineering candidate for their ops experience &
attitude. … yep, even FE/mobile devs!

“Operations is valued here.” you are signaling …

Senior software engineers should be reasonably good at these things.
So if they are not, don’t promote them. Operations engineering is about making systems maintainable, reliable, and comprehensible.

Data ... is just another service.

1) it’s impossible to treat stateful services exactly like stateless
services. Data

YOU SHOULD TRY. The more you treat your stateful services
like stateless ones, the more you win the future Data

Common pattern: state is the last to microserviceify. Monolith db
layer serves many small services. (because it’s hard, and usually is not the most evident source of pain) Data

In the future, YOU are the DBA.** Data ** (Everything
is going to be okay. Trust me.)

and from a DBA at a different company … …
Data

Observability … is the rock on which your castle is
built

Technical observability: debugging, monitoring, metrics, instrumentation Observability

People observability: 1x1s, email, asking questions. Looking at their face
and seeing if they are ok. Observability

If you’re doing microservices, you’re signing up for hard people
problems. Observability

Observability

You have a responsibility to your team’s well-being whether you’re
a manager or not. Observability

#truestory Observability

Observability

Talk to people BEFORE you launch any grand initiatives. Get
their buy-in.

if you didn’t … #truestory

seek feedback move forward <3 change is the only constant

Get buy-in from *all* stakeholders.

Tech leads, senior ICs

Most failures happen around transitions. • unpacking a monolith ->
microservices • rewriting from node.js into golang • acquiring or being acquired • migrating from hdfs in to new datastore • becoming a manager, or moving back to IC • getting married or divorced, having a kid

Choose the problems you are not going to solve, or
they will choose you.

Making decisions: Get ready to talk to people a lot
more about microservices. Sorry!

TL;DR: • Innovate only where you need to/where you'll gain
(and yes, this includes microservices, function-as-a service, and whatever's next) • Empower yourself; don't wait. Actively decentralize power and you'll decentralize points of failure. • Ask for permission strategically; move your org towards assume- yes. • Communication (implicit and explicit) is key to decentralizing & microservices • Look for the uncomfortable places. Be happy when you ﬁnd them; that's where you and the org can grow.

There is no fairy-tale answer Microservices give you ﬂexibility; the
rest is up to you, because hard things are still hard even when they're distributed and small.

Operability / Teams. • The mission • Build a cult
(j/k) (no really) • Let your team innovate.

most outages are triggered by “events”, from humans. draw a
line.

Pair responsibility with empowerment.

Have you considered … valuing non generalist SWEs and their
work?

Deploys On-Call Pull requests, arch reviews Observability Communication channels

Deploys

Deploys must be: • Fast. Rolling. Roll-back-able. • Reliable. Breaks
rarely. • Draws a tagged vertical line in graphs. • *Anyone* should be able to invoke deploy • For bonus points: canarying or automated

Revisit these tools regularly. part of every post mortem.

On Call

Haha, no.

What should leaders know? Managers, tech leads, and engineers

Things about leadership • Leadership is not a zero sum
game. The best leaders try to empower literally everyone to perform a leadership role in at least some areas. • Create guard-rails, not walls. • Be conventional in the big things (salary, org), unconventional in the small. • If you give a shit about diversity, don’t wait 'til you’re “ready” to hire them … look for ways to support underrepresented groups now. Make friends. Help people. Diversify your friend groups and personal networks. Be creative.

Management • Put the humans ﬁrst, and the mission a
close second • Be an enabler. Don’t starve your tech leads of growth opportunities by sucking all oxygen. • Reward intentionally. • Leadership is not zero-sum; encourage leadership everywhere • Managers, be friends with each other! Tolerance is not enough

The most powerful weapon in your arsenal is always cause
and eﬀect.

Engineers should be on call for their own services.

Yes but …. Yes, microservices helps you drift a little
bit and innovate independently … BUT, not as much as you might think. You all still share a fabric, after all. Stateful still gonna ruin your party. (and IPC, sec discovery, caching, cd pipelines, databases etc.)

• I don’t think anyone should approach management as a
thing they move in to permanently. It’s psychologically disﬁguring. • Nor is the maturation process one way. New teams within the company should be springing up. Hackathons can be a great way, esp if it involves dogfooding. Empathy needs constant renewal. • Practice making mistakes together. Practice cheerful apologies, asking questions, giving awkward feedback. It gets easier.

Charity Majors @mipsytipsy Bridget Kromhout @bridgetkromhout

Keep Calm and Carry On: Scaling Your Org With M...

Keep Calm and Carry On: Scaling Your Org With Microservices

More Decks by Charity Majors

Other Decks in Technology

Featured

Transcript