Breaking Down The Monolith

- CTO, Co-founder of RisingStack - @slashdotpeter on Twitter -
https://blog.risingstack.com $ whoami

Microservices Img: http://ryanjbaxter.com

Moving to microservices

- Co-located teams can work more efficiently - different space
/ time zone - Smaller teams are usually more efficient - More focused people - Faster onboarding for new developers Possible business reasons to move

- Improved fault isolation - Independent development - Independent deployment
- Allows technology diversity Technology benefits

- Increasing architectural complexity - Increasing operational complexity - Monitoring
and debugging are much harder - Handling eventual Consistency Technology drawbacks

microservice architecture is not a silver bullet

Why we moved?

- Microservice monitoring tool - Node.js focused - Built in
Node.js - We migrated to microservices architecture Trace by RisingStack

- We wanted to have more focused teams - We
have very different challenges - Fault tolerance is a key in our business Why we moved?

Before we moved

How we moved?

Services and teams

- Separate DB per service - Maximize the depth of
service call chains Service principles

- We create backward compatible endpoints - We don’t version
services (only endpoints) - Use feature flippers/toggles (dark launches) Service principles

- Good and available documentation - Update docs and code
together Good to start here: https://github.com/Yelp/service-principles Service principles

Automation

- Automation is a key in building and operating microservices
- Easy and fast to deploy - Easy and fast to rollback - Testing, Service bootstrap, DB migration etc. Automation

Proxy / API Gateway

- Client(s) specific things: - Authentication: Cookie headers, JWT token
etc. - Protocols: http, WebSocket etc. - Response format: JSON, XML etc. - Combining resources: from multiple services API Gateway

API Gateway Img: http://bits.citrusbyte.com/microservices

Fault tolerance

- Services fail separately (in theory) - Critical resources should
be cached - Messaging queues can help (HTTP request is not recoverable) Fault tolerance

Fault tolerant data collection - CQRS read write

Fault tolerant data collecting Queue size increasing during issue Queue
size decreasing after issue resolved

Caching

- On service level - Automatically via response headers -
Via our communication package - Multi store caching (in memory, Redis): https://www.npmjs.com/package/cache-manager Caching

Security

- Trusted sources (services) on public channel - Request signing
between services - Significant CPU overhead Request signing

Request signing

Infrastructure

- Started with PaaS - We moved to Kubernetes -
Zero downtime deployment Infrastructure

Zero downtime deployment Deploy

- Graceful shutdown - Rolling deployment - Self healing applications
- Horizontal autoscaling Infrastructure

Monitoring and debugging

Microservices producing lot’s of data - Logs, Errors - Metrics
per service - Transactions - Logical connections - Side effects

microservice monitoring is impossible for humans

How to find an issue? Alerting, Dashboard Topology Metrics, Errors
Traces Profiler Investigate a service Locate on code level Understand connections Identify participating services Always know about it

Distributed tracing

- Transaction ID, Correlation ID - Google Dapper white paper
- Trace by RisingStack - Zipkin - OpenTracing Distributed tracing

Case study: Network delay

Case study: Network delay 3.8ms 564.8ms 95th response time: Call
depth: 1 service 2 services

Case study: Network delay network delay

network delay is evil in microservices

- https://blog.risingstack.com/tag/node-js-at-scale - https://www.martinfowler.com/microservices - https://microserviceweekly.com/ - https://trace.risingstack.com What’s next?

Thanks!

Breaking Down The Monolith - NodeConfBP

Breaking Down The Monolith - NodeConfBP

More Decks by Peter Marton

Other Decks in Programming

Featured

Transcript